Refined memory/cpu cost models for `ValueData` and `UnValueData` #7500

Unisay · 2025-12-20T10:08:52Z

Context

What: Replace constant memory costs with linear models for ValueData and UnValueData builtins
Why: Existing constant memory costs (1 unit) don't reflect actual memory behavior, leading to inaccurate costing
Approach: Empirical measurement using new memory-analysis tooling to derive accurate linear coefficients

Problem Statement

The current cost models for ValueData and UnValueData use constant memory costs of 1 unit, regardless of input size. This is inaccurate because:

ValueData: Serializes a Value to Data, memory should scale with serialized size
UnValueData: Deserializes Data to Value, memory should scale with node count in the Data structure

Inaccurate memory models can lead to budget misestimation in smart contracts.

Solution Approach

This PR implements a comprehensive solution in 6 logical commits:

Infrastructure: Add memory-analysis executable with plotting and regression utilities
Core Implementation: Introduce DataNodeCount newtype for node-based memory tracking
Type System Integration: Wire DataNodeCount into the DefaultUni type system
Builtin Application: Apply specialized wrappers (ValueTotalSize, DataNodeCount) to builtins
Benchmark Alignment: Update benchmarks to use the new memory wrappers
Cost Model Update: Replace constant costs with empirically-derived linear models

Memory Measurement Strategy

ValueData: Use ValueTotalSize wrapper (already exists) - measures total serialized size
UnValueData: Use new DataNodeCount wrapper - performs lazy node traversal of Data structure

This approach separates concerns:

Memory measurement logic lives in ExMemoryUsage.hs (slope applied per node/byte)
Cost coefficients live in JSON cost models (intercept + slope multiplier)

Design Decisions

Why node count for UnValueData?

UnValueData converts Data → Value by traversing the Data tree structure
Memory cost scales with tree complexity measured as node count
Node count reflects the structural traversal work performed
Lazy traversal (via CostRose) ensures accurate accounting

Why separate wrappers?

Type safety: each wrapper represents a specific memory measurement strategy
Flexibility: can tune coefficients independently via JSON
Clarity: explicit in builtin signatures what memory model is used

Changes

Memory Analysis Tooling

New executable: plutus-benchmark:memory-analysis

PlutusBenchmark.MemoryAnalysis: Main analysis framework
PlutusBenchmark.MemoryAnalysis.Experiments: Memory behavior experiments
PlutusBenchmark.MemoryAnalysis.Generators: Test data generators
PlutusBenchmark.Plotting: Chart generation utilities
PlutusBenchmark.RegressionInteger: Regression with asymmetric loss

This tooling enabled empirical measurement of memory behavior to derive the coefficients used in the cost models.

Core Memory Tracking

plutus-core/src/PlutusCore/Evaluation/Machine/ExMemoryUsage.hs

Added DataNodeCount newtype wrapping Data
Implemented ExMemoryUsage instance using countNodesRoseScaled
Helper function performs lazy node traversal with slope-per-node accounting
Language extensions: AllowAmbiguousTypes, BlockArguments, InstanceSigs, KindSignatures, ScopedTypeVariables

plutus-core/src/PlutusCore/Default/Universe.hs

Added KnownTypeAst instance for DataNodeCount
Added MakeKnownIn and ReadKnownIn instances for marshalling
Minor refactoring: use void instead of (() <$) for clarity

Builtin Updates

plutus-core/src/PlutusCore/Default/Builtins.hs

ValueData: Changed signature from Value -> Data to ValueTotalSize -> Data
UnValueData: Changed signature from Data -> BuiltinResult Value to DataNodeCount -> BuiltinResult Value
Both changes enable accurate memory accounting via specialized wrappers

Benchmark Alignment

plutus-core/cost-model/budgeting-bench/Benchmarks/Values.hs

Updated valueDataBenchmark to use createOneTermBuiltinBenchWithWrapper with ValueTotalSize
Updated unValueDataBenchmark to use createOneTermBuiltinBenchWithWrapper with DataNodeCount
Ensures benchmarks measure the same memory behavior as production builtins

Cost Model Data

plutus-core/cost-model/data/builtinCostModel{A,B,C}.json

Updated memory models (all three variants updated identically):

"valueData": {
  "memory": {
    "arguments": {
      "intercept": 6,
      "slope": 38
    },
    "type": "linear_in_x"
  }
}

Memory = 38 × size + 6 (was constant 1)

"unValueData": {
  "cpu": {
    "arguments": {
      "intercept": 1000,
      "slope": 290658
    },
    "type": "linear_in_x"
  },
  "memory": {
    "arguments": {
      "intercept": 0,
      "slope": 8
    },
    "type": "linear_in_x"
  }
}

Memory = 8 × nodes + 0 (was constant 1)
CPU = 290658 × nodes + 1000 (updated from 43200 × arg + 1000)

plutus-core/cost-model/data/benching-conway.csv

Regenerated benchmark data (404 lines changed) with new memory measurement approach.

Impact

Budget Changes

Scripts using ValueData or UnValueData will see different memory budget consumption:

Small values: Similar or slightly higher memory cost
Large values: Significantly more accurate memory cost (linear scaling)
CPU costs: UnValueData CPU costs updated to reflect node-based measurement

Conformance Tests

Expect budget differences in conformance tests that use these builtins. The new costs are more accurate than the previous constant models.

4. Memory Analysis

The memory-analysis executable can reproduce the experiments:

cabal run plutus-benchmark:memory-analysis

This generates plots and regression analysis in plutus-benchmark/memory-analysis/data/.

5. Conformance Tests

cabal test plutus-conformance

Expect budget differences but correct behavior.

Notes for Reviewers

Commit Structure

The PR is organized as 6 atomic commits following dependency order:

Infrastructure (analysis tools)
Core implementation (DataNodeCount)
Type system integration
Builtin application
Benchmark alignment
Cost model data

Each commit is buildable and represents a logical unit of change.

The updated CPU models for ValueData and UnValueData could be previewed here.

Add new memory-analysis executable with modules for analyzing memory behavior of Plutus builtins. Includes plotting utilities, regression analysis, and experiment framework for deriving accurate memory models from empirical measurements.

Introduce DataNodeCount newtype that measures Data memory via lazy node traversal rather than serialization size. This provides more accurate memory accounting for UnValueData builtin which operates on the Data structure directly without serializing. The wrapper separates concerns: node counting logic here, cost coefficients in JSON models.

Add KnownTypeAst and builtin marshalling instances for DataNodeCount. This enables using the new memory model in builtin definitions while maintaining type safety through the universe system. Also includes minor refactoring (void instead of (() <$)) for clarity.

Apply ValueTotalSize to ValueData and DataNodeCount to UnValueData, replacing plain Value/Data types. This enables accurate memory accounting: ValueData uses total serialized size, UnValueData uses node count for measuring input Data complexity.

Update ValueData and UnValueData benchmarks to use createOneTermBuiltinBenchWithWrapper with appropriate memory measurement wrappers (ValueTotalSize and DataNodeCount). This ensures benchmarks measure the same memory behavior as production builtins.

Replace constant memory costs with linear models derived from empirical measurements: - ValueData: memory = 38×size + 6 (was constant 1) - UnValueData: memory = 8×nodes + 0 (was constant 1) CPU: 290658×nodes + 1000 (was 43200×arg + 1000) The linear models better reflect actual memory behavior: ValueData scales with serialized size, UnValueData scales with node count. Benchmark data regenerated with new memory measurement approach.

ana-pantilie · 2025-12-23T12:33:27Z

plutus-benchmark/memory-analysis/src/PlutusBenchmark/RegressionInteger.hs

@@ -0,0 +1,121 @@
+module PlutusBenchmark.RegressionInteger (integerBestFit) where


I don't think we should implement our own linear regression algorithm when R is the domain standard for this.

Neither do I!

zliu41 · 2026-01-06T04:22:21Z

plutus-core/cost-model/data/builtinCostModelC.json

    "cpu": {
-      "arguments": 194713,
+      "arguments": 164434,
      "type": "constant_cost"


Obviously, ValueData can not be constant time!

See the Slack discussion about this. We'll need to do something like using nf instead of whnf for the CPU costing for valueData, the problem being that the implementaion of valueData begins with Map . ... and whnf won't cause the stuff in ... (which does all the hard work) to be evaluated.

kwxm

Can you open a new PR that just updates the costs and doesn't include the memory cost inference stuff in PlutusBenchmark? I think that determining the memory costs empirically instead of just waving our hands and coming up with a rough estimate is a promising idea, but it's quite a big change and it's kind of orthogonal to what we're trying to achieve at the moment. As long as we have a measure of the memory cost it doesn't matter too much where it came from (although I do have some small reservations: see the comments). Keep the memory usage inference code somewhere though! We can come back and think about this more carefully after the pressure to get everything ready for the HF has relaxed.

However, the main thing that needs to be changed is the CPU costing for valueData: the current constant cost is definitely wrong, but that should be fixable.

kwxm · 2026-01-08T01:23:38Z

plutus-ledger-api/src/PlutusLedgerApi/V3/ParamName.hs

  | ValueContains'cpu'arguments'intercept
  | ValueContains'cpu'arguments'slope
  | ValueContains'memory'arguments
  | ValueData'cpu'arguments


This will need to have an intercept and slope since the CPU cost of valueData should be linear (and similarly in the other ParamName files).

kwxm · 2026-01-08T01:41:09Z

plutus-core/plutus-core/src/PlutusCore/Evaluation/Machine/ExMemoryUsage.hs

+  {-# INLINE memoryUsage #-}

-- Should be 72
+-- Helper function to count nodes in a Data object, returning a lazy CostRose


I don't think you need all this. It should be enough to count the nodes (so the same thing, but replacing s with 1), and then runOneArgumentModel will do the scaling for you.

No, hold on. The only time this is called is with s=1 anyway, so why the extra generality?

kwxm · 2026-01-08T01:43:21Z

plutus-core/plutus-core/src/PlutusCore/Evaluation/Machine/ExMemoryUsage.hs


-- Should be 72
+-- Helper function to count nodes in a Data object, returning a lazy CostRose
+-- with the slope applied per node. The intercept is applied once at the root


Is this comment correct? The ExMemoryUsage instance doesn't mention the intercept.

kwxm · 2026-01-08T02:02:25Z

plutus-core/plutus-core/src/PlutusCore/Evaluation/Machine/ExMemoryUsage.hs

+The actual memory formula (slope × nodeCount + intercept) is applied in the JSON cost model. -}
+newtype DataNodeCount = DataNodeCount Data
+
+instance ExMemoryUsage DataNodeCount where


I was a bit worried about what would happen here when we extend Data to have a Value field and valueData and unValueData use that. However I think that'll be OK. For instance we won't need to care about the number of nodes in the input to unValueData any more, but we can deal with that by updating the CPU and memory costing functions to have zero slope, effectively making them constant (although now I'm wondering if we'll still have to traverse the entire CostRose).

kwxm · 2026-01-08T02:06:07Z

plutus-core/cost-model/data/builtinCostModelC.json

    "cpu": {
-      "arguments": 194713,
+      "arguments": 164434,
      "type": "constant_cost"


See the Slack discussion about this. We'll need to do something like using nf instead of whnf for the CPU costing for valueData, the problem being that the implementaion of valueData begins with Map . ... and whnf won't cause the stuff in ... (which does all the hard work) to be evaluated.

kwxm · 2026-01-08T02:09:21Z

plutus-core/cost-model/budgeting-bench/Benchmarks/Values.hs

-valueDataBenchmark gen = createOneTermBuiltinBench ValueData [] (generateTestValues gen)
+valueDataBenchmark gen =
+  createOneTermBuiltinBenchWithWrapper
+    ValueTotalSize


Strictly we don't need this because ValueTotalSize gives the same result as the defaullt memory usage instance, but I think we should keep the wrapper because it makes the size measure explicit. So this change is good!

kwxm · 2026-01-08T02:23:27Z

plutus-core/cost-model/budgeting-bench/Benchmarks/Values.hs

 import PlutusCore.Evaluation.Machine.ExMemoryUsage
-  ( ValueLogOuterSizeAddLogMaxInnerSize (..)
+  ( DataNodeCount (..)
+  , ValueLogOuterSizeAddLogMaxInnerSize (..)


Note that @ana-pantilie 's PR changes this to ValueMaxDepth, which is less cumbersome and a lot clearer.

kwxm · 2026-01-08T05:16:47Z

plutus-benchmark/memory-analysis/src/PlutusBenchmark/MemoryAnalysis/Helpers.hs

+memU :: ExMemoryUsage a => a -> Integer
+memU x = fromSatInt (sumCostStream (flattenCostRose (memoryUsage x)))
+
+-- | Measure size by walking the object graph in 64-bit words; resistant to heap churn


I'm a little dubious about this approach because (if I understand correctly) it measures the total size occupied by the result, which, because the result may share some data with the input, may be considerably larger than the amount of new heap space that has to be allocated. For example, in the case of valueData and unValueData there's no need to make new copies of the keys and quantities: if you call valueData v then (I think) the resulting data object will contain pointers to the keys and quantities in the input value v, not new copies of them (and v itself will I think only contain pointers to the actual keys and quantities). Keys take up 4 words in the worst case and quantities take up 2 words, and we probably need to subtract those numbers from the memory usage reported by measureGraphWords to get the true amount of new heap space allocated.

Maybe this isn't really a problem: the numbers returned by the analysis should definitely be upper bounds for the "true" memory usage, so the bounds will be safe. Also, the existing memory costs are generallly pretty crude anyway, so a litlle bit of extra inaccuracy may be tolerable.

I think this issue would be more of a problem for things like tailList, where there really is a lot of sharing: if you have a list with 1000 elements then tailList will just return a pointer to the tail, not a new list with 999 elements. Similarly, bytestrings are implemented as C arrays in the heap together with a pointer to the start of thebytestring and an integer containing the length: sliceByteString doesn't copy any of the bytes in the bytestring, it just returns a small object containing the same array but with the pointer and the length updated. I have a vague memory that we didn't understand this at first and so overestimated the memory usage of sliceByteString. You'll see that the memory usage function is a linear function with slope zero, and I think that's because it originally had a nonzero slope but then we changed it when we realised what was going on.

These examples suggest that it'd be difficult to automatically infer the memory allocated by a builtin because you have to look at the implementation (which may change!) to see how much sharing there is in the inputs.

However, it's quite possible that I've misunderstood what's going on here, so let me know if that's the case.

kwxm · 2026-01-08T06:02:17Z

I'll add that the PR description is very lengthy, but it doesn't explain exactly how the memory inference works out the total memory allocation. A crucial point is that it uses GHC.DataSize.recursiveSize, but you have to dig into the code to discover that. Mentioning it in the description would have been helpful.

Unisay force-pushed the yura/value-data-memory-models branch 2 times, most recently from 1bd6254 to 43bbe0a Compare December 22, 2025 15:30

Unisay added 3 commits December 22, 2025 16:33

Unisay force-pushed the yura/value-data-memory-models branch from 43bbe0a to 073fe97 Compare December 22, 2025 15:34

Unisay self-assigned this Dec 22, 2025

Unisay added 3 commits December 22, 2025 17:54

Unisay force-pushed the yura/value-data-memory-models branch from 073fe97 to 026e835 Compare December 22, 2025 17:04

Unisay marked this pull request as ready for review December 22, 2025 17:27

Unisay requested review from ana-pantilie and kwxm December 22, 2025 17:31

ana-pantilie reviewed Dec 23, 2025

View reviewed changes

zliu41 requested changes Jan 6, 2026

View reviewed changes

kwxm added Builtins Costing Anything relating to costs, fees, gas, etc. labels Jan 8, 2026

kwxm requested changes Jan 8, 2026

View reviewed changes

		@@ -0,0 +1,121 @@
		module PlutusBenchmark.RegressionInteger (integerBestFit) where

Refined memory/cpu cost models for ValueData and UnValueData #7500

Are you sure you want to change the base?

Refined memory/cpu cost models for ValueData and UnValueData #7500

Conversation

Unisay commented Dec 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Problem Statement

Solution Approach

Memory Measurement Strategy

Design Decisions

Changes

Memory Analysis Tooling

Core Memory Tracking

Builtin Updates

Benchmark Alignment

Cost Model Data

Impact

Budget Changes

Conformance Tests

4. Memory Analysis

5. Conformance Tests

Notes for Reviewers

Commit Structure

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kwxm left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kwxm Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kwxm commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Refined memory/cpu cost models for `ValueData` and `UnValueData` #7500

Refined memory/cpu cost models for `ValueData` and `UnValueData` #7500

Unisay commented Dec 20, 2025 •

edited

Loading

kwxm Jan 8, 2026 •

edited

Loading