Add code size cache to the kvt backend by bhartnett · Pull Request #4214 · status-im/nimbus-eth1

bhartnett · 2026-05-06T07:42:58Z

Geth has a similar code size cache which holds a million values in a LRU. For the kvt rocksdb backend we don't have any caching and the rocksdb row cache is disabled so this should in theory provide a speedup.

…esize-cache

bhartnett · 2026-05-06T14:01:19Z

Ran a block import of 200000 blocks starting from around block 23 million:

master -
elapsed=12m48s897ms

codesize branch -
elapsed=12m22s962ms

Seems like not much improvement since code is already cached in the ledger. For the forkchain block import which doesn't reuse the caches between blocks, the improvement would likely be more.

bhartnett · 2026-05-06T15:29:26Z

@arnetheduck Do you have any thoughts on this PR?

We could either go with this code size cache or just rely on the code cache to return the code sizes.

I had a look at what the other EL clients do for caching code and it looks like only Geth caches code sizes separately. Most of the other ELs just cache code using a variable weighted LRU so that larger code has more weight than smaller code.

Either way we will also need to do something about the current code cache in the ledger which doesn't get reused across blocks because in the forked chain module, a new ledger is created when processing each block.

Perhaps if we use a variable weighted LRU then we can safely fit more code in the cache and then there isn't so much need for a separate code size cache.

arnetheduck · 2026-05-07T06:43:51Z

For the forkchain block import

sounds to me like an opportunity to extend the use of the code cache - a notable source of cpu usage is actually the code scanning done for jump analysis - in the future we might want to cache other "analysis" done on the bytecode (aka jit optimizations). Broadly, we want linear forked chain and import performance to be roughly the same (minus state root verification).

weighted cache

I guess the risk here is that you can damage the code cache with code size requests - the more pre-computation we perform on the code, the greater this risk (afair they were mispriced at some early point in the chain) - that said, I've thought about introducing weighted caches elsewhere (leaf vs branch in the mpt, in particular) so it's certainly a track worth investigating from a perf point of view.

bhartnett added 8 commits May 6, 2026 08:00

Add code size cache to the kvt backend.

440f8ad

Fix comments.

3d85f5c

Add getCodeSize ledger tests.

1eb1340

Fetch from layers before cache.

00e12d7

Merge remote-tracking branch 'origin/kvt-codesize-cache' into kvt-cod…

f7f648e

…esize-cache

Fix introduced bug.

484af5c

Fix test.

63ae8a4

Update const name.

021a5f0

bhartnett requested a review from arnetheduck May 6, 2026 15:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add code size cache to the kvt backend#4214

Add code size cache to the kvt backend#4214
bhartnett wants to merge 8 commits into
masterfrom
kvt-codesize-cache

bhartnett commented May 6, 2026 •

edited

Loading

Uh oh!

bhartnett commented May 6, 2026 •

edited

Loading

Uh oh!

bhartnett commented May 6, 2026

Uh oh!

arnetheduck commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bhartnett commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bhartnett commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bhartnett commented May 6, 2026

Uh oh!

arnetheduck commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bhartnett commented May 6, 2026 •

edited

Loading

bhartnett commented May 6, 2026 •

edited

Loading