Iterator time is exponential in number of Cache wrappings #10310

yihuang · 2021-10-06T02:08:04Z

Summary of Bug

evmos/ethermint#626
benchmark: evmos/ethermint#627
code of concern:

cosmos-sdk/store/cachekv/mergeiterator.go

Line 205 in 1c468de

func (iter *cacheMergeIterator) skipUntilExistsOrInvalid() bool {

In ethermint we use a nested cache context stack to support the Snapshot and RevertToSnapshot APIs for EVM.
When EVM call Snapshot, we push a new cache context based on the top one, when RevertToSnapshot, we discard the cache contexts, after the execution, we commit all the cache contexts.
What we found is when the context stack is deep, it's extremely slow to create an iterator on it.

Version

v0.44.1

Steps to Reproduce

For Admin Use

Not duplicate issue
Appropriate labels applied
Appropriate contributors tagged
Contributor assigned/self-assigned

The text was updated successfully, but these errors were encountered:

ValarDragon · 2021-10-07T00:49:37Z

We should get a pprof output out, and analyze whats going on!

Tried getting a pprof, but I can't build the ethermint library on machine...

But also, sounds like a misdesign that should be fixed to just be wrapping the store 34 times as mentioned in that PR. (Theres lots of overheads in doing so, outside of anything in the CacheKVStore)

Can you either get a pprof output or make a simplified PR here and I can try to get one? (Just add -cpuprofile cpu.out to your benchmark invocation, and then upload that along with the generated binary that ends in .test. You can investigate teh output with go tool pprof cpu.out)

ValarDragon · 2021-10-09T04:49:06Z

Ah in the code, this is definitely exponential time wrt to CacheKVStore wrapping depth. https://github.com/cosmos/cosmos-sdk/blob/master/store/cachekv/store.go#L168-L184

Its creating its own iterator over its own cache, and over its parents. Its parent will then create an iterator over its own cache, and its parents. This causes a branch factor of 2 at every depth, hence time will follow 2^n.

Can you explain more about why your nesting the caches?

For a nested cache, you basically want one iterator at every level, but implementing that would require a native concept of 'depth' in the stores, and ways of distinguishing thin store layers (gasKV) and 'thick' ones e.g. CacheKVStore.

yihuang · 2021-10-11T03:35:04Z

Ah in the code, this is definitely exponential time wrt to CacheKVStore wrapping depth. https://github.com/cosmos/cosmos-sdk/blob/master/store/cachekv/store.go#L168-L184

Its creating its own iterator over its own cache, and over its parents. Its parent will then create an iterator over its own cache, and its parents. This causes a branch factor of 2 at every depth, hence time will follow 2^n.

👍

Can you explain more about why your nesting the caches?

Nested caches are convenient to implement the nested exception revert, which is needed when we embed the EVM. There are other ways to do it, but nested caches are the simplest one.
And I think the complexity of get/set behavior on deeply nested caches are predictable, set is constant, get is O(N) where N is the depth of the cache stack.
The issue here is just we accidentally do an iteration without being aware of the computational complexity. In our case, we can flatten the stack before doing the iteration to fix the issue at hand.

For a nested cache, you basically want one iterator at every level, but implementing that would require a native concept of 'depth' in the stores, and ways of distinguishing thin store layers (gasKV) and 'thick' ones e.g. CacheKVStore.

tomtau · 2021-11-09T02:57:54Z

@ValarDragon for reference, pprof from a related issue:
evmos/ethermint#710 (comment)

robert-zaremba · 2021-12-02T16:44:12Z

@yihuang will you be able to handle this issue?

ValarDragon · 2021-12-03T16:47:25Z

I feel like this is something we should emit a warning for, but otherwise get downstream things to just not do this access pattern.

Also IAVL cache increases significantly helps here.

yihuang · 2021-12-04T02:58:14Z

@yihuang will you be able to handle this issue?

In ethermint we already changed our approach to avoid a deep cache context stack.
I haven't looked into the root cause, I'm not sure if it's worthwhile to put too much effort into this, maybe we just need to let people aware of the issue and avoid deep cache context stack.

robert-zaremba · 2021-12-17T00:21:53Z

So, normally this is not an issue in the core SDK. I'm not sure how we could log a warning without tracking the nested level: we would need to add one more parameter to cache pass it's increment when we create a new cache.

yihuang · 2021-12-17T03:13:21Z

So, normally this is not an issue in the core SDK. I'm not sure how we could log a warning without tracking the nested level: we would need to add one more parameter to cache pass it's increment when we create a new cache.

maybe just put sth in the comments for now?

ValarDragon · 2022-04-21T01:53:40Z

I think my prior comment was incorrect, I don't see how this is exponential time in the number of cache wrappings.
Each layer does two iterations, one at the structs local cache, and one in the parent.

The parent would then cause two iterations. The local cache would cause no further branching factors, as its just over this layers local sorted cache.

yihuang · 2022-04-21T02:10:27Z

I think my prior comment was incorrect, I don't see how this is exponential time in the number of cache wrappings. Each layer does two iterations, one at the structs local cache, and one in the parent.

The parent would then cause two iterations. The local cache would cause no further branching factors, as its just over this layers local sorted cache.

I have wrote a benchmark for this before: https://github.com/tharsis/ethermint/blob/release/v0.7.x/x/evm/keeper/benchmark_test.go#L100
The result:

$ go test -v ./x/evm/keeper -v -run="^$" -bench="BenchmarkDeepContextStack"
BenchmarkDeepContextStack1
BenchmarkDeepContextStack1-16     	  236114	      5047 ns/op
BenchmarkDeepContextStack10
BenchmarkDeepContextStack10-16    	      30	  34824724 ns/op
BenchmarkDeepContextStack13
BenchmarkDeepContextStack13-16    	       1	2250838178 ns/op

not sure if it's exponential, but the curve is steep.

yihuang · 2022-11-16T03:13:40Z

I did some profile and optimization.

before:

BenchmarkDeepContextStack1-12     	  118875	      9095 ns/op
BenchmarkDeepContextStack10-12    	      21	  55358983 ns/op
BenchmarkDeepContextStack13-12    	       1	3444975408 ns/op

after:

BenchmarkDeepContextStack1-12     	  127264	      8550 ns/op
BenchmarkDeepContextStack10-12    	   25220	     48912 ns/op
BenchmarkDeepContextStack13-12    	   19500	     65458 ns/op

Closes: cosmos#10310 Solution: - cache the valid status

Co-authored-by: Aleksandr Bezobchuk <[email protected]> Co-authored-by: Marko <[email protected]> Closes #10310

Co-authored-by: Aleksandr Bezobchuk <[email protected]> Co-authored-by: Marko <[email protected]> Closes #10310 (cherry picked from commit cbee1b3) # Conflicts: # CHANGELOG.md # go.mod # go.sum # simapp/go.mod # simapp/go.sum # store/cachekv/store.go # store/cachekv/store_test.go # tests/go.mod # tests/go.sum

Co-authored-by: Aleksandr Bezobchuk <[email protected]> Co-authored-by: Marko <[email protected]> Closes #10310 (cherry picked from commit cbee1b3) # Conflicts: # CHANGELOG.md # go.mod # store/cachekv/store.go

Co-authored-by: Aleksandr Bezobchuk <[email protected]> Co-authored-by: Marko <[email protected]> Closes #10310 (cherry picked from commit cbee1b3) # Conflicts: # CHANGELOG.md # go.mod # go.sum # simapp/go.mod # simapp/go.sum # store/cachekv/memiterator.go # store/cachekv/store.go # store/cachekv/store_test.go # tests/go.mod # tests/go.sum

Co-authored-by: Aleksandr Bezobchuk <[email protected]> Co-authored-by: Marko <[email protected]> Closes cosmos#10310 (cherry picked from commit cbee1b3)

yihuang mentioned this issue Oct 6, 2021

Follow up tracking issue on deep context stack efficiency evmos/ethermint#626

Closed

ValarDragon changed the title ~~Problem: cacheMergeIterator complexity curve is extremely steep against the depth of nested cache contexts~~ Iterator time is exponential in number of Cache wrappings Oct 9, 2021

ValarDragon added C:Store T: Performance Performance improvements labels Oct 9, 2021

tomtau mentioned this issue Nov 9, 2021

Problem: contract call which does lots of message calls is slow to execute evmos/ethermint#729

Merged

11 tasks

ValarDragon mentioned this issue Apr 20, 2022

Catch panics in Non-tx related code osmosis-labs/osmosis#1305

Open

3 tasks

ValarDragon mentioned this issue Apr 21, 2022

Epoch hooks, panic recovery osmosis-labs/osmosis#1308

Merged

This was referenced Oct 7, 2022

Support a cosmos-sdk friendly eth msg evmos/ethermint#1227

Closed

Stateful Precompile Developer UX evmos/ethermint#1363

Closed

yihuang mentioned this issue Nov 16, 2022

perf: optimize iteration on nested cache context #13881

Merged

19 tasks

yihuang added a commit to yihuang/cosmos-sdk that referenced this issue Nov 16, 2022

Optimize iteration on nested cache context

1874a29

Closes: cosmos#10310 Solution: - cache the valid status

tac0turtle closed this as completed in #13881 Dec 16, 2022

tac0turtle pushed a commit that referenced this issue Dec 16, 2022

perf: optimize iteration on nested cache context (#13881)

cbee1b3

Co-authored-by: Aleksandr Bezobchuk <[email protected]> Co-authored-by: Marko <[email protected]> Closes #10310

SpicyLemon mentioned this issue Jan 25, 2023

Nested Store Prefix Iterators Interfere With Each Other #14786

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Iterator time is exponential in number of Cache wrappings #10310

Iterator time is exponential in number of Cache wrappings #10310

yihuang commented Oct 6, 2021 •

edited

Loading

ValarDragon commented Oct 7, 2021 •

edited

Loading

ValarDragon commented Oct 9, 2021

yihuang commented Oct 11, 2021 •

edited

Loading

tomtau commented Nov 9, 2021

robert-zaremba commented Dec 2, 2021

ValarDragon commented Dec 3, 2021 •

edited

Loading

yihuang commented Dec 4, 2021

robert-zaremba commented Dec 17, 2021

yihuang commented Dec 17, 2021

ValarDragon commented Apr 21, 2022

yihuang commented Apr 21, 2022 •

edited

Loading

yihuang commented Nov 16, 2022 •

edited

Loading

Iterator time is exponential in number of Cache wrappings #10310

Iterator time is exponential in number of Cache wrappings #10310

Comments

yihuang commented Oct 6, 2021 • edited Loading

Summary of Bug

Version

Steps to Reproduce

For Admin Use

ValarDragon commented Oct 7, 2021 • edited Loading

ValarDragon commented Oct 9, 2021

yihuang commented Oct 11, 2021 • edited Loading

tomtau commented Nov 9, 2021

robert-zaremba commented Dec 2, 2021

ValarDragon commented Dec 3, 2021 • edited Loading

yihuang commented Dec 4, 2021

robert-zaremba commented Dec 17, 2021

yihuang commented Dec 17, 2021

ValarDragon commented Apr 21, 2022

yihuang commented Apr 21, 2022 • edited Loading

yihuang commented Nov 16, 2022 • edited Loading

yihuang commented Oct 6, 2021 •

edited

Loading

ValarDragon commented Oct 7, 2021 •

edited

Loading

yihuang commented Oct 11, 2021 •

edited

Loading

ValarDragon commented Dec 3, 2021 •

edited

Loading

yihuang commented Apr 21, 2022 •

edited

Loading

yihuang commented Nov 16, 2022 •

edited

Loading