refactor: simplify mpt_trie's API #400

0xaatif · 2024-07-16T04:36:24Z

trait PartialTrie abstracts over a fully hydrated trie StandardTrie, and a partial trie HashedPartialTrie:

Lines 19 to 30 in 25a8145

    
           /// Any node in the trie may be replaced by its [hash](Self::hash) in a 
        
           /// [Node::Hash], and the root hash of the trie will remain unchanged. 
        
           /// 
        
           /// ```text 
        
           ///     R            R' 
        
           ///    / \          / \ 
        
           ///   A   B        H   B 
        
           ///  / \   \            \ 
        
           /// C   D   E            E 
        
           /// ``` 
        
           /// 
        
           /// That is, if `H` is `A`'s hash, then the roots of `R` and `R'` are the same.

However, our code only uses HashedPartialTrie - we can simplify our API, which will make moving towards a unified backend for SMT and MPT easier (#275)

Changes

Remove StandardTrie.
Remove trait PartialTrie, HashedPartialTrie, and replace them with a top level mpt_trie::Node.
Arc<Box<Node>> -> Arc<Node>.
Introduce FrozenNode to handle hash caching, instead of RwLock-ing on HashedPartialTrie.
It turns out that evm_arithmetization only uses FrozenNode, which is a good thing for me to know :)

Nashtare

Thanks, overall looking good to me. Mostly nit comments.

evm_arithmetization/benches/fibonacci_25m_gas.rs

mpt_trie/src/partial_trie.rs

mpt_trie/src/utils.rs

mpt_trie/src/partial_trie.rs

…ode::Empty

Nashtare

Thanks Aatif!
We may want another pair of eyes on the refactor (perhaps @BGluth given the library is his work), but otherwise LGTM.

BGluth · 2024-07-16T16:48:17Z

Hmm... So I get what you're trying to do, but we're throwing away one important optimization with this PR.

With freeze(), we're not really taking advantage of caching hashes at all in trace_decoder. We are "freezing" the trie right when all the mutations stop, then never accessing the cached values again. Then we do some more inserts (mutate the tries some more), and recalculate the same hashes that we previously hashed but have thrown out.

zk_evm/trace_decoder/src/decoding.rs

Lines 227 to 231 in 8b1a1b4

    
           let mut curr_block_tries = PartialTrieState { 
        
               state: self.tries.state.clone(), 
        
               storage: self.tries.storage.clone(), 
        
               ..Default::default() 
        
           };

zk_evm/trace_decoder/src/decoding.rs

Lines 435 to 451 in 8b1a1b4

    
           let state_trie = create_minimal_state_partial_trie( 
        
               &curr_block_tries.state, 
        
               nodes_used_by_txn.state_accesses.iter().cloned(), 
        
               delta_application_out 
        
                   .additional_state_trie_paths_to_not_hash 
        
                   .into_iter(), 
        
           )? 
        
           .freeze(); 
        
           let txn_k = Nibbles::from_bytes_be(&rlp::encode(&txn_idx)).unwrap(); 
        
           let transactions_trie = 
        
               create_trie_subset_wrapped(&curr_block_tries.txn, once(txn_k), TrieType::Txn)?.freeze(); 
        
           let receipts_trie = 
        
               create_trie_subset_wrapped(&curr_block_tries.receipt, once(txn_k), TrieType::Receipt)? 
        
                   .freeze();

Also if we ever call hash() at some point, mutate the trie a bit, it will still cache any node hashes that were not below any mutated nodes. Essentially, if we're calling hash() in between trie mutations, the current setup is probably going to be more performant.

I personally would not go with the freeze API change and keep the existing setup (maybe without with Arc<_> if possible). With how freeze() is being used, we're not getting any benefits from hashing.

0xaatif · 2024-07-16T17:19:53Z

Thanks for your review @BGluth :)

I don't quite follow the claim of optimization loss though - could you help me understand?
On the lines you link, we freeze because that's the type expected by evm_arithmetization (which never makes any mutations), I'm not expecting any optimizations there.

AIUI, HashedPartialTrie is strictly equivalent to FrozenNode.

let mut hpt = HashedPartialTrie::default();
hpt.hash();
hpt.insert(...);
hpt.hash();

let frz = FrozenNode::default();
frz.hash();
let thw = frz.thaw();
thw.insert(...);
thaw.frz().hash();

Contain the exact same number of calls to trie_hash - and the internal nodes don't do any cache-ing themselves

Am I missing something?

0xaatif · 2024-07-16T17:21:29Z

recalculate the same hashes that we previously hashed but have thrown out.

I see that this causes duplicate work, but I think the previous implementation did this too - Nodes never cached their own hashes.

I feel positive about a Cow<Node> with a cached hash, for example, but that should wait for a future PR, no?

BGluth · 2024-07-16T18:08:55Z

Contain the exact same number of calls to trie_hash - and the internal nodes don't do any cache-ing themselves

I see that this causes duplicate work, but I think the previous implementation did this too - Nodes never cached their own hashes.

So the number of calls to hash() inside trace_decoder is the same between the two interfaces, but the amount of hashing internally is a lot worse with the freeze setup.

With the old setup, every node actually does cache it's own hash. It's actually not obvious when you glance at the code since the definition of Node clearly does not do any caching. However, the non-obvious thing here is the "nodes" in a HashedPartialTrie are actually HashedPartialTries themselves:

zk_evm/mpt_trie/src/partial_trie.rs

Lines 300 to 307 in 3336055

    
           /// A partial trie that lazily caches hashes for each node as needed. 
        
           /// If you are doing frequent hashing of node, you probably want to use this 
        
           /// `Trie` variant. 
        
           #[derive(Clone, Debug, Default, Deserialize, Serialize)] 
        
           pub struct HashedPartialTrie { 
        
               pub(crate) node: Node<HashedPartialTrie>, 
        
               pub(crate) hash: Arc<RwLock<Option<H256>>>, 
        
           }

The idea here is technically a single Node is a trie, and the trie type (ie. StandardTrie & HashedPartialTrie) potentially wraps the Node with some additional metadata (ie. a cached hash). So this ends up meaning that anytime we calculate a hash, all of the child nodes also calculate and cache their hash. If a mutation ever changes a node, then only the current node & parents up to the root have their cached hash invalidated (note that I actually accidently reversed it in my response above. "it will still cache any node hashes that were not below any mutated nodes" is not correct.). It only affects the node and the nodes upwards.

This means that two different tries can share a common child. Like consider this:

    B
   / \
  L   E

Where the extension node has a very dense sub-trie beneath it. If we perform an insert that converts the extension node into a branch node:

    B
   / \
  L   B

Then both the extension node and branch node will contain the same child node sub-trie (like in terms of the same piece of memory). If the entire child sub-trie was cached, then they will both have access to the already cached hashes.

Also, I'm fine if you want to remove StandardTrie. However, I also wrote this library with the idea that other users outside of us might be using it, so even if we are not using StandardTrie ourselves, there's a chance that someone else is. I don't think there's a huge performance hit from someone using a HashedPartialTrie vs. a StandardTrie and never calling hash(), so if people want to remove it, I'm ok with that (wdyt @Nashtare?). The support is already there, so I'm leaning towards keeping it personally.

0xaatif · 2024-07-16T18:45:04Z

With the old setup, every node actually does cache it's own hash. It's actually not obvious when you glance at the code since the definition of Node clearly does not do any caching. However, the non-obvious thing here is the "nodes" in a HashedPartialTrie are actually HashedPartialTries themselves

Ah I totally missed that! I'll refactor accordingly :) thanks for the patient explanation :)

As for StandardTrie, I'm keen to keep our codebase lean, and I think our internals are sufficiently exposed, and that there are enough other libraries out there that a user might find it easy to implement their own

atanmarko · 2024-07-17T14:43:01Z

mpt_trie/src/partial_trie.rs

@@ -1,126 +1,37 @@
 //! Definitions for the core types [`PartialTrie`] and [`Nibbles`].


I think there are few places to update comments from PartialTrie to Node

atanmarko · 2024-07-17T14:49:40Z

mpt_trie/src/special_query.rs

 where
    K: Into<Nibbles>,
 {
    TriePathIter {
-        curr_node: trie.clone().into(),
+        curr_node: Arc::new(trie.clone()),


Non-blocking: Could we skip cloning here and work with references?

This is a bit late, but yeah tries in mpt_trie are all essentially references, so cloning is very cheap.

atanmarko · 2024-07-17T14:52:32Z

I have skimmed through the PR, seems fine to me. It does simplify navigating the tries implementation.

BGluth · 2024-07-17T17:22:40Z

@0xaatif Yeah sounds good! Feel free to remove StandardTrie.

0xaatif added 11 commits July 16, 2024 02:34

mark: 0xaatif/refactor-mpt-trie

8ce7d72

refactor!: remove dead StandardTrie

2eca5de

wip: remove trait PartialTrie etc

0798916

wip

21a3e89

refactor: update crates

cb95bd9

run: cargo clippy --tests --fix

08a101d

run: cargo clippy --all-targets --fix

e7c58a0

refactor!: rename and reposition types

9595510

wip

2931848

run: cargo clippy --fix --tests --all-targets

66b2be6

refactor: evm_arith only uses FrozenNode

25a8145

0xaatif requested review from wborgeaud, muursh and Nashtare as code owners July 16, 2024 04:36

0xaatif added 5 commits July 16, 2024 05:42

chore: restore trie_ops tests

a07a16a

chore: Box -> Arc

5b2a6e8

run: cargo clippy --fix --package mpt_trie --tests

c08e7a3

chore: remove test for debugging

639651f

chore: wibbles

cd6bf50

Nashtare reviewed Jul 16, 2024

View reviewed changes

Nashtare added this to the System strengthening milestone Jul 16, 2024

0xaatif added 3 commits July 16, 2024 16:55

run: fd --glob *.rs --exec sd --string-mode Node::from(Node::Empty) N…

c35bfb9

…ode::Empty

fix: Node::from(Node::...)

593fc1d

run: fd --glob *.rs --exec sd let ([\w_]+): Node = Node let $1 = Node

1fc8459

0xaatif mentioned this pull request Jul 16, 2024

Do a pass on mpt_trie #401

Open

6 tasks

doc: minor

da77bd0

refactor!: remove Node::new

8b1a1b4

Nashtare approved these changes Jul 16, 2024

View reviewed changes

0xaatif mentioned this pull request Jul 16, 2024

refactor: use typed tries in trace_decoder #393

Merged

This comment has been minimized.

Sign in to view

0xaatif marked this pull request as draft July 17, 2024 12:37

atanmarko reviewed Jul 17, 2024

View reviewed changes

Nashtare assigned 0xaatif Jul 20, 2024

0xaatif closed this Sep 6, 2024

0xaatif deleted the 0xaatif/refactor-mpt-trie branch September 19, 2024 16:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: simplify mpt_trie's API #400

refactor: simplify mpt_trie's API #400

0xaatif commented Jul 16, 2024 •

edited

Loading

Nashtare left a comment

Nashtare left a comment

BGluth commented Jul 16, 2024 •

edited

Loading

0xaatif commented Jul 16, 2024

0xaatif commented Jul 16, 2024

BGluth commented Jul 16, 2024

This comment has been minimized.

0xaatif commented Jul 16, 2024

atanmarko Jul 17, 2024

atanmarko Jul 17, 2024

BGluth Sep 19, 2024

atanmarko commented Jul 17, 2024

BGluth commented Jul 17, 2024

	/// Any node in the trie may be replaced by its [hash](Self::hash) in a
	/// [Node::Hash], and the root hash of the trie will remain unchanged.
	///
	/// ```text
	/// R R'
	/// / \ / \
	/// A B H B
	/// / \ \ \
	/// C D E E
	/// ```
	///
	/// That is, if `H` is `A`'s hash, then the roots of `R` and `R'` are the same.

		@@ -1,126 +1,37 @@
		//! Definitions for the core types [`PartialTrie`] and [`Nibbles`].

refactor: simplify mpt_trie's API #400

refactor: simplify mpt_trie's API #400

Conversation

0xaatif commented Jul 16, 2024 • edited Loading

Changes

Nashtare left a comment

Choose a reason for hiding this comment

Nashtare left a comment

Choose a reason for hiding this comment

BGluth commented Jul 16, 2024 • edited Loading

0xaatif commented Jul 16, 2024

0xaatif commented Jul 16, 2024

BGluth commented Jul 16, 2024

This comment has been minimized.

0xaatif commented Jul 16, 2024

atanmarko Jul 17, 2024

Choose a reason for hiding this comment

atanmarko Jul 17, 2024

Choose a reason for hiding this comment

BGluth Sep 19, 2024

Choose a reason for hiding this comment

atanmarko commented Jul 17, 2024

BGluth commented Jul 17, 2024

0xaatif commented Jul 16, 2024 •

edited

Loading

BGluth commented Jul 16, 2024 •

edited

Loading