`AccountCode` refactor from MerkleTree to Sequential hash for offset based storage access #763

phklive · 2024-06-19T22:10:16Z

In this PR I propose to refactor storage to add an offset-based storage access.

I am actively working on understanding the whole chain of dependency for this modification for now I follow this plan:

Refactor the AccountCode struct to replace the Smt with a sequential hash commitment
Fix all dependant functions that supported the Smt logic, replace this logic with Vec<Digest, Felt> handling (which as pointed out in the issue should be simpler and more efficient)
Refactor tests
Refactor insertion and verification of data in the AdviceProvider
Refactor MASM code adding storage access authentication and refactor MerkleTree dependant code

Would be glad to have your input on this plan @bobbinth if you think I could go forward in a more efficient way.

Closes: #667

bobbinth

Looks good! Thank you! I left some preliminary comments inline (mostly focusing on changes in account_code module). Also, the overall plan makes sense.

objects/src/accounts/code.rs

bobbinth · 2024-06-20T06:58:21Z

objects/src/accounts/code.rs

+        let procedures: Vec<(Digest, Felt)> = procedures
+            .into_iter()
+            .enumerate()
+            .map(|(i, proc)| (proc, Felt::new(i as u64)))
+            .collect();


This would assign a unique index to each procedure, which is probably not what we want to do. Ideally, the offset would be specified within MASM (and in the future MAST) itself. For example, for the basic wallet contract, it could look something like:

use.miden::contracts::wallets::basic->basic_wallet use.miden::contracts::auth::basic->basic_auth @miden-storage-offset(0) export.basic_wallet::receive_asset @miden-storage-offset(0) export.basic_wallet::send_asset @miden-storage-offset(1) export.basic_auth::auth_tx_rpo_falcon512

And the result of this would be:

[(receive_asset_hash, 0), (send_asset_hash, 0), (auth_tx_rpo_falcon512, 1)]

But we don't have support for attributes in MASM yet, and i'm not sure what would be a good interim solution (e.g., maybe we provide offsets as a construction parameter - it would be brittle but maybe OK for now).

cc @bitwalker and @plafer for another example of how we'd use annotations in MASM.

Two other things to consider:

Should we sort the procedures before computing commitment? I think probably yes.

Should we somehow indicate that some procedures don't need storage access? For example, in the above recieve_asset and send_asset procedures never need to touch storage). I also think yes, but not sure how to do it yet.

I am still thinking through how we could handle the input of the offsets, will add towards the end of the refactor once everything works correctly with dummy values.

We still need to address some of the above questions.

objects/src/accounts/code.rs

miden-tx/src/host/account_procs.rs

miden-lib/src/transaction/inputs.rs

miden-lib/asm/miden/kernels/tx/account.masm

miden-lib/asm/miden/kernels/tx/memory.masm

miden-lib/asm/miden/kernels/tx/prologue.masm

bobbinth

Looks good! Thank you! Not a full review yet, but I did review pretty much everything except for tests. Left some comments inline.

objects/src/accounts/code.rs

bobbinth · 2024-07-20T00:04:37Z

objects/src/accounts/code.rs

+        let procedures: Vec<(Digest, Felt)> = procedures
+            .into_iter()
+            .enumerate()
+            .map(|(i, proc)| (proc, Felt::new(i as u64)))
+            .collect();


We still need to address some of the above questions.

objects/src/accounts/code.rs

miden-tx/src/compiler/mod.rs

miden-tx/src/host/account_procs.rs

miden-lib/src/transaction/inputs.rs

miden-lib/asm/kernels/transaction/api.masm

phklive · 2024-07-25T10:21:22Z

Error: `caller == [0,0,0,0]`

Error context

This bug has been discovered after adding this check in get_procedure_info proc added in this PR:

  # check that index < number of procedures contained in the account code
  dup exec.memory::get_num_account_procedures lt assert.err=ERR_PROC_INDEX_OUT_OF_BOUNDS
  # => [index]

Requested here: #763 (comment)

This check asserts that the procedure index in memory that is trying to be accessed is actually in bounds with the length of the procedure vector ( number of procedures ).

This new assert makes the following tests fail:

All of these tests initiate at one point a syscall for example test_build_recipient_hash creates a note at the end which initiates the following syscall:

#! Creates a new note and returns the index of the note.
#!
#! Inputs: [tag, aux, note_type, RECIPIENT]
#! Outputs: [note_idx]
#!
#! tag is the tag to be included in the note.
#! aux is the auxiliary metadata to be included in the note.
#! note_type is the storage type of the note
#! RECIPIENT is the recipient of the note.
#! note_idx is the index of the crated note.
export.create_note
    syscall.create_note
    # => [note_idx, EMPTY_WORD, 0]

    # clear the padding from the kernel response
    movdn.4 dropw swap drop
    # => [note_idx]
end

This seems to be the common factor in all of the failures.

Error track down

The get_procedure_info procedure is called by authenticate_procedure of the tx kernel:

#! Verifies that the procedure root is part of the account code
#!
#! Stack: [PROC_ROOT]
#! Output: [storage_offset]
#!
#! - PROC_ROOT is the hash of the procedure to authenticate.
#!
#! Panics if
#! - procedure root is not part of the account code.
export.authenticate_procedure
    # load procedure index
    emit.ACCOUNT_PUSH_PROCEDURE_INDEX_EVENT adv_push.1
    # => [index, PROC_ROOT]

    # get procedure info (PROC_ELEMENTS, storage_offset) from memory stored at index
    exec.get_procedure_info
    # => [PROC_ELEMENTS, storage_offset, PROC_ROOT]

    # verify that PROC_ROOT exists in memory at index
    movup.4 movdn.8 assert_eqw.err=ERR_PROC_NOT_PART_OF_ACCOUNT_CODE
    # => [storage_offset]
end

Which is itself called by authenticate_account_origin or the kernel api:

#! Authenticates that the invocation of a kernel procedure originates from the account context.
#!
#! Panics:
#!   - if the invocation of the kernel procedure does not originate from the account context.
#!
#! Stack: [...]
#! Output: [...]
proc.authenticate_account_origin
    # get the hash of the caller
    padw caller
    # => [CALLER, ...]

    # assert that the caller is from the user context
    exec.account::authenticate_procedure
    # => [storage_offset, ...]

    # TODO: use the storage_offset for storage access
    # drop the storage_offset
    drop
    # => [...]
end

What we can read above is that the authenticate_procedure function takes in a PROC_ROOT as input that is provided by the authenticate_account_origin procedure by using the caller environment input which overwrites the top four stack items with the hash of the function which initiated the current SYSCALL.

It seems that all the above tests executes the caller environment input from the root context resulting in the caller returned being [0,0,0,0]. Hence the following chain of events:

authenticate_account_origin calls authenticate_procedure passing in as input caller which is [0,0,0,0]

authenticate_procedure emits the following event: ACCOUNT_PUSH_PROCEDURE_INDEX that gets caught by the TransactionHost in the on_event function:

miden-base/miden-tx/src/host/mod.rs

Lines 434 to 436 in 13724e9

    
           TransactionEvent::AccountPushProcedureIndex => { 
        
               self.on_account_push_procedure_index(process) 
        
           },

on_event calls on_account_push_procedure_index that queries the account_procedure_index_map using get_proc_index with the current process (here [0,0,0,0]).
get_proc_index queries the first word of the operand stack and searches for a value in the AccountProcedureIndexMap matching that key.
The returned value is sent back to the AdviceProvider to be added to the operand stack of the VM.

The problem being here that we are querying the AccountProcedureIndexMap with [0,0,0,0] (Digest::default) which will return us the element at this position and not a valid procedure index in the map.

There are 2 implementations of the AccountProcedureIndexMap a production one and a mock one, the one that is getting called during the tests is the mock one and has additional lines compared to the classical one stating that:

miden-base/miden-tx/src/testing/account_procs.rs

Lines 47 to 62 in 13724e9

    
               pub fn get_proc_index<S: ProcessState>( 
        
                   &self, 
        
                   process: &S, 
        
               ) -> Result<u8, TransactionKernelError> { 
        
                   let proc_root = process.get_stack_word(0).into(); 
        
                   // mock account method for testing from root context 
        
                   // TODO: figure out if we can get rid of this 
        
                   if proc_root == Digest::default() { 
        
                       return Ok(255); 
        
                   } 
        
                   self.0 
        
                       .get(&proc_root) 
        
                       .cloned() 
        
                       .ok_or(TransactionKernelError::UnknownAccountProcedure(proc_root)) 
        
               } 
        
           }

This if clause will hence push 255 on the operand stack:

         if proc_root == Digest::default() { 
             return Ok(255); 
         }

Without the index out of bounds check that was added in get_procedure_info mentioned at the top of this comment the procedure continues execution as follows:

# => [255]

push.2 mul exec.memory::get_account_procedures_section_offset add dup push.1 add
# => [1211, 1210]

# Here 2 possibilities: 
# - There are elements at  location 1210 and 1211 in memory and they are returned
# - There are no elements at location 1210 and 1211 in memory and 0's are returned
mem_load swap padw movup.4 mem_loadw
# => [0,0,0,0,0]

# Next we will be checking if the returned data from the memory matches the PROC_ROOT (which in this case is [0,0,0,0]
movup.4 movdn.8 assert_eqw.err=ERR_PROC_NOT_PART_OF_ACCOUNT_CODE
# => [0]

Hence we understand here that the tests would pass. Not because the logic is valid but because the caller in the root context is [0,0,0,0] and the returned values from the procedure in the memory are also [0,0,0,0].

@bobbinth

A few questions:

Why do we have 2 versions of the AccountProcedureIndexMap ? classical and mock ?
Is it normal that the caller here is [0,0,0,0] ?
From your answer yesterday I understand that it is because it comes from the root context, but shouldn't it be erroring out ?

bobbinth · 2024-07-26T07:15:35Z

Why do we have 2 versions of the AccountProcedureIndexMap ? classical and mock ?

I think the reason was exactly to make this tests work (i.e., to skip the real procedure authentication check).

Is it normal that the caller here is [0,0,0,0] ?

From your answer yesterday I understand that it is because it comes from the root context, but shouldn't it be erroring out ?

It should not be possible to execute the caller instruction if we are not in a syscall, but apparently the VM doesn't check for this. Once this check is added to the VM, all instances where we execute caller like in the test in question will start failing.

But as long as there is no check, caller returning [0, 0, 0, 0] when invoked in the root context is expected (i.e., the root context has no caller and therefore the returned values are all zeros).

bobbinth

Looks good! Thank you! I left some more comments inline - but they are either pretty minor or can be addressed in the next PR (when we actually integrate offsets into storage access procedures).

miden-tx/src/host/account_procs.rs

miden-tx/src/testing/account_procs.rs

miden-lib/asm/miden/kernels/tx/prologue.masm

+  # move procedure data from the advice map to the advice stack and then push the number of
+  # procedures onto the operand stack before storing it in memory
+  adv.push_mapval adv_push.1 dup exec.memory::set_num_account_procedures
+  # => [num_procs, CODE_COMMITMENT]


miden-lib/asm/miden/kernels/tx/account.masm

objects/src/accounts/code/mod.rs

bobbinth

Looks good! Thank you! I left a few more nits inline. Once these are addressed, let's merge.

miden-tx/src/host/mod.rs

miden-tx/src/host/account_procs.rs

miden-lib/asm/miden/kernels/tx/prologue.masm

…gue code in 2

bobbinth

Looks good! Thank you. I left one more comment inline. Also, there seems to be a merge conflict that needs to be resolved.

miden-tx/src/host/account_procs.rs

phklive requested a review from bobbinth June 19, 2024 22:12

bobbinth reviewed Jun 20, 2024

View reviewed changes

phklive commented Jun 27, 2024

View reviewed changes

objects/src/accounts/code.rs Outdated Show resolved Hide resolved

phklive commented Jun 27, 2024

View reviewed changes

miden-tx/src/host/account_procs.rs Outdated Show resolved Hide resolved

phklive commented Jun 27, 2024

View reviewed changes

miden-lib/src/transaction/inputs.rs Outdated Show resolved Hide resolved

phklive commented Jun 28, 2024

View reviewed changes

miden-lib/asm/miden/kernels/tx/account.masm Outdated Show resolved Hide resolved

bobbinth force-pushed the next branch from 9adfc85 to 1cd939e Compare July 4, 2024 06:31

Rebased from next

cd0ef51

phklive force-pushed the phklive-account-code-refactor branch from daf6e06 to cd0ef51 Compare July 18, 2024 09:12

phklive added 3 commits July 18, 2024 16:28

Added rust memory & added prologue memory test for account procs

7d43073

Fixed dropw error in authenticate_account_origin

58d7ee6

Merge branch 'next' into phklive-account-code-refactor

acc9969

phklive commented Jul 19, 2024

View reviewed changes

miden-lib/asm/miden/kernels/tx/memory.masm Outdated Show resolved Hide resolved

phklive commented Jul 19, 2024

View reviewed changes

miden-lib/asm/miden/kernels/tx/memory.masm Show resolved Hide resolved

phklive commented Jul 19, 2024

View reviewed changes

miden-lib/asm/miden/kernels/tx/memory.masm Outdated Show resolved Hide resolved

phklive commented Jul 19, 2024

View reviewed changes

miden-lib/asm/miden/kernels/tx/prologue.masm Show resolved Hide resolved

phklive added 2 commits July 19, 2024 10:57

Cleanup MASM code

f634073

Cleanup Rust code

c712835

phklive marked this pull request as ready for review July 19, 2024 09:19

phklive requested a review from bobbinth July 19, 2024 09:19

phklive changed the title ~~WIP: AccountCode refactor for offset-based storage access~~ AccountCode refactor from MerkleTree to Sequential hash for offset based storage access Jul 19, 2024

bobbinth requested changes Jul 20, 2024

View reviewed changes

phklive added 8 commits July 22, 2024 15:55

Merge branch 'next' into phklive-account-code-refactor

98b4137

Remove procedure_commitment() changed value to reference

40ee7da

Account code_root to code_commitment first pass

392987e

replaced with any()

285e404

Added procedure_roots() method on AccountCode

711d14e

Added TransactionHostError for AccountProcedureIndexMapError

3bd86a7

Account code_root to code_commitment second pass

cc7303d

Added AccountProcedure and TryFrom [Felt; 8]

9cc6661

phklive added 3 commits July 24, 2024 09:43

Added requested changes

f164bbb

Added errors in MASM and rust

08ca435

Optimized procedure

c20d28e

phklive and others added 9 commits July 26, 2024 13:30

Added in-kernel test fix

acf352e

Updated changelog.md

86728c0

Merge branch 'next' into phklive-account-code-refactor

4d9eaf0

Updated validate_account_procedures to use pipe_double_words_to_memory

b44c6c8

Improved comments

be55ddf

Fix cargo doc

95cc6ba

docs: added comments to AccountProcedureInfo

55558ef

docs: update comments for AccountCode

828d734

docs: improved comments in the prologue MASM

069b265

bobbinth approved these changes Jul 27, 2024

View reviewed changes

phklive added 2 commits July 29, 2024 14:11

Added check for MAX_NUM_PROCEDURES

8fd9ecf

Fixed check, added constants, removed testing account_procs

c93c547

phklive requested a review from bobbinth July 29, 2024 13:56

bobbinth approved these changes Jul 29, 2024

View reviewed changes

phklive added 3 commits July 30, 2024 13:38

Updated account_procs, changed pub from module to struct, split prolo…

d073035

…gue code in 2

lint

2eaf813

Added doc comment for helper functions in account code

ff5e8a5

bobbinth approved these changes Jul 30, 2024

View reviewed changes

miden-tx/src/host/account_procs.rs Outdated Show resolved Hide resolved

phklive and others added 2 commits July 30, 2024 22:07

Move num_procs down to prevent index out of bounds error

32dbaab

Merge branch 'next' into phklive-account-code-refactor

68c419b

bobbinth merged commit b2b6621 into next Jul 30, 2024
13 checks passed

bobbinth deleted the phklive-account-code-refactor branch July 30, 2024 21:41

This was referenced Jul 31, 2024

AccountStorage refactor from Smt to sequential hash #811

Closed

Implement offset based storage access #813

Closed

bobbinth mentioned this pull request Sep 3, 2024

Implement offset-based storage access #667

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`AccountCode` refactor from MerkleTree to Sequential hash for offset based storage access #763

`AccountCode` refactor from MerkleTree to Sequential hash for offset based storage access #763

phklive commented Jun 19, 2024

bobbinth left a comment

bobbinth Jun 20, 2024

phklive Jun 27, 2024

bobbinth Jul 20, 2024

bobbinth left a comment

bobbinth Jul 20, 2024

phklive commented Jul 25, 2024 •

edited

Loading

bobbinth commented Jul 26, 2024

bobbinth left a comment

This comment was marked as resolved.

bobbinth left a comment

bobbinth left a comment

AccountCode refactor from MerkleTree to Sequential hash for offset based storage access #763

AccountCode refactor from MerkleTree to Sequential hash for offset based storage access #763

Conversation

phklive commented Jun 19, 2024

bobbinth left a comment

Choose a reason for hiding this comment

bobbinth Jun 20, 2024

Choose a reason for hiding this comment

phklive Jun 27, 2024

Choose a reason for hiding this comment

bobbinth Jul 20, 2024

Choose a reason for hiding this comment

bobbinth left a comment

Choose a reason for hiding this comment

bobbinth Jul 20, 2024

Choose a reason for hiding this comment

phklive commented Jul 25, 2024 • edited Loading

Error: caller == [0,0,0,0]

Error context

Error track down

bobbinth commented Jul 26, 2024

bobbinth left a comment

Choose a reason for hiding this comment

This comment was marked as resolved.

bobbinth left a comment

Choose a reason for hiding this comment

bobbinth left a comment

Choose a reason for hiding this comment

`AccountCode` refactor from MerkleTree to Sequential hash for offset based storage access #763

`AccountCode` refactor from MerkleTree to Sequential hash for offset based storage access #763

phklive commented Jul 25, 2024 •

edited

Loading

Error: `caller == [0,0,0,0]`