Inclusion Proof Format

Note

This page describes the CSMT inclusion proof format. For MPF inclusion proofs, see MPF (16-ary Trie).

Inclusion proofs allow verifying that a key-value pair exists in a CSMT without access to the full tree. Proofs are serialized using CBOR for compact, portable representation.

Overview

An inclusion proof contains the key, value hash, and Merkle path needed to recompute the root hash. The verifier supplies a trusted root hash separately. Verification is pure computation with no database access required.

Note on tree prefixes: When treePrefix is configured, the tree key used for proof generation is treePrefix(value) <> fromK(key). The proof generator first looks up the value from the KV column to compute the full tree key, then traverses the CSMT column to build the proof. The proofKey in the resulting proof contains the full prefixed key.

graph TD
    subgraph "Inclusion Proof"
        K[proof_key]
        V[proof_value]
        S[proof_steps]
        J[proof_root_jump]
    end

    subgraph "Verification"
        TR[trusted_root_hash] --> |compare| C
        V --> |start| C[Compute]
        S --> |siblings| C
        K --> |directions| C
        J --> |apply| C
    end

CDDL Specification

The proof format is formally specified in CDDL (Concise Data Definition Language):

; CDDL specification for CSMT Inclusion Proof
; CBOR encoding used by CSMT.Hashes.CBOR module

; Direction in the merkle tree path (L=0, R=1)
direction = 0 / 1

; Key is a path through the tree as a list of directions
key = [* direction]

; Blake2b-256 hash (32 bytes)
hash = bstr .size 32

; Indirect node: jump path and hash value
indirect = [
    jump: key,
    value: hash
]

; Proof step: bits consumed and sibling node
proof_step = [
    step_consumed: int,
    step_sibling: indirect
]

; Complete inclusion proof
inclusion_proof = [
    proof_key: key,         ; The key being proven
    proof_value: hash,      ; Hash of the value at the key
    proof_root_hash: hash,  ; Root hash this proves against
    proof_steps: [* proof_step],
    proof_root_jump: key    ; Jump path at the root
]

Data Types

Direction

A single bit indicating left (0) or right (1) in the tree traversal.

data Direction = L | R

Key

A path through the tree represented as a list of directions. For Blake2b-256 hashed keys, this is 256 directions (one per bit of the hash).

type Key = [Direction]

Indirect

A tree node reference containing:

Field	Type	Description
`jump`	`Key`	Path prefix to skip (sparse tree optimization)
`value`	`Hash`	Hash value of the node

data Indirect a = Indirect { jump :: Key, value :: a }

ProofStep

Each step in the proof records what's needed to compute one level of the Merkle tree:

Field	Type	Description
`stepConsumed`	`Int`	Key bits consumed: 1 (direction) + length of jump
`stepSibling`	`Indirect`	Sibling node for hash combination

The direction and jump path are derived from the key during verification, not stored in the step. This reduces proof size.

data ProofStep a = ProofStep
    { stepConsumed :: Int
    , stepSibling :: Indirect a
    }

InclusionProof

The complete self-contained proof:

Field	Type	Description
`proofKey`	`Key`	The key being proven
`proofValue`	`Hash`	Hash of the value at the key
`proofSteps`	`[ProofStep]`	Steps from leaf to root
`proofRootJump`	`Key`	Jump path at the root node

data InclusionProof a = InclusionProof
    { proofKey :: Key
    , proofValue :: a
    , proofSteps :: [ProofStep a]
    , proofRootJump :: Key
    }

Verification Algorithm

Verification recomputes the root hash and compares it to a trusted root hash supplied by the caller:

verifyInclusionProof(trustedRootHash, proof):
    computed = computeRootHash(proof)
    return computed == trustedRootHash

The computeRootHash algorithm:

Start with proofValue as the current accumulator
Compute keyAfterRoot = drop(length(proofRootJump), proofKey)
Reverse keyAfterRoot (proof steps are leaf-to-root, key is root-to-leaf)
For each step in proofSteps:
- Take stepConsumed bits from the reversed key
- First bit is the direction, remaining bits are the jump
- Combine current accumulator with stepSibling using the direction:
  - If direction is L: hash(Indirect(jump, acc) || stepSibling)
  - If direction is R: hash(stepSibling || Indirect(jump, acc))
- Result becomes the new accumulator
Apply proofRootJump: rootHash(Indirect(proofRootJump, accumulator))

Example

Consider a tree with key [L, R, L] and value hash 0xabc...:

Tree structure:
    Root (jump=[])
     |
     L
     |
   Node (jump=[])
    / \
   R   L
   |   |
  Leaf Sibling

The proof would contain:

InclusionProof {
    proofKey = [L, R, L],
    proofValue = 0xabc...,
    proofSteps = [
        ProofStep { stepConsumed = 1, stepSibling = Indirect [] 0x111... },
        ProofStep { stepConsumed = 1, stepSibling = Indirect [] 0x222... }
    ],
    proofRootJump = []
}

Verification: 1. Start with acc = 0xabc... 2. Step 1: direction=L, combine acc with sibling 0x222... → acc' = hash(...) 3. Step 2: direction=R, combine acc' with sibling 0x111... → acc'' = hash(...) 4. Apply root jump: rootHash(Indirect [] acc'') 5. Compare with trusted root hash

Security Considerations

The verifier must independently obtain a trusted root hash. Typical trust sources:

Blockchain consensus (root hash stored on-chain)
Trusted third party attestation
Previous verified state

The proof only demonstrates internal consistency—it cannot prove the root hash itself is legitimate.