What Amaru needs to bootstrap

Reverse-engineered from pragma-org/amaru/docker/testnet/amaru-loader.sh (Arnaud Bailly).

Bundle layout

<bundle>/<network>/
├── chain.<network>.db/                    # amaru's chain store, prepopulated
├── ledger.<network>.db/                   # amaru's ledger store, prepopulated
├── snapshots/<slot>.<hash>.cbor           # target plus two prior epoch boundaries
├── snapshots/history.<slot>.<hash>.json   # testnet era history sidecar
├── nonces.json                            # tail points to previous-epoch header hash
└── headers/header.<slot>.<hash>.cbor      # includes the latest snapshot header

The latest snapshot's <slot>.<hash> must have an exact matching header file. Amaru uses that header to align the chain store to the ledger tip when amaru run opens the produced stores.

Pipeline as Arnaud built it

Steps in amaru-loader.sh:

Convert ledger states (one per epoch already dumped by the forked db-synthesizer)

amaru convert-ledger-state \
  --network testnet_42 \
  --snapshot <slot-dir> \
  --target-dir out/snapshots

For startup, Amaru needs the target epoch snapshot and the two prior epoch snapshots. The live ledger opens from ledger.<network>.db/live, then historical stores are consulted for the rewards and leader-schedule stake distributions.

Compose nonces.json by copying the last snapshot's nonces file and patching the tail field with the last header hash of the previous epoch.
Extract headers via db-server query --query list-blocks then --query "get-header <slot>.<hash>". Two headers each for the last and second-to-last snapshot — needed because the active nonce of an epoch is computed from the parent hash of its tail. The header set must include the exact <slot>.<hash> named by the latest snapshot, because Amaru aligns its chain store to the ledger tip during startup.
Import into amaru's stores:

amaru import-ledger-state --network testnet_42 \
  --ledger-dir out/ledger.db \
  --snapshot-dir out/snapshots/

amaru import-headers --network testnet_42 \
  --chain-dir out/chain.db

amaru import-nonces --network testnet_42 \
  --nonces-file out/nonces.json \
  --chain-dir out/chain.db

Where the upstream/fork divergence lives

Step 1 pre-condition is "snapshots already exist on disk". Stock db-synthesizer does not emit them. Arnaud's abailly/snapshot-generator branch adds 4 commits to wire snapshot-at-epoch-boundary writing into db-synthesizer's main loop.

The no-fork implementation

Phase 0 proved stock db-analyser --store-ledger does not emit the exact snapshot shape Amaru imports. The producer therefore replaces Arnaud's forked snapshot writer with two in-repo Haskell tools that consume the stock node libraries:

ledger-state-emitter \
  --db <chain-db> \
  --config <config.json> \
  --target-slot <slot> \
  --out <legacy-ext-ledger-state.cbor>

header-extractor tip-info|list-blocks|get-header ...

ledger-state-emitter targets the repository's pinned cardano-node 10.7.1 dependency set and emits the Amaru bootstrap projection documented in specs/003-amaru-bootstrap-producer/research.md#r-011. The producer calls it three times: target_slot, target_slot - epochLength, and target_slot - 2 * epochLength. amaru convert-ledger-state still owns the final snapshot slicing, history JSON, and nonce JSON formats.

For custom testnets, the producer corrects the converted history.<slot>.<hash>.json files before import: the open-ended current era's epoch_size_slots is set to the mounted Shelley genesis epochLength. This keeps short-epoch networks consistent with the ledger snapshot epoch number that Amaru checks during import.