Migrating Devnet/Mainnet Archive to Berkeley Archive

Before you start the process to migrate your archive database from the current Mainnet or Devnet format to Berkeley, be sure that you:

Understand the Archive Migration
Meet the foundational requirements in Archive migration prerequisites
Have successfully installed the archive migration package

Migration process

The Devnet/Mainnet migration can take up to a couple of days. Therefore, you can achieve a successful migration by using three stages:

Stage 1: Initial migration
Stage 2: Incremental migration
Stage 3: Remainder migration

Each stage has three migration phases:

Phase 1: Copying data and precomputed blocks from Devnet/Mainnet database using the berkeley_migration app.
Phase 2: Populating new Berkeley tables using the replayer app in migration mode
Phase 3: Additional validation for migrated database
The source database with original Devnet/Mainnet data
The migrated database with original Devnet/Mainnet data converted to the Berkeley schema

Review these phases and stages before you start the migration.

Simplified approach

For convenience, use the berkeley_migration.sh script if you do not need to delve into the details of migration or if your environment does not require a special approach to migration.

Stage 1: Initial migration

mina-berkeley-migration-script  \ 
   initial \
   --genesis-ledger ledger.json \
   --source-db postgres://postgres:postgres@localhost:5432/source \
   --target-db postgres://postgres:postgres@localhost:5432/migrated \
   --blocks-bucket mina_network_block_data \
   --blocks-batch-size 50 \
   --checkpoint-output-path .\
   --precomputed-blocks-local-path .\
   --network NETWORK

where:

-g | --genesis-ledger: path to the genesis ledger file

-s | --source-db: connection string to the database to be migrated

-t | --target-db: connection string to the database that will hold the migrated data

-b | --blocks-bucket: name of the precomputed blocks bucket. Precomputed blocks are assumed to be named with format: {network}-{height}-{state_hash}.json

-bs | --blocks-batch-size: number of precomputed blocks to be fetched at one time from Google Cloud. A larger number, like 1000, can help speed up the migration process.

-n | --network: network name (devnet or mainnet) when determining precomputed blocks. Precomputed blocks are assumed to be named with format: {network}-{height}-{state_hash}.json.

-c | --checkpoint-output-path: path to folder for replayer checkpoint files

-l | --precomputed-blocks-local-path: path to folder for on-disk precomputed blocks location

The command output is the migration-replayer-XXX.json file required for the next run.

Stage 2: Incremental migration

mina-berkeley-migration-script \
   incremental \
   --genesis-ledger ledger.json \
   --source-db postgres://postgres:postgres@localhost:5432/source \
   --target-db postgres://postgres:postgres@localhost:5432/migrated \
   --blocks-bucket mina_network_block_data \
   --blocks-batch-size 50 \
   --network NETWORK \
   --checkpoint-output-path . \
   --precomputed-blocks-local-path . \
   --replayer-checkpoint migration-checkpoint-XXX.json