Name	Name	Last commit message	Last commit date
parent directory ..
src	src
Cargo.toml	Cargo.toml
README.md	README.md

Snapshot

Overview

Data that has reached a finalized state and won't undergo further changes (essentially frozen) should be read without concerns of modification. This makes it unsuitable for traditional databases.

This crate aims to copy this data from the current database to multiple static files, aggregated by block ranges. At every 500_000th block new static files are created.

Below are two diagrams illustrating the processes of creating static files (custom format: NippyJar) and querying them. A glossary is also provided to explain the different (linked) components involved in these processes.

Creation diagram (Snapshotter)

graph TD;
    I("BLOCK_HEIGHT % 500_000 == 0")--triggers-->SP(Snapshotter)
    SP --> |triggers| SH["create_snapshot(block_range, SnapshotSegment::Headers)"]
    SP --> |triggers| ST["create_snapshot(block_range, SnapshotSegment::Transactions)"]
    SP --> |triggers| SR["create_snapshot(block_range, SnapshotSegment::Receipts)"]
    SP --> |triggers| ETC["create_snapshot(block_range, ...)"]
    SH --> CS["create_snapshot::&lt; T &gt;(DatabaseCursor)"]
    ST --> CS
    SR --> CS
    ETC --> CS
    CS --> |create| IF(NippyJar::InclusionFilters)
    CS -- iterates --> DC(DatabaseCursor) -->HN{HasNext} 
    HN --> |true| NJC(NippyJar::Compression)
    NJC --> HN
    NJC --store--> NJ
    HN --> |false| NJ 
    IF --store--> NJ(NippyJar)
    NJ --freeze--> F(File)
    F--"on success"--> SP1(Snapshotter)
    SP1 --"sends BLOCK_HEIGHT"--> HST(HighestSnapshotTracker)
    HST --"read by"-->Pruner
    HST --"read by"-->DatabaseProvider
    HST --"read by"-->SnapsotProvider
    HST --"read by"-->ProviderFactory

Query diagram (Provider)

graph TD;
    RPC-->P
    P("Provider::header(block_number)")-->PF(ProviderFactory)
    PF--shares-->SP1("Arc(SnapshotProvider)")
    SP1--shares-->PD(DatabaseProvider)
    PF--creates-->PD
    PD--check `HighestSnapshotTracker`-->PD
    PD-->DC1{block_number <br> > <br> highest snapshot block}
    DC1 --> |true| PD1("DatabaseProvider::header(block_number)")
    DC1 --> |false| ASP("SnapshotProvider::header(block_number)")
    PD1 --> MDBX
    ASP --find correct jar and creates--> JP("SnapshotJarProvider::header(block_number)")
    JP --"creates"-->SC(SnapshotCursor)
    SC --".get_one&lt; HeaderMask&lt; Header  &gt; &gt;(number)"--->NJC("NippyJarCursor")
    NJC--".row_by_number(row_index, mask)"-->NJ[NippyJar]
    NJ--"&[u8]"-->NJC
    NJC--"&[u8]"-->SC
    SC--"Header"--> JP
    JP--"Header"--> ASP

Glossary

In descending order of abstraction hierarchy:

Snapshotter: A reth background service that copies data from the database to new snapshot files when the block height reaches a certain threshold (e.g., 500_000th). Upon completion, it dispatches a notification about the higher snapshotted block to HighestSnapshotTracker channel. It DOES NOT remove data from the database.

HighestSnapshotTracker: A channel utilized by Snapshotter to announce the newest snapshot block to all components with a listener: Pruner (to know which additional tables can be pruned) and DatabaseProvider (to know which data can be queried from the snapshots).

SnapshotProvider A provider similar to DatabaseProvider, managing all existing snapshot files and selecting the optimal one (by range and segment type) to fulfill a request. A single instance is shared across all components and should be instantiated only once within ProviderFactory. An immutable reference is given everytime ProviderFactory creates a new DatabaseProvider.

SnapshotJarProvider A provider similar to DatabaseProvider that provides access to a single snapshot file.

SnapshotCursor An elevated abstraction of NippyJarCursor for simplified access. It associates the bitmasks with type decoding. For instance, cursor.get_two::<TransactionMask<Tx, Signature>>(tx_number) would yield Tx and Signature, eliminating the need to manage masks or invoke a decoder/decompressor.

SnapshotSegment Each snapshot file only contains data of a specific segment, e.g., Headers, Transactions, or Receipts.

NippyJarCursor Accessor of data in a NippyJar file. It enables queries either by row number (e.g., block number 1) or by a predefined key not part of the file (e.g., transaction hashes). If a file has multiple columns (e.g., Tx | TxSender | Signature), and one wishes to access only one of the column values, this can be accomplished by bitmasks. (e.g., for TxSender, the mask would be 0b010).

NippyJar A create-only file format. No data can be appended after creation. It supports multiple columns, compression (e.g., Zstd (with and without dictionaries), lz4, uncompressed) and inclusion filters (e.g., cuckoo filter: is hash X part of this dataset). Snapshots are organized by block ranges. (e.g., TransactionSnapshot_499_999.jar contains a transaction per row for all transactions from block 0 to block 499_999). For more check the struct documentation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

snapshot

snapshot

README.md

Snapshot

Overview

Glossary

Files

snapshot

Directory actions

More options

Directory actions

More options

Latest commit

History

snapshot

Folders and files

parent directory

README.md

Snapshot

Overview

Glossary