Scribe is a multi-chain indexing service. Scribe is designed to take a list of contracts specified by chain id and store logs, receipts, and txs for every event, past to present, in a mysql database.
Use cases
Scribe is currently indexing over 900 contracts across 18 different chains.
Scribe comes with a graphql/iql endpoint with various queries to make interacting with the collected data easier. Here are some basic queries
lastIndexed(chain_id, contract_address)
logsRange(chain_id, contract_address, start_block, end_block, page)
blockTime(chain_id, block_number)
txSender(tx_hash, chain_id)
A full list can be found at graphql/server/graph/schema/queries.graphql
Scribe indexer supports indexing on any number of contracts on any chain. For each contract Scribe indexes from the specified start of the contract from the config. Scribe stores every log, receipt, tx, and timestamp for every event until it reaches the specified livefill threshold where it will then continue to index events as they occur in real time. For reorg protection, Scribe does not store any events that are more than the specified number of confirmations from the chain tip into its primary tables. Scribe stores “unconfirmed” events near the tip of the chain in separate and transient tables. The Scribe server has built in queries to query both the primary (confirmed) and transient (unconfirmed) tables at the same time if realtime data needed for querying or streaming.
Run the following command to start Scribe:
$ go run main.go
Then Scribe command line will be exposed. You can use the following commands:
# Start Scribe indexer
$ Scribe --config </Full/Path/To/Config.yaml> --db <sqlite or mysql> --path <path/to/database or database url>
# Start Scribe server
$ server --port <port> --db <sqlite or mysql> --path <path/to/database or database url>
See /charts/scribe for the deployment helm chart for this service
There are many ways to augment Scribe’s behavior. Before running Scribe, please take a look at the different parameters.
rpc_url: The url of the rpc aggregator
chains: A list of chains to index
chain_id: The ID of the chain
get_logs_range: is the number of blocks to request in a single getLogs request.
get_logs_batch_amount: is the number of getLogs requests to include in a batch request.
store_concurrency: is the number of goroutines to use when storing data.
concurrency_threshold: is the max number of block from head in which concurrent operations (store, getlogs) is allowed.
livefill_threshold: number of blocks away from the head to start livefilling (see "Understanding Scribe Indexer" to understand this better)
livefill_range: range in whcih the getLogs request for the livefill contracts will be requesting.
livefill_flush_interval: the interval in which the unconfirmed livefill table will be flushed.
confirmations: the number of blocks from head that the livefiller will livefill up to (and where the unconfirmed livefill indexer will begin)
contracts: stores all the contract information for the chain
address: address of the contract
start_block: block to start indexing the contract from (block with the first tx)
rpc_url: <omnirpc>
chains:
- chain_id: 1
get_logs_range: 500
get_block_batch_amount: 1
store_concurrency: 100
concurrency_threshold: 50000
livefill_threshold: 300
livefill_range: 200
livefill_flush_interval: 10000
confirmations: 200
contracts:
- address: 0xAf41a65F786339e7911F4acDAD6BD49426F2Dc6b
start_block: 18646320
The Scribe indexer is composed of three components
Fetcher
: Takes a list of contracts, a block range, and fetches and feeds logs into a channel to be consumed by an indexer.Indexer
: Takes a list of contracts, a block range, and a config and stores logs, receipts, and txs for all events in that range.ChainIndexer
: Runs 2+ indexers per chain.
ChainIndexer
spun up and runs it in a go routineChainIndexer
checks all contracts for their lastIndexed
block. Contracts without a lastIndexed
or with a lastIndexed
block outside the
specified livefill block range are put into individual indexers (backfill). All other contracts are collected into a single indexer (livefill).Indexer
is initialized with config and a list of contractsIndexer
is started with a block rangeFetcher
that feeds logs into a channel that is then consumed by the indexer. The fetcher retrieves logs in chunks specified in the config
(get_logs_range
and get_logs_batch_amount
).concurrency_threshold
) is set to at least a couple hours
from the head to ensure that data is inserted into the database in order (if streaming).Scribe ├── api: Contains the Swagger API definition and GQL Client tests. ├── backend: The backend implementation for Scribe ├── client: Client implementation for Scribe (embedded/remote) ├── cmd: The command line interface functions for running Scribe and GraphQL server ├── config: Configuration files for Scribe ├── db: The database schema and functions for interacting with the database ├── graphql: GraphQL implementation for Scribe's recorded data │ ├── client: The client interface for the GraphQL server │ ├── contrib: The GraphQL generators for Scribe │ └── server: The server implementation for GraphQL │ └── graph: The server's models, resolvers, and schemas ├── grpc: The gRPC client implementation for Scribe ├── internal: Internal packages for Scribe ├── logger: Handles logging for various events in Scribe. ├── metadata: Provides metadata for building . ├── scripts: Scripts for Scribe ├── service: Service holds Scribe indexer code (Fetcher, Indexer, ChainIndexer) ├── testhelper: Assists testing in downstream services. ├── testutil: Test utilities suite for Scribe └── types: Holds various custom types for Scribe
The schema for each table in the database can be found at db/datastore/sql/base
make generate