This crate contains a set of utilities that can be used to benchmark and weigh FRAME pallets that you develop for your Substrate Runtime.
Substrate's FRAME framework allows you to develop custom logic for your blockchain that can be included in your runtime. This flexibility is key to help you design complex and interactive pallets, but without accurate weights assigned to dispatchables, your blockchain may become vulnerable to denial of service (DoS) attacks by malicious actors.
The Substrate Runtime Benchmarking Framework is a tool you can use to mitigate DoS attacks against
your blockchain network by benchmarking the computational resources required to execute different
functions in the runtime, for example extrinsics, on_initialize
, verify_unsigned
, etc...
The general philosophy behind the benchmarking system is: If your node can know ahead of time how long it will take to execute an extrinsic, it can safely make decisions to include or exclude that extrinsic based on its available resources. By doing this, it can keep the block production and import process running smoothly.
To achieve this, we need to model how long it takes to run each function in the runtime by:
- Creating custom benchmarking logic that executes a specific code path of a function.
- Executing the benchmark in the Wasm execution environment, on a specific set of hardware, with a custom runtime configuration, etc...
- Executing the benchmark across controlled ranges of possible values that may affect the result of the benchmark (called "components").
- Executing the benchmark multiple times at each point in order to isolate and remove outliers.
- Using the results of the benchmark to create a linear model of the function across its components.
With this linear model, we are able to estimate ahead of time how long it takes to execute some logic, and thus make informed decisions without actually spending any significant resources at runtime.
Note that we assume that all extrinsics are assumed to be of linear complexity, which is why we are able to always fit them to a linear model. Quadratic or higher complexity functions are, in general, considered to be dangerous to the runtime as the weight of these functions may explode as the runtime state or input becomes too complex.
The benchmarking framework comes with the following tools:
- A set of macros (
benchmarks!
,add_benchmark!
, etc...) to make it easy to write, test, and add runtime benchmarks. - A set of linear regression analysis functions for processing benchmark data.
- A CLI extension to make it easy to execute benchmarks on your node.
The end-to-end benchmarking pipeline is disabled by default when compiling a node. If you want to
run benchmarks, you need to enable it by compiling with a Rust feature flag runtime-benchmarks
.
More details about this below.
Substrate represents computational resources using a generic unit of measurement called "Weight". It defines 10^12 Weight as 1 second of computation on the physical machine used for benchmarking. This means that the weight of a function may change based on the specific hardware used to benchmark the runtime functions.
By modeling the expected weight of each runtime function, the blockchain is able to calculate how many transactions or system level functions it will be able to execute within a certain period of time. Often, the limiting factor for a blockchain is the fixed block production time for the network.
Within FRAME, each dispatchable function must have a #[weight]
annotation with a function that can
return the expected weight for the worst case scenario execution of that function given its inputs.
This benchmarking framework will result in a file that automatically generates those formulas for
you, which you can then use in your pallet.
Writing a runtime benchmark is much like writing a unit test for your pallet. It needs to be carefully crafted to execute a certain logical path in your code. In tests you want to check for various success and failure conditions, but with benchmarks you specifically look for the most computationally heavy path, a.k.a the "worst case scenario".
This means that if there are certain storage items or runtime state that may affect the complexity
of the function, for example triggering more iterations in a for
loop, to get an accurate result,
you must set up your benchmark to trigger this.
It may be that there are multiple paths your function can go down, and it is not clear which one is
the heaviest. In this case, you should just create a benchmark for each scenario! You may find that
there are paths in your code where complexity may become unbounded depending on user input. This may
be a hint that you should enforce sane boundaries for how a user can use your pallet. For example:
limiting the number of elements in a vector, limiting the number of iterations in a for
loop,
etc...
Examples of end-to-end benchmarks can be found in the pallets provided by Substrate, and the
specific details on how to use the benchmarks!
macro can be found in its
documentation.
You can test your benchmarks using the same test runtime that you created for your pallet's unit
tests. By creating your benchmarks in the benchmarks!
macro, it automatically generates test
functions for you:
fn test_benchmark_[benchmark_name]<T>::() -> Result<(), &'static str>
Simply add these functions to a unit test and ensure that the result of the function is Ok(())
.
Note: If your test runtime and production runtime have different configurations, you may get different results when testing your benchmark and actually running it.
In general, benchmarks returning Ok(())
is all you need to check for since it signals the executed
extrinsic has completed successfully. However, you can optionally include a verify
block with your
benchmark, which can additionally verify any final conditions, such as the final state of your
runtime.
These additional verify
blocks will not affect the results of your final benchmarking process.
To run the tests, you need to enable the runtime-benchmarks
feature flag. This may also mean you
need to move into your node's binary folder. For example, with the Substrate repository, this is how
you would test the Balances pallet's benchmarks:
cargo test -p pallet-balances --features runtime-benchmarks
NOTE: Substrate uses a virtual workspace which does not allow you to compile with feature flags.
error: --features is not allowed in the root of a virtual workspace`
To solve this, navigate to the folder of the node (
cd bin/node/cli
) or pallet (cd frame/pallet
) and run the command there.
The benchmarks included with each pallet are not automatically added to your node. To actually
execute these benchmarks, you need to implement the frame_benchmarking::Benchmark
trait. You can
see an example of how to do this in the included Substrate
node.
Assuming there are already some benchmarks set up on your node, you just need to add another
instance of the add_benchmark!
macro:
/// configuration for running benchmarks
/// | name of your pallet's crate (as imported)
/// v v
add_benchmark!(params, batches, pallet_balances, Balances);
/// ^ ^
/// where all benchmark results are saved |
/// the `struct` created for your pallet by `construct_runtime!`
Once you have done this, you will need to compile your node binary with the runtime-benchmarks
feature flag:
cd bin/node/cli
cargo build --release --features runtime-benchmarks
Finally, once you have a node binary with benchmarks enabled, you need to execute your various benchmarks.
You can get a list of the available benchmarks by running:
./target/release/substrate benchmark --chain dev --pallet "*" --extrinsic "*" --repeat 0
Then you can run a benchmark like so:
./target/release/substrate benchmark \
--chain dev \ # Configurable Chain Spec
--execution=wasm \ # Always test with Wasm
--wasm-execution=compiled \ # Always used `wasm-time`
--pallet pallet_balances \ # Select the pallet
--extrinsic transfer \ # Select the extrinsic
--steps 50 \ # Number of samples across component ranges
--repeat 20 \ # Number of times we repeat a benchmark
--output <path> \ # Output benchmark results into a folder or file
This will output a file pallet_name.rs
which implements the WeightInfo
trait you should include
in your pallet. Each blockchain should generate their own benchmark file with their custom
implementation of the WeightInfo
trait. This means that you will be able to use these modular
Substrate pallets while still keeping your network safe for your specific configuration and
requirements.
The benchmarking CLI uses a Handlebars template to format the final output file. You can optionally
pass the flag --template
pointing to a custom template that can be used instead. Within the
template, you have access to all the data provided by the TemplateData
struct in the
benchmarking CLI writer. You can find the
default template used here.
There are some custom Handlebars helpers included with our output generation:
underscore
: Add an underscore to every 3rd character from the right of a string. Primarily to be used for delimiting large numbers.join
: Join an array of strings into a space-separated string for the template. Primarily to be used for joining all the arguments passed to the CLI.
To get a full list of available options when running benchmarks, run:
./target/release/substrate benchmark --help
License: Apache-2.0