This directory contains configuration definition for IREE's continuous benchmarks suite. Benchmark results are posted to https://perf.iree.dev.
The https://buildkite.com/iree/iree-benchmark Buildkite pipeline runs on each
commit to the main
branch and posts those results to the dashboard. The
pipeline also runs on pull requests with the buildkite:benchmark
label,
posting results compared against their base commit as comments.
├── TensorFlow
│ * models authored in TensorFlow and imported with `iree-import-tf`
└── TFLite
* models converted to TensorFlow Lite and imported with `iree-import-tflite`
-
Pick the model you want to benchmark and find its source, which could be a Python script, TensorFlow SavedModel from https://tfhub.dev/, TensorFlow Lite FlatBuffer, or some other format with a supported path into IREE. The model can optionally include trained weights if those are important for benchmarking.
-
Import the model into an MLIR file that IREE can compile using the core
iree-translate
tool. For TensorFlow models useiree-import-tf
, for TensorFlow Lite models useiree-import-tflite
, etc. Take notes for where the model came from and how it was imported in case the MLIR file needs to be regenerated in the future.We may further automate this over time, such as by importing from Python sources as part of the benchmarks pipeline directly (see iree-org#6942). For now, here are some references:
-
Package the imported .mlir model file(s) for storage (see iree_mlir_benchmark_suite.cmake and download_file.py), then upload them to the
iree-model-artifacts
Google Cloud Storage bucket with the help of a team member. Files currently hosted in that bucket can be viewed at https://storage.googleapis.com/iree-model-artifacts/index.html. -
Edit the appropriate
CMakeLists.txt
file under this directory to include your desired benchmark configuration with theiree_mlir_benchmark_suite
function. You can test your change by running the https://buildkite.com/iree/iree-benchmark pipeline on a GitHub pull request with thebuildkite:benchmark
label. -
Once your changes are merged to the
main
branch, results will start to appear on the benchmarks dashboard at https://perf.iree.dev.
TODO(#6161): Collect metrics for miscellaneous IREE system states
These are ad-hoc notes added for developers to help triage errors.
These steps help reproduce the failures in TFLite models.
-
Install
iree-import-tflite
.$ python -m pip install iree-tools-tflite -f https://github.com/google/iree/releases
-
Confirm the binary
iree-import-tflite
is in your path by running$ iree-import-tflite --help
-
Download the TFLite flatbuffer for the failing benchmarks. The location can be found from this CMakeLists.txt file
-
The input TOSA model can be generated by running
$ iree-import-tflite <tflite file> -o <tosa output file>
-
The exact flags used to compile and run the benchmarks can be found in this CMakeLists.txt file