Name	Name	Last commit message	Last commit date
parent directory ..
sd3	sd3
sdxl	sdxl
.gitattributes	.gitattributes
.gitignore	.gitignore
README.md	README.md
requirements.txt	requirements.txt

IREE Special Models Tests

This directory contains generated test suites for running through IREE's compiler and runtime tools.

The difference between this and iree_tests is that models here have flexibility for custom flags, tolerances, and configurations based on the backend/model compared to iree_tests where we stick with the most basic backend configurations that are fixed for every model.

Each model added has one folder containing a few files:

[model name]/
  model.mlirbc (source mlir bytecode)
  test_cases.json (sepcifies weight and input/output remote files to download from Azure)
  test_<model_name>.py (python file that is called for testing. contains all configurations)

Where:

model.mlirbc is in a bytecode format that is ready for use with iree-compile (e.g. torch-mlir, stablehlo, tosa, linalg)
test_cases.json sepcifies weight and input/output remote files to download from Azure
test_<model_name>.py is a python file that is called for testing and contains all configurations.

Testing follows several stages:

graph LR
  Import -. "\n(offline)" .-> Compile
  Compile --> Run

Importing is run "offline" and the outputs are checked in to the repository for ease of use in downstream projects and by developers who prefer to work directly with .mlir files and native (C/C++) tools. Each test suite or test case may also have its own import logic, with all test suites converging onto the standard format described above.

For the special sd models that have been added so far, the mlir, input/output, and weight files have all been generated using SHARK-Turbine. SHARK-Turbine example

Some large files are stored using Git LFS. When working with these files please ensure that you have Git LFS installed:

$ git lfs install

Files that are too large for Git LFS (e.g. model weights) are stored on cloud providers. Download these files with download_remote_files.py:

# All files
$ python download_remote_files.py

# Just files for one subdirectory
$ python download_remote_files.py --root-dir iree_special_models/sdxl/prompt-encoder

Running tests

Tests are run using the pytest framework. We simply just run pytest on the python test files in each model which define all the configurations to run the models on.

Common venv setup with deps

$ python -m venv .venv
$ source .venv/bin/activate
$ python -m pip install -r iree_tests/requirements.txt
$ python -m pip install -r iree_special_models/requirements.txt
$ pip install --no-compile --pre --upgrade -e common_tools

To use iree-compile and iree-run-module from Python packages:

$ python -m pip install --find-links https://iree.dev/pip-release-links.html \
  iree-compiler iree-runtime --upgrade

To use local versions of iree-compile and iree-run-module, put them on your $PATH ahead of your .venv/Scripts directory:

$ export PATH=path/to/iree-build;$PATH

Downloading remote files

$ python3 iree_tests/download_remote_files.py --root-dir iree_special_models

Invoking pytest

Run tests:

$ pytest iree_special_models

Run tests with parallelism (using pytest-xdist):

$ pytest iree_special_models -n auto

Run tests for specific backend:

$ pytest iree_special_models -k rocm
# OR
$ pytest iree_special_models -k cpu

Run tests from a specific subdirectory:

$ pytest iree_special_models/sdxl

Generating model test cases from SHARK-Turbine custom models

Warning

UNDER CONSTRUCTION - this will change!

Setup venv for the SHARK-Turbine and iree-turbine repos by following the github workflow file in SHARK-Turbine for test_models there:

$ python3.11 -m venv turbine_venv
$ source turbine_venv/bin/activate
$ python3.11 -m pip install --upgrade pip
# Note: We install in three steps in order to satisfy requirements
# from non default locations first. Installing the PyTorch CPU
# wheels saves multiple minutes and a lot of bandwidth on runner setup.
$ pip install --no-compile -r <path_to_iree-turbine>/pytorch-cpu-requirements.txt
$ pip install --no-compile --pre --upgrade -r <path_to_iree-turbine>/requirements.txt
$ pip install --no-compile --pre -e <path_to_iree-turbine>[testing]
$ pip install --upgrade --pre --no-cache-dir iree-compiler iree-runtime -f https://iree.dev/      pip-release-links.html
$ pip install --no-compile --pre --upgrade -e <path_to_shark-turbine>/models -r <path_to_shark-turbine>/models/requirements.txt

Notes:

You may need to downgrade numpy:

pip uninstall numpy
pip install numpy<2.0

Run the model tools from SHARK-Turbine to generate artifact files:
```
SHARK-Turbine$ python models/turbine_models/custom_models/sdxl_inference/clip.py
SHARK-Turbine$ python models/turbine_models/custom_models/sdxl_inference/clip_runner.py
```
We want the program .mlir, input/output .npy files, and weight .irpa file. Make sure to set the appropriate flags here
Upload inference_input, inference_output, and real_weights.irpa files to Azure (e.g. using Azure Storage Explorer)
Add a test_cases.json pointing at the uploaded remote files and source mlir to its own test folder. Also, add a test python file to the folder that tests all the configurations for the model. Take a look at this example

For examples on how to generate model artifacts from different repos/tools and an appendix on tools to use to work with weights and bytecode mlirs, you can find more information here Just make sure that you are adhering to the 3 file structure specified above.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

iree_special_models

iree_special_models

README.md

IREE Special Models Tests

Running tests

Common venv setup with deps

Downloading remote files

Invoking pytest

Generating model test cases from SHARK-Turbine custom models

Files

iree_special_models

Directory actions

More options

Directory actions

More options

Latest commit

History

iree_special_models

Folders and files

parent directory

README.md

IREE Special Models Tests

Running tests

Common venv setup with deps

Downloading remote files

Invoking pytest

Generating model test cases from SHARK-Turbine custom models