An onnx-based quantitation tool.

Features

Calibration/Quantization: Load model.onnx, calibrate and store it as int8.onnx.
Reformat optimization: Process reformat automatically and achieve the best speed and minimal accuracy drop.
No pytorch dependencies.

Usage

from onnx_quant import quantonnx
import numpy as np

with quantonnx("yolov5s.onnx", save="yolov5s.int8.onnx") as m:

    # ops
    m.disable("Concat_40", "Conv_41")

    # tensors
    m.disable_afters("462", "422", "382")

    x = np.random.randn(1, 3, 640, 640).astype(np.float32)
    m.collect(x)

print("Done.")

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
onnx_quant.py		onnx_quant.py
quant.png		quant.png
yolov5s.onnx		yolov5s.onnx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An onnx-based quantitation tool.

Features

Usage

About

Releases

Packages

Languages

License

jhzhang19/onnx_quant_tool

Folders and files

Latest commit

History

Repository files navigation

An onnx-based quantitation tool.

Features

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages