forked from openvinotoolkit/nncf
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
NF4 per-channel support for AWQ and Scale Estimation (openvinotoolkit…
…#2898) ### Changes Supported NF4 mode for Scale Estimation and AWQ. All results below were collected w/ and w/o Scale estimation algorithms and w/ Lora Correction algorithm. ![image](https://github.com/user-attachments/assets/eaab96a9-f7c1-438c-9aef-99a37794b10f) ### Reason for changes NF4 per-channel with scale estimation may give promising results for NPU, since the accuracy is on par with int4 group-wise quantization. ### Related tickets 150560 ### Tests - [x] OV 2024.5 job/NNCF/job/manual/job/post_training_weight_compression/182 ![image](https://github.com/user-attachments/assets/f76d40b8-70ab-4cb1-8f6b-eba7d9a41c02) - [x] OV 2024.4 job/NNCF/job/manual/job/post_training_weight_compression/181 ![image](https://github.com/user-attachments/assets/899bebc2-4a1a-4ae2-9740-6eba8d3a0fd2) - [x] OV 2024.3 job/NNCF/job/manual/job/post_training_weight_compression/180 ![image](https://github.com/user-attachments/assets/338a0edb-9425-4868-9a1d-7d1b5eec1631)
- Loading branch information
1 parent
d9b3f38
commit 05f37f5
Showing
14 changed files
with
139 additions
and
67 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.