-
Notifications
You must be signed in to change notification settings - Fork 227
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Andreyan/awq extention #2708
Andreyan/awq extention #2708
Conversation
Could you post a job number with install and weight compression tests? Just to make sure it doesn't lead to degradation. |
nncf/quantization/algorithms/weight_compression/openvino_backend.py
Outdated
Show resolved
Hide resolved
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## develop #2708 +/- ##
============================================
+ Coverage 47.12% 91.18% +44.06%
============================================
Files 479 483 +4
Lines 46072 46412 +340
============================================
+ Hits 21712 42323 +20611
+ Misses 24360 4089 -20271
... and 339 files with indirect coverage changes
Flags with carried forward coverage won't be shown. Click here to find out more.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Changes
Extended AWQ algorithms for patterns Act->MatMul and Act->Multiply->MatMul with insertion for extra scales after activation.
Reason for changes
Support AWQ for wider family of LLMs
Related tickets
CVS-141131
Tests
Added unit tests