[AWQ] Expand scale dims to match activation dims #3746
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Changes
AWQ: multiply scale shape is expanded to match the activation shape length: Mamba models have an AWQ pattern which has 3 dims, and insertion of default 2d scale leads to an error during inference
Reason for changes
To support AWQ algo for mamba models
Related tickets
173277
Tests
tests/cross_fw/test_templates/template_test_weights_compression.py::test_awq_scale_reference is updated to test the non mergable AWQ branch, testing the branch + new reshape implementation