Commit 5a77b59
authored
Keep quantization enabled during calibration (#1299)
## Purpose ##
* Revert the behavior regression introduced as a result of #1114
* When calibrating a model using the `QuantizationModifier`,
quantization should be enabled when calibrating
## Changes ##
* Remove "disabling quantization" from the calibration forward pass
* Add "disabling quantization" to the sequential pipelines in order to
continue to disable quantization during calibration for GPTQ and SGPT
* When [calibration pipelines become shared between modifiers](#1279),
the decision of whether to disabling quantization during calibration
will have to be moved to the calibration pipelines themselves. Some work
needs to be done to demonstrate that GPTQ and SGPT do not suffer
accuracy regression from enabling activation quantization during
calibration (in theory, the change should increase accuracy)
---------
Signed-off-by: Kyle Sayers <[email protected]>1 parent 30d45c5 commit 5a77b59
File tree
4 files changed
+8
-9
lines changed- src/llmcompressor
- pipelines
- layer_sequential
- sequential
- utils
- tests/llmcompressor/utils
4 files changed
+8
-9
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
15 | | - | |
| 15 | + | |
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
| |||
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
54 | | - | |
| 54 | + | |
55 | 55 | | |
56 | 56 | | |
57 | 57 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
11 | | - | |
| 11 | + | |
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
| |||
50 | 50 | | |
51 | 51 | | |
52 | 52 | | |
53 | | - | |
| 53 | + | |
54 | 54 | | |
55 | 55 | | |
56 | 56 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1013 | 1013 | | |
1014 | 1014 | | |
1015 | 1015 | | |
1016 | | - | |
| 1016 | + | |
1017 | 1017 | | |
1018 | 1018 | | |
1019 | 1019 | | |
| |||
1040 | 1040 | | |
1041 | 1041 | | |
1042 | 1042 | | |
1043 | | - | |
1044 | 1043 | | |
1045 | 1044 | | |
1046 | 1045 | | |
1047 | 1046 | | |
1048 | 1047 | | |
1049 | | - | |
1050 | 1048 | | |
1051 | 1049 | | |
1052 | 1050 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
134 | 134 | | |
135 | 135 | | |
136 | 136 | | |
| 137 | + | |
137 | 138 | | |
138 | 139 | | |
139 | 140 | | |
140 | | - | |
141 | 141 | | |
| 142 | + | |
142 | 143 | | |
143 | | - | |
144 | 144 | | |
| 145 | + | |
0 commit comments