WARNING 04-15 15:50:49 config.py:211] awq quantization is not fully optimized yet. The speed can be slower than non-quantized models.? #4101
Unanswered
silvacarl2
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Is this anytihng we need to be concerned about during inferencing?
WARNING 04-15 15:50:49 config.py:211] awq quantization is not fully optimized yet. The speed can be slower than non-quantized models.
because it seems REALLY REALLY FAST already.
8-)
Beta Was this translation helpful? Give feedback.
All reactions