T-MAC reports an error when using the 3-bit model #71

xdd130 · 2024-11-22T09:58:17Z

xdd130
Nov 22, 2024

I try used the AutoGPTQ tool to quantize the Qwen2.5-3B-Instruct model to 3 bits. I successfully obtained the model in GPTQ format, but when I compiled the script using T-MAC：

python compile.py -fa -o tuned -da -nt 8 -tb -gc -gs 128 -ags 64 -m gptq-auto -md /home/phytium/yq/Qwen2.5-3B-yq/Qwen2.5-3B-Quant-3b -t

I got the following error:

Traceback (most recent call last):
  File "/home/tmac/tmac/T-MAC/deploy/compile.py", line 244, in <module>
    main()
  File "/home/tmac/tmac/T-MAC/deploy/compile.py", line 234, in main
    compile(**device_kwargs)
  File "/home/tmac/tmac/T-MAC/deploy/compile.py", line 130, in compile
    qgemm_mod = qgemm_lut.compile(
                ^^^^^^^^^^^^^^^^^^
  File "/home/tmac/tmac/T-MAC/python/t_mac/ops/base.py", line 255, in compile
    self.tuning(*args, n_trial=n_trial, thread_affinity=thread_affinity, **eval_kwargs)
  File "/home/tmac/tmac/T-MAC/python/t_mac/ops/base.py", line 95, in tuning
    task = autotvm.task.create(template_name, args=args, target=self.target)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/tmac/T-MAC/3rdparty/tvm/python/tvm/autotvm/task/task.py", line 480, in create
    sch, _ = ret.func(*args)
             ^^^^^^^^^^^^^^^
  File "/home/tmac/tmac/T-MAC/3rdparty/tvm/python/tvm/autotvm/task/task.py", line 240, in __call__
    return self.fcustomized(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/tmac/tmac/T-MAC/python/t_mac/ops/base.py", line 71, in _func
    tensors = self._compute(*args)
              ^^^^^^^^^^^^^^^^^^^^
  File "/home/tmac/tmac/T-MAC/python/t_mac/ops/qgemm.py", line 138, in _compute
    raise TVMError("K({}) must be devisible by group_size({})".format(K, self.group_size))
tvm._ffi.base.TVMError: K(10304) must be devisible by group_size(128)

Does this indicate that there is a problem with my quantization step or that t-mac does not support the direct use of 3-bit models? What additional operations do I need to do?

QingtaoLi1 · 2024-12-06T06:47:44Z

QingtaoLi1
Dec 6, 2024
Collaborator

@xdd130 Does your model have any weight of shape 10304? Since 10304=64*161, you should set -gs=64 in the command.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

T-MAC reports an error when using the 3-bit model #71

{{title}}

Replies: 1 comment

{{title}}

Select a reply

T-MAC reports an error when using the 3-bit model #71

xdd130 Nov 22, 2024

Replies: 1 comment

QingtaoLi1 Dec 6, 2024 Collaborator

xdd130
Nov 22, 2024

QingtaoLi1
Dec 6, 2024
Collaborator