return none when use torch.load to load a quantize_model #126

ev-day · 2024-09-20T10:17:32Z

Describe the bug
while I use quantize_model.py(in script folder) to quantize ChatGLM3 model, it can be saved successfully, but when I try to load the model by torch.load(), it was return None.

To Reproduce
Steps to reproduce the behavior:
1.run 'python quantize_model.py -m .\chatglm3-6b -b 4 -o .\chatglm3-6b' to quantize the model, it will export below files

try to load model

return none

Except:
It should return a model object

ev-day · 2024-09-20T10:36:12Z

By another way, just I can quantize the model use 'model = intel_npu_acceleration_library.compile(model, dtype=int4) ', however, it will need much time to quantize and compile.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

return none when use torch.load to load a quantize_model #126

return none when use torch.load to load a quantize_model #126

ev-day commented Sep 20, 2024 •

edited

Loading

ev-day commented Sep 20, 2024

return none when use torch.load to load a quantize_model #126

return none when use torch.load to load a quantize_model #126

Comments

ev-day commented Sep 20, 2024 • edited Loading

ev-day commented Sep 20, 2024

ev-day commented Sep 20, 2024 •

edited

Loading