Skip to content

Question Regarding Code Execution in the Calibration Process. #6629

Open
@crinex

Description

@crinex

Dear @cccclai

I’m reviewing the code while following the guide(Export with Spinquant) you provided for converting the Llama3.2-3B-Instruct model with Qualcomm SpinQuant. When I execute the _export_llama function in the export_llama_lib.py file, the pt2e_quantize(quantizers) function is called. Within this function, the pt2e_calibrate function is executed before the convert_pt2e function. Why is pt2e_calibrate performed before convert_pt2e here?
Generally, wouldn't it make more sense to perform calibration after quantization?

Thank you

Metadata

Metadata

Assignees

Labels

module: quantizationIssues related to quantizationtriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions