AttributeError: module 'intel_extension_for_pytorch.quantization' has no attribute 'WoqActQuantMode'

### Describe the issue

Getting the error while trying to run the below command.

 Step 2: Generate quantized model with INT4 weights
# Provide checkpoint file name by --low-precision-checkpoint <file name>
python single_instance/run_llama_quantization.py --ipex-weight-only-quantization --output-dir "saved_results" --int8-bf16-mixed -m meta-llama/Llama-2-7b-chat-hf --low-precision-checkpoint "saved_results/gptq_checkpoint.pt"

Do I need to source install IPEX as opposed to pip install?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

AttributeError: module 'intel_extension_for_pytorch.quantization' has no attribute 'WoqActQuantMode' #456

Describe the issue

Provide checkpoint file name by --low-precision-checkpoint

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

AttributeError: module 'intel_extension_for_pytorch.quantization' has no attribute 'WoqActQuantMode' #456

Description

Describe the issue

Provide checkpoint file name by --low-precision-checkpoint

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions