Skip to content

Commit 38fefd1

Browse files
Fix the mirror Issue and update the Docs.
1 parent afe8571 commit 38fefd1

File tree

2 files changed

+2
-6
lines changed

2 files changed

+2
-6
lines changed

docs/api_reference/quantization.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -351,4 +351,4 @@ Resource Requirements
351351
| GGUF | Low | Very Fast | Good |
352352
+-------------+------------+-------------+------------+
353353

354-
For a detailed example of direct quantizer usage, you can adapt the `QuantizerFactory` example by instantiating the chosen quantizer directly and calling its methods.
354+
For detailed examples of using GGUF quantization, check out the examples in the `examples/` directory or refer to the interactive tutorial in `testing.ipynb`.

quantllm/quant/__init__.py

Lines changed: 1 addition & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,19 +1,15 @@
1-
"""Quantization functionality for LLMs."""
1+
"""GGUF Quantization for Large Language Models."""
22

33
from .quantization_engine import (
44
QuantizationConfig,
55
QuantizedLinear,
66
QuantizationEngine
77
)
8-
from .gptq import GPTQQuantizer
9-
from .awq import AWQQuantizer
108
from .gguf import GGUFQuantizer
119

1210
__all__ = [
1311
"QuantizationConfig",
1412
"QuantizedLinear",
1513
"QuantizationEngine",
16-
"GPTQQuantizer",
17-
"AWQQuantizer",
1814
"GGUFQuantizer"
1915
]

0 commit comments

Comments
 (0)