Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Updating API docs
#1787 opened Aug 29, 2025 by aireilly Draft
Recovered skipped w8a8 compression related tests ready When a PR is ready for review
#1785 opened Aug 29, 2025 by shanjiaz Draft
[MXFP4] Add mxfp4 support
#1783 opened Aug 28, 2025 by dsikka Draft
[Transform] Support separating v and u transforms of quip ready When a PR is ready for review
#1782 opened Aug 27, 2025 by kylesayrs Loading…
[Transform] Spinquant R3 ready When a PR is ready for review
#1778 opened Aug 27, 2025 by kylesayrs Loading…
Fix wonky dependency range on datasets
#1774 opened Aug 22, 2025 by timkpaine Loading…
[MoE] MoE Calibration with calibrate_all_experts
#1760 opened Aug 19, 2025 by kylesayrs Loading…
[Tracing] Decouple vision tower from first layer ready When a PR is ready for review
#1710 opened Aug 6, 2025 by kylesayrs Loading…
[WIP] [MoE] GPT OSS
#1705 opened Aug 5, 2025 by kylesayrs Draft
[Example] [VLM] Gemma3n
#1696 opened Jul 31, 2025 by kylesayrs Draft
[Autowrapper] Support Gemma3n, autowrapper improvements ready When a PR is ready for review
#1693 opened Jul 30, 2025 by kylesayrs Loading…
1686 Logic matching refactor
#1687 opened Jul 28, 2025 by ved1beta Loading…
add quantization_w4a4_fp4 qwen3 example
#1681 opened Jul 24, 2025 by wangwenmingaa Loading…
[KV Cache] support kv cache int8 per channel quantization ready When a PR is ready for review
#1663 opened Jul 19, 2025 by Eviannn Loading…
[Transform] Online Rotations
#1651 opened Jul 16, 2025 by kylesayrs Draft
[GPTQ] Use torch.compile to speed up gptq algo ready When a PR is ready for review
#1561 opened Jun 17, 2025 by aladerran Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.