Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[CI] Install pre-release version of apache-tvm-ffi for flashinfer ci/build ready ONLY add when PR is ready to merge/full CI is needed
#27262 opened Oct 21, 2025 by hmellor Loading…
Remove last level references not removed in #26355 llama Related to Llama models
#27260 opened Oct 21, 2025 by hmellor Loading…
Bugfix: Cutlass FP8 FusedMoE
#27255 opened Oct 21, 2025 by amirkl94 Loading…
[MISC] Add prefix cache reset to LMCache CPU offload example documentation Improvements or additions to documentation kv-connector
#27248 opened Oct 21, 2025 by sakunkun Loading…
5 tasks
[WIP][Model] Upstream Deepseek-OCR model deepseek Related to DeepSeek models new-model Requests to new models
#27247 opened Oct 21, 2025 by Isotr0py Draft
5 tasks
[WIP] Support DeepSeek-OCR deepseek Related to DeepSeek models documentation Improvements or additions to documentation new-model Requests to new models
#27246 opened Oct 21, 2025 by Xu-Wenqing Draft
5 tasks
[CPU]Improve cpu fused moe perf
#27244 opened Oct 21, 2025 by xiangze-arm Loading…
[Bug] Qwen reasoning parser qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#27241 opened Oct 21, 2025 by ahao-anyscale Draft
5 tasks
[CPU]Improve dynamic 4bit moe performance
#27240 opened Oct 21, 2025 by xiangze-arm Loading…
[ResponseAPI] Fix mcp tool type extraction frontend gpt-oss Related to GPT-OSS models
#27234 opened Oct 21, 2025 by Jialin Loading…
3 of 5 tasks
Adds runai distributed streamer ci/build documentation Improvements or additions to documentation rocm Related to AMD ROCm
#27230 opened Oct 20, 2025 by bbartels Loading…
5 tasks
[Feature] Batch Invariant for R1 TP 8 on Blackwell ready ONLY add when PR is ready to merge/full CI is needed
#27229 opened Oct 20, 2025 by yewentao256 Loading…
[ROCm][MLA] Support block-size > 1 for AITER MLA backend rocm Related to AMD ROCm v1
#27224 opened Oct 20, 2025 by ganyi1996ppo Loading…
5 tasks
Flashinfer_CUTLASS_MOE fuses quantization for TP
#27223 opened Oct 20, 2025 by wenscarl Loading…
5 tasks
[CORE] Support Prefix Caching with Prompt Embeds documentation Improvements or additions to documentation v1
#27219 opened Oct 20, 2025 by qthequartermasterman Loading…
3 of 5 tasks
ProTip! What’s not been updated in a month: updated:<2025-09-21.