-
-
Notifications
You must be signed in to change notification settings - Fork 10.7k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CI] Install pre-release version of ONLY add when PR is ready to merge/full CI is needed
apache-tvm-ffi
for flashinfer
ci/build
ready
#27262
opened Oct 21, 2025 by
hmellor
Loading…
Remove last Related to Llama models
level
references not removed in #26355
llama
#27260
opened Oct 21, 2025 by
hmellor
Loading…
Fix EventPublisherFactory logic for disabled KV cache events and publisher is "null"
#27257
opened Oct 21, 2025 by
usberkeley
Loading…
5 tasks done
Updated xgrammar backend to not deny supported string formats
structured-output
v1
#27253
opened Oct 21, 2025 by
ExtReMLapin
Loading…
3 of 5 tasks
[MISC] Add prefix cache reset to LMCache CPU offload example
documentation
Improvements or additions to documentation
kv-connector
#27248
opened Oct 21, 2025 by
sakunkun
Loading…
5 tasks
[WIP] Support DeepSeek-OCR
deepseek
Related to DeepSeek models
documentation
Improvements or additions to documentation
new-model
Requests to new models
#27246
opened Oct 21, 2025 by
Xu-Wenqing
•
Draft
5 tasks
Mirroring changes in test-pipeline.yaml into test-amd.yaml
ci/build
rocm
Related to AMD ROCm
#27242
opened Oct 21, 2025 by
Alexei-V-Ivanov-AMD
Loading…
[Bug] Qwen reasoning parser
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#27241
opened Oct 21, 2025 by
ahao-anyscale
•
Draft
5 tasks
v1/kv_cache_utils: Respect num_gpu_blocks_override in memory check
v1
#27238
opened Oct 21, 2025 by
khaled-wsa
Loading…
[CI/Testing] Add basic single node dual batch overlap test
ci/build
v1
#27235
opened Oct 21, 2025 by
LucasWilkinson
Loading…
[ResponseAPI] Fix mcp tool type extraction
frontend
gpt-oss
Related to GPT-OSS models
#27234
opened Oct 21, 2025 by
Jialin
Loading…
3 of 5 tasks
[Bugfix] Ensure calculated KV scales are applied in attention.
v1
#27232
opened Oct 21, 2025 by
adabeyta
Loading…
Adds runai distributed streamer
ci/build
documentation
Improvements or additions to documentation
rocm
Related to AMD ROCm
#27230
opened Oct 20, 2025 by
bbartels
Loading…
5 tasks
[Feature] Batch Invariant for R1 TP 8 on Blackwell
ready
ONLY add when PR is ready to merge/full CI is needed
#27229
opened Oct 20, 2025 by
yewentao256
Loading…
[ROCm][MLA] Support block-size > 1 for AITER MLA backend
rocm
Related to AMD ROCm
v1
#27224
opened Oct 20, 2025 by
ganyi1996ppo
Loading…
5 tasks
Flashinfer_CUTLASS_MOE fuses quantization for TP
#27223
opened Oct 20, 2025 by
wenscarl
Loading…
5 tasks
ARM64 CUDA 12.9 wheels built and uploaded to index incorrectly
ci/build
#27221
opened Oct 20, 2025 by
Gregory-Pereira
Loading…
[Bugfix] Fix dp_chunking enablement logic in FusedMoE layer
#27220
opened Oct 20, 2025 by
alexm-redhat
Loading…
[CORE] Support Prefix Caching with Prompt Embeds
documentation
Improvements or additions to documentation
v1
#27219
opened Oct 20, 2025 by
qthequartermasterman
Loading…
3 of 5 tasks
[Backend][WIP] Integrate MPK (Mirage) compiler as an experimental execution backend to vLLM
v1
#27218
opened Oct 20, 2025 by
NorthmanPKU
•
Draft
1 of 8 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-09-21.