-
-
Notifications
You must be signed in to change notification settings - Fork 8.7k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Remove everything scheduled for removal in v0.10.0
documentation
Improvements or additions to documentation
frontend
tool-calling
#20979
opened Jul 15, 2025 by
hmellor
Loading…
Add full serve CLI reference back to docs
ci/build
documentation
Improvements or additions to documentation
frontend
#20978
opened Jul 15, 2025 by
hmellor
Loading…
[Frontend] OpenAI Responses API supports input image
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#20975
opened Jul 15, 2025 by
chaunceyjiang
Loading…
3 of 4 tasks
fix: Handle unsupported message fields in tool calling
#20973
opened Jul 15, 2025 by
ejrtks1020
Loading…
Voxtral
ci/build
documentation
Improvements or additions to documentation
frontend
new-model
Requests to new models
[Deprecation] Remove Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
TokenizerPoolConfig
documentation
add support for qwen3 moe model EPLB
qwen
Related to Qwen models
#20967
opened Jul 15, 2025 by
hsliuustc
Loading…
2 of 4 tasks
Fix tool_calls to fit with openai client
frontend
#20966
opened Jul 15, 2025 by
relic-yuexi
Loading…
3 of 4 tasks
Add add_logger API to AsyncLLM
ci/build
v1
#20953
opened Jul 14, 2025 by
eicherseiji
•
Draft
3 of 4 tasks
[Reasoning] Add thinking budget support
deepseek
Related to DeepSeek models
frontend
needs-rebase
v1
#20949
opened Jul 14, 2025 by
rishitdholakia13
•
Draft
4 tasks
[doc] Add more details for Ray-based DP
documentation
Improvements or additions to documentation
#20948
opened Jul 14, 2025 by
ruisearch42
Loading…
4 tasks
Add Dockerfile argument for VLLM_USE_PRECOMPILED environment
ci/build
documentation
Improvements or additions to documentation
#20943
opened Jul 14, 2025 by
dougbtv
Loading…
1 task done
[Bugfix] Correct per_act_token in CompressedTensorsW8A8Fp8MoECutlassM…
#20937
opened Jul 14, 2025 by
minosfuture
Loading…
[Misc] Qwen MoE model supports LoRA
documentation
Improvements or additions to documentation
qwen
Related to Qwen models
#20932
opened Jul 14, 2025 by
jeejeelee
Loading…
4 tasks
[MODEL] New model support for naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B
ci/build
documentation
Improvements or additions to documentation
new-model
Requests to new models
#20931
opened Jul 14, 2025 by
bigshanedogg
Loading…
6 tasks done
[Chore] Removal V0 structured outputs
ready
ONLY add when PR is ready to merge/full CI is needed
structured-output
[Model] Consolidate pooler implementations
ready
ONLY add when PR is ready to merge/full CI is needed
#20927
opened Jul 14, 2025 by
DarkLight1337
Loading…
1 of 4 tasks
[Model] Add ModelConfig class for GraniteMoeHybrid to override default max_seq_len_to_capture
#20923
opened Jul 14, 2025 by
tdoublep
Loading…
3 of 4 tasks
[Bugfix] Fix the FP8 kv cache accuracy issue in flashinfer TRT-LLM backend
v1
#20920
opened Jul 14, 2025 by
elvischenv
•
Draft
3 of 4 tasks
[CI] update typos config for CI pre-commit and fix some spells
ci/build
documentation
Improvements or additions to documentation
frontend
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
tpu
Related to Google TPUs
v1
#20919
opened Jul 14, 2025 by
panpan0000
Loading…
3 tasks done
Previous Next
ProTip!
Updated in the last three days: updated:>2025-07-12.