-
Notifications
You must be signed in to change notification settings - Fork 59
Pull requests: vllm-project/vllm-gaudi
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[New Feature] Add cpu core pinning to vllm-server to improve performance.
#502
opened Oct 29, 2025 by
louie-tsai
Loading…
Port: Fix bucketing of query + num_blocks neighbor expansion #350, #355
#500
opened Oct 29, 2025 by
iboiko-habana
Loading…
Documentation updates - part 1
documentation
Improvements or additions to documentation
skip-gaudi-tests
#493
opened Oct 28, 2025 by
mhelf-intel
Loading…
Port "[Bugfix] Fix bucketing of query + num_blocks neighbor expansion" #350
#482
opened Oct 27, 2025 by
iboiko-habana
Loading…
[Attention Metadata Overhaul 1/N] Add per-layer attention metadata
#475
opened Oct 24, 2025 by
kzawora-intel
•
Draft
Add tests for custom operator implementation correctness
#457
opened Oct 23, 2025 by
Kacper-Pietkun
Loading…
Automatically adjust VLLM_DECODE_BLOCK_BUCKET_MIN if it exceeds max_blocks
#432
opened Oct 20, 2025 by
dsocek
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.