-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
doc: Add Paged Attention, IFB, and Request Scheduling page
#6532
opened Aug 1, 2025 by
kaiyux
Loading…
doc:[AutoDeploy]: Move AutoDeploy README.md to torch docs
#6528
opened Jul 31, 2025 by
Fridah-nv
Loading…
fix: serialize the window_size in the kv event
Community want to contribute
PRs initiated from Community
#6526
opened Jul 31, 2025 by
richardhuo-nv
Loading…
fix: correct scaling factor calculation in tests/unittest/trt/functional/test_fp4_gemm.py
Community want to contribute
PRs initiated from Community
#6523
opened Jul 31, 2025 by
muse-coder
Loading…
[None][infra] Pin the version for triton to 3.3.1 (#6508)
#6519
opened Jul 31, 2025 by
chzblych
Loading…
[TRTLLM-4279] fix: Add a protection test for checking trtllm custom ops
#6515
opened Jul 31, 2025 by
yali-arch
Loading…
[6683][feat] Support LoRA reload CPU cache evicted adapter
#6510
opened Jul 31, 2025 by
amitz-nv
Loading…
[Draft - WIP] Upgrade dependencies version to avoid security vulnerability
#6506
opened Jul 31, 2025 by
yibinl-nvidia
•
Draft
[https://nvbugspro.nvidia.com/bug/5412562] Allocate MoE workspace only when necessary
#6502
opened Jul 31, 2025 by
nv-yilinf
Loading…
fix: Update cuda memory alignment as 256 to match cublas12.9.1
#6501
opened Jul 31, 2025 by
Wanli-Jiang
•
Draft
feat: Add support for fused gate_up_proj scales for FP8 blockwise
#6496
opened Jul 30, 2025 by
achartier
Loading…
Update Documentation link to point to docs instead of docs source code
Community want to contribute
PRs initiated from Community
#6495
opened Jul 30, 2025 by
asrivas
Loading…
[TRTLLM-6812][feat] Add standardized GitHub issue templates and disable blank issues
#6494
opened Jul 30, 2025 by
venkywonka
Loading…
[feat] add support for Eclairv2 model - cherry-pick changes
#6493
opened Jul 30, 2025 by
yibinl-nvidia
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.