NVIDIA / TensorRT-LLM Public

Notifications You must be signed in to change notification settings
Fork 1.6k
Star 11.2k

Code
Issues 713
Pull requests 372
Discussions
Actions
Projects 2
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: NVIDIA/TensorRT-LLM

Labels 44 Milestones 1

New pull request New

372 Open 3,196 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

doc: Add Paged Attention, IFB, and Request Scheduling page

#6532 opened Aug 1, 2025 by kaiyux

Loading…

[https://nvbugs/5423962][fix] Fix broken links

#6531 opened Jul 31, 2025 by chenopis

Loading…

doc:[AutoDeploy]: Move AutoDeploy README.md to torch docs

#6528 opened Jul 31, 2025 by Fridah-nv

Loading…

fix: serialize the window_size in the kv event Community want to contribute

PRs initiated from Community

#6526 opened Jul 31, 2025 by richardhuo-nv

Loading…

refactor: Simplify finish reasons handling in DecoderState

#6524 opened Jul 31, 2025 by Funatiq • Draft

fix: correct scaling factor calculation in tests/unittest/trt/functional/test_fp4_gemm.py Community want to contribute

PRs initiated from Community

#6523 opened Jul 31, 2025 by muse-coder

Loading…

[6826][feat] Allow sending more than 2GiB through MPI by using mpi4py.util.pkl5

#6522 opened Jul 31, 2025 by amitz-nv • Draft

Remove expand configuration from mamba2 mixer

#6521 opened Jul 31, 2025 by danielafrimi

Loading…

[AutoDeploy] merge feat/ad-2025-07-22

#6520 opened Jul 31, 2025 by lucaslie

Loading…

[None][infra] Pin the version for triton to 3.3.1 (#6508)

#6519 opened Jul 31, 2025 by chzblych

Loading…

[TRTLLM-4279] fix: Add a protection test for checking trtllm custom ops

#6515 opened Jul 31, 2025 by yali-arch

Loading…

add cute dsl deepgemm fp8 blowise op

#6514 opened Jul 31, 2025 by limin2021

Loading…

refactor: Return the first token ahead of time

#6512 opened Jul 31, 2025 by Shixiaowei02

Loading…

chore: Make example SLURM scripts more parameterized

#6511 opened Jul 31, 2025 by kaiyux

Loading…

[6683][feat] Support LoRA reload CPU cache evicted adapter

#6510 opened Jul 31, 2025 by amitz-nv

Loading…

[Draft - WIP] Upgrade dependencies version to avoid security vulnerability

#6506 opened Jul 31, 2025 by yibinl-nvidia • Draft

[https://nvbugspro.nvidia.com/bug/5412562] Allocate MoE workspace only when necessary

#6502 opened Jul 31, 2025 by nv-yilinf

Loading…

fix: Update cuda memory alignment as 256 to match cublas12.9.1

#6501 opened Jul 31, 2025 by Wanli-Jiang • Draft

Feat/mtp opt 2 lamport allgather

#6500 opened Jul 31, 2025 by ameynaik-hub

Loading…

[fix] Fix DeepSeek w4a8 weight loading

#6498 opened Jul 31, 2025 by jinyangyuan-nvidia

Loading…

Refactoring input prep to allow out-of-tree models

#6497 opened Jul 31, 2025 by rakib-hasan • Draft

feat: Add support for fused gate_up_proj scales for FP8 blockwise

#6496 opened Jul 30, 2025 by achartier

Loading…

Update Documentation link to point to docs instead of docs source code Community want to contribute

PRs initiated from Community

#6495 opened Jul 30, 2025 by asrivas

Loading…

[TRTLLM-6812][feat] Add standardized GitHub issue templates and disable blank issues

#6494 opened Jul 30, 2025 by venkywonka

Loading…

[feat] add support for Eclairv2 model - cherry-pick changes

#6493 opened Jul 30, 2025 by yibinl-nvidia

Loading…

Previous 1 2 3 4 5 … 14 15 Next

Previous Next

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!