Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

pick Boyang's change to fix multi streams
#3880 opened Apr 26, 2025 by litaotju Loading…
feat: Add py_state member in LlmRequest class
#3872 opened Apr 25, 2025 by QiJune Loading…
fix: [nvbugs/5066257] serialization improvments
#3869 opened Apr 25, 2025 by coldwaterq Loading…
feat: add relaxed acceptance for DS
#3865 opened Apr 25, 2025 by yweng0828 Loading…
tests: skip writing prepare_dataset output to logs
#3864 opened Apr 25, 2025 by ruodil Loading…
fix: fix bug of deepseek gropu_size setting
#3860 opened Apr 25, 2025 by byshiue Loading…
feat: Add multimodal embedding field in LlmRequest
#3855 opened Apr 25, 2025 by katec846 Loading…
feat: enable PP on Llama4
#3854 opened Apr 25, 2025 by v-shobhit Loading…
feat:Low Precision Allreduce for PCIe based GPU
#3851 opened Apr 25, 2025 by kanghui0204 Loading…
feat: AutoDeploy fp8 quantization support for bmm
#3849 opened Apr 25, 2025 by meenchen Loading…
temp
#3844 opened Apr 24, 2025 by netanel-haber Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.