Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[PyTorch] Debug NeMo distributed optimizer 2.0.0 bug Something isn't working
#1444 opened Jan 31, 2025 by timmoon10 Loading…
5 of 13 tasks
Support store_param_remainders feature from Apex in TE Fused Adam enhancement New feature or request
#1443 opened Jan 30, 2025 by timmoon10 Loading…
6 of 13 tasks
Rename block scaling recipe
#1442 opened Jan 30, 2025 by ksivaman Loading…
4 of 14 tasks
[Pytorch] Nvidia-DLFramework-Inspect support
#1441 opened Jan 30, 2025 by pggPL Draft
8 of 13 tasks
[common] Generalized MXFP8 fused kernels w.r.t. input tensor dimensions 2.0.0 enhancement New feature or request
#1437 opened Jan 29, 2025 by Oleg-Goncharov Loading…
8 of 13 tasks
[PyTorch] Revert tensor dimensions in MXFP8 tests 2.0.0 bug Something isn't working testing Improvements to tests or testing infrastructure
#1435 opened Jan 29, 2025 by timmoon10 Loading…
7 of 14 tasks
Add test for Lightning Thunder integration testing Improvements to tests or testing infrastructure
#1433 opened Jan 28, 2025 by timmoon10 Draft
6 of 14 tasks
Introduce NVSHMEM based communication API for pytorch
#1430 opened Jan 28, 2025 by gdengk Loading…
13 tasks
Adding remove_caches API to Float8Tensor class
#1425 opened Jan 27, 2025 by youngeunkwon0405 Loading…
13 tasks
Initial Support Blackwell Build
#1418 opened Jan 21, 2025 by johnnynunez Loading…
9 of 13 tasks
[PyTorch] cuBLAS workspace size fix for TP overlap unit test bug Something isn't working
#1415 opened Jan 17, 2025 by denera Loading…
8 of 13 tasks
Fix Linear Weight Initialization in the PaddlePaddle Implementation
#1413 opened Jan 17, 2025 by GuoxiaWang Loading…
4 of 13 tasks
Better cuBLAS handle management
#1389 opened Jan 2, 2025 by ptrendx Loading…
8 of 13 tasks
Update README.rst
#1385 opened Dec 23, 2024 by sbhavani Loading…
1 of 6 tasks
Don't touch nor send messages to the root logger.
#1380 opened Dec 19, 2024 by sagostinho-nvidia Loading…
4 of 13 tasks
Add paged attention support
#1355 opened Dec 4, 2024 by cyanguwa Loading…
8 of 13 tasks
[PyTorch] Bugfix for wgrad bulk overlap conflict when dgrad overlap is reduce-scatter bug Something isn't working
#1341 opened Nov 18, 2024 by denera Loading…
6 of 13 tasks
[C/JAX] Comm+GEMM Overlap API for TE/JAX enhancement New feature or request jax
#1337 opened Nov 15, 2024 by denera Draft
3 of 13 tasks
Build with uv instead of just pip
#1324 opened Nov 8, 2024 by jennifgcrl Loading…
5 of 13 tasks
TP communication overlap: enable the overlap between GEMM chunk at Ho…
#1311 opened Nov 4, 2024 by erhoo82 Loading…
1 of 13 tasks
[PyTorch] Add heuristics for intializing FP8 params enhancement New feature or request
#1300 opened Oct 30, 2024 by timmoon10 Loading…
8 of 13 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.