Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Generation tutorial for Gemma model
#829 opened May 1, 2024 by pggPL Review required
8 of 11 tasks
Fp8 model init factory
#880 opened May 30, 2024 by sudhakarsingh27 Draft
[pre-commit.ci] pre-commit suggestions wontfix This will not be worked on
#979 opened Jul 2, 2024 by pre-commit-ci bot Draft
[JAX] Sharding Utils
#1003 opened Jul 9, 2024 by mingxu1067 Draft
8 of 13 tasks
Flash attention support softcap.
#1013 opened Jul 14, 2024 by Lzhang-hub Loading…
7 tasks
Change condition for ub tp overlap.
#1055 opened Jul 29, 2024 by Victarry Loading…
1 of 13 tasks
Use pyproject.toml to specify build requirements build Build system
#1061 opened Jul 30, 2024 by ksivaman Loading…
6 of 13 tasks
Add high_precision_init_val to model params when using fp8_model_init
#1121 opened Aug 19, 2024 by kunlunl Loading…
8 of 13 tasks
Fix param input order for cudagraph bug Something isn't working
#1138 opened Aug 27, 2024 by yifeis-nv Loading…
4 of 13 tasks
Norms Refractor
#1140 opened Aug 27, 2024 by phu0ngng Draft
5 of 13 tasks
[PyTorch] Avoid saving fp8_tensors in certain scenarios
#1143 opened Aug 28, 2024 by cyanguwa Loading…
8 of 13 tasks
Fix autocast deprecation warning.
#1167 opened Sep 6, 2024 by jondeaton Loading…
Draft: Use fused push_send_recv kernel for TP AG and RS overlaps
#1200 opened Sep 24, 2024 by erhoo82 Loading…
13 tasks
[PyTorch] Improve CP P2P efficiency
#1208 opened Sep 26, 2024 by yenchenlin Loading…
1 of 6 tasks
Save CUDA Graph memory by reusing input and output tensors
#1234 opened Oct 9, 2024 by buptzyb Loading…
5 of 13 tasks
fused out correction in CP
#1248 opened Oct 14, 2024 by xiaoyao0115 Loading…
12 tasks
Draft: reduce cudagraph mem via preoallcations
#1253 opened Oct 15, 2024 by JimmyZhang12 Loading…
13 tasks
attention_mask fill with -inf for UnfusedDotProductAttention
#1268 opened Oct 18, 2024 by Agoniii Loading…
1 of 13 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.