Highlights
- Pro
-
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
Python Apache License 2.0 UpdatedJul 11, 2024 -
pkuthss Public
Forked from CasperVector/pkuthssLaTeX template for dissertations in Peking University
TeX UpdatedApr 25, 2024 -
DeepSpeedExamples Public
Forked from deepspeedai/DeepSpeedExamplesExample models using DeepSpeed
Python Apache License 2.0 UpdatedApr 24, 2023 -
-
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedOct 5, 2021 -
shufflenetv2-tensorflow2.0 Public
Forked from Zhengtq/shufflenetv2-tensorflow2.0shufflenet v2 tensorflow2.0 tf2,0 tf-keras
Python UpdatedJul 28, 2020