Skip to content

Add efficient Cross-Entropy by cuda kernel to accelerate training speed and reduce cross-entropy memory usage during training.#995

Open
cb521 wants to merge 18 commits intoNVIDIA:mainfrom cb521:add_efficient_cross_entropy

Commits

Commits on Jun 14, 2024

Commits on Jun 16, 2024

Commits on Jul 7, 2024

Commits on Jul 8, 2024

Commits on Jul 24, 2024

Commits on Jul 25, 2024

Commits on Jul 26, 2024

Commits on Jul 29, 2024

Commits on Jul 30, 2024

Commits on Aug 4, 2024

Commits on Aug 5, 2024