Skip to content

[Pytorch] Implement fp32 accumulation for attention with context parallel in both forward and backward pass.#821

Open
Yuxin-CV wants to merge 2 commits intoNVIDIA:mainfrom Yuxin-CV:main