Fix DeepSeek-V3 H100 large scale timeout issue#2401
Fix DeepSeek-V3 H100 large scale timeout issue#2401scsudhakaran wants to merge 1 commit intomainfrom
Conversation
Signed-off-by: Sanju C Sudhakaran <scsudhakaran@nvidia.com>
|
/ok to test 1a8e1d0 |
|
No actionable comments were generated in the recent review. 🎉 📝 WalkthroughWalkthroughThis PR modifies the DeepSeek V3 H100 FP8 SC Large Scale pretrain configuration by adding two parameters: Changes
Estimated code review effort🎯 1 (Trivial) | ⏱️ ~3 minutes Possibly related PRs
Suggested labels
Suggested reviewers
🚥 Pre-merge checks | ✅ 3 | ❌ 2❌ Failed checks (2 warnings)
✅ Passed checks (3 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing touches
🧪 Generate unit tests (beta)
Tip Issue Planner is now in beta. Read the docs and try it out! Share your feedback on Discord. Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
Summary by CodeRabbit