-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Pull requests: PaddlePaddle/PaddleNLP
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[cherry-pick] add zcc_ema_loss_threshold args to avoid merging models…
contributor
#11053
opened Sep 2, 2025 by
sjy1203
Loading…
2 tasks
[fea] support dp-moe for zcc and global_expert_id
contributor
#11050
opened Sep 2, 2025 by
bo-ke
Loading…
2 tasks
add script for training gpt3 on XPU machine using flagcx as comm backend
contributor
#11014
opened Aug 26, 2025 by
mikethegoblin
Loading…
2 tasks
[NOT MERGE]Pr adapt flex checkpoint
contributor
#10996
opened Aug 25, 2025 by
zty-king
Loading…
2 tasks
[BUG]: fix the bug in PretrainedModel.recompute_disable()
contributor
#10988
opened Aug 21, 2025 by
hongjx175
Loading…
2 tasks
recompute support offload tensor
#10981
opened Aug 21, 2025 by
blacksheep-Aristotle
Loading…
2 tasks
moe_layer support fine_grained_forward
#10980
opened Aug 21, 2025 by
blacksheep-Aristotle
Loading…
2 tasks
update expert parallel init logic
#10966
opened Aug 18, 2025 by
blacksheep-Aristotle
Loading…
2 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:develop.