Skip to content

Pull requests: PaddlePaddle/PaddleFormers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

【GLM】fix ep callback and pp moe_subbatch_token_num
#2687 opened Sep 25, 2025 by danleifeng Loading…
2 tasks
Feat/model unittest ci action contributor
#2683 opened Sep 25, 2025 by huanghengheng Loading…
2 tasks
GLM4.5 support sp + moe aux loss contributor
#2682 opened Sep 24, 2025 by WYB27 Loading…
[DSv3]: Add Tokenizer Config for DSv3
#2650 opened Sep 22, 2025 by hushenwei2000 Loading…
Glm4Moe: fix attn_mask && fused_loss contributor
#2648 opened Sep 20, 2025 by WYB27 Loading…
Update CODE_OF_CONDUCT.md contributor
#2636 opened Sep 18, 2025 by Jagdish2810 Draft
2 tasks done
Glm4moe fix tp+ep+sp contributor
#2621 opened Sep 17, 2025 by WYB27 Loading…
[dsv3]Move dsv3 model from paddlenlp-dsv3-sft
#2593 opened Sep 11, 2025 by Difers Loading…
1 of 7 tasks
【Bug】Fix attn_mask_startend_row_indices shape mismatch
#2564 opened Sep 8, 2025 by cheng221 Loading…
2 tasks
【FlexCP】add Flexcp for trainer
#2541 opened Sep 4, 2025 by xiaoguoguo626807 Loading…
2 tasks
feat(dsv3):Runnable N1C8 configs
#2525 opened Sep 1, 2025 by hushenwei2000 Loading…
feat(dsv3): add dsv3 fast pretrain into paddleformers
#2524 opened Aug 31, 2025 by chen2016013 Loading…
2 tasks
feat(dsv3):Runnable N1C8 configs
#2523 opened Aug 31, 2025 by chen2016013 Loading…
2 tasks
add moe
#2510 opened Aug 28, 2025 by a31413510 Loading…
fix bug support download ernie model contributor
#2509 opened Aug 28, 2025 by fjjF77 Loading…
fix typos contributor
#2500 opened Aug 28, 2025 by co63oc Loading…
2 tasks
feat(dsv3): add dsv3 fast pretrain into paddleformers
#2496 opened Aug 27, 2025 by chen2016013 Loading…
2 tasks
Update lora layer source contributor
#2489 opened Aug 27, 2025 by emmanuel-ferdman Loading…
1 of 2 tasks
Merge dsv3 tainer part
#2487 opened Aug 27, 2025 by hushenwei2000 Draft
change deepseekv2 model
#2486 opened Aug 26, 2025 by chen2016013 Loading…
2 tasks
add pre_train entrance
#2483 opened Aug 26, 2025 by chen2016013 Loading…
2 tasks
Support GPT-OSS contributor
#2478 opened Aug 25, 2025 by WYB27 Loading…
ProTip! Filter pull requests by the default branch with base:develop.