Skip to content

Pull requests: huggingface/open-r1

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Initial GRPO exps on the Numina dataset
#262 opened Feb 10, 2025 by edbeeching Loading…
Weighted reward functions
#213 opened Feb 7, 2025 by zeenolife Loading…
fix: easier environment setup; pin trl, transformers
#199 opened Feb 6, 2025 by ctjlewis Loading…
2
6
[Feat] Adding minimal training for multimodal model
#136 opened Jan 31, 2025 by kcz358 Loading…
Create data from remote api.
#102 opened Jan 29, 2025 by PoTaTo-Mika Loading…
feat: Added reward model according to paper.
#78 opened Jan 27, 2025 by ahmeterdempmk Loading…
Add Environment Test Script
#52 opened Jan 26, 2025 by sambhavnoobcoder Loading…
Add devcontainer configuration for VS Code
#33 opened Jan 25, 2025 by bhack Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.