Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix handling of f_divergence_type in DPO
#4171 opened Sep 30, 2025 by albertvillanova Loading…
Fix FA loss
#4170 opened Sep 30, 2025 by qgallouedec Loading…
Updated vLLM integration guide
#4162 opened Sep 29, 2025 by sergiopaniego Loading…
5 tasks
🖨️ Print rich table for messages
#4160 opened Sep 28, 2025 by qgallouedec Loading…
[WIP] Tool call
#4151 opened Sep 26, 2025 by qgallouedec Loading…
5 tasks
🅰️ Remove apex
#4139 opened Sep 24, 2025 by qgallouedec Loading…
👾 Use our own require_bitsandbytes
#4137 opened Sep 24, 2025 by qgallouedec Loading…
[WIP] Make the CI faster
#4127 opened Sep 23, 2025 by qgallouedec Loading…
🎁 RewardTrainer refactor
#4093 opened Sep 15, 2025 by qgallouedec Loading…
feat:add support for 'image_grid_thw'(QwenVL) in DPOTrainer
#4091 opened Sep 15, 2025 by ycma8 Loading…
2 of 5 tasks
Update links to docs in README to latest packaged version
#4084 opened Sep 15, 2025 by sergiopaniego Loading…
5 tasks
Add config_init_kwargs option in GRPOConfig
#4069 opened Sep 12, 2025 by hokuyama0106 Loading…
2 of 5 tasks
ProTip! Follow long discussions with comments:>50.