Issues · huggingface/trl

[Project] Training Agents with GRPO

#2723 opened Jan 31, 2025 by August-murr

Open 4

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 5

[Tracking issue] Wrong loss scaling when accumulating gradient

#2617 opened Jan 23, 2025 by qgallouedec

Open

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

208 Open 1,250 Closed

✨ enhancement 🏋 SFT

#2821 opened Feb 10, 2025 by August-murr

✨ enhancement 🏋 GRPO

#2820 opened Feb 10, 2025 by haoxiongliu

🏋 GRPO ❓ question

#2809 opened Feb 9, 2025 by August-murr

🐛 bug 🏋 GRPO

#2805 opened Feb 8, 2025 by mdy666

5 tasks done

🐛 bug 🏋 GRPO

#2803 opened Feb 8, 2025 by macheng6

🐛 bug 🏋 GRPO

#2802 opened Feb 8, 2025 by yynil

5 tasks done

🐛 bug ⚡ PEFT 🏋 SFT

#2819 opened Feb 7, 2025 by ibitec7

🐛 bug 🏋 GRPO

#2798 opened Feb 7, 2025 by kawamou

5 tasks done

⚡accelerate 🐛 bug 🏋 GRPO

#2796 opened Feb 7, 2025 by cuong-dyania

5 tasks done

IndexError: pop from an empty deque while using PPO and downgrading accelerate to 0.34.2 ⚡accelerate 🐛 bug ⚡ PEFT 🏋 PPO

#2795 opened Feb 7, 2025 by JohnConnor123

5 tasks done

🐛 bug 🏋 GRPO ⏳ needs more info

#2791 opened Feb 7, 2025 by zhengqigao

5 tasks done

⚡accelerate 🐛 bug ⚡ PEFT

#2788 opened Feb 6, 2025 by Superskyyy

5 tasks done

🐛 bug 🚀 deepspeed

#2787 opened Feb 6, 2025 by zaddy6

5 tasks done

⚡accelerate 🏋 DPO ✨ enhancement 🏋 GRPO

#2786 opened Feb 6, 2025 by tchang1997

⚡accelerate 🐛 bug ⚡ PEFT

#2781 opened Feb 6, 2025 by zhourunlong

5 tasks done

🐛 bug ⚡ PEFT

#2780 opened Feb 6, 2025 by zhangguoxin1

5 tasks done

✨ enhancement 🏋 GRPO

#2775 opened Feb 5, 2025 by cfpark00

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues: huggingface/trl

Issues list