-
Notifications
You must be signed in to change notification settings - Fork 233
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add DPO and SFT of TRL support in Gaudi and example #601
Conversation
should work with #600 |
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
@regisss do we have plan to enable TRL in Gaudi? I have enabled PPO, DPO and SFT by my side. If we have the plan. I will upload the PR one by one. |
add @libinta for comment |
6a016a3
to
a2d0927
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you add a link in each .py file to specify where this file is ported from and the version also?
Signed-off-by: Wang, Yi A <[email protected]>
done,also upgrade the trl to the latest tag, v0.7.6 |
Signed-off-by: Wang, Yi A <[email protected]>
* add DPO and SFT of TRL support in Gaudi and example Signed-off-by: Wang, Yi A <[email protected]> * upgrade SFTTrainer/DPO trainer and stack_llama_2 example to v0.7.6 Signed-off-by: Wang, Yi A <[email protected]> --------- Signed-off-by: Wang, Yi A <[email protected]>
What does this PR do?
Fixes # (issue)
Before submitting