Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it possible to make the model training log public? #259

Open
asirgogogo opened this issue Feb 10, 2025 · 5 comments
Open

Is it possible to make the model training log public? #259

asirgogogo opened this issue Feb 10, 2025 · 5 comments

Comments

@asirgogogo
Copy link

model:https://huggingface.co/Dongwei/Qwen-2.5-7B_Base_Math_smalllr

Image

@Some-random
Copy link
Contributor

I'm not sure how to make my runs/project public, but here is a screenshot, hope it helps!

Image

@asirgogogo
Copy link
Author

@Some-random
I observed that your training step is 68. When I was experimenting, I used the same parameter configuration and the training step number was 468

trl 0.15.0.dev0
transformers 4.49.0.dev0

@merlinarer
Copy link

@Some-random
Hello, how do you evaluate model in yr setting? I use lighteval in this repo and get a very low acc.

achieves 69.4% accuracy on MATH-500, demonstrating a 17%+ improvement over the base model.

@Some-random
Copy link
Contributor

@asirgogogo I'm only training on MATH. You can check the config I used to train this model

@Some-random
Copy link
Contributor

@merlinarer My lighteval version is lighteval @ git+https://github.com/huggingface/lighteval.git@0e462692436e1f0575bdb4c6ef63453ad9bde7d4, hope that helps. I've also seen many issues around mismatched accuracy numbers... Hope the maintainer will fix it soon

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants