Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to load the sft finetuned model based on the diffusers version. #688

Open
zyhbili opened this issue Jan 27, 2025 · 1 comment
Open
Assignees

Comments

@zyhbili
Copy link

zyhbili commented Jan 27, 2025

Thanks for your great work! The recently released training scripts are quite user-friendly.
I have done the sft finetuning. And i convert the checkpoints using tools/convert_weight_deepspeed2hf.py.
But i fail to load finetuned weight when i specify the new weight path to --model_path in inference/cli_demo.py.
It seems due to the model_index.json is missing under new path.
How to load it in an efficient and straightforward manner?

@zRzRzRzRzRzRzR zRzRzRzRzRzRzR self-assigned this Jan 28, 2025
@zRzRzRzRzRzRzR
Copy link
Member

If you are missing model_index.json, you can check the contents that are missing from the original model file, as this file serves as the index for the split safetensors. If the number of .safetensors in your exported model is the same as in the original model, you can directly copy the model_index.json from the original model.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants