[Possible PR discuss] Will a PR of training HF model be welcomed?

Hi! We are in the process of developing a novel training framework for Reinforcement Learning (RL) following TorchTitan. Recently, we've developed a feature to support direct training from Hugging Face (HF) models and the loading safetensors in online sharded fashion. This may substantially cuts down the cost of adapting a new model. All you have to do is implement the parallelism applying function.
Given this, I wonder whether  a PR with the relevant code and a training example for training Hugging Face's Llama model is welcomed. I think this addition will be of great benefit to many in the community.
By the way, during my testing, I found that the HF Llama model demonstrates competitive TPS when compared to the model implemented in TorchTitan.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Possible PR discuss] Will a PR of training HF model be welcomed? #903

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Possible PR discuss] Will a PR of training HF model be welcomed? #903

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions