You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I would also love to have the weights. We have a project that can really benefit from this model, however doing the training by ourselves might be too computationally heavy for us. Also there is no guarantees that we will be able to achieve the same results
Thanks for the great work! The results are really amazing. Will you open-source the model weights and the data to train the policy and reward model ?
The text was updated successfully, but these errors were encountered: