Traing problem #81

LKELN · 2024-10-19T14:15:04Z

When I'm using your model, I've noticed that your model converts to float16, but training this way results in a loss=nan, and when I train with mixed-precision floats (the model is loaded with parameters without converting to float16) it also results in a loss =nan. Can you provide some help?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Traing problem #81

Traing problem #81

LKELN commented Oct 19, 2024

Traing problem #81

Traing problem #81

Comments

LKELN commented Oct 19, 2024