How to train a diffusion model from sratch but not from from_pretrained ? #12163
-
I finetuned the pre-trained model of 'stable-diffusion-inpainting' on image inpainting task and all work well, but when I finetuned with the pre-trained model of 'stable-diffusion-v1-4' on image inpainting task, the loss is always NaN. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Training a diffusion model from scratch, especially for image inpainting, is absolutely possible, but involves careful setup. Since you're seeing Let's go over: 1. Why You're Getting
|
Beta Was this translation helpful? Give feedback.
Training a diffusion model from scratch, especially for image inpainting, is absolutely possible, but involves careful setup. Since you're seeing
NaN
losses after manually modifying a pre-trained model, it’s likely due to input/output mismatches, improper initialization, or numerical instability in training.Let's go over:
1. Why You're Getting
NaN
LossYou said:
That’s a red flag. A few things to verify:
Potential Issues: