Training Collapses with train_text_to_image.py #7845
Unanswered
nighting0le01
asked this question in
Q&A
Replies: 1 comment
-
it just looks to me like a very high learning rate for a batch size of 64! are you using lr_scale option? have you tried lower value like 1e-6 or 4e-7 ? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
If i use a larger (400,000k image text pairs) dataset and using the training script provided. I see weird collapse on all prompts. I haven't even finished 1 epoch and trian with 1e-5 lr . i start with something like this from pretrained SD1.5
data:image/s3,"s3://crabby-images/36a26/36a26e22a1a92d5d147ad066ab048400107f3399" alt="image"
data:image/s3,"s3://crabby-images/45897/4589737102d0e2554a65c7419a99d7074d90c7f6" alt="image"
but it transitions to
after less than 200 steps at batch size 64
(essentially not even seeing the full data!)
Is there any bug in the train_text_to_image.py?
@sayakpaul have you seen something like this a collapse like this before?
training on a small subset of around 200 images does not lead to collapse but very blurry results
Beta Was this translation helpful? Give feedback.
All reactions