Possible artificing happening for long prompt and combining prompts #6930
Replies: 4 comments
-
Hi, diffusers doesn’t do either of them. By default, Diffusers doesn’t use prompt weighting or allow long prompts is up to the user how to enable this. One popular library to enable prompt weighting is compel so you can ask there. Another good solution is the community pipeline lpw_stable_diffusion_xl That pipeline doesn't use |
Beta Was this translation helpful? Give feedback.
-
The StableDiffusionLongPromptWeightingPipeline does similar though just not on the fragment level but for the whole embedding. |
Beta Was this translation helpful? Give feedback.
-
This is what I see in the line 356 (edit: I found you were referring to the SD 1.5 pipeline): Still, It will always have to do something similar since you need to do some kind of strategy to fit a longer prompt in 75 tokens but going with the article it seems that the problem is with this operation done in automatic1111:
Which is not done in SDXL version of the pipeline since this seems to be a problem with only SDXL and not SD 1.5. I must admit I haven't gone to deep into this since I don't understand why would anyone try to play the lottery with SDXL and prompt with 300 tags to generate something when SDXL understands if you just write what you want. |
Beta Was this translation helpful? Give feedback.
-
Ah, seems you are right. Didn't expect this part to change between sd1 and xl pipelines as it alters outcomes a lot. Just remembered it following auto1111 when I was looking at it. Whether one should be 75-token-frugal with prompts is a philosophical question. Great start for a discussion but I'll see myself out on that. :) |
Beta Was this translation helpful? Give feedback.
-
Hello diffusers team,
Seen this link by hako-mikan: AUTOMATIC1111/stable-diffusion-webui#14874 and their civitai report: Analyzing and Addressing Artifacts in Web-UI with XL-models.
In summary, does diffusers follow a similar generation process like A1111 webui where there is a normalization process happening after the prompt emphasis is calculated? Additionally this seems to be exclusive for SDXL (atleast for A1111 webui), but unsure of the effect it has on the diffusers platform, because the possible different method.
Credits to @hako-mikan for the interesting find!
Beta Was this translation helpful? Give feedback.
All reactions