Skip to content

Conversation

@nolan4
Copy link

@nolan4 nolan4 commented Oct 25, 2025

Pull Request: Add Entity-Level Image Generation (EliGen) for Qwen Image

Summary

This update implements Entity-Level Image Generation (EliGen) for the Qwen Image model, allowing region-specific prompts through spatial masks. The feature provides fine-grained control over image generation by applying separate attention masks for each entity.

Key Features
• Spatial attention masking with isolated entity prompts
• Automatic mask resizing to match latent dimensions
• RoPE embedding implementation aligned with DiffSynth Studio
• Support for batch_size > 1
• Backward compatible with standard Qwen Image workflows

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Core Core team dependency

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants