You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* wan sound to video
* fix some problem
* more consistent style and update moviepy
* get rid of decord
* style improvement
* finish todo
* seems that they don't know what is binary search and it's not appropriate to use binary search here
* maybe fix attn problem
* move data to lfs
* test cases
* improve example
* basic doc, leave detailed doc to future
* fix an error in example
* fix some naming issue
* fix bbox transform issue
* seems that regex match is notoriously slow
* seems that regex match is notoriously slow
* no need to split context again
* add rope explanation
* fix pose image permute issue
* amend accord. to pr 157
* remove some comments and unnecessary imports
* remove legacy weight_norm
* remove unused librosa
* fix bufferreader not pickle-able error
* remove non-used import
* revert parallel.py, but we do need to set PARALLEL_FWD_TIMEOUT_SEC to be much longer to avoid not receiving the output after multi-gpu inference complete
* fix testcases
* edit accord. to comments
---------
Copy file name to clipboardExpand all lines: README.md
+1Lines changed: 1 addition & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -23,6 +23,7 @@ and offloading strategies, enabling loading of larger diffusion models (e.g., Fl
23
23
24
24
## News
25
25
26
+
-**[v0.6.0](https://github.com/modelscope/DiffSynth-Engine/releases/tag/v0.6.0)** | **September 9, 2025**:  Supports [Wan2.2-S2V](https://modelscope.cn/models/Wan-AI/Wan2.2-S2V-14B), a video generation model designed for audio-driven cinematic video generation
26
27
-**[v0.5.0](https://github.com/modelscope/DiffSynth-Engine/releases/tag/v0.5.0)** | **August 27, 2025**:  Supports [Qwen-Image-Edit](https://modelscope.cn/models/Qwen/Qwen-Image-Edit), the image editing version of Qwen-Image, enabling semantic/appearance visual editing, and precise text editing
27
28
-**[v0.4.1](https://github.com/modelscope/DiffSynth-Engine/releases/tag/v0.4.1)** | **August 4, 2025**:  Supports [Qwen-Image](https://www.modelscope.cn/models/Qwen/Qwen-Image), an image generation model excels at complex text rendering and creating images in a wide range of artistic styles
0 commit comments