[pull] master from comfyanonymous:master by pull[bot] · Pull Request #298 · code/app-python-comfyui

pull · 2025-09-17T04:27:04Z

See Commits and Changes for more details.

Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

* flux: Do the xq and xk ropes one at a time This was doing independendent interleaved tensor math on the q and k tensors, leading to the holding of more than the minimum intermediates in VRAM. On a bad day, it would VRAM OOM on xk intermediates. Do everything q and then everything k, so torch can garbage collect all of qs intermediates before k allocates its intermediates. This reduces peak VRAM usage for some WAN2.2 inferences (at least). * wan: Optimize qkv intermediates on attention As commented. The former logic computed independent pieces of QKV in parallel which help more inference intermediates in VRAM spiking VRAM usage. Fully roping Q and garbage collecting the intermediates before touching K reduces the peak inference VRAM usage.

rattus128 and others added 2 commits September 16, 2025 19:21

Support the HuMo model. (#9903)

9288c78

pull bot locked and limited conversation to collaborators Sep 17, 2025

pull bot added the ⤵️ pull label Sep 17, 2025

pull bot merged commit 9288c78 into code:master Sep 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] master from comfyanonymous:master#298

[pull] master from comfyanonymous:master#298
pull[bot] merged 2 commits intocode:masterfrom
Comfy-Org:master

pull bot commented Sep 17, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

pull bot commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

pull bot commented Sep 17, 2025 •

edited

Loading