[docs] Caching methods #11625

stevhliu · 2025-05-28T20:23:26Z

Make the caching docs more visible and give a little more context behind the methods.

HuggingFaceDocBuilderDev · 2025-05-28T20:29:56Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

a-r-r-o-w

Thanks for looking into it @stevhliu! Just some comments regarding the correctness of the explanations and more technical details (which is fine to skip if you think it puts a lot of burden in front of the user)

docs/source/en/api/cache.md

docs/source/en/optimization/cache.md

a-r-r-o-w · 2025-05-29T01:41:32Z

docs/source/en/optimization/cache.md

+
+## FasterCache
+
+[FasterCache](https://huggingface.co/papers/2410.19355) computes and caches attention features at every other timestep instead of directly reusing cached features because it can cause flickering or blurry details in the generated video. The features from the skipped step are calculated from the difference between the adjacent cached features.


Suggested change

[FasterCache](https://huggingface.co/papers/2410.19355) computes and caches attention features at every other timestep instead of directly reusing cached features because it can cause flickering or blurry details in the generated video. The features from the skipped step are calculated from the difference between the adjacent cached features.

[FasterCache](https://huggingface.co/papers/2410.19355) caches and reuses attention features in a similar manner to PAB, as output differences in successive timesteps of the generation process is small. Additionally, when using classifier-free guidance for sampling (commonly used in most base models), FasterCache may choose to skip the unconditional branch prediction entirely, and estimate it from the conditional branch prediction, if there is a significant redundancy in the predicted latent outputs between successive timesteps.

Cc: @sunovivid you might be interested.

docs/source/en/optimization/cache.md

sayakpaul

Looks good to me. Thanks to @a-r-r-o-w for the suggestions as well. I would be in favor of keeping the technincal details.

Two things:

Include a table to report timing and memory numbers so that users can know the trade-offs (can happen in a follow-up PR).
If we have info if the caching methods are generally model-agnostic, having an explicit note about it would be useful.

stevhliu · 2025-06-02T17:01:00Z

Thanks for the reviews!

Happy to include a table in follow-up if someone can provide me with the timing and memory numbers (or the code to generate that)!

sayakpaul

The comments I brought up can definitely be included in the follow-up.

stevhliu requested review from a-r-r-o-w and sayakpaul May 28, 2025 20:30

a-r-r-o-w approved these changes May 29, 2025

View reviewed changes

sayakpaul reviewed May 30, 2025

View reviewed changes

stevhliu added 2 commits June 2, 2025 09:48

cache

efef58c

feedback

7db872e

stevhliu force-pushed the cache branch from 63aa372 to 7db872e Compare June 2, 2025 16:59

sayakpaul approved these changes Jun 2, 2025

View reviewed changes

stevhliu merged commit 9f48394 into huggingface:main Jun 2, 2025
1 check passed

stevhliu deleted the cache branch June 2, 2025 17:58

DN6 added the roadmap Add to current release roadmap label Jun 5, 2025

github-project-automation bot added this to Diffusers Roadmap 0.34 Jun 5, 2025

github-project-automation bot moved this to In Progress in Diffusers Roadmap 0.34 Jun 5, 2025

DN6 moved this from In Progress to Done in Diffusers Roadmap 0.34 Jun 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[docs] Caching methods #11625

[docs] Caching methods #11625

Uh oh!

stevhliu commented May 28, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 28, 2025

Uh oh!

a-r-r-o-w left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

a-r-r-o-w May 29, 2025

Uh oh!

sayakpaul May 30, 2025

Uh oh!

Uh oh!

sayakpaul left a comment

Uh oh!

stevhliu commented Jun 2, 2025

Uh oh!

sayakpaul left a comment

Uh oh!

Uh oh!

Uh oh!


		## FasterCache

		[FasterCache](https://huggingface.co/papers/2410.19355) computes and caches attention features at every other timestep instead of directly reusing cached features because it can cause flickering or blurry details in the generated video. The features from the skipped step are calculated from the difference between the adjacent cached features.

[docs] Caching methods #11625

[docs] Caching methods #11625

Uh oh!

Conversation

stevhliu commented May 28, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 28, 2025

Uh oh!

a-r-r-o-w left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

a-r-r-o-w May 29, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul May 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

stevhliu commented Jun 2, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!