[Community Discussion] Scheduler design

[This](https://github.com/patil-suraj/stable-diffusion-jax/pull/8) is a very nice PR by @pcuenca showing what changes need to be done to the PNDM/PMLS scheduler to make it work with JAX/XLA - it's actually more then anticipated and shows that the scheduler now substantially differs from the original implementation that we use for PyTorch.

In the coming days we will integrate these changes into main `diffusers` to make the library compatible with Flax/JAX. Now the big question is should we:
a) Make each `scheduler` very generic and continue the `set_format("pt")` logic? While this would make sense logically as the schedulers don't store any trainable weights really - this could potentially lead to quite some `if - else` statements and too much abstracted code, *e.g.* lots of `self.where(...)` functions in `scheduler_utils.py`. Also maybe we want schedulers to have trainable weights in the future? Also do we anticipate schedulers to be more or less complex in the future?
b) Make one scheduler file for each framework. Instead of trying to fit all frameworks into one scheduler file, we make one scheduler for one framework. The advantage is clearly readability. Also most people probably always only work in one framework so for them it might be nicer to have schedulers seperate. **However**: Some schedulers will probably be 1-to-1 the same (which also might not be a problem necessarily) 

I'm starting to be lean more and more towards b) actually here.

Would love to discuss - cc @anton-l @patil-suraj @natolambert @pcuenca 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Community Discussion] Scheduler design #311

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Community Discussion] Scheduler design #311

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions