SD3 IP-Adapter runtime checkpoint conversion #10718

guiyrt · 2025-02-05T00:47:46Z

What does this PR do?

Now IP-Adapter from InstantX/SD3.5-Large-IP-Adapter can be used directly, with checkpoint runtime conversion.

I also changed the structure to have the familiar _convert_ip_adapter_attn_to_diffusers and _convert_ip_adapter_image_proj_to_diffusers functions.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@hlky @yiyixuxu

guiyrt · 2025-02-05T00:58:45Z

Example output:

Inference code

import os
import torch
from pathlib import Path

from diffusers import StableDiffusion3Pipeline
from diffusers.utils import load_image
from transformers import SiglipVisionModel, SiglipImageProcessor

model_id = "stabilityai/stable-diffusion-3.5-large"
image_encoder_id = "google/siglip-so400m-patch14-384"
ip_adapter_id = "InstantX/SD3.5-Large-IP-Adapter"

feature_extractor = SiglipImageProcessor.from_pretrained(
    image_encoder_id, torch_dtype=torch.bfloat16
)

image_encoder = SiglipVisionModel.from_pretrained(
    image_encoder_id, torch_dtype=torch.bfloat16
)

pipe = StableDiffusion3Pipeline.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
    feature_extractor=feature_extractor,
    image_encoder=image_encoder,
)

# Load IP Adapter
pipe.load_ip_adapter(ip_adapter_id, "ip-adapter.bin")
pipe.set_ip_adapter_scale(0.5)
pipe._exclude_from_cpu_offload.append("image_encoder")
pipe.enable_sequential_cpu_offload()

# Input
ip_adapter_img = load_image("astronaut.jpg")

# please note that SD3.5 Large is sensitive to highres generation like 1536x1536
image = pipe(
    width=1024,
    height=1024,
    prompt="an astronaut on top of a waffle",
    negative_prompt="lowres, low quality, worst quality",
    num_images_per_prompt=1,
    generator=torch.manual_seed(42),
    ip_adapter_image=ip_adapter_img,
    guidance_scale=6,
    num_inference_steps=24,
).images[0]

image.save("result.jpg")

Prompt: "an astronaut on top of a waffle"

Image Prompt:

Output:

hlky

Thanks @guiyrt. We can also change the checkpoint in docs/examples back to the original. You can also go ahead and add Fixes #9966 to close the issue when this is merged.

HuggingFaceDocBuilderDev · 2025-02-05T10:01:22Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

guiyrt · 2025-02-05T12:25:26Z

Done and done, thanks for the review @hlky !

guiyrt · 2025-02-13T01:36:20Z

Updated PR accounting for #10728

yiyixuxu · 2025-02-20T20:36:13Z

thanks a lot everyone!!

guiyrt and others added 2 commits February 5, 2025 00:42

Added runtime checkpoint conversion

3a98710

Merge branch 'main' into sd3_ipa_loader

b67723b

hlky approved these changes Feb 5, 2025

View reviewed changes

Updated docs

0835ca1

Fix for quantized model

32207cc

Merge branch 'main' into sd3_ipa_loader

201c875

shethaadit approved these changes Feb 19, 2025

View reviewed changes

yiyixuxu merged commit d9ee387 into huggingface:main Feb 20, 2025
12 checks passed

guiyrt deleted the sd3_ipa_loader branch February 20, 2025 21:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SD3 IP-Adapter runtime checkpoint conversion #10718

SD3 IP-Adapter runtime checkpoint conversion #10718

Uh oh!

guiyrt commented Feb 5, 2025 •

edited

Loading

Uh oh!

guiyrt commented Feb 5, 2025

Uh oh!

hlky left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Feb 5, 2025

Uh oh!

guiyrt commented Feb 5, 2025

Uh oh!

guiyrt commented Feb 13, 2025

Uh oh!

Uh oh!

yiyixuxu commented Feb 20, 2025

Uh oh!

Uh oh!

SD3 IP-Adapter runtime checkpoint conversion #10718

SD3 IP-Adapter runtime checkpoint conversion #10718

Uh oh!

Conversation

guiyrt commented Feb 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

guiyrt commented Feb 5, 2025

Uh oh!

hlky left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Feb 5, 2025

Uh oh!

guiyrt commented Feb 5, 2025

Uh oh!

guiyrt commented Feb 13, 2025

Uh oh!

Uh oh!

yiyixuxu commented Feb 20, 2025

Uh oh!

Uh oh!

guiyrt commented Feb 5, 2025 •

edited

Loading