Add the ability to parse workflows from AVIF images #4420

FerrahWolfeh · 2025-07-11T00:33:02Z

Adds the ability to import workflows directly from avif files. Specially useful when using specific nodes that allow the user to specify the output format of the saved image or some tools that allow converting png files into AVIF while preserving their metadata.

The added code works normally as expected, though I plan to create a more robust implementation in the future by reusing some code from the isobmff.ts file.

┆Issue is synchronized with this Notion page by Unito

christian-byrne · 2025-07-11T01:22:24Z

Wow, a PR adding support for open, royalty-free image file format that people have requested.

And you are searching by using the binary tags instead of just decoding it all as text and searching for string matches linearly. That's seriously amazing.

Was this inspired by Pillow adding AVIF support last week?

FerrahWolfeh · 2025-07-11T09:21:38Z

Wow, a PR adding support for open, royalty-free image file format that people have requested.

And you are searching by using the binary tags instead of just decoding it all as text and searching for string matches linearly. That's seriously amazing.

Was this inspired by Pillow adding AVIF support last week?

Wait, did Pillow just add AVIF support? That's pretty neat!

Also, thanks for the compliment, I really appreciate that.

Currently I'm using a "brute-forcey" way of finding the exif tag inside the AVIF header. Going the binary route is actually faster and opens a way for me to correctly scan for the boxes in the future, as sometimes, the location of the mdat box and the exif header may vary.

christian-byrne · 2025-07-11T17:32:54Z

Wait, did Pillow just add AVIF support? That's pretty neat!

Yep. I think from python-pillow/Pillow#8858, AVIF is supported without requiring specific wheel. If you run pip install -U Pillow then start ComfyUI, AVIF images will be loadable, it's awesome

Currently I'm using a "brute-forcey" way of finding the exif tag inside the AVIF header. Going the binary route is actually faster and opens a way for me to correctly scan for the boxes in the future, as sometimes, the location of the mdat box and the exif header may vary.

I totally agree. If you look at some of our decoding implementations in https://github.com/Comfy-Org/ComfyUI_frontend/tree/main/src/scripts/metadata, you will find some lazy approaches. Your approach seems proper and should be 2-3 magnitudes of order faster. I will do some research into AVIF format, I wrote the ISOBMFF decoder somewhat recently and had a lot of fun.

christian-byrne · 2025-07-11T21:14:33Z

If you are able to create a simple AVIF with a workflow embedded, you can add a test case by just adding the file to this folder: https://github.com/Comfy-Org/ComfyUI_frontend/tree/main/browser_tests/assets

then just putting the filename in this list:

ComfyUI_frontend/browser_tests/tests/loadWorkflowInMedia.spec.ts

Lines 5 to 19 in 19eaf6e

    
           test.describe('Load Workflow in Media', () => { 
        
             const fileNames = [ 
        
               'workflow.webp', 
        
               'edited_workflow.webp', 
        
               'no_workflow.webp', 
        
               'large_workflow.webp', 
        
               'workflow.webm', 
        
               // Skipped due to 3d widget unstable visual result. 
        
               // 3d widget shows grid after fully loaded. 
        
               // 'workflow.glb', 
        
               'workflow.mp4', 
        
               'workflow.mov', 
        
               'workflow.m4v', 
        
               'workflow.svg' 
        
             ]

It will run the browser emulation and test dropping the AVIF onto the graph and loading the workflow.

… instead

full response body

christian-byrne · 2025-07-12T16:06:38Z

Oh sorry I forgot to mention that we generate the screenshots in the GitHub test runner automatically by adding the "New Browser Test Expectations" label. I will generate the screenshot now!

webfiltered · 2025-07-12T23:35:36Z

N.B. Workflow needs to be approved after adding label.

christian-byrne · 2025-07-13T22:45:50Z

Give us a moment to fix this, apologies.

FerrahWolfeh · 2025-07-14T11:58:38Z

Is there anything I need to do with the code or the snapshots? Because I'm pretty lost about what should be done on my end...

webfiltered · 2025-07-14T19:37:05Z

Is there anything I need to do with the code or the snapshots? Because I'm pretty lost about what should be done on my end...

Nope! Not unless you want to set your monitor to a MUCH smaller res, nuke all your settings, and upload a new screenshot? 😃

I'll just do a quick follow-up afterwards.

webfiltered · 2025-07-14T19:59:07Z

Oh ah, actually - could you delete the existing playwright screenshot? Will be able to merge aftter it's all tested, but we may as well not add it to the commit history seeing as it's about to be deleted.

FerrahWolfeh · 2025-07-14T20:01:46Z

Ok, done.

webfiltered

LGTM w/minor changes - thank you for this!

src/scripts/pnginfo.ts

src/scripts/app.ts

src/scripts/metadata/avif.ts

christian-byrne · 2025-07-15T15:49:06Z

Should we just merge this and add the screenshot in followup? Then again, we want the update expectations to be runnable by any contributor, especially with new PR requirements.

webfiltered · 2025-07-15T17:40:36Z

Was literally halfway through merging this yesterday when internet problems started.

For the root issue, the action would need to run on the fork. Which means it can't be based on a PR label. But should still be very doable.

christian-byrne · 2025-07-15T18:09:32Z

Okay nice, we can merge as part of 1.25 then later today!

FerrahWolfeh · 2025-07-17T08:13:08Z

For the root issue, the action would need to run on the fork. Which means it can't be based on a PR label. But should still be very doable.

Alright, I can do that. But which workflow in the actions menu should I run then if at all?

webfiltered · 2025-07-18T01:36:43Z

Alright, I can do that. But which workflow in the actions menu should I run then if at all?

We'll need to fix our workflows to be fork-friendly. We're good to merge this - just waiting for some bug fixes to go in, so this can land in the version after that.

add avif metadata parser

720f2b4

FerrahWolfeh requested a review from a team as a code owner July 11, 2025 00:33

Merge branch 'main' into avif_parse

54e21fa

add browser test for avif workflows

0f15fc3

FerrahWolfeh requested a review from a team as a code owner July 11, 2025 22:42

FerrahWolfeh added 6 commits July 11, 2025 21:04

isolate avif parser from pnginfo + parse avif header by defined boxes…

a3d891d

… instead

parse and extract comfy metadata from avif files instead of generating

2e7d428

full response body

Implement concrete typing for AVIF boxes + thorough parsing

196d29f

run pre-commit hooks

eed9f9e

Merge branch 'Comfy-Org:main' into avif_parse

6f34d59

add nsapshot to fix CI tests

f454b10

christian-byrne added the New Browser Test Expectations New browser test screenshot should be set by github action label Jul 12, 2025

christian-byrne added New Browser Test Expectations New browser test screenshot should be set by github action and removed New Browser Test Expectations New browser test screenshot should be set by github action labels Jul 13, 2025

remove screenshot

5c217f9

webfiltered requested changes Jul 14, 2025

View reviewed changes

src/scripts/pnginfo.ts Outdated Show resolved Hide resolved

src/scripts/app.ts Outdated Show resolved Hide resolved

src/scripts/metadata/avif.ts Outdated Show resolved Hide resolved

src/scripts/metadata/avif.ts Outdated Show resolved Hide resolved

apply code suggestions

781d4e4

webfiltered approved these changes Jul 14, 2025

View reviewed changes

Add the ability to parse workflows from AVIF images #4420

Are you sure you want to change the base?

Add the ability to parse workflows from AVIF images #4420

Uh oh!

Conversation

FerrahWolfeh commented Jul 11, 2025 • edited by sync-by-unito bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

christian-byrne commented Jul 11, 2025

Uh oh!

FerrahWolfeh commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

christian-byrne commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

christian-byrne commented Jul 11, 2025

Uh oh!

christian-byrne commented Jul 12, 2025

Uh oh!

webfiltered commented Jul 12, 2025

Uh oh!

christian-byrne commented Jul 13, 2025

Uh oh!

FerrahWolfeh commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

webfiltered commented Jul 14, 2025

Uh oh!

webfiltered commented Jul 14, 2025

Uh oh!

FerrahWolfeh commented Jul 14, 2025

Uh oh!

webfiltered left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

christian-byrne commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

webfiltered commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

christian-byrne commented Jul 15, 2025

Uh oh!

FerrahWolfeh commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

webfiltered commented Jul 18, 2025

Uh oh!

Uh oh!

FerrahWolfeh commented Jul 11, 2025 •

edited by sync-by-unito bot

Loading

FerrahWolfeh commented Jul 11, 2025 •

edited

Loading

christian-byrne commented Jul 11, 2025 •

edited

Loading

FerrahWolfeh commented Jul 14, 2025 •

edited

Loading

christian-byrne commented Jul 15, 2025 •

edited

Loading

webfiltered commented Jul 15, 2025 •

edited

Loading

FerrahWolfeh commented Jul 17, 2025 •

edited

Loading