Support vision transformers #18

guangy10 · 2025-02-05T19:53:06Z

Following PR #35124, we will add support for vision transformer models that are suitable for on-device deployment

johnsutor · 2025-02-23T17:10:00Z

What is the timeline for this? Is there any way I can contribute?

guangy10 · 2025-02-25T01:43:45Z

@johnsutor I think we will prioritize to get the key features implemented first so that users can have a good experience to publish .pte models, and load cached one from hub. For connecting the exportable vision models to optimum, it will have to wait until after it. However, it would be super nice if you would like to contribute!

Here are what you need in order to contribute:

Register new tasks for the vision models under https://github.com/huggingface/optimum-executorch/tree/main/optimum/exporters/executorch/tasks, similar to causal_lm for "text-genertion" task.
Modify the existing xnnpack recipe as needed, e.g. https://github.com/huggingface/optimum-executorch/blob/main/optimum/exporters/executorch/recipes/xnnpack.py#L78, the lowering to .pte should just work
To run the .pte model using ExecuTorch runtime via pybind, you will need to implement a new modeling class similar to ExecuTorchModelForCausalLM for the vision tasks.

With step 1 & 2, you will be able to generate the pte models. Step 3, inference, can be done separately.

guangy10 self-assigned this Feb 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support vision transformers #18

Support vision transformers #18

guangy10 commented Feb 5, 2025

johnsutor commented Feb 23, 2025

guangy10 commented Feb 25, 2025

Support vision transformers #18

Support vision transformers #18

Comments

guangy10 commented Feb 5, 2025

johnsutor commented Feb 23, 2025

guangy10 commented Feb 25, 2025