FLA-Zoo: Flash-Linear-Attention models beyond language

This repository implements a collection of Flash-Linear-Attention models that extend beyond language, supporting vision, video, and more.

News
Features
Installation
TODO

News

$\texttt{[2025-01-25]}$: This repo is created with vision models.

Features

vision: fla-zoo currently supports vision models. A simple documentation is in here. TL;DR: use hybrid model and random-scan for better performance and efficiency. "Vision" here refers to image classification tasks.
video: fla-zoo currently supports certain video models. Documentation is in progress.

Installation

Requirements:

All the dependencies shown here
torchvision

For example, you can install all the dependencies using the following command:

conda create -n flazoo python=3.12
conda activate flazoo
pip install torch torchvision accelerate
pip install transformers datasets evaluate causal_conv1d einops sklearn wandb
pip install flash-attn --no-build-isolation

Now we can start cooking! 🚀

Note that as an actively developed repo, fla-zoo currently no released packages are provided.

TODO

Write documentation for vision models.
Write documentation for video models.
Release training scripts for vision models.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
3rdparty		3rdparty
assets		assets
docs/vision		docs/vision
flazoo		flazoo
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
fla		fla
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FLA-Zoo: Flash-Linear-Attention models beyond language

News

Features

Installation

TODO

About

Languages

fla-org/fla-zoo

Folders and files

Latest commit

History

Repository files navigation

FLA-Zoo: Flash-Linear-Attention models beyond language

News

Features

Installation

TODO

About

Topics

Resources

Stars

Watchers

Forks

Languages