GitHub - InterDigitalInc/CompressAI-Vision: CompressAI-Vision helps you design, test and compare Video Compression for Machines pipelines. Compression methods can be either pulled from custom AI-based modules from CompressAI or traditional codecs such as H.266/VVC.

CompressAI-Vision helps you to develop, test and evaluate compression models with standardized tests in the context of compression methods optimized for machine tasks algorithms such as Neural-Network (NN)-based detectors.

It currently focuses on two types of pipeline:

Video compression for remote inference (compressai-remote-inference), which corresponds to the MPEG "Video Coding for Machines" (VCM) activity.
Split inference (compressai-split-inference), which includes an evaluation framework for compressing intermediate features produced in the context of split models. The software supports all thepipelines considered in the related MPEG activity: "Feature Compression for Machines" (FCM).

Features

Detectron2 for Object Detection (Faster-RCNN) and Instance Segmentation (Mask-RCNN)
JDE for Object Tracking
YOLOX-Darknet53 for Object Detection
MMPOSE RTMO for Pose Estimation (Bottom Up)
Segment Anything
Segment Anything 2 (SAM2)

Documentation

A complete documentation is provided here, including installation, CLI usage, as well as tutorials.

installation

The CompressAI library providing learned compresion modules is available as a submodule. It can be initilized by running:

git submodule update --init --recursive

Note: the installation scripts documented below installs compressai from source expects the submodule to be populated.

CompressAI-Vision can be installed using a virtual environment and pip or using uv.

1. Using a virtual environment:

Initialization of the environment

To get started locally and install the development version of CompressAI-Vision, first create a virtual environment with python==3.8:

python3.8 -m venv venv
source ./venv/bin/activate
pip install -U pip

Installation of compressai-vision and supported vision models:

First, if you want to manually export CUDA related paths, please source (e.g. for CUDA 11.8):

bash scripts/env_cuda.sh 11.8

Then, please run:

bash scripts/install.sh

To install the dependencies in conformance with MPEG FCM Test Conditions, run:

bash scripts/install.sh --fcm-cttc (--cpu)

For more otions, check:

bash scripts/install.sh --help

NOTE 1: install.sh gives you the possibility to install vision models' source and weights at specified locations so that mutliple versions of compressai-vision can point to the same installed vision models

NOTE 2: the downlading of JDE pretrained weights might fail. Check that the size of following file is ~558MB. path/to/weights/jde/jde.1088x608.uncertainty.pt The file can be downloaded at the following link (in place of the above file path): "https://docs.google.com/uc?export=download&id=1nlnuYfGNuHWZztQHXwVZSL_FvfE551pA"

NOTE 3: SAM2 requires python>=3.10, torch>=2.5.1 and torchvision>=0.20.1., which are higher versions of the packages needed for the previous models installation. For instance, the installation of models with the ‘-—fcm-cttc’ configuration may be incompatible with SAM2 installation, and vice versa.

2. Using uv:

Within the root folder of compressai-vision:

bash scripts/install_uv.sh

Note: Make sure you pin the desired installed python version before, e.g.,

uv python pin 3.8

Usage

Split inference pipelines

To run split-inference pipelines, please use the following command:

compressai-split-inference --help

Note that the following entry point is kept for backward compability. It runs split inference as well.

compressai-vision-eval --help

For example for testing a full split inference pipelines without any compression, run

compressai-vision-eval --config-name=eval_split_inference_example

Remote inference pipelines

For remote inference (MPEG VCM-like) pipelines, please run:

compressai-remote-inference --help

Configurations

Please check other configuration examples provided in ./cfgs as well as examplary scripts in ./scripts

Test data related to the MPEG FCM activity can be found in ./data/mpeg-fcm/

For developers

After your dev, you can run (and adapt) test scripts from the scripts/tests directory. Please check [scripts/tests/README.md] for more details

Contributing

Code is formatted using black and isort. To format code, type:

make code-format

Static checks with those same code formatters can be run manually with:

make static-analysis

Compiling documentation

To produce the html documentation, from docs/, run:

make html

To check the pages locally, open docs/_build/html/index.html

License

CompressAI-Vision is licensed under the BSD 3-Clause Clear License

Authors

Fabien Racapé, Hyomin Choi, Eimran Eimon, Sampsa Riikonen, Jacky Yat-Hong Lam

Citation

If you use this project, please cite:

@article{compressai_vision,
  title={CompressAI-Vision: Open-source software to evaluate compression methods for computer vision tasks},
  author={Choi, Hyomin and Han, Heeji and Rosewarne, Chris and Racap{\'e}, Fabien},
  journal={arXiv preprint arXiv:2509.20777},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1,140 Commits
.github		.github
cfgs		cfgs
compressai @ ff16d32		compressai @ ff16d32
compressai_vision		compressai_vision
data		data
docker		docker
docs		docs
examples/vcm		examples/vcm
scripts		scripts
tests		tests
.flake8		.flake8
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.gitmodules		.gitmodules
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
NEWS.md		NEWS.md
README.md		README.md
pyproject.toml		pyproject.toml
ruff.toml		ruff.toml
setup.py		setup.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Features

Documentation

installation

1. Using a virtual environment:

Initialization of the environment

Installation of compressai-vision and supported vision models:

2. Using uv:

Usage

Split inference pipelines

Remote inference pipelines

Configurations

For developers

Contributing

Compiling documentation

License

Authors

Citation

Related links

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 11

Languages

License

InterDigitalInc/CompressAI-Vision

Folders and files

Latest commit

History

Repository files navigation

Features

Documentation

installation

1. Using a virtual environment:

Initialization of the environment

Installation of compressai-vision and supported vision models:

2. Using uv:

Usage

Split inference pipelines

Remote inference pipelines

Configurations

For developers

Contributing

Compiling documentation

License

Authors

Citation

Related links

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 11

Languages

Packages