LFA: Efficient Transfer Learning driven by Layer-wise Features Aggregation

Executable code for "Efficient Transfer Learning driven by Layer-wise Features Aggregation"

In this study, we propose a novel approach called Layer-wise Feature Aggregation (LFA), which utilizes features from all layers of a pre-trained model with instance-specific importance. First, LFA captures hierarchical features from low-level to high-level, enabling the extraction of richer and more general features; therefore, it significantly improves the performance in domain shift and few-shot learning. Second, LFA requires optimization only on top of large pre-trained models. Therefore, LFA optimization is efficient because it does not require back-propagation through the model. LFA is a new efficient transfer learning approach with improved performance and efficiency.

Approach

Main Contributions

We introduce the Layer-wise Feature Aggregation (LFA) method, a novel approach that significantly mitigates the computational burdens commonly associated with the fine-tuning of pre-trained models.
We empirically validate that the LFA method excels in handling both domain and distribution shifts, thereby establishing its versatility and applicability for a broad spectrum of machine learning tasks.
We establish the critical importance of our holistic aggregation approach by validating its efficacy across a various experiments.

Results

The image depicts an elephant from the PACS dataset. but the LinearProbing CLIP model misclassifies it as a horse. Observing the activation proportion when applying our methodology, we note that the last layer exhibits the highest activation. However, a model utilizing only the features of the last layer classifies this picture as a giraffe, correctly predicting it as an elephant only when all layers are used.

The best scores are bolded, and the second-best scores are underlined. CLIP + LFA demonstrates superior performance compared to DPLCLIP, DeiT, and HViT across VLCS, PACS, and OfficeHome datasets. Additionally, the average of our results showcases the best overall performance.

* indicates that experiments were conducted with a batch size of 8 due to memory constraints. In our model with LFA, peak memory usage is slightly higher than LinearProbing CLIP but still lower than CLIP + DPL and CLIP + QLoRA. Despite the trainable parameters, training and inference times remain as short as those of LinearProbing CLIP, demonstrating that our model is both lightweight and efficient while improving generalization performance.

Usage

Installation

Python libraries

install pytorch library (pytorch page link)
install requirements

pip install -r requiremnets.txt

Data preparation

Domainbed benchmark datasets for Domain Generalization(DG) tasks

💡 This paragraph for has been borrowed directly from DPLCLIP's official repository.

python -m domainbed.scripts.download --data_dir=/my/datasets/path --dataset pacs

Note: change --dataset pacs for downloading other datasets (e.g., vlcs, office_home, terra_incognita).

11 datasets for Few-shot tasks

💡 This paragraph for has been borrowed directly from CoOp's official repository.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
analysis		analysis
docs		docs
domain_generalization		domain_generalization
few_shot		few_shot
images		images
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LFA: Efficient Transfer Learning driven by Layer-wise Features Aggregation

Approach

Main Contributions

Results

Usage

Installation

Python libraries

Data preparation

Domainbed benchmark datasets for Domain Generalization(DG) tasks

11 datasets for Few-shot tasks

Training and Evaluation

Domain Generalization(DG)

Few-shot

Acknowledgement

Citation

About

Releases

Packages

Languages

MLAI-Yonsei/LFA

Folders and files

Latest commit

History

Repository files navigation

LFA: Efficient Transfer Learning driven by Layer-wise Features Aggregation

Approach

Main Contributions

Results

Usage

Installation

Python libraries

Data preparation

Domainbed benchmark datasets for Domain Generalization(DG) tasks

11 datasets for Few-shot tasks

Training and Evaluation

Domain Generalization(DG)

Few-shot

Acknowledgement

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages