Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning

Official Implementation of our paper "Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning", in CVPR 2023.

by Kaiyou Song, Jin Xie, Shan Zhang and Zimeng Luo.

[arXiv] [Paper]

Method

Usage

ImageNet Pre-training

This implementation supports DistributedDataParallel training; single-gpu or DataParallel training is not supported.

To do pre-training of a ResNet50-ViT-S model pairs on ImageNet in an 16-gpu machine, run:

python3 -m torch.distributed.launch --nproc_per_node=8 \
--nnodes 2 --node_rank 0 --master_addr='100.123.45.67' --master_port='10001'  \
main_mokd.py \
--arch_cnn resnet50 --arch_vit vit_small \
--out_dim 65536 --norm_last_layer False \
--clip_grad_cnn 3 --clip_grad_vit 3 --freeze_last_layer 1 \
--optimizer sgd  \
--lr_cnn 0.1 --lr_vit 0.0003 --warmup_epochs 10 \
--use_fp16 True \
--warmup_teacher_temp 0.04 --teacher_temp 0.07 \
--warmup_teacher_temp_epochs_cnn 50 --warmup_teacher_temp_epochs_vit 30 \
--patch_size 16 --drop_path_rate 0.1 \
--local_crops_number 8 --global_crops_scale 0.25 1 --local_crops_scale 0.05 0.25 \
--momentum_teacher 0.996 \
--num_workers 10 \
--batch_size_per_gpu 16 --epochs 100 \
--lamda_t 0.1 --lamda_c 1.0 \
--data_path /path to imagenet/ \
--output_dir output/ \

ImageNet Linear Classification

With a pre-trained model, to train a supervised linear classifier on frozen features/weights in an 8-gpu machine, run:

python3 -m torch.distributed.launch --nproc_per_node=8 eval_linear.py \
--arch resnet50 \
--lr 0.01 \
--batch_size_per_gpu 256 \
--num_workers 10 \
--pretrained_weights /path to pretrained checkpoints/xxx.pth \
--checkpoint_key teacher_cnn \
--data_path /path to imagenet/ \
--output_dir output/ \
--method mokd

python3 -m torch.distributed.launch --nproc_per_node=8 eval_linear.py \
--arch vit_small \
--n_last_blocks 4 \
--lr 0.001 \
--batch_size_per_gpu 256 \
--pretrained_weights /path to pretrained checkpoints/xxx.pth \
--checkpoint_key teacher_vit \
--data_path /path to imagenet/ \
--output_dir output/ \
--method mokd

Evaluation: k-NN classification on ImageNet

To evaluate a k-NN classifier with a single GPU on a pre-trained model, run:

python3 -m torch.distributed.launch --nproc_per_node=8 eval_knn.py \
--arch resnet18 \
--batch_size_per_gpu 512 \
--pretrained_weights /path to pretrained checkpoints/xxx.pth \
--checkpoint_key teacher_cnn \
--num_workers 20 \
--data_path /path to imagenet/ \
--use_cuda True \
--method mokd

Acknowledgement

This project is based on DINO. Thanks for the wonderful work.

License

This project is under the Apache License 2.0 license. See LICENSE for details.

Citation

@InProceedings{Song_2023_CVPR,
    author    = {Song, Kaiyou and Xie, Jin and Zhang, Shan and Luo, Zimeng},
    title     = {Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2023},
    pages     = {11848-11857}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
augmentations		augmentations
backbones		backbones
models		models
src		src
utils		utils
LICENSE		LICENSE
README.md		README.md
eval_knn.py		eval_knn.py
eval_linear.py		eval_linear.py
main_mokd.py		main_mokd.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning

Method

Usage

ImageNet Pre-training

ImageNet Linear Classification

Evaluation: k-NN classification on ImageNet

Acknowledgement

License

Citation

About

Releases

Packages

Contributors 2

Languages

License

skyoux/mokd

Folders and files

Latest commit

History

Repository files navigation

Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning

Method

Usage

ImageNet Pre-training

ImageNet Linear Classification

Evaluation: k-NN classification on ImageNet

Acknowledgement

License

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages