Knowledge Distillation with VRM

This project aims to analyse the impact of various VRM techniques(applied on teacher models) on the generalization performance of a student model. The VRM techniques being analysed here are:

Work accepted at the ICML-UDL Workshop, 2020

Step 1: Replicate Conda Environment

conda create -n ml
conda install --name ml --file spec-file.txt
conda activate ml

Step 2: Train Teacher Models

Train a set of techer models with these VRM techniques.

Step 3: Train Student Models

Use dark knowledge from teacher models trained in Step 2.

Step 4: Analyse generalization performance

Use different datasets and performance metrics to analyse generalization performance of the different student models. To measure generalization, we can evaluate the models on the unseen CIFAR test set. In addition to that, we also consider the following datasets:

CIFAR 10.1 v6: Small natural variations in the dataset
CINIC (ImageNet Fold): Distributional shift in images
CIFAR 10H: CIFAR Test Set but with human labels - can help us in analysing prediction structure.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
eval_utils		eval_utils
img		img
models		models
utils		utils
.gitignore		.gitignore
README.md		README.md
spec-file.txt		spec-file.txt
train.py		train.py
train_kd.py		train_kd.py
training_template.py		training_template.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eval_utils

eval_utils

img

img

models

models

utils

utils

.gitignore

.gitignore

README.md

README.md

spec-file.txt

spec-file.txt

train.py

train.py

train_kd.py

train_kd.py

training_template.py

training_template.py

Repository files navigation

Knowledge Distillation with VRM

Step 1: Replicate Conda Environment

Step 2: Train Teacher Models

Step 3: Train Student Models

Step 4: Analyse generalization performance

About

Releases

Packages

Languages

deepandas11/Distilling-with-VRM

Folders and files

Latest commit

History

Repository files navigation

Knowledge Distillation with VRM

Step 1: Replicate Conda Environment

Step 2: Train Teacher Models

Step 3: Train Student Models

Step 4: Analyse generalization performance

About

Topics

Resources

Stars

Watchers

Forks

Languages