VICReg

The main issue with self-supervised models is on how to not produce constant and non-informative outputs. Although existing models have shown to prevent a collapsing solution, they do so in a way that is not fully understood. VICReg or Variance-Invariance-Covariance Regularization is a method that has shown to explicitly prevent a collapsing solution using a loss function with 3 terms. Below is an image depicting the architecture;

Just as with most self-supervised learning models, VICReg training begins with applying random augmentations to image dataset to produce two different views. The two views are fed into a joint embedding architecture with a Resnet backbone to produce embeddings. And the embeddings are then fed into a 3 layer MLP called the expander to transform them into representations for applying the loss terms.

The loss terms which consists of;

Variance,v(Z),v(Z') - is applied separately to both branches on each embedding over a batch. The purpose of the variance term is to preserve the standard deviation of the embeddings above a threshold to force the vectors to not produce constant outputs - essentially preventing a collapsing solution. In this sense, one embedding vector will be different from the other.
Invariance,s(Z,Z') - is a distance metric between embeddings from the two branches. Since both branches are fed with augmented versions of the same image, they should be invariant to augmentations and it is the invariance loss term that sees to that. It is simply the mean squared distance between them.
Covariance,c(Z),c(Z') - also applied separately to both branches. The covariance loss term is applied on pairs of embeddings over a batch and this helps to remove any form of correlation between them. Essentially it enforces decorrelation between the different dimensions of the embedding and this ensures that they are not producing the same information, therefore preventing informational collapse.

The paper points out that, it is the variance and covariance terms that play the active role of preserving information and avoiding a degenerate solution.

Reports

Pretrain	Linear Evaluation
Pretrain report	Eval report

References

VICReg paper (Adrien Bardes, Jean Ponce, Yann LeCun.ICLR 2022)
Official VICReg implementation in Pytorch
YouTube video explaining VICReg by DeepReader
ChatGPT (provided the starting point to implement the loss function)
Unofficial VICReg implementation in Jax

Citation

@inproceedings{bardes2022vicreg,
  author  = {Adrien Bardes and Jean Ponce and Yann LeCun},
  title   = {VICReg: Variance-Invariance-Covariance Regularization For Self-Supervised Learning},
  booktitle = {ICLR},
  year    = {2022},
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
VICReg_Pretraining_TF.ipynb		VICReg_Pretraining_TF.ipynb
VICReg_lincls_TF.ipynb		VICReg_lincls_TF.ipynb
WEIGHTS_2023 03 08 (1).h5		WEIGHTS_2023 03 08 (1).h5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

VICReg_Pretraining_TF.ipynb

VICReg_Pretraining_TF.ipynb

VICReg_lincls_TF.ipynb

VICReg_lincls_TF.ipynb

WEIGHTS_2023 03 08 (1).h5

WEIGHTS_2023 03 08 (1).h5

Repository files navigation

VICReg

Reports

References

Citation

About

Releases

Packages

Languages

atiaisaac/VICReg_TF

Folders and files

Latest commit

History

Repository files navigation

VICReg

Reports

References

Citation

About

Topics

Resources

Stars

Watchers

Forks

Languages