Skip to content

Study of a powerful Semi-Supervised Learning model, FixMatch, trained on the dataset CIFAR-10

Notifications You must be signed in to change notification settings

aillaud/Semi-Supervised-Learning-CIFAR10

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Semi-Supervised Learning for image classification on CIFAR-10

The goal of this project was to study a simple but effective model for Semi-Supervised Learning : FixMatch.
This very good blogpost by Amit Chaudhary explains the main points of the approach, which allows to reach 95% top-1 accuracy for classification by using only 250 labelled images (25 randomly selected images per class) out of the 60 000 of the CIFAR-10 dataset.

Of particular interest to me was understanding what made this implementation so effective. Indeed, FixMatch relies on a Wide ResNet, which is not a particulary exotic model, and two key, but classical, principles of Semi-Supervised Learning :

  • Pseudo-labelling : assignation of "pseudo-label" to unlabelled data after prediction by the model, if the classifier is sufficiently confident about the class to which they belong.
  • Consistency regularization : the perturbation of an image must not modify its label : a severely perturbed image must therefore be classified in the same way as a weakly perturbed image.

I reimplemented a FixMatch version specific to CIFAR-10 with 250 labelled images by using kekmodel's Github as inspiration. This allowed me to gain a better understanding of the functioning of this algorithm and improve my software engineering skills

  • My reimplementation is available in the SSL_CIFAR python file
  • The results of my analysis is available in the Report PDF file
  • The Wide ResNet model used is available in the wideresnet Python file

References

  1. Kihyuk Sohn et al. FixMatch : Simplifying Semi-Supervised Learning with Consistency and Confidence, 2020, eprint : arXiv:2001.07685
  2. Ilya Loshchilov et Frank Hutter. SGDR : Stochastic Gradient Descent with Warm Restarts, 2016, eprint : arXiv:1608.03983
  3. Sergey Zagoruyko et Nikos Komodakis. Wide Residual Networks, 2016, eprint : arXiv:1605.07146
  4. Twan van Laarhoven. L2 Regularization versus Batch and Weight Normalization, 2017, eprint : arXiv: 1706.05350
  5. Nicholas Carlini, Úlfar Erlingsson et Nicolas Papernot. Distribution Density, Tails, and Outliers in Machine Learning : Metrics and Applications, 2019, eprint : arXiv:1910.13427

About

Study of a powerful Semi-Supervised Learning model, FixMatch, trained on the dataset CIFAR-10

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages