Semi-Supervised Learning for image classification on CIFAR-10

The goal of this project was to study a simple but effective model for Semi-Supervised Learning : FixMatch.
This very good blogpost by Amit Chaudhary explains the main points of the approach, which allows to reach 95% top-1 accuracy for classification by using only 250 labelled images (25 randomly selected images per class) out of the 60 000 of the CIFAR-10 dataset.

Of particular interest to me was understanding what made this implementation so effective. Indeed, FixMatch relies on a Wide ResNet, which is not a particulary exotic model, and two key, but classical, principles of Semi-Supervised Learning :

Pseudo-labelling : assignation of "pseudo-label" to unlabelled data after prediction by the model, if the classifier is sufficiently confident about the class to which they belong.
Consistency regularization : the perturbation of an image must not modify its label : a severely perturbed image must therefore be classified in the same way as a weakly perturbed image.

I reimplemented a FixMatch version specific to CIFAR-10 with 250 labelled images by using kekmodel's Github as inspiration. This allowed me to gain a better understanding of the functioning of this algorithm and improve my software engineering skills

My reimplementation is available in the SSL_CIFAR python file
The results of my analysis is available in the Report PDF file
The Wide ResNet model used is available in the wideresnet Python file

References

Kihyuk Sohn et al. FixMatch : Simplifying Semi-Supervised Learning with Consistency and Confidence, 2020, eprint : arXiv:2001.07685
Ilya Loshchilov et Frank Hutter. SGDR : Stochastic Gradient Descent with Warm Restarts, 2016, eprint : arXiv:1608.03983
Sergey Zagoruyko et Nikos Komodakis. Wide Residual Networks, 2016, eprint : arXiv:1605.07146
Twan van Laarhoven. L2 Regularization versus Batch and Weight Normalization, 2017, eprint : arXiv: 1706.05350
Nicholas Carlini, Úlfar Erlingsson et Nicolas Papernot. Distribution Density, Tails, and Outliers in Machine Learning : Metrics and Applications, 2019, eprint : arXiv:1910.13427

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
Report.pdf		Report.pdf
SSL_CIFAR.py		SSL_CIFAR.py
wideresnet.py		wideresnet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Report.pdf

Report.pdf

SSL_CIFAR.py

SSL_CIFAR.py

wideresnet.py

wideresnet.py

Repository files navigation

Semi-Supervised Learning for image classification on CIFAR-10

References

About

Releases

Packages

Languages

aillaud/Semi-Supervised-Learning-CIFAR10

Folders and files

Latest commit

History

Repository files navigation

Semi-Supervised Learning for image classification on CIFAR-10

References

About

Topics

Resources

Stars

Watchers

Forks

Languages