Few-Shot-Classification (FSL)

The objective of this project is to evaluate the feasibility of using pre-trained feature-extractors to quickly categorize products based on images with limited amounts of data. To do that, we explore different techniques based on metric learning (siamese and prototypical networks) and meta-learning (model-agnostic meta-learning). Our results are presented in our defense presentation and report.

Dataset

We use real images from seven luxury brands. The dataset provided by Navee contains 3967 classes accross 7 brands. Each class represents a fashion article and contains about 5 images. Here is an example of the images available for three different articles.

Image Retrieval task

We approach FSL through a retrieval task that is evaluated with mean average precision (mAP): our systems embed images in a space where similar articles should be projected closer to one another. During training, mAP metrics are computed to check on the performances of our networks.

Siamese Networks

They consist of neural networks that contain two or more identical sub-networks, that share same characteristics and parameters and undergo the same updates during training. Two main losses have been used to train our models: contrastive loss [1] and triplet loss, especially developed in [2]. The former is based on using pairs of images. The latter is based on the use of triplets of images. The idea of both losses is to push similar images close together and dissimilar images far from another in the embedding space.

Prototypical Networks

Prototypical networks are a metric-learning technique using in our implementation ResNet-50 to map fashion images into a metric space where classification is then done computing prototypes (means) from each category and their distance to the query image. This simple method can actually thrive in limited-data regime.

Model-Agnostic Meta-Learning

This is an implementation of the paper by Finn et. al[3], that uses meta-learning to train a model on batches of tasks, for the purpose of image classification using few-shot learning. Although it's model-agnostic, since it can be implemented with any gradient-descent model, we use a convolutional network. Training is composed of training on each individual task then minimizing the sum of all losses. This implementation is heavily inspired by this one.

References

[1] LeCun et al. (2005). Dimensionality reduction by learning an invariant mapping

[2] Schroff et al. (2015). FaceNet: A Unified Embedding for Face Recognition and Clustering

[3] Finn et al. (2017). Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
data		data
images		images
model-agnostic meta-learning		model-agnostic meta-learning
project defense		project defense
prototypical networks		prototypical networks
siamese networks		siamese networks
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

images

images

model-agnostic meta-learning

model-agnostic meta-learning

project defense

project defense

prototypical networks

prototypical networks

siamese networks

siamese networks

README.md

README.md

Repository files navigation

Few-Shot-Classification (FSL)

Dataset

Image Retrieval task

Siamese Networks

Prototypical Networks

Model-Agnostic Meta-Learning

References

About

Releases

Packages

Contributors 3

Languages

carlossantosgarcia/few-shot-classification

Folders and files

Latest commit

History

Repository files navigation

Few-Shot-Classification (FSL)

Dataset

Image Retrieval task

Siamese Networks

Prototypical Networks

Model-Agnostic Meta-Learning

References

About

Topics

Resources

Stars

Watchers

Forks

Languages