Active Learning

In this python notebook, different Active Learning strategies combined with diversity criteria are compared in order to find the best set of labeled observations to train a SVM classifier.

Problem description

The Semeion Handwritten Digit Data Set is a database of handwritten digits. Each record represents a handwritten digit, orginally scanned with a resolution of 256 grays scale (28). The goal is to obtain the best possible result by minimizing the number of labeled observations used to train a classification model by selecting samples that provide the maximum information possible.

Active Learning strategies

MS (margin sampling)
MCLU (multi-class label uncertainty)
SSC (significance space construction)
nEQB (normalized entropy query bagging)

Diversity criteria

MAO (most ambiguous and orthogonal)
MAO lambda
diversity by clustering.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
Data		Data
.gitignore		.gitignore
ActiveLearning.ipynb		ActiveLearning.ipynb
Results.png		Results.png
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data

Data

.gitignore

.gitignore

ActiveLearning.ipynb

ActiveLearning.ipynb

Results.png

Results.png

readme.md

readme.md

Repository files navigation

Active Learning

Problem description

Active Learning strategies

Diversity criteria

Results

About

Releases

Packages

Contributors 2

Languages

crossvalidados/Active_Learning

Folders and files

Latest commit

History

Repository files navigation

Active Learning

Problem description

Active Learning strategies

Diversity criteria

Results

About

Resources

Stars

Watchers

Forks

Languages