This is quick evaluation of different ways of data augmentation performance on ImageNet-2012.

The architecture is similar to CaffeNet, but has differences:

Default augmentation: random crop 128x128 from 144xN image, 50% random horizontal flip. Additional augmentations:

Name	Accuracy	LogLoss	Comments
Default	0.471	2.36	Random flip, random crop 128x128 from 144xN, N > 144
Drop 0.1	0.306	3.56	+ Input dropout 10%. not finished, 186K iters result
Multiscale	0.462	2.40	Random flip, random crop 128x128 from ( 144xN, - 50%, 188xN - 20%, 256xN - 20%, 130xN - 10%)
5 deg rot	0.448	2.47	Random rotation to [0..5] degrees.

Provide feedback

Saved searches