PyramidNet-caffe

Caffe implementation of the paper "Deep Pyramidal Residual Networks" (https://arxiv.org/abs/1610.02915).

This repository contains the code for the paper:

Dongyoon Han*, Jiwhan Kim*, and Junmo Kim, "Deep Pyramidal Residual Networks", CVPR 2017 (* equal contribution).

Abstract

Deep convolutional neural networks (DCNNs) have shown remarkable performance in image classification tasks in recent years. Generally, deep neural network architectures are stacks consisting of a large number of convolution layers, and they perform downsampling along the spatial dimension via pooling to reduce memory usage. At the same time, the feature map dimension (i.e., the number of channels) is sharply increased at downsampling locations, which is essential to ensure effective performance because it increases the capability of high-level attributes. Moreover, this also applies to residual networks and is very closely related to their performance. In this research, instead of using downsampling to achieve a sharp increase at each residual unit, we gradually increase the feature map dimension at all the units to involve as many locations as possible. This is discussed in depth together with our new insights as it has proven to be an effective design to improve the generalization ability. Furthermore, we propose a novel residual unit capable of further improving the classification accuracy with our new network architecture. Experiments on benchmark CIFAR datasets have shown that our network architecture has a superior generalization ability compared to the original residual networks.

Figure 1: Schematic illustration of (a) basic residual units, (b) bottleneck, (c) wide residual units, and (d) our pyramidal residual units.

Figure 2: Visual illustrations of (a) additive PyramidNet, (b) multiplicative PyramidNet, and (c) comparison of (a) and (b).

Results

ImageNet

Top-1 and Top-5 error rates of single-model, single-crop (224*224) on ImageNet dataset. We use the additive PyramidNet for our results.

Network	# of parameters	Output feat. dimension	Top-1 error	Top-5 error
ResNet-101	44.5M	2048	23.6	7.1
PyramidNet-101, alpha=250	23.5M	1256	23.24	6.59
ResNet-152	60.0M	2048	23.0	6.7
PyramidNet-152, alpha=200	26.0M	1056	22.44	6.14
PyramidNet-200, alpha=300	62.1M	1456	20.41	5.16
PyramidNet-200, alpha=450, Dropout (0.5)	116.4M	2056	20.27	5.49

Model files download: link

Notes

The ImageNet results are obtained using the uploaded codes.
When testing with our model, please do not forget to use a scale factor (0.017352).

Contact

Jiwhan Kim (jhkim89@kaist.ac.kr), Dongyoon Han (dyhan@kaist.ac.kr), Junmo Kim (junmo.kim@kaist.ac.kr)

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
PyramidNet-101		PyramidNet-101
PyramidNet-152		PyramidNet-152
PyramidNet-200		PyramidNet-200
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyramidNet-101

PyramidNet-101

PyramidNet-152

PyramidNet-152

PyramidNet-200

PyramidNet-200

README.md

README.md

Repository files navigation

PyramidNet-caffe

Abstract

Results

ImageNet

Notes

Contact

About

Releases

Packages

jhkim89/PyramidNet-caffe

Folders and files

Latest commit

History

Repository files navigation

PyramidNet-caffe

Abstract

Results

ImageNet

Notes

Contact

About

Topics

Resources

Stars

Watchers

Forks