Skip to content

ercanserteli/dl2019

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Net2Net in Knet

This is the repo for my 2019 Deep Learning term project. This project consists of a replication of the paper Net2Net: Accelerating Learning via Knowledge Transfer using Julia and Knet.

Net2Net is a set of methods for growing a wider and/or deeper 'student' network from a trained 'teacher' network while preserving its function. The student network performs immediately as well as the teacher while possessing greater capacity that can be utilized by further training. Training a Net2Net initialized network is shown to converge faster than training from scratch.

While the original paper ran experiments using the Inception-BN network on the ImageNet dataset, this work uses a modified, smaller version of the network on the CIFAR-10 dataset. However, the original Inception-BN architecture is available in inception.jl, and can be used for experiments.

Check the following links for additional information:

Usage

To run the tests and experiments, simply run source.jl.

To use the Net2WiderNet or Net2DeeperNet operations, call the wider_* and deeper_* functions defined in wider.jl and deeper.jl. Refer to the docstrings for the usage of those functions.

models.jl and inception.jl files define strucures and functions for building models that can be used with the Net2WiderNet and Net2DeeperNet functions.

About

Net2Net in Julia, Knet

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages