NasNet 2018

This code is the reimplementation of "Learning Transferable Architectures for Scalable Image Recognition", including the training process of controller. This code contains three algorithms to search model, Random Search, Policy Gradient and PPO.

Requirements

Python >= 3.6.7, PyTorch == 0.4.0

Architecture Search

python train_search.py --cutout --algorithm RS  #use random search
python train_search.py --cutout --algorithm PG  #use policy gradient
python train_search.py --cutout --algorithm PPO #use PPO

Note the validation performance in this step does not indicate the final performance of the architecture. One must train the obtained genotype/architecture from scratch using full-sized models. Also the default setting is training with 20 processes and 3 GPU. Change the processes to 10:

python train_search.py --cutout --episodes 10

or modify code in random_search.py, policy_gradient.py and PPO.py .

Architecture Evaluation

Because of the limitation of time and computation resource, I didn't train the candidate genotypes/architectures from scratch.

Results

python draw.py

We can see RL search is better than random search, also PPO is more stable and faster.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
fig		fig
log		log
.gitignore		.gitignore
PPO.py		PPO.py
README.md		README.md
Worker.py		Worker.py
controller.py		controller.py
draw.py		draw.py
genotypes.py		genotypes.py
model.py		model.py
operations.py		operations.py
policy_gradient.py		policy_gradient.py
random_search.py		random_search.py
run.sh		run.sh
train_search.py		train_search.py
utils.py		utils.py

MarSaKi/nasnet

Folders and files

Latest commit

History

Repository files navigation

NasNet 2018

Requirements

Architecture Search

Architecture Evaluation

Results

About

Topics

Resources

Stars

Watchers

Forks

Languages