NAS-Bench-360

This codebase reproduces various empirical evaluations on NAS-Bench-360, a benchmark for evaluating neural architecture search on diverse tasks, that can be found in the associated paper.

Resources

Oct 2022: Please use this link for all dataset and precompute downloads as we are transitioning our AWS resources to the Open Data Sponsorship program:

Shared Drive

Datasets in the benchmark with download links ( old links, use the shared drive above! ):

CIFAR-100 (Image classification)
Spherical CIFAR-100 (Transformed image classification) (272 MB)
Ninapro DB5 (Hand-gesture classification)(15 MB)
Darcy Flow (Partial differential equation solver) (1.6 GB)
PSICOV (Protein sequence distance prediction) (1.1 GB)
FSD50k (Sound event classification) (24 GB)
Cosmic (Cosmic ray identification and replacement) (6.5 GB)
ECG (Cardiac anomaly detection)(150 MB)
Satellite (Earth monitoring through satellite imagery) (322 MB)
DeepSEA (identifying chromatin features from DNA sequences)(860 MB)

Precomputed evaluation benchmark files on the NB201 search space (following NATS-Bench):

NinaPro DB5(84 MB)
Darcy Flow (85 MB)

For full outputs (include training logs and all weights and checkpoints), please contact the administrators. They are of size ~40 GB.

Prerequisites for main NAS experiments

We use the open-source Determined software to implement experiment code.

Installing determined: pip install determined

A master instance is required:

for local deployment (need to install docker):
- to start the master: det deploy local cluster-up
- access the WebUI at http://localhost:8080
- to shut down: det deploy local cluster-down
for AWS deployment (preferred):
- install AWS CLI
- Run aws configure and find AWS EC2 keypair name
- to start the master: det deploy aws up --cluster-id CLUSTER_ID --keypair KEYPAIR_NAME
- access the WebUI at {ec2-instance-uri}:8080
- to shut down: det deploy aws down --cluster-id CLUSTER_ID

For an end-to-end example of running experiments with determined, you can refer to this video.

When running experiments, a docker image is automatically pulled from docker hub which contains all required python packages , i.e. you don't need to install them yourself, and it ensures reproducibility.

Main NAS Experiments Reproduction

We provide pytorch implementations for two state-of-the-art NAS algorithms: GAEA PC-DARTS (paper link) and DenseNAS (paper link), which can be found inside each folder with the associated name, i.e. darts/ for GAEA PC-DARTS and densenas/ for DenseNAS.

To run these algorithms on 1D tasks, we've adapted their search spaces whose experiments are provided in darts_1d/ for GAEA PC-DARTS (1D) and densenas_1d/ for DenseNAS(1D).

Two task-specific NAS methods are implemented: Auto-DeepLab for dense prediction tasks in autodeeplab/ and AMBER for 1D prediction tasks in AMBER/.

We also implement procedure for running and tuning hyperparameters of the backbone architecture Wide ResNet (paper link), in backbone/. The 1D-customized Wide ResNet is in backbone_1d/.

To modify the random seed for each experiment, modify the number under

reproducibility: experiment_seed: for each script

Additional Baseline Experiments

We also evaluate the performance of non-NAS baselines for comparison:

Expert architectures for each dataset: see expert.
Perceiver-IO: see perceiver-io.
XGBoost: see xgboost.

Precomputed results on NinaPro and DarcyFlow

See the precompute directory for NAS algorithms from NATS-Bench and reproduction of the precomputed benchmark.

Baselines

Performance of NAS and baselines across NAS-Bench-360. Methods are divided into efficient methods (e.g. DenseNAS and fixed WRN) that take 1-10 GPU-hours, more expensive methods (e.g. DARTS and tuned WRN) that take 10-100+ GPU-hours, and specialized methods (Auto-DL and AMBER). All results are averages of three random seeds, and lower is better for all metrics.

Citation

If you find this project helpful, please consider citing our paper:

@inproceedings{
  tu2022nasbench,
  title={{NAS}-Bench-360: Benchmarking Neural Architecture Search on Diverse Tasks},
  author={Renbo Tu and Nicholas Roberts and Mikhail Khodak and Junhong Shen and Frederic Sala and Ameet Talwalkar},
  booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
  year={2022},
  url={https://openreview.net/forum?id=xUXTbq6gWsB}
}

Thanks!

Name		Name	Last commit message	Last commit date
Latest commit History 252 Commits
.idea		.idea
.vscode		.vscode
AMBER		AMBER
autodeeplab		autodeeplab
backbone		backbone
backbone_1d		backbone_1d
darts		darts
darts_1d/gaea		darts_1d/gaea
densenas		densenas
densenas_1d		densenas_1d
docs		docs
expert		expert
images		images
perceiver-io		perceiver-io
performance_profiles		performance_profiles
precompute		precompute
xgboost		xgboost
.DS_Store		.DS_Store
.dockerignore		.dockerignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
download_datasets.sh		download_datasets.sh

License

rtu715/NAS-Bench-360

Folders and files

Latest commit

History

Repository files navigation

NAS-Bench-360

Resources

Prerequisites for main NAS experiments

Main NAS Experiments Reproduction

Additional Baseline Experiments

Precomputed results on NinaPro and DarcyFlow

Baselines

Citation

About

Resources

License

Stars

Watchers

Forks

Languages