Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About Search Time #7

Open
zkjst110 opened this issue Jul 15, 2019 · 0 comments
Open

About Search Time #7

zkjst110 opened this issue Jul 15, 2019 · 0 comments

Comments

@zkjst110
Copy link

zkjst110 commented Jul 15, 2019

In the paper

We set the number of phases np to three and the number of nodes in each phase no to six.
Duringarchitecturesearch, we limit the number of filters (channels) in any node to 16 for each
one of the generated network architecture. We then train them on our training set using standard stochastic gradient descent (SGD) back-propagation algorithm and a cosine annealing learning rate schedule. Our initial learning rate is 0.025 and we train for 25 epochs, which takes about 9 minutes on a NVIDIA 1080Ti GPU implementation in PyTorch .

But in the code, the default parameter --n_nodes(number of nodes per phases) is four.
I set the channel to 16 and n_nodes to six ,but the search process is slow. So, I want to know if you have the concrete configuration about the 9 minutes.
Also , I find that the search code run slower in multiple GPU than run in a GPU,can you explain the phenomenon?
Tnank you very much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant