rationale behind learning rate #113

simin75simin · 2021-12-10T06:24:08Z

i was able to train your model on my own machine and get robust webcam estimations.
i wonder what is the rationale behind disabling training (setting learning rate to 0) for the first conv and bn of your resnet backbone, and giving a 5x learning rate to the three fc layers?
also, would you suggest more epoches for a smaller model? i need to make this work on peripheral device for work.

thanks very much. great work.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rationale behind learning rate #113

rationale behind learning rate #113

simin75simin commented Dec 10, 2021

rationale behind learning rate #113

rationale behind learning rate #113

Comments

simin75simin commented Dec 10, 2021