train_op #67

aaalgo · 2018-06-07T20:43:14Z

The following is what I found in main.py

206 opt = tf.train.AdamOptimizer(0.0001).minimize(loss, var_list=[var for var in tf.trainable_variables()])

However this doesn't properly handle batcn_norm, and it is recommended that opt is created by

op = slim.learning.create_train_op(loss, optimizer, global_step=global_step)

Please refer to this and search UPDATE_OPS
https://github.com/tensorflow/tensorflow/blob/master/tensorflow/contrib/slim/python/slim/learning.py

Basically, batch_norm needs UPDATE_OPS to function, but optimizer.minimize doesn't handle that.

Please correct me if I'm wrong.

The text was updated successfully, but these errors were encountered:

GeorgeSeif · 2018-06-10T13:29:04Z

Hmm this is very interesting. I also found this issue which references the same thing. Although it seems to have been closed and resolved a while ago, so not too sure.

However, I haven't had any issues with training the networks and many others were able to train just fine. Basically I'm saying I would have assumed results would be much worse if batch norm wasn't working.

Perhaps the best thing to do is a quick test. Training using each setup and see if there is a big difference in results. Maybe just a few epochs to get some quick results. Even after such a short time, if batch norm was indeed "off" before we should see a difference.

Thanks for letting me know!

GeorgeSeif closed this as completed Jun 28, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train_op #67

train_op #67

aaalgo commented Jun 7, 2018

GeorgeSeif commented Jun 10, 2018

train_op #67

train_op #67

Comments

aaalgo commented Jun 7, 2018

GeorgeSeif commented Jun 10, 2018