GitHub - jtkim-kaist/ram_modified: "Recurrent Models of Visual Attention" in TensorFlow

Tensorflow Implementation of Recurrent Attention Model (RAM)

Author

Requirement

tensorflow rc 1.1.0-rc0

Description

code: 'ram_modified.py'

This project is modified version of https://github.com/jlindsey15/RAM. The critical problem of last implemetnation is that the location network cannot learn because of tf.stop_gradient implementation so that they got just '94% accuracy'. It seems relatively bad compared to the result of paper. If 'tf.stop_gradient' was commented, the classification result was very bad. The reason I think is that the problem is originated from sharing the gradient flow through location, core, glimpse network. Through gradient sharing, gradients of classification part are corrupted by gradients of reinforcement part so that classification result become very bad. (If someone want to share gradient, the weighted loss should be needed. please refer https://arxiv.org/pdf/1412.7755.pdf) According to their post research, 'Multiple Object Recognition with Visual Attention' (https://arxiv.org/pdf/1412.7755.pdf) they softly separate location network and others through multi-layer RNN. From this, I assume that sharing the gradient through whole network is not a good idea so separate them, and finally got a good result. In summary, the learning stretegy is as follow.

location network, baseline network : learn with gradients of reinforcement learning only.
glimpse network, core network : learn with gradients of supervised learning only.

Thank you!

Result

After 600,000 epoch, I got about 98% accuracy.

Reference

Recurrent Models of Visual Attention

http://papers.nips.cc/paper/5542-recurrent-models-of-visual-attention.pdf

https://arxiv.org/pdf/1412.7755.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
__pycache__		__pycache__
.gitignore		.gitignore
README.md		README.md
ram.py		ram.py
ram.py.bak		ram.py.bak
ram_modified.py		ram_modified.py
ram_up.py		ram_up.py
report.txt		report.txt
tf_mnist_loader.py		tf_mnist_loader.py
tf_mnist_loader.pyc		tf_mnist_loader.pyc
tf_upgrade.py		tf_upgrade.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pycache

pycache

.gitignore

.gitignore

README.md

README.md

ram.py

ram.py

ram.py.bak

ram.py.bak

ram_modified.py

ram_modified.py

ram_up.py

ram_up.py

report.txt

report.txt

tf_mnist_loader.py

tf_mnist_loader.py

tf_mnist_loader.pyc

tf_mnist_loader.pyc

tf_upgrade.py

tf_upgrade.py

Repository files navigation

Tensorflow Implementation of Recurrent Attention Model (RAM)

Author

Requirement

Description

Result

Reference

About

Releases

Packages

Languages

jtkim-kaist/ram_modified

Folders and files

Latest commit

History

Repository files navigation

Tensorflow Implementation of Recurrent Attention Model (RAM)

Author

Requirement

Description

Result

Reference

About

Topics

Resources

Stars

Watchers

Forks

Languages