Minimal Neural Machine Translation

Resources

NEURAL MACHINE TRANSLATION BY JOINTLY LEARNING TO ALIGN AND TRANSLATE https://arxiv.org/pdf/1409.0473.pdf

Effective Approaches to Attention-based Neural Machine Translation https://arxiv.org/pdf/1508.04025.pdf

Massive Exploration of Neural Machine Translation Architectures https://arxiv.org/pdf/1703.03906.pdf

conda install pytorch -c pytorch=0.4.1

pip install -r requirements.txt

Training with a batch size of 32 takes ~3gb GPU ram. If this is too much, lower the batch size or reduce network dimensionality in hyperparams.py.

python train.py

view logs in Tensorboard decent alignments should be seen after 2-3 epochs.

tensorboard --logdir runs

(partially trained attention heatmap)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
tests		tests
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
attention.py		attention.py
beamsearch.ipynb		beamsearch.ipynb
beamsearch.py		beamsearch.py
datasets.py		datasets.py
decoding_helpers.py		decoding_helpers.py
hyperparams.py		hyperparams.py
models.py		models.py
nmt_tutorial.ipynb		nmt_tutorial.ipynb
requirements.txt		requirements.txt
sgdr.py		sgdr.py
train.py		train.py
utils.py		utils.py