Neural Machine Translation

A full report of the task can be found in NMT_report.pdf

This code implements a Sequence to Sequence Neural Machine Translation model based on this paper by Luong et. al.

The NMT model used global attention with dropout and input feeding.
The model is also extended to implement the lexical model as per this paper
The data used for training is Japanese - English parallel corpus with Japanese as source language and English as target - language.
The model will train until convergence and last epoch will be saved as checkpoint in the specified directory. The epoch with best validation loss will be saved as best checkpoint in the specified directory.
Usage:
- To train (with default settings): python train.py
- To translate (with default settings): python translate.py
- To calculate BLEU score: perl multi-bleu.perl -lc raw data/test.en < model translations.txt
*test.en are the ground truth test data translations.

References

Minh-Thang Luong, Hieu Pham, and Christopher D Manning. Effective approaches to attention-based neural machine translation. arXiv preprint arXiv:1508.04025, 2015.
Toan Q Nguyen and David Chiang. Improving lexical choice in neural machine translation. arXiv preprint arXiv:1710.01329, 2017.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
__pycache__		__pycache__
lex_checkpts		lex_checkpts
lex_checkpts_fullset		lex_checkpts_fullset
prepared_data		prepared_data
pretrained_checkpoints		pretrained_checkpoints
q4_tiny_checkpts		q4_tiny_checkpts
raw_data		raw_data
seq2seq		seq2seq
visualizations		visualizations
LICENSE		LICENSE
NMT_Report.pdf		NMT_Report.pdf
README.md		README.md
data_exploration.py		data_exploration.py
multi-bleu.perl		multi-bleu.perl
preprocess.py		preprocess.py
read_pickled_files.py		read_pickled_files.py
train.py		train.py
translate.py		translate.py
try_tensor.py		try_tensor.py
visualize.py		visualize.py