Machine_Translation_Seq2Seq

Machine translation (jp-en) using LSTM-based encoder-decoder model (Pytorch). This is the implementation of several models:

adapted to JP-EN translation.

Data: https://nlp.stanford.edu/projects/jesc/, official split. The xls data is converted into csv with panda (prepro.py). Japanese is tokenized using sentencepiece (https://github.com/google/sentencepiece/), English is tokenized using space (sorry, too lazy).

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
README.md		README.md
main.py		main.py
model.py		model.py
model2.py		model2.py
prepro.py		prepro.py

Provide feedback