Skip to content

littleflow3r/seq2seq-learning-for-machine-translation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Machine_Translation_Seq2Seq

Machine translation (jp-en) using LSTM-based encoder-decoder model (Pytorch). This is the implementation of several models:

adapted to JP-EN translation.

Data: https://nlp.stanford.edu/projects/jesc/, official split. The xls data is converted into csv with panda (prepro.py). Japanese is tokenized using sentencepiece (https://github.com/google/sentencepiece/), English is tokenized using space (sorry, too lazy).

About

(minimal implementation) Seq2Seq models (Machine translation task, Japanese-English)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages