Skip to content

duyvuleo/JSALT17-NMT-Lab

Repository files navigation

Repository for JSALT'17 NMT Lab

Comprehensive documentation for this lab here: https://tinyurl.com/yalmjyk2

Set up your environment

source ENV.sh

This will allow your run to find the necessary executables. You do not need to modify the values in this file if you are on the PSC grid. If not, replace them with values for your grid.

Get the parallel data

  1. Run data.sh. This will fetch data from the internet and create train, test and validation datasets.
sh data.sh
  1. This will also create a smaller toy training dataset that we will use during this session to make training faster.

Pre-process

Pre-process the input files; tokenize etc. The following command will do this for you.

sh preprocess.sh

Run training

sh train-toy.sh de en
sh train-toy.sh en de

Gaurav's baselines to beat

De-En

18.04 BLEU.

En-De

14.52 BLEU

About

Repository for JSAL'17 NMT Lab

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •