Introduction

Requirements and Installation

A PyTorch installation
For training new models, you'll also need an NVIDIA GPU and NCCL
Python version 3.7+

Currently NMTG requires PyTorch version >= 1.8.0. Best is 1.10.0 Please follow the instructions here: https://github.com/pytorch/pytorch#installation.

After PyTorch is installed, you can install the requirements with:

pip install -r requirements.txt

C++/CUDA module installation

NMTG supports a couple of modules written using custom Pytorch/C++/CUDA modules to utilize GPU better and reduce overheads, including:

Self-attention and encoder-decoder attention with CUBLASLT
Multi-layer Perceptrons with CUBLASLT and fused dropout-relu/gelu/silu where inplace is implemented whenever possible
Highly optimized layer norm and multi-head attention (only available with sm80 (NVIDIA A100)) from Apex
Fused Logsoftmax/Cross-entropy loss to save memory for large output layer, from Apex
Fused inplaced Dropout Add for residual Transformers

Installation requires CUDA and nvcc with the same version with PyTorch. Its possible to install CUDA from conda via:

conda install -c nvidia/label/cuda-11.3.1 cuda-toolkit

or if using a custom version with CUDA 11.5

conda install -c nvidia/label/cuda-11.5.2 cuda-toolkit

(depending on the CUDA version that comes with your PyTorch)

And then navigate to the extension modules and install nmtgminor-cuda via

cd onmt/modules/extension
python setup.py install

Without this step, all modules backoff to PyTorch versions.

IWSLT 2022 Speech Translation models

Interspeech 2022 Multilingual ASR models

Name		Name	Last commit message	Last commit date
Latest commit History 811 Commits
.idea		.idea
ae		ae
fairseq_utils		fairseq_utils
onmt		onmt
pretrain_module		pretrain_module
recipes		recipes
test		test
tools		tools
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE.md		LICENSE.md
README.md		README.md
autoencoder.py		autoencoder.py
average_checkpoints_auto.py		average_checkpoints_auto.py
classify.py		classify.py
eval_autoencoder.py		eval_autoencoder.py
extend_weight.py		extend_weight.py
extract_vocab.py		extract_vocab.py
extract_wav2vec2_codebook.py		extract_wav2vec2_codebook.py
extract_wav2vec2_tdnn.py		extract_wav2vec2_tdnn.py
flask_mt.py		flask_mt.py
flask_online.py		flask_online.py
learn_kmeans.py		learn_kmeans.py
neural_feature_reader.py		neural_feature_reader.py
normalize_text.py		normalize_text.py
online.py		online.py
options.py		options.py
predict_language.py		predict_language.py
preprocess.py		preprocess.py
preprocess_classify.py		preprocess_classify.py
preprocess_triangle.py		preprocess_triangle.py
quantize_kmeans.py		quantize_kmeans.py
quantize_pretrained_hubert_kmeans.py		quantize_pretrained_hubert_kmeans.py
rematch_language_embedding.py		rematch_language_embedding.py
requirement.txt		requirement.txt
rescore.py		rescore.py
sample_lm.py		sample_lm.py
setup.py		setup.py
test_nllb.py		test_nllb.py
train.py		train.py
train_classify.py		train_classify.py
train_distributed.py		train_distributed.py
train_language_model.py		train_language_model.py
translate.py		translate.py
translate_distributed.py		translate_distributed.py
verify_wav2vec2_feat.py		verify_wav2vec2_feat.py

License

quanpn90/NMTGMinor

Folders and files

Latest commit

History

Repository files navigation

Introduction

Requirements and Installation

C++/CUDA module installation

IWSLT 2022 Speech Translation models

Interspeech 2022 Multilingual ASR models

About

Resources

License

Stars

Watchers

Forks

Languages