GitHub - StatNLP/ada4asr

Name		Name	Last commit message	Last commit date
Latest commit History 1,589 Commits
.github		.github
config		config
docs		docs
examples		examples
fairseq		fairseq
fairseq_cli		fairseq_cli
scripts		scripts
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
hubconf.py		hubconf.py
pyproject.toml		pyproject.toml
setup.py		setup.py
train.py		train.py

Repository files navigation

Implementation of our paper: On-the-Fly Aligned Data Augmentation for Sequence-to-Sequence ASR

Requirements

Python 3.6
fairseqv1.0

Scripts added/modified for ADA

fairseq/criterions/s2t_xent_ctc_loss.py
fairseq/criterions/label_smoothed_cross_entropy.py
fairseq/models/speech_to_text/init.py
fairseq/models/speech_to_text/s2t_transformer.py
fairseq/models/speech_to_text/s2t_ctc_transformer.py
fairseq/models/roberta/hub_interface.py
fairseq/models/transformer.py
fairseq/data/audio/speech_to_text_dataset.py
fairseq/data/audio/audio_dict_dataset.py
fairseq/data/audio/feature_transforms/alignAugment.py
fairseq/data/audio/feature_transforms/samp_fbank.py
fairseq/data/audio/feature_transforms/specaugment.py
fairseq/tasks/speech_to_text.py
fairseq/sequence_generator.py

sample data input (.tsv)

script for audio dictionary

About

No description, website, or topics provided.

Code of conduct

Custom properties

Report repository

Releases

No releases published

Packages

No packages published

Contributors 185

+ 171 contributors

Languages

Python 97.8%
Cuda 1.4%
Other 0.8%