ASR project

The project is made for educational purposes, as the homework of the course deep learning for audio processing.

Installation guide

It is recommended to use python 3.8 or 3.9

You need to clone the repository and install the libraries:

git clone https://github.com/maximkm/DLA_ASR_HW.git
cd DLA_ASR_HW
pip install -r requirements.txt

Description of the work done

Wandb report

The final score received

Dataset	Type predict	CER	WER
LibriSpeech: test-clean	beam search	0.06742	0.12988
LibriSpeech: test-other	beam search	0.17529	0.27248
LibriSpeech: test-clean	argmax	0.07794	0.21284
LibriSpeech: test-other	argmax	0.17529	0.38656

Independent code testing

You need to download:

The final checkpoint of the model and put the save folder in the main directory
LM and place the file in the hw_asr/lm directory

You can run this script:

gdown https://drive.google.com/uc?id=10Ubmu6-w415A2jiUXobJL4ZzMy7A5fxW
unzip saved.zip
gdown https://drive.google.com/uc?id=1WGFJgzrh850BSXkaCb-dzsWqK894Dmd0
mv 5_full_gram.arpa hw_asr/lm

Now you can run the code:

You need to run the model with the following command:

python test.py -c hw_asr/configs/test_ctc_big_clean.json -r saved/models/baseline/1013_154403/model_best.pth -o test-clean.json

This command loads the prepared test_ctc_big_clean.json config inside of which contains the description of the model and dataset.

After processing all the data will save the predictions in test-clean.json.

Similarly, the test_ctc_big_other.json config was created. Also at test.py there is a -t argument to specify a folder with a dataset.

The last step is to run a script to calculate the WER and CER metrics

python calc_wer_cer.py -t test-clean.json

Credits

This repository is based on a heavily modified fork of pytorch-template repository.

The CTC transformer architecture is based on Transformers with convolutional context for ASR.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
hw_asr		hw_asr
test_data		test_data
.gitignore		.gitignore
ColabTrain.ipynb		ColabTrain.ipynb
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
calc_wer_cer.py		calc_wer_cer.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hw_asr

hw_asr

test_data

test_data

.gitignore

.gitignore

ColabTrain.ipynb

ColabTrain.ipynb

Dockerfile

Dockerfile

LICENSE

LICENSE

README.md

README.md

calc_wer_cer.py

calc_wer_cer.py

requirements.txt

requirements.txt

test.py

test.py

train.py

train.py

Repository files navigation

ASR project

Installation guide

Description of the work done

The final score received

Independent code testing

Credits

About

Releases

Packages

Languages

License

maximkm/DLA_ASR_HW

Folders and files

Latest commit

History

Repository files navigation

ASR project

Installation guide

Description of the work done

The final score received

Independent code testing

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Languages