Dual Voice Mixer for TIMIT Dataset

This project is intended to be used in conjunction with the Pytorch implementation of LAS and test the LAS model's recognition ability when background noise is introduced. The TIMIT dataset is modified to generate voices with background noise that can be tested with a trained LAS model.

Read the paper that used this project here.

Setup

TIMIT

Move the files in mix_timit to the LAS Pytorch directory.
TIMIT dataset folder must be in the same directory as timit_preprocess.sh and mixer.py
Run timit_preprocess.sh (should convert NIST .WAV to RIFF .wav)
Run mixer.py
- TIR and gender mixing can be adjusted by editing their respective lists
Run timit_preproccess_mixed.py
- Adjust TIR and gender list accordingly
Run test_timit_mixed.py to generate phoneme error rate results

Requirements

pysox: Mixes audio files
SoX: Converts NIST to RIFF and a requirement for pysox
NumPy: Calculates target-to-interference ratio
pandas: Saves testing data in .csv format

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
mix_timit		mix_timit
test_tir		test_tir
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mix_timit

mix_timit

test_tir

test_tir

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Dual Voice Mixer for TIMIT Dataset

Setup

TIMIT

Requirements

About

Releases

Packages

Languages

License

BradleyHe/TIMIT-Voice-Mixer

Folders and files

Latest commit

History

Repository files navigation

Dual Voice Mixer for TIMIT Dataset

Setup

TIMIT

Requirements

About

Topics

Resources

License

Stars

Watchers

Forks

Languages