Audio augmentation

(README.md)

Audio augment is a tool/script for batch audio data augmenation through speed, volume, reverb, noise based on kaldi and sox.

Installation

kaldi
sox
cd audio_augment
cd tools; make KALDI=/your_kaldi_path

Usage

cd audio_augment
vim run_aug.sh to change your input_path and out_path and save
bash run_aug.sh, so easy!

Workflow(run_aug.sh)

Stage 1: Data Preparation contain data and text
Stage 2: speed 0.9/1.1
Stage 3: volume +-db
Stage 4: reverberation(RIRS)
Stage 5: MUSAN(noise/music/babble)
Stage 6: combine above data and select a subset of the augmend data list about twice the origin data
Stage 7: data and label generation

generated Examples

cd data/wav/train_aug listen a few enhanced aishell1 audio example through speed/volume/RIRS/MUSAN.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
local		local
steps		steps
utils		utils
README.md		README.md
cmd.sh		cmd.sh
path.sh		path.sh
prepare_data.py		prepare_data.py
requirment.txt		requirment.txt
run_aug.sh		run_aug.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

local

local

steps

steps

utils

utils

README.md

README.md

cmd.sh

cmd.sh

path.sh

path.sh

prepare_data.py

prepare_data.py

requirment.txt

requirment.txt

run_aug.sh

run_aug.sh

Repository files navigation

Audio augmentation

Installation

Usage

Workflow(run_aug.sh)

generated Examples

Reference

About

Releases

Packages

Languages

zhaoyi2/audio_augment

Folders and files

Latest commit

History

Repository files navigation

Audio augmentation

Installation

Usage

Workflow(run_aug.sh)

generated Examples

Reference

About

Topics

Resources

Stars

Watchers

Forks

Languages