VIB audio classification

Audio classification with variational information bottleneck (VIB)

Dependencies

-python 3.7.4 -torch 1.7.1

Feature set information

For this task, we use 3 datsets: emontiontoronto, urbansound8k, audioMNIST.

The emontiontoronto dataset is built using 5252 samples from:

The classes the model wants to predict are the following: (0 = neutral, 1 = calm, 2 = happy, 3 = sad, 4 = angry, 5 = fearful, 6 = disgust, 7 = surprised). This dataset is skewed as there is not a calm class in TESS, hence there are less data for that particular class and this is evident when observing the classification report.

urbansound8k : around 8000 samples of sounds from natural environment
audioMNIST : 3000 samples from 6 speaker prounce 0-9 in English, wiht 50 samples per speaker

Usage

With Pytorch>=1.7.0 environment, you can either run the Notebook (VIB_audio_classifier) or enter he following lines in terminal directly

train over Emotion Toronto dataset: python main.py --mode train --beta 1e-3 --data emotiontoronto --epoch 50 --lr 1.e-3 --K=64 --batch_size=32

(with default values all parameters unless specified)

test over Emotion Toronto dataset: python main.py --mode test --beta 1e-3 --data emotiontoronto --epoch 50 --lr 1.e-3 --K=64 --batch_size=32 --load_ckpt best_acc.tar

References

If you find this repository helpful, please cite our paper: Variational Information Bottleneck for Effective Low-resource Audio Classification, Shijing Si, et al. 2021, https://www.isca-speech.org/archive/pdfs/interspeech_2021/si21_interspeech.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
Dictionaries		Dictionaries
checkpoints/main		checkpoints/main
joblib_features		joblib_features
MLP_audio_classifier .ipynb		MLP_audio_classifier .ipynb
MLP_baseline.py		MLP_baseline.py
README.md		README.md
VIB_audio_classifier.ipynb		VIB_audio_classifier.ipynb
create_fetures_urbansound8k.ipynb		create_fetures_urbansound8k.ipynb
main.py		main.py
model.py		model.py
plotting from dictionaries.ipynb		plotting from dictionaries.ipynb
solver.py		solver.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dictionaries

Dictionaries

checkpoints/main

checkpoints/main

joblib_features

joblib_features

MLP_audio_classifier .ipynb

MLP_audio_classifier .ipynb

MLP_baseline.py

MLP_baseline.py

README.md

README.md

VIB_audio_classifier.ipynb

VIB_audio_classifier.ipynb

create_fetures_urbansound8k.ipynb

create_fetures_urbansound8k.ipynb

main.py

main.py

model.py

model.py

plotting from dictionaries.ipynb

plotting from dictionaries.ipynb

solver.py

solver.py

utils.py

utils.py

Repository files navigation

VIB audio classification

Dependencies

Feature set information

Usage

References

About

Releases

Packages

Contributors 2

Languages

shijing001/VIB_audio_classification

Folders and files

Latest commit

History

Repository files navigation

VIB audio classification

Dependencies

Feature set information

Usage

References

About

Resources

Stars

Watchers

Forks

Languages