Dense Recurrent Net For Speech Command Classification

This was a course project for Audio Content Analysis taught by Prof. Bello. It is based on a Kaggle competition to classify one-second long English speech commands.

Three models were implemented in Keras:

a ConvNet
a DenseNet
a Dense-Recurrent Net

The Kaggle dataset was transformed with simple Gaussian noise to make a noisy variant of the clean dataset. The following table lists the multi-class accuracies of the three models on both clean and noisy datasets:

Architecture	Architecture	ConvNet	DenseNet	Dense-Recurrent Net
Clean dataset	Training accuracy	85.68%	90.28%	99.71%
	Validation accuracy	80.47%	84.74%	83.45%
	Testing accuracy	81.49%	85.04%	83.39%
Noisy dataset	Training accuracy	88.02%	88.67%	99.59%
	Validation accuracy	83.64%	81.95%	82.74%
	Testing accuracy	84.14%	83.20%	82.60%

A copy of the report is available for reference.

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
Report		Report
models		models
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
Exploring Data.ipynb		Exploring Data.ipynb
LICENSE		LICENSE
README.md		README.md
Speaker_predictions.ipynb		Speaker_predictions.ipynb
evaluate.ipynb		evaluate.ipynb
requirements-gpu.txt		requirements-gpu.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Report

Report

models

models

src

src

.gitignore

.gitignore

.gitmodules

.gitmodules

Exploring Data.ipynb

Exploring Data.ipynb

LICENSE

LICENSE

README.md

README.md

Speaker_predictions.ipynb

Speaker_predictions.ipynb

evaluate.ipynb

evaluate.ipynb

requirements-gpu.txt

requirements-gpu.txt

requirements.txt

requirements.txt

Repository files navigation

Dense Recurrent Net For Speech Command Classification

About

Releases

Packages

Contributors 2

Languages

License

aakaashjois/Dense-Recurrent-Net-For-Speech-Command-Classification

Folders and files

Latest commit

History

Repository files navigation

Dense Recurrent Net For Speech Command Classification

About

Topics

Resources

License

Stars

Watchers

Forks

Languages