Skip to content

Audio Classification with AlexNet and Speech Commands dataset

Notifications You must be signed in to change notification settings

tuanio/audio-classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Audio Classification

Datasets

  • Speech Commands from torchaudio, with 35 class.

Model Architecture

  • Using AlexNet with Mel Spectrogram 1 channel.
  • Output is a softmax with 35 nodes (35 class).

alexnet

How to use this respository?

  • Edit your configuration in conf/configs.yaml
  • Train model using python main.py -cp conf -cn configs

Additional

  • You guys should install PySoundFile on windows or sox on linux, for torchaudio I/O backend.

Releases

No releases published

Packages

No packages published

Languages