SpeechEmoRec

Introduction

This project aims to implement speech emotion recognition strategy proposed in Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching

Runtime enviorment

CPU Host :

ubuntu16.04
python3.5
tensorflow1.7.0

GPU Server :

tensorflow-gpu1.7.0
NVIDIA driver version:390
cuda9.0
cudnn7.0

Instructions

Preprocessing Data

Update path of dataset which you want to save from path.py
Downloading Berlin Database of Emotional Speech!
1. Berlin Dataset
  $ python load_emodb.py
2. eNTERFACE Dataset
  Downloading the eNTERFACE05 Dataset and update the dataset root
Starting preprocessing

$ python melSpec.py

Feature Extracting

Finetune AlexNet with Tensorflow

$ python finetune.py

Discriminant Temporal Pyramid Matching

$ python dtpm.py -s  
$ python dtpm.py -n

Classfier

Support Vector Machine

$ python svm.py

Refrences:

Refrence Model:

Alexnet
SVM

Refrence Papers:

ImageNet Classification with Deep Convolutional Neural Networks
Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching
Geometric ℓp-norm feature pooling for image classification

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.gitignore		.gitignore
.travis.yml		.travis.yml
README.md		README.md
__init__.py		__init__.py
alexnet.py		alexnet.py
datagenerator.py		datagenerator.py
dtpm.py		dtpm.py
finetune.py		finetune.py
get_fc7.py		get_fc7.py
load_emodb.py		load_emodb.py
melSpec.py		melSpec.py
model_test.py		model_test.py
path.py		path.py
plot_confusion_matrix.py		plot_confusion_matrix.py
requires.sh		requires.sh
svm.py		svm.py
utils.py		utils.py

tzaiyang/SpeechEmoRec

Folders and files

Latest commit

History

Repository files navigation

SpeechEmoRec

Introduction

Runtime enviorment

Instructions

Preprocessing Data

Feature Extracting

Classfier

Refrences:

About

Topics

Resources

Stars

Watchers

Forks

Languages