DeepTalk Emotions

PyTorch implementation of the DeepTalk model described in DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis by A. Chowdhury, A. Ross, and P. David in IEEE International Conference on Acoustics, Speech and Signal Processing 2021 (ICASSP-2021). The code is applied to the domain of emotion recognition to investigate correlations between speaker identity and emotion state.

Research Article

Morgan Sandler and Arun Ross, Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition, 2022.

arXiv: https://arxiv.org/

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.vscode		.vscode
DT_EmotionRecog		DT_EmotionRecog
__pycache__		__pycache__
assets		assets
encoder		encoder
images		images
montreal_forced_aligner_linux		montreal_forced_aligner_linux
montreal_forced_aligner_mac		montreal_forced_aligner_mac
static		static
synthesizer		synthesizer
target_text_dir		target_text_dir
templates		templates
toolbox		toolbox
utils		utils
vocoder		vocoder
.gitignore		.gitignore
DeepTalkSER.tar.gz		DeepTalkSER.tar.gz
DeepTalk_demo.py		DeepTalk_demo.py
README.md		README.md
Readme.txt		Readme.txt
_config.yml		_config.yml
analysis.py		analysis.py
app.py		app.py
commons.py		commons.py
convert_text_to_lab.py		convert_text_to_lab.py
convert_to_wav.py		convert_to_wav.py
demo_config.py		demo_config.py
demo_functions.py		demo_functions.py
encoder_preprocess.py		encoder_preprocess.py
encoder_train.py		encoder_train.py
example.py		example.py
inference.py		inference.py
install_MFA_linux.sh		install_MFA_linux.sh
parse_textgrid.py		parse_textgrid.py
preprocess_audio.py		preprocess_audio.py
requirements.txt		requirements.txt
ser-lstm-training-script-subjectdisjoint-DEEPTALK.ipynb		ser-lstm-training-script-subjectdisjoint-DEEPTALK.ipynb
spec-file.txt		spec-file.txt
split_audio.py		split_audio.py
split_audio.pyc		split_audio.pyc
synthesizer_preprocess_audio.py		synthesizer_preprocess_audio.py
synthesizer_preprocess_embeds.py		synthesizer_preprocess_embeds.py
synthesizer_train.py		synthesizer_train.py
train_DeepTalk_step1.py		train_DeepTalk_step1.py
train_DeepTalk_step2.py		train_DeepTalk_step2.py
vocoder_preprocess.py		vocoder_preprocess.py
vocoder_train.py		vocoder_train.py

morganlee123/DeepTalkEmotions

Folders and files

Latest commit

History

Repository files navigation

DeepTalk Emotions

Research Article

About

Resources

Stars

Watchers

Forks

Languages