E-Vector a.k.a Emotion-Scenario Vector for Speaker Recognition

Morgan Sandler

This is mostly my thesis work! Computed embeddings for all test sets are included in this repository within the ExperimentData/ folders. In addition, the code and figures from the papers may be found in their respective folders. If you would like pre-trained models, please reach out to sandle20@msu.edu. I hope to soon upload them for inference on HuggingFace :-)

Training and Evaluation data can be requested from the MSP lab at UTDallas. Thank you to Prof. Carlos Busso for granting permission of the data. The dataset details are here: MSP-Podcast

Architecture of E-Vector (Full Paper)

Here is a brief description of each relevant file:

encoder_preprocess.py preprocess the raw data from the MSP-Podcast root directory and stores preprocessed it in Data/
encoder_train.py will train models from scratch or could be refined to fine-tune models if you so desire.
speaker_verification_MSP.py will test SV on the MSP-Podcast testing sets.
Ignore speaker_verification_mass.py. It is vestigial and mostly trash. I did not use this file in the end, but I am double-checking the code to make sure it isn't useful in some capacity.

Of course, you may have many questions about the code since it is vast and complicated. If you have any questions, feel free to reach out and I will do my best to answer them! - Morgan

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
ExperimentData		ExperimentData
Figures		Figures
analysis		analysis
baseline_comp_roc_det		baseline_comp_roc_det
baseline_experiment		baseline_experiment
encoder		encoder
evec_tokensize_rocdet		evec_tokensize_rocdet
old_rocs		old_rocs
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
d_prime_calculator.py		d_prime_calculator.py
det_curves.py		det_curves.py
encoder_preprocess.py		encoder_preprocess.py
encoder_train.py		encoder_train.py
get_parameter_count.py		get_parameter_count.py
roc_curves.py		roc_curves.py
speaker_verification.py		speaker_verification.py
speaker_verification_MSP.py		speaker_verification_MSP.py
speaker_verification_mass.py		speaker_verification_mass.py
test1_det.jpg		test1_det.jpg
test1_roc.jpg		test1_roc.jpg
test1dprimes.txt		test1dprimes.txt
test2_det.jpg		test2_det.jpg
test2_roc.jpg		test2_roc.jpg
test2dprimes.txt		test2dprimes.txt
val_det.jpg		val_det.jpg
val_roc.jpg		val_roc.jpg
valdprimes.txt		valdprimes.txt

License

morganlee123/evector

Folders and files

Latest commit

History

Repository files navigation

E-Vector a.k.a Emotion-Scenario Vector for Speaker Recognition

Morgan Sandler

Resources/References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages