Usage

Voiceprint maker with SincNet architecture and CosFace head implemented using TensorFlow v2+.

Voiceprint is represented as unit vector of floating-point numbers. Voiceprints generated by this maker have feature. The more similar two voices, the closer its voiceprints by cosine metric.

Usage

Download dataset with voice records and speaker labels.
Split dataset into parts by creating three files containing record paths. These files would be used for training, validating and testing respectively. Record path should be relative to the dataset root.
Create .npy file which contains dictionary mapping record path to speaker label. Speaker label should be integer from range [0, C - 1] where C is number of distinct speakers.
Create configuration file like cfg/SincNet_TIMIT.cfg.
Train: python train.py --cfg=<your configuration file path>.
Test: python test_print_maker --cfg=<your configuration file path>.
Make voiceprints: python wav_to_voiceprint.py --cfg=<your configuration file path>.

If you work with TIMIT dataset then you can skip 2-4 steps and use cfg/SincNet_TIMIT.cfg as configuration file.

References

[1] SincNet original code written in PyTorch by the autor (https://github.com/mravanelli/SincNet)

[2] Mirco Ravanelli, Yoshua Bengio, “Speaker Recognition from raw waveform with SincNet” Arxiv

[3] Hao Wang, Yitong Wang and others, "CosFace: Large Margin Cosine Loss for Deep Face Recognition" Arxiv

[4] CosFace repository (https://github.com/4uiiurz1/keras-arcface)

Name		Name	Last commit message	Last commit date
Latest commit History 263 Commits
cfg		cfg
data_lists		data_lists
utils		utils
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SincNet.png		SincNet.png
config.py		config.py
data_loader.py		data_loader.py
requirements.txt		requirements.txt
sincnet.py		sincnet.py
test.py		test.py
test_print_maker.py		test_print_maker.py
train.py		train.py
tune_hyperparams.py		tune_hyperparams.py
wav_to_voiceprint.py		wav_to_voiceprint.py

License

AntonDemchenko/voiceprint_maker

Folders and files

Latest commit

History

Repository files navigation

Usage

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages