voxceleb_enrichment_age_gender/notebooks at main · hechmik/voxceleb_enrichment_age_gender

History

Name		Name	Last commit message	Last commit date
parent directory ..
src		src
01-Enrich_VoxCeleb_Dataset.ipynb		01-Enrich_VoxCeleb_Dataset.ipynb
01.1-Overall Age stats.ipynb		01.1-Overall Age stats.ipynb
02.1-Gender recognition-Train_test-ivec.ipynb		02.1-Gender recognition-Train_test-ivec.ipynb
02.2-Gender recognition-Train_test-xvec.ipynb		02.2-Gender recognition-Train_test-xvec.ipynb
03.1-Age regression-Train_test-ivec.ipynb		03.1-Age regression-Train_test-ivec.ipynb
03.2-Age regressio-Train_test-xvec.ipynb		03.2-Age regressio-Train_test-xvec.ipynb
03.3-Age regression-Train_test-MFCC.ipynb		03.3-Age regression-Train_test-MFCC.ipynb
03.4-Age regression-Train_test-MelSpect.ipynb		03.4-Age regression-Train_test-MelSpect.ipynb
README.md		README.md
environment_asvtorch.yml		environment_asvtorch.yml
environment_enrichment.yml		environment_enrichment.yml
environment_tensorflow.yml		environment_tensorflow.yml
settings.json		settings.json

README.md

In this folder a number of scripts and notebooks are reported. Here it follows a general overview:

Disclaimer

From a software engineering it is clear that there is large room of improvements, as various functions appear as duplicated due to the fact that . The main goal is to enable the reader to understand how things were done and allow as much as possible the reproducibility of all the steps. If some imports are broken or you don't understand some steps please open an issue and let me know.

00 Conda environment

Conda was used for managing the various libraries needed by scripts and notebooks. In particular:

environment_enrichment.yml
environment_asvtorch.yml: Environment used for computing x- and i-Vectors + training gender recognition models
environment_asvtorch.yml: Environment used in training Keras and Scikit-learning models + all the other activities The general instruction for installing the various environments is the following: conda env create -f environment.yml

01 Enrichment

The enrichment activity was performed using "01-Enrich_VoxCeleb_Dataset.ipynb" Jupyter notebook. This code takes advantage of some.

IMPORTANT: It is necessary to have specific credentials for querying Google Knowledge Graph. Detailed instruction can be found in this webpage https://developers.google.com/knowledge-graph/prereqs

02 Computation of MFCC, i-Vectors and x-Vectors

This activity was done using ASVTorch code made by Ville Vestman, who have well supported me during this crucial activity.

The repo was basically used as is, apart from the following modifications done in ivector/run.py and xvector/run.py

All the training parts are computed on VoxCeleb1, while the trial data is the whole VoxCeleb 2 corpus. In order to do this please modify all the strings passed as argument to chooseAll() functions
All steps, except the lasting one, are performed, without altering their sequences. More info about how to call the run.py scripts can be found in the i-Vector and x-Vector README files.

x-Vector

For the x-Vector training part the following parameters reported in the run_configs.py file were changed (because of time constraints):

network.utts_per_speaker_in_epoch = 200
network.max_epochs = 500

Also the loading part of stage 5 (the effective training of the model) has been modified as follows:

training_data = UtteranceSelector().choose_all('voxceleb1_combined') # combined = augmented version
training_data.remove_short_utterances(550)  # Remove utts with less than 500 frames
training_data.remove_speakers_with_few_utterances(10)  # Remove spks with less than 10 utts

print('Selecting PLDA training data...')
plda_data = UtteranceSelector().choose_all('voxceleb1_combined')
plda_data.select_random_speakers(400)

trial_data = UtteranceSelector().choose_all('voxceleb1')
trial_data.select_random_speakers(100)

03 Model training and evaluation

Notebooks starting with "02" and "03" have been used for training and evaluation all the reported models. In the src folder you can find the methods that effectively build the predictive models according to the specified params + some methods that were tried during the experimentation phase, done using K-Fold CV on the train set, but weren't used in the final train due to their lower results.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notebooks

notebooks

src

src

01-Enrich_VoxCeleb_Dataset.ipynb

01-Enrich_VoxCeleb_Dataset.ipynb

01.1-Overall Age stats.ipynb

01.1-Overall Age stats.ipynb

02.1-Gender recognition-Train_test-ivec.ipynb

02.1-Gender recognition-Train_test-ivec.ipynb

02.2-Gender recognition-Train_test-xvec.ipynb

02.2-Gender recognition-Train_test-xvec.ipynb

03.1-Age regression-Train_test-ivec.ipynb

03.1-Age regression-Train_test-ivec.ipynb

03.2-Age regressio-Train_test-xvec.ipynb

03.2-Age regressio-Train_test-xvec.ipynb

03.3-Age regression-Train_test-MFCC.ipynb

03.3-Age regression-Train_test-MFCC.ipynb

03.4-Age regression-Train_test-MelSpect.ipynb

03.4-Age regression-Train_test-MelSpect.ipynb

README.md

README.md

environment_asvtorch.yml

environment_asvtorch.yml

environment_enrichment.yml

environment_enrichment.yml

environment_tensorflow.yml

environment_tensorflow.yml

settings.json

settings.json

README.md

Disclaimer

00 Conda environment

01 Enrichment

02 Computation of MFCC, i-Vectors and x-Vectors

x-Vector

03 Model training and evaluation

Files

notebooks

Directory actions

More options

Directory actions

More options

Latest commit

History

notebooks

Folders and files

parent directory

Disclaimer

00 Conda environment

01 Enrichment

02 Computation of MFCC, i-Vectors and x-Vectors

x-Vector

03 Model training and evaluation