Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

input dimension #2

Open
zhangshaohu opened this issue May 2, 2022 · 2 comments
Open

input dimension #2

zhangshaohu opened this issue May 2, 2022 · 2 comments

Comments

@zhangshaohu
Copy link

Hello!

ASVTorch generates 24 MFCCs, so the MFCCS are (n, 24) shape. Your input is (200, 30). Where is the 30 from? Can you please provide some test samples?

@hechmik
Copy link
Owner

hechmik commented May 2, 2022

Hi!
The 30 comes from the number of Mel bins and ceps specified in the MFCC.conf file used by Kaldi https://gitlab.com/ville.vestman/asvtorch/-/blob/master/asvtorch/recipes/voxceleb/xvector/configs/mfcc.conf.

Regarding the test sample ,unfortunately the answer is no. The reason why is that the original dataset comes from YT videos and there are various copyright issues that may arise (also, the original VoxCeleb team should be, imo, the one to provide the raw tracks and devise appropriate sharing rules in their licence).
We have, however, provided the list of recordings we used for train and test, therefore it should be possible to replicate it by following all the steps described in the paper and in the various notebooks

@zhangshaohu
Copy link
Author

Thank you for your immediate response. I experienced some errors using ASVtorch so I used Kaldi. The original Kaldi for vector was num-ceps=24
https://github.com/kaldi-asr/kaldi/blob/master/egs/voxceleb/v1/conf/mfcc.conf
I will update the value of num-ceps. Yes, Vox data can be requested. I think it is okay if you put several computed features for testing. In this case, somebody would like to replicate your code who only use a simple test example.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants