Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About implement of Normalized Maximum Eigengap Spectral Clustering(NME-SC) for Speaker Diarizaton #287

Closed
Zhubisong opened this issue Mar 11, 2024 · 2 comments

Comments

@Zhubisong
Copy link

Thank you for uploading pre-trained ECAPA-TDNN model.

For speaker diarization, the spectral clustering algorithm used by wespeaker uses the p-neighbor binarization scheme, and "p" should be choosed by people. I want to know how to choose "p" for different dataset(such as AMI, DIHARD, MagicData, Callhome or AISHELL4), 0.01 is ok?

In "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap", author proposed NME-SC, the algorithm free us for choosing "p". I want to know if wespeaker can Implement the algorithm?

@JiJiJiang
Copy link
Collaborator

  1. I think there isn't a fixed "p" can perform well in all datasets as you mention, which is exactly why the NME-SC algorithm is proposed ans works. In my experience, "p" in [0.01, 0.05] would get a modest result. Also, you can refer to our setup in our diarization recipe.
  2. This algorithm is essentially enumerating the "p" value and find the best in the dev set, which is costly in computation. You can easily implement it from our diarization codes by adding a for loop of "p". Maybe you can contribute the codes when you finish it!

@JiJiJiang
Copy link
Collaborator

This git repo may also help: Auto-Tuning-Spectral-Clustering

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants