Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

suggestion: tSNE map instead of PCA #282

Open
SaraMati opened this issue Sep 2, 2020 · 2 comments
Open

suggestion: tSNE map instead of PCA #282

SaraMati opened this issue Sep 2, 2020 · 2 comments

Comments

@SaraMati
Copy link
Contributor

SaraMati commented Sep 2, 2020

can we have the option to choose the clustering algorithm to be applied on the tSNE space of the waveforms?

@yger
Copy link
Member

yger commented Sep 8, 2020

SC does not use tSNE, mostly because I'm afraid that it might be very slow, and requires fine hand parameter tuning, depending on the dataset. But this is something we could think of: to launch tSNE before the clustering, instead of a plain and naive PCA. Do you have in mind a special library/implementation?

@SaraMati
Copy link
Contributor Author

SaraMati commented Sep 8, 2020

True, but the hyper parameters are a problem if the number of spikes is really low (~ 100) with anything on the scale of thousands, the performance is robust to tuning of the parameters (e.g. perplexity between 5 to 50). Anyways, a good choice can be perplexity of 30 and learning rate of 500.

true regarding the speed. This implementation is fast though: https://github.com/DmitryUlyanov/Multicore-TSNE
(or this GPU implementation https://github.com/rapidsai/cuml)

it is nice if SC provides the users with the option to choose tSNE instead of PCA (through a setting in the parameter file perhaps), at least for experimentation. The default can still be PCA.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants