We consider various similarity measures between different clusterings of a dataset, parametrized by the number of clusters. Parallelizing our code, we graph the similiarity between 20 implementations of kmeans for 1 up to 15 clusters. Studying these histograms, we can make a better choice about how many clusters best segregates out linguisitic dataset.
lishali/Stability_of_Kmeans
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
How stable are the kmeans clusters in the classifying_linguistic_geograph repo?
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published