Skip to content

Why is kmeans using flat index faster than scikit learn #2589

Discussion options

You must be logged in to vote
  1. It depends on the number of iterations that k-means uses. As far as I remember, Faiss uses 25 iterations by default.
  2. By default, Faiss uses only a subset of samples for the clustering procedure, and it is 256 * nclusters, if I recall correctly. You can find the number that Faiss uses by turning on the verbose output.
  3. Faiss uses Intel MKL and some optimized SIMD kernels.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by alexanderguzhva
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants