Skip to content

Q&A: Scale effects #661

Answered by alexklibisz
ezorita asked this question in Q&A
Mar 26, 2024 · 2 comments · 1 reply
Discussion options

You must be logged in to vote

Hi @ezorita, these are some good questions. I'll try to answer below.

However, we are experiencing scaling issues. In a small corpus of 500k documents it's blazing fast, when we go up to 150M it takes about a minute to run an elastiknn query (even without further filtering).

150M is more than I've ever tested with. It's not surprising that it takes longer, but 60s sounds like it might just lack resources for that amount of data. I'm assuming that these are LSH (approximate) queries. As a sanity check, how long does it take to run a standard term query on 150M documents with your current infrastructure? Any Elastiknn vector query is basically matching a bunch of terms, so the vector quer…

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@ezorita
Comment options

Answer selected by alexklibisz
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #659 on March 27, 2024 19:03.