Background operations after the index has been built #31909
Unanswered
ronbarhen
asked this question in
Q&A and General discussion
Replies: 1 comment 4 replies
-
segment size is too large. usually what we recommend is 2GB - 4GB. otherwise index build will be very slow. |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I have a database of 500 millions 96 float32 features records.
I use a script which is very close to the script "example_bulkinsert_numpy.py" in pymilvus in order to create the index and insert the data into it.
After about 4 hours when the index is built and after "utility.wait_for_index_building_complete()" returns and "collection.load(_refresh = True)" returns, the index is searchable but the search speed is very slow.
Moreover, if I look at htop, after the index is built and when no search is performed, I can see that the CPU is heavily used in the background.
The situation is even worse if I define a segment size of 128GB, then, you have to wait 3-4 hours in order for the search speed to be OK.
My questions are:
Thanks!
Ron
Beta Was this translation helpful? Give feedback.
All reactions