Skip to content
This repository has been archived by the owner on Aug 2, 2022. It is now read-only.

Use with frequent updates #357

Open
z0mb1ek opened this issue Jul 29, 2021 · 4 comments
Open

Use with frequent updates #357

z0mb1ek opened this issue Jul 29, 2021 · 4 comments
Assignees

Comments

@z0mb1ek
Copy link

z0mb1ek commented Jul 29, 2021

Hi! Big thanks for this project, it is great!
Please tell me if I understand correctly: If I often add vectors or often update vectors, I will have many segments and search performance will be slow?

@jmazanec15
Copy link
Member

Hi @z0mb1ek ,

Yes that is correct. How much slower depends on how many segments there are. The hnsw search complexity scales O(log(n)), where n is the number of vectors (1).

Lucene will search the segments sequentially. So searching 5 segments with 100,000 vectors would take 5 *log(100,000) ~= 83. Searching 1 segment with 500,000 vectors would take log(500,000) ~= 19.

@z0mb1ek
Copy link
Author

z0mb1ek commented Aug 5, 2021

thx @jmazanec15

Are there any options for update at least once every 2-3 hours? Without downtime

@jmazanec15
Copy link
Member

What do you mean by update?

Regardless, you won't get downtime.

@z0mb1ek
Copy link
Author

z0mb1ek commented Aug 27, 2021

add portion of vectors, delete them

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants