Benchmark against competition #136

mre · 2020-10-13T11:24:31Z

People are interested in a size-comparison between

tinysearch
lunr.js
elasticlunr
flexsearch
fuse.js

If anyone wants to do a comparison, post a comment here.
Some ideas:

Use an open dataset, e.g. Shakespeare texts https://shakespeare.folger.edu/download/
Create a size histogram of the results that we can include into the README.
Add the code to the repository under a bench folder and create a pull request. This will make the benchmark reproducible in the future.

Feel free to ask questions here before starting.

The text was updated successfully, but these errors were encountered:

Jieiku · 2022-05-01T21:19:32Z

Awesome idea. I use Zola for my blog which comes with elasticlunr. I am evaluating different search solutions.

Might want to add Stork to this list as well.

Jieiku · 2023-04-13T04:50:01Z

I found that somebody actually made one that tests a lot of them!

https://nextapps-de.github.io/flexsearch/bench/

Unfortunately tinysearch is not in their benchmark list.

(neither is pagefind)

mre · 2023-04-13T08:59:21Z

Interesting. We should add it there; also Stork and pagefind.

Jieiku · 2023-04-13T17:16:08Z

one thing I dont see in their benchmark is bandwidth used... so they really should have a second benchmark for that.

(that is just as important to me as performance, if they are not interested in making one then I might.)

mre · 2023-04-13T22:42:38Z

How exactly do you define bandwidth? Queries per second?
You could create an issue for the project to discuss that.

Jieiku · 2023-04-14T00:07:04Z

no, what i mean is hit the website. perform a search, how many KB had to be downloaded for the search javascript + wasm + index to perform that search.

For instance Stork is around 700kb whereas tinysearch is around 100kb for abridge.

Obviously the amount of content will affect the index size. In their test they appear to be using Gulliver's Travels

they could also benchmark say: 1search, 10searches, etc. to see if that affects the amount of data downloaded to perform the searches, obviously this would affect pagesearch and possibly others because they chunk the index into multiple pieces.

mre · 2023-04-14T09:33:49Z

Gotcha. Yeah, that would indeed be a great comparison.

mre added enhancement New feature or request good first issue Good for newcomers hacktoberfest help wanted Extra attention is needed labels Oct 13, 2020

Jieiku mentioned this issue Aug 7, 2022

Is there a way to return the page description or body in the results? #159

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark against competition #136

Benchmark against competition #136

mre commented Oct 13, 2020

Jieiku commented May 1, 2022 •

edited

Jieiku commented Apr 13, 2023 •

edited

mre commented Apr 13, 2023

Jieiku commented Apr 13, 2023 •

edited

mre commented Apr 13, 2023

Jieiku commented Apr 14, 2023 •

edited

mre commented Apr 14, 2023

Benchmark against competition #136

Benchmark against competition #136

Comments

mre commented Oct 13, 2020

Jieiku commented May 1, 2022 • edited

Jieiku commented Apr 13, 2023 • edited

mre commented Apr 13, 2023

Jieiku commented Apr 13, 2023 • edited

mre commented Apr 13, 2023

Jieiku commented Apr 14, 2023 • edited

mre commented Apr 14, 2023

Jieiku commented May 1, 2022 •

edited

Jieiku commented Apr 13, 2023 •

edited

Jieiku commented Apr 13, 2023 •

edited

Jieiku commented Apr 14, 2023 •

edited