More comprehensive benchmarks #128

Morwenn · 2018-03-02T14:16:09Z

The benchmarks page on the wiki is seldom updated, and only compares the sorting of integer and floating point values. We need better and more comprehensive benchmarks, with the abillity to generate distributions for more kind of types (pairs, arrays, strings, etc...) so that we can better compare the different algorithms and where they are better than others.

A first easier step would be to group sorting algorithms by families to make some benchmarks more relevant since the different families of algorithms generally don't have the same goals.

Some other interesting results could be highlighted: for example, is it stably sorting by applying stable_adapter to pdq_sorter faster in many cases than using an inherently stable sorting algorithm? When is indirect sorting faster? When is using a Schwartz transform faster? I think that some pieces of the library can answer many use cases, but we need many more benchmarks to highlight which tool might be the best for a specific scenario. With a comprehensive benchmark suite, the dedicated wiki page could even be turned in a whole section with several sub-articles.

Another problem: I remember an oldish benchmark that showed how sorting strings with the same algorithm could have greatly different results depending on which compiler was used, which might make the benchmarks more complicated once again...

Ideally we would need a tool where we can select the different parameters before running the benchmarks:

Algorithm
Adapter if any
Type of data
Distribution of data
Compiler
Compiler options

Unfortunately that would require a full tool, and probably a dedicated website too. While this may be of value, I most probably don't have the time nor the skills required to do that. The benchmarks on C++ Hut are a good example of what I would like to achieve.

The text was updated successfully, but these errors were encountered:

Still far from addressing #128, but still a nice improvement to the status quo. [ci skip]

Morwenn · 2021-04-07T17:35:04Z

We are still far from comprehensive benchmarks, but there have been some improvements recently:

A new benchmark for stable sorts that don't allocate heap memory was added to the wiki as part of issue Improve the state of mergesorts #175.
A new dist::to_long_string projection was added to the benchmark suite by 6b5862c. It makes it possible to reuse the existing distributions to generate collections of long strings with a long common prefix. This is a good start to show expensive comparisons.

Morwenn added help wanted tooling labels Mar 2, 2018

Morwenn added a commit that referenced this issue Sep 19, 2020

Add new benchmark for more complete results

2a1ca17

Still far from addressing #128, but still a nice improvement to the status quo. [ci skip]

Morwenn mentioned this issue Feb 21, 2022

Benchmark idea: number of comparisons #202

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More comprehensive benchmarks #128

More comprehensive benchmarks #128

Morwenn commented Mar 2, 2018 •

edited

Morwenn commented Apr 7, 2021

More comprehensive benchmarks #128

More comprehensive benchmarks #128

Comments

Morwenn commented Mar 2, 2018 • edited

Morwenn commented Apr 7, 2021

Morwenn commented Mar 2, 2018 •

edited