Add specialized binary search skipping boundary check #4026

multitalentloes · 2024-04-23T12:42:20Z

Small adjustment that skips checking if the indices are within the range because this is already checked in the outer function.
This function is very often called, so might give a small improvement.

bska

Could we not use std::upper_bound() for this instead?

multitalentloes · 2024-04-23T13:45:37Z

std::upper_bound() seems to be slower on my machine benchmarking a couple of runs on norne...
Edit:
To be a bit more precise, the property evaluation on single core norne on my machine took 75 sec with the original code, 72sec with this small change, and 77 sec using std::upper_bound to binary search and std::distance to get the index

akva2 · 2024-04-24T07:06:21Z

benchmark please

multitalentloes · 2024-04-24T08:08:46Z

The measurements are probably within what can be explained by noise, so hopefully then benchmark can provide some consistent results

ytelses · 2024-04-24T22:47:07Z

Benchmark result overview:

Test	Configuration	Relative
opm-git	OPM Benchmark: drogon - Threads: 1 - FOPT (Total Oil Production At End Of Run)	1
opm-git	OPM Benchmark: drogon - Threads: 8 - FOPT (Total Oil Production At End Of Run)	1
opm-git	OPM Benchmark: punqs3 - Threads: 1 - FOPT (Total Oil Production At End Of Run)	1
opm-git	OPM Benchmark: punqs3 - Threads: 8 - FOPT (Total Oil Production At End Of Run)	1
opm-git	OPM Benchmark: smeaheia - Threads: 1 - FOPT (Total Oil Production At End Of Run)	1
opm-git	OPM Benchmark: smeaheia - Threads: 8 - FOPT (Total Oil Production At End Of Run)	1
opm-git	OPM Benchmark: spe10_model_1 - Threads: 1 - FOPT (Total Oil Production At End Of Run)	1
opm-git	OPM Benchmark: spe10_model_1 - Threads: 8 - FOPT (Total Oil Production At End Of Run)	1
opm-git	OPM Benchmark: flow_mpi_extra - Threads: 1 - FOIT (Total Oil Injection At End Of Run)	1
opm-git	OPM Benchmark: flow_mpi_extra - Threads: 8 - FOIT (Total Oil Injection At End Of Run)	1
opm-git	OPM Benchmark: flow_mpi_norne - Threads: 1 - FOPT (Total Oil Production At End Of Run)	1
opm-git	OPM Benchmark: flow_mpi_norne - Threads: 8 - FOPT (Total Oil Production At End Of Run)	1
opm-git	OPM Benchmark: flow_mpi_norne_4c_msw - Threads: 1 - FOPT (Total Oil Production At End Of Run)	1
opm-git	OPM Benchmark: flow_mpi_norne_4c_msw - Threads: 8 - FOPT (Total Oil Production At End Of Run)	1

Speed-up = Total time master / Total time pull request. Above 1.0 is an improvement. *

View result details @ https://www.ytelses.com/opm/?page=result&id=2448

bska · 2024-04-25T07:09:55Z

Benchmark result overview:

FOPT (Total Oil Production At End Of Run) = 1

That's good to know, but alas not particularly useful from a CPU performance point of view. @blattms: Is anything up with the benchmark test rig at the moment?

multitalentloes · 2024-04-25T07:54:29Z

That was a bit underwhelming, but how are these times measured since there is absolutely no deviation on any case?

akva2 · 2024-04-25T07:55:52Z

notice that those are not even times. benchmarks either didn't execute properly, or it only reported fluid statistics back.

bska · 2024-04-25T08:32:59Z

but how are these times measured since there is absolutely no deviation on any case?

notice that those are not even times.

@akva2 is correct–the FOPT measure isn't a time at all. Rather it's a (crude) measure of solution accuracy. If FOPT differs from one here, then we didn't compute the same solution as the reference case and it in some sense doesn't really matter what performance improvement we get from the PR under test.

multitalentloes · 2024-04-29T11:38:49Z

Can we get some clarifications as of what happened to the performance benchmark @blattms

akva2 · 2024-05-14T08:19:58Z

benchmark please

ytelses · 2024-05-14T15:51:55Z

Benchmark result overview:

Test	Configuration	Relative
opm-git	OPM Benchmark: drogon - Threads: 1	0.997
opm-git	OPM Benchmark: drogon - Threads: 8	0.993
opm-git	OPM Benchmark: punqs3 - Threads: 1	0.978
opm-git	OPM Benchmark: punqs3 - Threads: 8	1.009
opm-git	OPM Benchmark: smeaheia - Threads: 1	0.973
opm-git	OPM Benchmark: smeaheia - Threads: 8	1
opm-git	OPM Benchmark: spe10_model_1 - Threads: 1	1.019
opm-git	OPM Benchmark: spe10_model_1 - Threads: 8	0.997
opm-git	OPM Benchmark: flow_mpi_extra - Threads: 1	0.989
opm-git	OPM Benchmark: flow_mpi_extra - Threads: 8	0.985
opm-git	OPM Benchmark: flow_mpi_norne - Threads: 1	0.984
opm-git	OPM Benchmark: flow_mpi_norne - Threads: 8	0.995
opm-git	OPM Benchmark: flow_mpi_norne_4c_msw - Threads: 1	1.008
opm-git	OPM Benchmark: flow_mpi_norne_4c_msw - Threads: 8	1.001

Speed-up = Total time master / Total time pull request. Above 1.0 is an improvement. *

View result details @ https://www.ytelses.com/opm/?page=result&id=2470

Add specialized binary search skipping boundary check

433c03f

bska reviewed Apr 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add specialized binary search skipping boundary check #4026

Add specialized binary search skipping boundary check #4026

multitalentloes commented Apr 23, 2024

bska left a comment

multitalentloes commented Apr 23, 2024 •

edited

akva2 commented Apr 24, 2024

multitalentloes commented Apr 24, 2024

ytelses commented Apr 24, 2024

bska commented Apr 25, 2024

multitalentloes commented Apr 25, 2024

akva2 commented Apr 25, 2024

bska commented Apr 25, 2024

multitalentloes commented Apr 29, 2024

akva2 commented May 14, 2024

ytelses commented May 14, 2024

Add specialized binary search skipping boundary check #4026

Are you sure you want to change the base?

Add specialized binary search skipping boundary check #4026

Conversation

multitalentloes commented Apr 23, 2024

bska left a comment

Choose a reason for hiding this comment

multitalentloes commented Apr 23, 2024 • edited

akva2 commented Apr 24, 2024

multitalentloes commented Apr 24, 2024

ytelses commented Apr 24, 2024

bska commented Apr 25, 2024

multitalentloes commented Apr 25, 2024

akva2 commented Apr 25, 2024

bska commented Apr 25, 2024

multitalentloes commented Apr 29, 2024

akva2 commented May 14, 2024

ytelses commented May 14, 2024

multitalentloes commented Apr 23, 2024 •

edited