`@BeforeMethod()` does not get called before each call? #896

Ocramius · 2021-07-24T18:49:52Z

I was working on laminas/laminas-servicemanager#93 today, and noticed that we still have the codebase sprinkled with blocks referencing #304

In fact, #304 has been closed as "old", but still applies today.

My assumption was that @BeforeMethods would be called before each call to said bench method: doesn't seem to be the case, and that leads to benchmarking warmed up caches too, which is a problem (especially if we're benchmarking said warmup).

In following example, I would expect 100 calls to mySetup(), but only 10 are occurring:

/** 
 * @BeforeMethods
 * @Revs(10)
 * @Iterations(10)
 */ 
class MyBench
{
    public function mySetup(): void
    {
        reset_all_the_things_here();
    }

    public function benchSomething(): void
    {
        // irrelevant
    }
}

What's the best way forward here? Is a change in @BeforeMethods viable? It would change the benchmark results for bench suites I've worked on so far, massively, but it would lead to more honest results.

The text was updated successfully, but these errors were encountered:

…d `ServiceManager` instance Ref: phpbench/phpbench#304 Ref: phpbench/phpbench#896

dantleech · 2021-07-24T20:00:37Z

The problem (as you know) is that then it introduces the call to a setUp from within the sample loop, which contaminates the time measurement. How would you offset it? You could run another benchmark for the clone and deduct it, but that benchmark also includes the overhead of microtime start/stop and X method calls to "setIUp" and the overhead of the for loop. The error margin could be such that you end up with a negative result.

@BeforeMethod is called once per iteration (i.e. sample), the code is then repeated X times consecutively to determine the net time, and then from that the average. Most of the time this is probably ok.

Trying to reset the state in a way that didn't end up reducing the percentage of time which is executing the code you are benchmarking is hard.

Possible solutions though:

Don't care about it: include the call within the subject, accept that it's a constant overhead
Can benchmark the "overhead" in another benchmark and deduct it in a report (the suggestion in Provide a way to reset state/caches at every loop #304 is now possible in master - something like mode(partition['result_time_avg']) - mode(frame[benchmark_name='Foobar' and subject_name="setupSomething"]["result_time_avg"])) as time
Create a custom executor to try running benchmarks with calls to setup within the loop or similar (would need to base it on the remote executor with a custom template

Ocramius · 2021-07-24T20:04:12Z

within the sample loop,

Shouldn't it be before measurements start?

In pseudo code:

loop {
    setup();
    start_measuring();
    method();
    collect_measurement();
}

dantleech · 2021-07-24T20:13:41Z

i don't think so - because it further contaminates the loop. think of setUp for booting a DI container, loading fixtures etc. Even with micro "setup" calls it introduces 3 new method calls, and microtime can only report ~~integers~~ at microsecond resolution (ignore that we don't have hrtime).

your example is similar to running PHPBench with revs=1 (with the avoidable overhead of the for loop).

Ocramius · 2021-07-26T08:10:42Z

So the correct workaround would be to run everything with revs=1? 🤔

What about warmup laps? Does @BeforeMethods get called once a warmup is complete?

dantleech · 2021-07-26T17:54:09Z

So the correct workaround would be to run everything with revs=1?

Not really -the resolution would be too low - I would assume you are measuring at a microsecond resolution, that is thrown out by the calls to microtime and the for loop. The idea of the loop is to fill the "sampling space" as much as possible with work you want to benchmark to reduce the cost of the measurement itself.

It would be interesting to see if hrtime helps to solve this problem to any extent. Otherwise the solution probably is, as best as possible, benchmarking the per-revolution setup cost and deduct it (a new executor).

Would be interesting to try both.

What about warmup laps? Does @BeforeMethods get called once a warmup is complete?

BeforeMethods is called at the beginning of the iteration, before any call to the subject method:

phpbench/lib/Executor/Benchmark/template/remote.template

Lines 34 to 37 in 722572f

    
           // run before methods 
        
           foreach ($beforeMethods as $beforeMethod) { 
        
               $benchmark->$beforeMethod($parameters); 
        
           }

)

Worth thinking if you want to do heavy things like booting a DI container or preparing fixtures before every revolution. I think what we are talking could be a new feature rather than changing @BeforeMethods

scorgn · 2021-11-19T17:05:12Z

PHPUnit has the setUp method which runs on every test run. When you run a specific test, you want everything to be freshly set up each time it's ran. Even if it's running through the same test multiple times with different data, you still want to make sure you're re-setting up the test before each test run. If you couldn't undo changes from the last test run before each run then there would be certain things you wouldn't be able to test properly, unless you were to include the setup logic inside each test itself.

PHPUnit also lets you run bootstrapping logic before any tests are ran, so you can set up the application the way that you need only once at the beginning of the tests.

It seems like PHPBench is missing that feature, to be able to set up a benchmark before each run. Except in this case including the setup logic inside the benchmark methods themselves also has additional drawbacks that wouldn't be the case with PHPUnit.

Although maybe I'm misunderstanding the difference between revs and iterations and the solution really is to just have one rev with many iterations.

Ocramius · 2021-11-19T17:10:00Z

Yeah, the reason why I haven't closed this issue yet is that:

we really need a way to "reset" the SUT across runs
we need to exclude anything that is considered "warmup" from the measurement

Right now, I think most bench suites that I have (which use phpbench/phpbench) are flawed because they assumed that @BeforeMethods would reset the SUT, which isn't the case. In fact, those tests would have radically different output, if phpbench behaved differently (closer to my expectation in OP).

If @BeforeMethods is not the solution (and changing it would be a BC break) we probably need a different concept that represents what we need, which is:

foreach ($repetition as $i) {
    reset_stuff();

    start_measuring();
    run_benchmark_method();
    stop_measuring();
}

dantleech · 2021-11-19T17:26:30Z

My main concern (which may or may not be valid) is that it affects the information content of the sample.

start_measuring is non-zero stop_measuring is non-zero, they margin of error may overlap with the thing you are benchmarking. The reason for the for loop is to try and ensure that the time measured is your thing and not the instrumentation. Also (and I need to check) the grain is a microsecond so the longer the loop takes the better the accuracy.

The subject is reset via. BeforeMethods before it starts the revolutions (for loop), and the result is the Iteration.

You can run the loop with Revs=1 to achieve the result above (although could probably improve the template for that case). It would also be a bit slower (new process for each sample of 1 rev).

But in general, need to do some research to see what can be done or how correct my assumptions are.

Ocramius · 2021-11-19T17:28:58Z

start_measuring is non-zero stop_measuring is non-zero,

Agreed, but right now, we're not measuring anything valuable either :D

(new process for each sample of 1 rev)

That would then lead to things like autoloading overhead (and accidental measurement thereof), no?

dantleech · 2021-11-19T17:29:30Z

Agreed, but right now, we're not measuring anything valuable either :D

Depends if you have state or not ?

That would then lead to things like autoloading overhead (and accidental measurement thereof), no?

@WarmUp can be employed for that

Ocramius · 2021-11-19T17:30:30Z

Depends if you have state or not ?

There's always some degree of state, so re-creating (or re-using, which is where I would design two separate benchmarks to see the difference) the SUT is vital.

Ocramius · 2021-11-19T17:31:36Z

Specifically, if you believe it is possible to achieve precise measurements as per current state, I would need some guidance on boesing/laminas-servicemanager@d6e7fe4 - perhaps this can be reduced to a documentation issue then?

dantleech · 2021-11-19T17:39:39Z

to be honest i don't know. the original reasoning is that the grain is a microsecond, which is pretty useless at Revs=1 (it's 0 or 1 microsecond, compared with doing the same thing 1000 times to give you 0.655 microseconds). I don't know if that's correct now or then, or how accurate we can be with hrtime at this level.

the current execution template won't remove the for loop at revs=1, but that might not make much difference.

i'll need to investigate and experiment (and this is pretty much next priority)

Ocramius · 2021-11-19T17:41:31Z

You don't need to make this a priority: at this point, we're just pillaging this issue :D

It's just a thing to be aware of, if you are tinkering with the design of phpbench/ in the near future. For now, clone $this->something is a sufficient workaround for me, although potentially unreliable and polluting the results.

dantleech · 2021-11-19T17:43:02Z

well, it fits into general improvements around execution etc, and I genuinely want to be able to answer these questions 😅

dantleech · 2021-11-20T17:05:59Z

so I did an experiment:

#944

outcome is that hrtime seems to remote the necessity of the loop at smaller scales. So we could add a new hrtime executor and provide an option to sample inside/outside the loop.

At that point running with Revs=1 and a before method woud work for you I think. We could also add an option to decide where the "before/after" methods get called, so that Revs=1000 could call the setup methods for each sampling.

Ocramius added a commit to boesing/laminas-servicemanager that referenced this issue Jul 24, 2021

Rewrote ServiceManager#has() tests to avoid mutating the benchmarke…

d6e7fe4

…d `ServiceManager` instance Ref: phpbench/phpbench#304 Ref: phpbench/phpbench#896

Ocramius mentioned this issue Jul 24, 2021

Adde benchmarks for ServiceManager#has() laminas/laminas-servicemanager#94

Merged

dantleech added this to To do in 1.3 Nov 21, 2021

dantleech removed this from To do in 1.3 Nov 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`@BeforeMethod()` does not get called before each call? #896

`@BeforeMethod()` does not get called before each call? #896

Ocramius commented Jul 24, 2021

dantleech commented Jul 24, 2021 •

edited

Ocramius commented Jul 24, 2021

dantleech commented Jul 24, 2021 •

edited

Ocramius commented Jul 26, 2021

dantleech commented Jul 26, 2021 •

edited

scorgn commented Nov 19, 2021

Ocramius commented Nov 19, 2021

dantleech commented Nov 19, 2021 •

edited

Ocramius commented Nov 19, 2021 •

edited

dantleech commented Nov 19, 2021 •

edited

Ocramius commented Nov 19, 2021

Ocramius commented Nov 19, 2021

dantleech commented Nov 19, 2021

Ocramius commented Nov 19, 2021

dantleech commented Nov 19, 2021

dantleech commented Nov 20, 2021

@BeforeMethod() does not get called before each call? #896

@BeforeMethod() does not get called before each call? #896

Comments

Ocramius commented Jul 24, 2021

dantleech commented Jul 24, 2021 • edited

Ocramius commented Jul 24, 2021

dantleech commented Jul 24, 2021 • edited

Ocramius commented Jul 26, 2021

dantleech commented Jul 26, 2021 • edited

scorgn commented Nov 19, 2021

Ocramius commented Nov 19, 2021

dantleech commented Nov 19, 2021 • edited

Ocramius commented Nov 19, 2021 • edited

dantleech commented Nov 19, 2021 • edited

Ocramius commented Nov 19, 2021

Ocramius commented Nov 19, 2021

dantleech commented Nov 19, 2021

Ocramius commented Nov 19, 2021

dantleech commented Nov 19, 2021

dantleech commented Nov 20, 2021

`@BeforeMethod()` does not get called before each call? #896

`@BeforeMethod()` does not get called before each call? #896

dantleech commented Jul 24, 2021 •

edited

dantleech commented Jul 24, 2021 •

edited

dantleech commented Jul 26, 2021 •

edited

dantleech commented Nov 19, 2021 •

edited

Ocramius commented Nov 19, 2021 •

edited

dantleech commented Nov 19, 2021 •

edited