Fix race condition in OpenCL kernel #3535

frasercrmck · 2024-02-21T11:17:15Z

Description

Without the barrier at the end of barrierOR, it is possible for work-item 0 to start the next loop iteration and update predicates[0] while other work-items are still inside barrierOR reading predicates, meaning they read the next loop iteration's exit condition. This results in a divergent loop, where not all work-items reach the same barriers.

A previous fix identified this as a problem only on NVIDIA platforms, but strictly speaking a barrier is required in all cases to avoid a spec violation and undefined behaviour.

Changes to Users

The kernel should produce correct results on more OpenCL implementations.

Locally I tested both Intel(R) FPGA Emulation Device and various oneAPI Construction Kit devices, which all previously failed the confidence_connected_opencl --gtest_filter="SingleSeed/ConfidenceConnectedDataTest.SegmentARegion/_prefix_background_radius_0_multiplier_1_iterations_5_replace_255" unit test.

I'm unable to test other OpenCL implementations, sorry.

Checklist

Rebased on latest master
Code compiles
Tests pass
~~[ ] Functions added to unified API~~
~~[ ] Functions documented~~

Without the barrier at the end of barrierOR, it is possible for work-item 0 to start the next loop iteration and update predicates[0] while other work-items are still inside barrierOR reading `predicates`, meaning they read the next loop iteration's exit condition. This results in a divergent loop, where not all work-items reach the same barriers. A previous fix identified this as a problem only on NVIDIA platforms, but strictly speaking a barrier is required in all cases to avoid a spec violation and undefined behaviour.

umar456 · 2024-02-21T21:57:12Z

Took me a bit to figure out the problem but I see the issue now. The we can ignore the errors in the CI because they are not related. I will test it on a couple of other systems before merge this PR. Thank you for your contribution!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix race condition in OpenCL kernel #3535

Fix race condition in OpenCL kernel #3535

frasercrmck commented Feb 21, 2024

umar456 commented Feb 21, 2024

Fix race condition in OpenCL kernel #3535

Are you sure you want to change the base?

Fix race condition in OpenCL kernel #3535

Conversation

frasercrmck commented Feb 21, 2024

Description

Changes to Users

Checklist

umar456 commented Feb 21, 2024