You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The CPU atomic implementation using std::atomic_ref use a sequentially consistent memory ordering, which is a stronger guarantee than their CUDA counterparts, which are weakly ordered and always require explicit fences. Therefore, the CPU atomics should also be weakened to a relaxed memory order, potentially improving performance on CPUs.
The text was updated successfully, but these errors were encountered:
The CPU atomic implementation using
std::atomic_ref
use a sequentially consistent memory ordering, which is a stronger guarantee than their CUDA counterparts, which are weakly ordered and always require explicit fences. Therefore, the CPU atomics should also be weakened to a relaxed memory order, potentially improving performance on CPUs.The text was updated successfully, but these errors were encountered: