-
Notifications
You must be signed in to change notification settings - Fork 100
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft: Desul ordered atomic policies + litmus tests #1616
Draft
publixsubfan
wants to merge
14
commits into
LLNL:develop
Choose a base branch
from
publixsubfan:feature/yang39/desul-memory-order
base: develop
Could not load branches
Branch not found: {{ refName }}
Could not load tags
Nothing to show
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Draft: Desul ordered atomic policies + litmus tests #1616
publixsubfan
wants to merge
14
commits into
LLNL:develop
from
publixsubfan:feature/yang39/desul-memory-order
Commits on Mar 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 89e38c8 - Browse repository at this point
Copy the full SHA 89e38c8View commit details -
Configuration menu - View commit details
-
Copy full SHA for b71c42b - Browse repository at this point
Copy the full SHA b71c42bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4be5c8c - Browse repository at this point
Copy the full SHA 4be5c8cView commit details -
Increase observability of weak memory behaviors
Fiddling around with some parameters for the litmus test driver: - It seems that having only a subset of the running blocks participate in the Message Passing litmus test increases the rate at which weak memory behaviors are observed. - Pre-stressing memory doesn't seem to help on NVIDIA V100s.
Configuration menu - View commit details
-
Copy full SHA for e73d52a - Browse repository at this point
Copy the full SHA e73d52aView commit details -
Add a store buffer litmus test
Store buffering is an observable behavior where a store may be reordered after a load. This exercises MemoryOrderSeqCst.
Configuration menu - View commit details
-
Copy full SHA for e0dea11 - Browse repository at this point
Copy the full SHA e0dea11View commit details -
Configuration menu - View commit details
-
Copy full SHA for 35a7a11 - Browse repository at this point
Copy the full SHA 35a7a11View commit details
Commits on Mar 19, 2024
-
Litmus tests: various modifications
- Use a forall device kernel to check results - Interleave order of operations between testing threads - Only warn on a lack of observed relaxed behaviors
Configuration menu - View commit details
-
Copy full SHA for d32bf98 - Browse repository at this point
Copy the full SHA d32bf98View commit details -
Litmus tests: changes to increase relaxed observation rates
Correctly use the stress testing formulation from the paper, "Foundations of Empirical Memory Consistency Testing" (OOPSLA 2020). Instead of having all stressing blocks scatter their accesses across the "stressing" array, select a small-ish subset of 64-word lines and stripe them across the stressing blocks. This increases the stress on the contention hardware in a GPU. Synchronize testing blocks and stressing blocks together on each iteration.
Configuration menu - View commit details
-
Copy full SHA for 822d8dd - Browse repository at this point
Copy the full SHA 822d8ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2784605 - Browse repository at this point
Copy the full SHA 2784605View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1fea7cf - Browse repository at this point
Copy the full SHA 1fea7cfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 94d45b5 - Browse repository at this point
Copy the full SHA 94d45b5View commit details -
Configuration menu - View commit details
-
Copy full SHA for a90f93a - Browse repository at this point
Copy the full SHA a90f93aView commit details -
Configuration menu - View commit details
-
Copy full SHA for a86a176 - Browse repository at this point
Copy the full SHA a86a176View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f4dc30 - Browse repository at this point
Copy the full SHA 3f4dc30View commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.