Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Swap ordering of thread configuration in Sycl #1628

Merged
merged 9 commits into from Apr 30, 2024

Conversation

artv3
Copy link
Member

@artv3 artv3 commented Apr 17, 2024

Sycl uses the outermost index as the fastest, this change enables consistency when using RAJA::launch with other device back ends.

For reference this page has a good description on thread indexing using sycl:
https://www.intel.com/content/www/us/en/docs/dpcpp-compatibility-tool/developer-guide-reference/2023-2/cuda-and-sycl-programming-model-comparison.html

See the Thread Indexing section.

@artv3 artv3 marked this pull request as ready for review April 17, 2024 17:54
@artv3
Copy link
Member Author

artv3 commented Apr 29, 2024

@rhornung67 , did you want to take a look at this PR? Or should I just merge?

Copy link
Member

@rhornung67 rhornung67 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add some notes about this in the user guide. Also, did you notice any performance differences with this change?

@artv3
Copy link
Member Author

artv3 commented Apr 30, 2024

Please add some notes about this in the user guide. Also, did you notice any performance differences with this change?

@rhornung67 I just pushed up a note on the docs. Please let me know if any adjustments are needed. I held off on commenting about kernel as we need to double check and we can follow up after. I also don't have a sense on performance changes as I had not looked at timings and the kernels in the raja perf suite were generating incorrect answers prior to this change.

Copy link
Member

@rhornung67 rhornung67 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor doc changes.

artv3 and others added 2 commits April 30, 2024 10:56
Co-authored-by: Rich Hornung <hornung1@llnl.gov>
Co-authored-by: Rich Hornung <hornung1@llnl.gov>
@artv3 artv3 enabled auto-merge April 30, 2024 18:44
@artv3 artv3 merged commit c315ddd into develop Apr 30, 2024
24 checks passed
@rhornung67
Copy link
Member

@artv3 if you don't want to keep the branch, please delete it.

@artv3 artv3 deleted the artv3/sycl-thread-grid-layout-fix branch April 30, 2024 20:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants