Skip to content
This repository has been archived by the owner on Mar 21, 2024. It is now read-only.

Can an object with thread_scope be used by distinct thread groups? #325

Closed Answered by jrhemstad
jrhemstad asked this question in Q&A
Discussion options

You must be logged in to vote

For posterity, we decided to place a specific limitation on cuda::barrier<cuda::thread_scope_block> in __shared__ memory such that it can only be used by the threads in the CTA of the thread that constructed it. This was necessary compromise in order to leverage certain hardware acceleration features.

For all other data structures and state spaces, it remains valid to reuse those objects across different thread groups.

See: NVIDIA/cccl#75

Replies: 3 comments 9 replies

Comment options

You must be logged in to vote
1 reply
@jrhemstad
Comment options

jrhemstad Oct 19, 2022
Maintainer Author

Comment options

You must be logged in to vote
8 replies
@hcedwar
Comment options

@griwes
Comment options

@hcedwar
Comment options

@griwes
Comment options

@hcedwar
Comment options

Comment options

jrhemstad
Aug 21, 2023
Maintainer Author

You must be logged in to vote
0 replies
Answer selected by jrhemstad
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants