Princeton GPU hackathon 2022 wrap-up #325

FlorianReiss · 2022-06-09T07:23:30Z

during the hackathon several issues related to running on GPUs were understood

stack overflow

some functions, in particular the kMatrix, allocate a lot of memory on the stack, which can exceed the default limit. Possible solutions are more memory-friendly implementations, using malloc (usually slower) or increasing the stack size with cuda_error_check(cudaDeviceSetLimit (cudaLimitStackSize, 1024*50));

related issues #287 #242

using RO_CACHE on stack memory

it is not allowed to use RO_CACHE/__ldg on stack memory

related issues #216 #217

see MR #328

to do

finalize and merge fixes
document everything learned

The text was updated successfully, but these errors were encountered:

FlorianReiss · 2022-06-09T07:28:06Z

@thboettc @henryiii @JuanBSLeite just a little summary to keep track of the things we fixed or should be able to fix after the hackathon. Feel free to edit

FlorianReiss linked a pull request Jun 21, 2022 that will close this issue

fix crash on GPU of SquareDalitzEfficiency and SmoothHistogramPdf #328

Merged

FlorianReiss removed a link to a pull request Jun 21, 2022

fix crash on GPU of SquareDalitzEfficiency and SmoothHistogramPdf #328

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Princeton GPU hackathon 2022 wrap-up #325

Princeton GPU hackathon 2022 wrap-up #325

FlorianReiss commented Jun 9, 2022 •

edited

FlorianReiss commented Jun 9, 2022 •

edited

Princeton GPU hackathon 2022 wrap-up #325

Princeton GPU hackathon 2022 wrap-up #325

Comments

FlorianReiss commented Jun 9, 2022 • edited

stack overflow

using RO_CACHE on stack memory

to do

FlorianReiss commented Jun 9, 2022 • edited

FlorianReiss commented Jun 9, 2022 •

edited

FlorianReiss commented Jun 9, 2022 •

edited