[FEA] Add NVTX ranges to pool allocation/deallocation #495

rongou · 2020-08-13T23:09:54Z

Is your feature request related to a problem? Please describe.
It would be helpful to be able to see on an Nsight profile how much time is spend on allocating/deallocating memory in the pool memory resource, especially in a multi-threaded environment with per-thread default stream.

Describe the solution you'd like
Add NVTX ranges to memory allocation/deallocation in the pool.

Additional context
cuDF has a macro defined: https://github.com/rapidsai/cudf/blob/branch-0.15/cpp/include/cudf/detail/nvtx/ranges.hpp

jrhemstad · 2020-08-14T00:24:02Z

@harrism went about this in the past. We ended up not going through with it because most allocation events are faster than the recommended 1us minimum time for events to annotate with NVTX. That said I don't see anything wrong with adding it as an option that is disabled by default.

I think we can take a simpler/coarser grained approach than #336 and just annotate the device_memory_resource base class allocate/deallocate calls, that way we can see the annotations no matter what resource is being used.

jrhemstad · 2020-08-14T00:25:41Z

We're likely going to run into some difficulty/conflicts with having the nvtx3.hpp header in both RMM and libcudf until NVIDIA/NVTX#2 is merged and we can pull the header from there to ensure both have the same version of the header.

rongou · 2020-08-14T00:53:01Z

There is some concern with contentions around CUDA events used in the pool between different threads/streams, it'd be nice to have better insight into that scenario.

harrism · 2020-08-14T03:02:29Z

I have an open PR #336 but I am waiting (as Jake points out) on NVIDIA/NVTX#2 and I will just add NVTX regions at the top-level rather than at low levels within pool_memory_resource.

github-actions · 2021-02-16T17:29:27Z

This issue has been marked rotten due to no recent activity in the past 90d. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.

github-actions · 2021-02-16T17:29:47Z

This issue has been marked stale due to no recent activity in the past 30d. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be marked rotten if there is no activity in the next 60d.

harrism · 2021-02-16T23:16:10Z

Still waiting on NVIDIA/NVTX#2 to be put in a release.

github-actions · 2021-03-18T23:23:14Z

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

github-actions · 2021-11-18T18:01:25Z

This issue has been labeled inactive-90d due to no recent activity in the past 90 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.

Let's get RMM allocate/deallocates showing up in profiler timelines. Closes #495 Authors: - Mark Harris (https://github.com/harrism) - Bradley Dice (https://github.com/bdice) Approvers: - Lawrence Mitchell (https://github.com/wence-) - Vyas Ramasubramani (https://github.com/vyasr) URL: #1558

rongou added ? - Needs Triage Need team to review and classify feature request New feature or request labels Aug 13, 2020

github-actions bot added this to Needs prioritizing in Feature Planning Aug 13, 2020

github-actions bot added the inactive-90d label Feb 16, 2021

github-actions bot added the inactive-30d label Feb 16, 2021

github-actions bot removed inactive-30d inactive-90d labels Feb 16, 2021

github-actions bot added the inactive-30d label Mar 18, 2021

github-actions bot added the inactive-90d label Nov 18, 2021

harrism mentioned this issue May 9, 2024

Add NVTX support and RMM_FUNC_RANGE() macro #1558

Merged

3 tasks

rapids-bot bot closed this as completed in #1558 May 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Add NVTX ranges to pool allocation/deallocation #495

[FEA] Add NVTX ranges to pool allocation/deallocation #495

rongou commented Aug 13, 2020

jrhemstad commented Aug 14, 2020

jrhemstad commented Aug 14, 2020

rongou commented Aug 14, 2020

harrism commented Aug 14, 2020

github-actions bot commented Feb 16, 2021

github-actions bot commented Feb 16, 2021

harrism commented Feb 16, 2021 •

edited

github-actions bot commented Mar 18, 2021

github-actions bot commented Nov 18, 2021

[FEA] Add NVTX ranges to pool allocation/deallocation #495

[FEA] Add NVTX ranges to pool allocation/deallocation #495

Comments

rongou commented Aug 13, 2020

jrhemstad commented Aug 14, 2020

jrhemstad commented Aug 14, 2020

rongou commented Aug 14, 2020

harrism commented Aug 14, 2020

github-actions bot commented Feb 16, 2021

github-actions bot commented Feb 16, 2021

harrism commented Feb 16, 2021 • edited

github-actions bot commented Mar 18, 2021

github-actions bot commented Nov 18, 2021

harrism commented Feb 16, 2021 •

edited