You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@jinz2014 for cudaStreamAttachMemAsync, attached devPtr has three kind of memory:
managed memory declared using the managed keyword or allocated with cudaMallocManaged, which is mapped into shared memory in SYCL side, the memory is accessible by all the queues in the same context as default.
host-accessible region of system-allocated pageable memory, which has two scenarios.
2.1. the memory is allocated by cudaHostAlloc(), which is mapped to sycl::malloc_host(), the memory is accessible by all the queues in the same context as default..
2.2. the memory is allocated by cudaHostRegister(), which is not support in SYCL side.
In cuda side, the memory should be attached before used on device, while in SYCL side, which is not needed, or not supported.
It is a feature request due to the SYCL support.
The call to cudaStreamAttachMemAsync was replaced with 0 because SYCL currently does not support associating USM with a specific queue.
I am not sure if the CUDA support is required to run the example successfully. Thanks.
Reference
https://github.com/NVIDIA/cuda-samples/tree/master/Samples/0_Introduction/UnifiedMemoryStreams
The text was updated successfully, but these errors were encountered: