SYCL currently does not support associating USM with a specific queue #1546

jinz2014 · 2023-12-20T21:26:54Z

It is a feature request due to the SYCL support.

The call to cudaStreamAttachMemAsync was replaced with 0 because SYCL currently does not support associating USM with a specific queue.

I am not sure if the CUDA support is required to run the example successfully. Thanks.

tomflinda · 2023-12-26T09:00:12Z

@jinz2014 for cudaStreamAttachMemAsync, attached devPtr has three kind of memory:

managed memory declared using the managed keyword or allocated with cudaMallocManaged, which is mapped into shared memory in SYCL side, the memory is accessible by all the queues in the same context as default.
host-accessible region of system-allocated pageable memory, which has two scenarios.
2.1. the memory is allocated by cudaHostAlloc(), which is mapped to sycl::malloc_host(), the memory is accessible by all the queues in the same context as default..
2.2. the memory is allocated by cudaHostRegister(), which is not support in SYCL side.

In cuda side, the memory should be attached before used on device, while in SYCL side, which is not needed, or not supported.

jinz2014 added the enhancement New feature or request label Dec 20, 2023

Provide feedback