CUDA: Error handling variables not added to the `@llvm.used` list #9526

gmarkall · 2024-04-08T16:13:32Z

#9267 fixed dropping of kernels by pynvjitlink by adding kernels to the @llvm.used list.

We also add global variables for representing an error handling state:

Lines 219 to 224 in 03f2722

    
           def define_error_gv(postfix): 
        
               name = wrapfn.name + postfix 
        
               gv = cgutils.add_global_variable(wrapper_module, ir.IntType(32), 
        
                                                name) 
        
               gv.initializer = ir.Constant(gv.type.pointee, None) 
        
               return gv

The variables seem to get optimized away when LTO is used with pynvjitlink, and I suspect they should also be added to the @llvm.used list to prevent them being optimized away - from the perspective of device code, they are only ever written to, so they look un-needed - it's only the host that looks up their values after kernel execution.

The text was updated successfully, but these errors were encountered:

Previous commits added support for compiling Python functions to CUDA LTO-IR via the compilation interfaces. This commit adds stub code for supporting compilation of `@cuda.jit`-decorated functions to LTO-IR. The only functional change, unused in Numba at present, is that if the linker has LTO enabled, the CUDA codegen uses NVVM to generate LTO-IR instead of PTX, and passes that to the linker. The `lto` attribute is added linker classes in `numba.cuda.cudadrv.driver` - this is always `False` for the built-in linkers, but a linker from pynvjitlink (or any other external linker, in theory) could set it to `True` to signal that LTO is enabled. Some tests must be skipped if LTO is enabled, because it becomes difficult to use the functionality they test when LTO is enabled: - Some inspect the PTX, which is difficult to do when LTO-IR is generated instead. - Others check for exceptions, but the exception flags get optimized away by LTO because Numba fails to add them to the used list (See numba#9526).

- Wording edits to docs on CUDA compilation. - Check for `if cc is not None` rather than just `if cc`, etc., in the codegen, for greater robustness. - Add a test that checks the error reported when specifying an illegal output kind. - Cross-reference numba#9526 in the comment in `TestUserUxc`.

gmarkall added CUDA CUDA related issue/PR bug - miscompile Bugs: miscompile labels Apr 8, 2024

stuartarchibald mentioned this issue Apr 24, 2024

CUDA: Add support for compilation to LTO-IR #9274

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA: Error handling variables not added to the `@llvm.used` list #9526

CUDA: Error handling variables not added to the `@llvm.used` list #9526

gmarkall commented Apr 8, 2024

CUDA: Error handling variables not added to the @llvm.used list #9526

CUDA: Error handling variables not added to the @llvm.used list #9526

Comments

gmarkall commented Apr 8, 2024

CUDA: Error handling variables not added to the `@llvm.used` list #9526

CUDA: Error handling variables not added to the `@llvm.used` list #9526