Add ability to link CUDA functions with in-memory PTX. #9470

ed-o-saurus · 2024-02-27T22:57:26Z

This commit adds the PTXCode class. It is a simple wrapper around a string. This allows the user to link CUDA functions with dynamically generated PTX code without having to write data to a file.

a string. This allows the user to link CUDA functions with dynamically generated PTX code without having to write data to a file.

gmarkall

Many thanks for the PR, @ed-o-saurus - this is an often-requested feature so I'm happy to see this PR!

There are a few thoughts I need to add on the design and scope of the changes, but I like the idea of using an object to hold the code whilst keeping strings being used for paths, maintaining backwards-compatibility.

I'll post a follow-up once I've had more time to get my thoughts down.

@ed-o-saurus

Adds support for linking code from memory to Numba's `@cuda.jit` decorator (in addition to the already-supported linking files from disk). New classes are added to Numba's top-level `cuda` module: * `Archive`: An archive of objects * `CUSource`: A CUDA C/C++ source * `Cubin`: A cubin ELF * `Fatbin`: A fatbin ELF * `Object`: An object file * `PTXSource`: PTX assembly source code. These are all used by constructing them with a single argument, the code in memory to use. Once created, they can then be passed to the `link=` kwarg of the `@cuda.jit` decorator. An example showing a use case with a CUDA C/C++ source is added. This implementation is inspired by the approach outlined by @ed-o-saurus in numba/numba#9470. Notes on changes: * Various tests now run on the GPU, so test binaries need to be generated using the relevant compute capability - this change is applied in the `Makefile`, along with some refactoring to tidy it up a little. * Test for new functionality are added. In addition, existing tests that used the test binaries are all modified such that they use a relevant compute capability (usually the one in the test machine, for most of them) because the test binaries are now built for the CC of the current GPU - it's no longer sufficient to hard code CCs like 7.5 or 7.0 for tests. Since fixtures are needed across test files, I've started moving them into `conftest.py`. --------- Co-authored-by: Bradley Dice <bdice@bradleydice.com> Co-authored-by: jakirkham <jakirkham@gmail.com>

This commit adds the PTXCode class. It is a simple wrapper around

17345d6

a string. This allows the user to link CUDA functions with dynamically generated PTX code without having to write data to a file.

ed-o-saurus requested a review from gmarkall as a code owner February 27, 2024 22:57

gmarkall added the CUDA CUDA related issue/PR label Feb 28, 2024

gmarkall reviewed Feb 28, 2024

View reviewed changes

gmarkall mentioned this pull request Mar 6, 2024

Numba patch: Support linking code from memory rapidsai/pynvjitlink#60

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ability to link CUDA functions with in-memory PTX. #9470

Add ability to link CUDA functions with in-memory PTX. #9470

ed-o-saurus commented Feb 27, 2024 •

edited

gmarkall left a comment

Add ability to link CUDA functions with in-memory PTX. #9470

Are you sure you want to change the base?

Add ability to link CUDA functions with in-memory PTX. #9470

Conversation

ed-o-saurus commented Feb 27, 2024 • edited

gmarkall left a comment

Choose a reason for hiding this comment

ed-o-saurus commented Feb 27, 2024 •

edited