Suggestion: infer plan type from `x` type #100

JeffFessler · 2022-06-22T03:02:35Z

If I understand correctly, the current approach is that user decides between cpu and gpu versions by either

CPU: p = plan_nfft(x, N) or (currently equivalently I think) p = plan_nfft(Array, x, N)
GPU: p = plan_nfft(CuArray, x, N)

I'd prefer that the default plan type come from the type of x by adding a method something like
plan_nfft(x, N) = plan_nfft(typeof(x), x, N)

So if x is a CuArray then by default the plan will be a GPU plan.

The reason is that then I can embed plan_nfft into downstream packages without forcing them to depend on CUDA.jl.
This might also "future proof" it so that someday when we have a OpenCLArray version then again it can inherit from x.
But I might be missing something?
And I have not really been able to test it out yet because I think the recent CUDA fixes are waiting for release updates.

I realize that if x is an Adjoint type (or a Range) then this would need additional work to get at the "base" of such types.

The text was updated successfully, but these errors were encountered:

tknopp · 2022-06-22T05:58:31Z

But I might be missing something?

No, your proposal is clever. One argument against it would be that in practice the nodes will not neccecary need to be copied to the GPU. Our current implementation in CuNFFT actually precomputes the (sparse) convolution matrix on the CPU and then copies it to the GPU.

And I have not really been able to test it out yet because I think the recent CUDA fixes are waiting for release updates.

Thanks for the trigger, will do later. The real issue is that we don't have CI for that and I therefore need to manually run the tests on a dedicated computer.

I realize that if x is an Adjoint type (or a Range) then this would need additional work to get at the "base" of such types.

That is fixable by having a method that collects the nodes first and afterwards makes the GPU/CPU dispatch. We currently do this here https://github.com/JuliaMath/NFFT.jl/blob/master/AbstractNFFTs/src/derived.jl#L28. But the order of the methods would need to change. Will have a look at that.

In general: This GPU/CPU dispatching has not seen any real world usage. For instance in MRIReco.jl it is not yet possible to use GPUs without hacking the source code. So any real-world testing (e.g. in MIRT) would be appreciated.

tknopp · 2022-06-22T05:59:17Z

ping @migrosser

JeffFessler · 2022-06-23T22:25:17Z

The real issue is that we don't have CI for that

Bummer. I would offer to do the test on my GPU machine if this were a single package, but I don't really know how to do such test properly for a repo with multiple nested packages. I used ]add with the #master branch of CuNFFT and NFFT and then did test NFFT. I got one package dependency error but still the tests ran and everything seemed to run except for one ERROR: LoadError: UndefVarError: libfinufft not defined which I suspect is unrelated to #97. So it seems OK, but I do not have full confidence of my test given the package nesting...

any real-world testing (e.g. in MIRT) would be appreciated.

Yep, I am working on it. In fact this suggestion originated from a user reported issue where the user clearly thought that having the nodes on the GPU would suffice to invoke CUDA (and so did I initially until I looked into it more): JeffFessler/mirt-demo#5

tknopp · 2022-06-24T14:10:44Z

One needs to dev NFFT and CuNFFT (and probably also AbstractNFFTs) and then do the testing. But they were commented out. I re-enabled them
https://github.com/JuliaMath/NFFT.jl/blob/master/test/runtests.jl#L15
And I also tested on my GPU system (works!) and made a release. So right now CuNFFT should work.

tknopp · 2022-06-24T14:21:08Z

By the way, CuNFFT is not of the same quality as its CPU implementation. We use the sparse matrix trick since it is so simple to bring everything on the GPU. As far as I have seen, this is that fastest GPU implementation:
https://github.com/flatironinstitute/cufinufft
It would be very interesting how far one can get with a pure Julia code. I have no experience in that direction until now. If there is interest in a competitive CuNFFT.jl implementation we should probably create a dedicated issue and discuss there. Other idea would be to just create wrapper around cufinufft for the moment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suggestion: infer plan type from `x` type #100

Suggestion: infer plan type from `x` type #100

JeffFessler commented Jun 22, 2022

tknopp commented Jun 22, 2022

tknopp commented Jun 22, 2022

JeffFessler commented Jun 23, 2022

tknopp commented Jun 24, 2022

tknopp commented Jun 24, 2022

Suggestion: infer plan type from x type #100

Suggestion: infer plan type from x type #100

Comments

JeffFessler commented Jun 22, 2022

tknopp commented Jun 22, 2022

tknopp commented Jun 22, 2022

JeffFessler commented Jun 23, 2022

tknopp commented Jun 24, 2022

tknopp commented Jun 24, 2022

Suggestion: infer plan type from `x` type #100

Suggestion: infer plan type from `x` type #100