feat(wip): cupy histograms #1095

lgray · 2024-05-15T14:57:58Z

work in progress on using cupy in the old coffea-hist package as a demonstrator

…throughput on multi-dim histograms

martindurant · 2024-05-31T15:03:58Z

(cf https://docs.cupy.dev/en/stable/reference/generated/cupy.histogramdd.html , which is the standard regular-array API conforming to the numpy version - no boost or anything)

lgray · 2024-06-03T15:16:05Z

@martindurant - cool, it appears to use the same techniques (fill by atomics) so it'll be subject to the same scaling limitations I'm seeing. However, the number of calls to fill is a bit more lean so maybe it's worth backing an implementation with it. I'll have to try some benchmarks.

Otherwise - there's significant functionality missing from the cupy hists that we'll still have to add on top, if it turns out to run faster in the first place.

martindurant · 2024-06-03T15:20:16Z

Otherwise - there's significant functionality missing from the cupy hists that we'll still have to add on top, if it turns out to run faster in the first place.

Yes, I expect this is the case. dask-histogram also uses boost and adds a numpy- compatible API on top for those that expect it; and of course, it's designed to work with awkward. I expect there's an amount of sharing and refactoring that can eventually be done.

lgray · 2024-06-03T15:24:06Z

Yeah - I think it is all possible. Right now is really getting all the pipe-fittings in place. I'll let you know if there's any clear win in the benchmarks.

lgray · 2024-06-03T15:30:07Z

@Saransh-cpp: @jpivarski and I talked last Friday and it came up that you might be interested in taking this "pilot project" and turning it into a full-blow UHI compatible histogramming interface (a la scikit-hep/hist), but for cupy/cuda histograms.

What's in this PR has the necessary functionality for HEP and we can convert to scikit-hep/hist, but it would be nice to have a cohesive ecosystem and only convert to CPU memory-space at the last moment. This would grant us more or less infinite scaling.

We also has some ideas towards warpwise-distributed histograms where a (collection) of warps would tend to a sub-range of bins so that filling can be done more in parallel. This old implementation description demonstrates that if you stick to a warp (i.e. 32 bins) and replicate histograms to do filling parallel you can reach 10GB/s filling rates, because there's no use of atomics.

This also has interesting parallels to cluster-distributed histograms where a (relatively enormous) histogram could be distributed across a whole dask cluster and achieve scaling to 100s of GBs in size or more. This would effectively remove scaling limitations for histograms for the foreseeable future and is probably important for achieving precision HL-LHC analyses.

Anyway - please let us know if you are interested in turning this into a more mature package and possibly adding features to it! We're happy to answer any questions you may have.

lgray force-pushed the jitters branch from 89e25b8 to 238156e Compare May 15, 2024 14:58

lgray force-pushed the jitters branch 2 times, most recently from 29b2e77 to 38b0625 Compare May 26, 2024 17:32

lgray added 8 commits May 28, 2024 08:27

port in hist from old coffea to jitters area

c3ef3d7

ressurect tests

7590379

ressurect main functionality tests, bits of cruft yet to fix, decent …

1019296

…throughput on multi-dim histograms

use precompiled package for cupy

e19cada

install cuda-toolkit with actions

6a28879

cuda install is too big

4d0d65f

cannot test gpu code in matrix as is

de47783

skip hist plotting tests if no cupy

a0aa63a

lgray force-pushed the jitters branch from 0668e63 to a0aa63a Compare May 28, 2024 13:27

all tests work except test_hist_plot::test_plotgrid

0240790

lgray mentioned this pull request Jun 3, 2024

chore: trying atomics and tree reduction for CUDA reducer kernels scikit-hep/awkward#3123

Draft

23 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(wip): cupy histograms #1095

feat(wip): cupy histograms #1095

lgray commented May 15, 2024

martindurant commented May 31, 2024

lgray commented Jun 3, 2024

martindurant commented Jun 3, 2024

lgray commented Jun 3, 2024

lgray commented Jun 3, 2024 •

edited

feat(wip): cupy histograms #1095

Are you sure you want to change the base?

feat(wip): cupy histograms #1095

Conversation

lgray commented May 15, 2024

martindurant commented May 31, 2024

lgray commented Jun 3, 2024

martindurant commented Jun 3, 2024

lgray commented Jun 3, 2024

lgray commented Jun 3, 2024 • edited

lgray commented Jun 3, 2024 •

edited