Improve KLL Sketch perf for non-grouped queries #22619

ZacBlanco · 2024-04-26T17:21:41Z

Description

Defers calculations of memory sizes for Kll sketches unless using a grouped query.

Motivation and Context

The current method for calculating memory usage has a hidden cost. Within getEstimatedKllInMemorySize we call getSerializedSizeBytes. The code for the serialized bytes size actually serializes the entire internal state to a byte array first before returning the length. This is expensive and should be avoided.

I am working on a PR to the upstream library to add a less-costly method but until released, I would like to fix this as non-grouped execution doesn't need the memory accounting for every sketch input.

Impact

N/A

Test Plan

N/A

Contributor checklist

Please make sure your submission complies with our development, formatting, commit message, and attribution guidelines.
PR description addresses the issue accurately and concisely. If the change is non-trivial, a GitHub Issue is referenced.
Documented new properties (with its default value), SQL syntax, functions, or other functionality.
If release notes are required, they follow the release notes guidelines.
Adequate tests were added if applicable.
CI passed.

Release Notes

== NO RELEASE NOTE ==

The current method for calculating memory usage has a hidden cost. Within getEstimatedKllInMemorySize we call getSerializedSizeBytes. The code for the serialized bytes size actually serializes the entire internal state to a byte array first before returning the length. This is expensive and should be avoided. I am working on a PR to the upstream library to add a less-costly method but until released, I would like to fix this as non-grouped execution doesn't need the memory accounting for every sketch input.

aaneja · 2024-04-30T03:36:31Z

Can you paste the call stack or method in datasketches-java that ends up serializing the internal state for size calculation ?

ZacBlanco · 2024-04-30T12:08:15Z

You can find the relevant section here

However, for the KllDoublesSketch, the copying doesn't occur. The call chain eventually gets to here where getNumRetained() is just a few array index lookups. I have a local branch where I'm trying to replace the implementation for doubles with this version of the sketch.

They don't have a sketch implementation for raw longs yet, but I am planning on contributing that to the library so that for the native numeric types we can get better performance creating the sketches. You can see some of the perf numbers I was getting in this comment

ZacBlanco force-pushed the upstream-kll-perf branch from 2e08340 to 9afd5b0 Compare April 26, 2024 17:37

ZacBlanco marked this pull request as ready for review April 29, 2024 13:24

ZacBlanco requested a review from a team as a code owner April 29, 2024 13:24

ZacBlanco requested review from presto-oss, aaneja and ClarenceThreepwood April 29, 2024 13:24

tdcmeehan self-assigned this Apr 30, 2024

tdcmeehan approved these changes Apr 30, 2024

View reviewed changes

ZacBlanco merged commit c209f50 into prestodb:master May 1, 2024
57 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve KLL Sketch perf for non-grouped queries #22619

Improve KLL Sketch perf for non-grouped queries #22619

ZacBlanco commented Apr 26, 2024

aaneja commented Apr 30, 2024

ZacBlanco commented Apr 30, 2024

Improve KLL Sketch perf for non-grouped queries #22619

Improve KLL Sketch perf for non-grouped queries #22619

Conversation

ZacBlanco commented Apr 26, 2024

Description

Motivation and Context

Impact

Test Plan

Contributor checklist

Release Notes

aaneja commented Apr 30, 2024

ZacBlanco commented Apr 30, 2024