[Draft] Add support for custom allocators #366

richardartoul · 2022-09-08T02:18:54Z

TODO: Its error prone to require allocator to be non-nil. Could just do a nil check on each usage so that users instantiating bitmap like &Bitmap{} aren't broken. Will address if we decide to move forward with this approach.

Benchmark Before

goos: darwin
goarch: amd64
pkg: github.com/RoaringBitmap/roaring
cpu: Intel(R) Core(TM) i7-8569U CPU @ 2.80GHz
BenchmarkRepeatedSparseSerialization-8   	 3732640	       316.4 ns/op	      96 B/op	       5 allocs/op
PASS
ok  	github.com/RoaringBitmap/roaring	1.849s

Benchmark After

goos: darwin
goarch: amd64
pkg: github.com/RoaringBitmap/roaring
cpu: Intel(R) Core(TM) i7-8569U CPU @ 2.80GHz
BenchmarkRepeatedSparseSerialization-8   	 5236890	       220.4 ns/op	      24 B/op	       1 allocs/op
PASS
ok  	github.com/RoaringBitmap/roaring	1.575s

lemire · 2022-09-08T02:28:25Z

You go from 316.4 ns/op to 220.4 ns/op. Can you provide a rationale as to why your code (which defers to the Go make function) is so much faster?

richardartoul · 2022-09-08T02:31:26Z

@lemire It's not defering to the Go make function I'm pretty sure (I wrote this quickly, totally possible I made a mistake). See the difference in allocations:

Before

 5 allocs/op

After

1 allocs/op

The reason for this is that the following branches are always true:

if size <= cap(a.buf) && capacity <= cap(a.buf) {

and

if size <= cap(a.uint16s) && capacity <= cap(a.uint16s) {

and I only have a single allocator so its just reusing the single instance of buf and uint16s over and over again instead of creating new ones constantly. Once l.WriteTo() is done, all the data has been captured in the output buffer and its safe to reuse the same instances for the next iteration. Thats the idea anyways!

richardartoul · 2022-09-08T02:32:50Z

(The benchmark is using a custom allocator that always returns the same slices basically, which is an overly simplified version of what we would do in our production code as well)

richardartoul · 2022-09-08T02:54:44Z

@lemire Ah ok I think I see the source of the confusion. My proposal is not to implement a "default" thread-safe allocator for everyone to use by default. My proposal is to support users injecting their own allocator, and users will write whatever type of allocator makes sense for them. They could write an allocator that uses manual refcounting and is globally shared with locks, or they can do what I plan to do which is to spawn a dedicated allocator in critical path single-threaded loops that allows each iterator of a loop to reuse datastructures over and over again.

draft of allocator interface

29ca5d0

richardartoul mentioned this pull request Sep 8, 2022

Optimize repeatedly serializing array bitmaps #364

Open

richardartoul added 3 commits September 7, 2022 21:41

make it work with nil

3ab317c

pass 0 instead of sparseArrayMax

e9ca781

simplify interface

9a66ae4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Draft] Add support for custom allocators #366

[Draft] Add support for custom allocators #366

richardartoul commented Sep 8, 2022 •

edited

lemire commented Sep 8, 2022

richardartoul commented Sep 8, 2022

richardartoul commented Sep 8, 2022

richardartoul commented Sep 8, 2022 •

edited

[Draft] Add support for custom allocators #366

Are you sure you want to change the base?

[Draft] Add support for custom allocators #366

Conversation

richardartoul commented Sep 8, 2022 • edited

lemire commented Sep 8, 2022

richardartoul commented Sep 8, 2022

richardartoul commented Sep 8, 2022

richardartoul commented Sep 8, 2022 • edited

richardartoul commented Sep 8, 2022 •

edited

richardartoul commented Sep 8, 2022 •

edited