Avoid allocation in `BigDecimal` serializer with alternative way to write bytes. #1016

gdela · 2023-10-14T17:41:41Z

This is a continuation of #1014 and alternative to #1015. Same as #1015 it avoids allocation, but additionally new methods were added to the Input/Output:

Output.writeBytesFromLong(long bytes, int count)
Input.readBytesAsLong(int count)
Those methods write/read specified number of bytes using a long value as storage for bytes instead of byte[] array, to avoid allocation of the array on the heap. Additionally they properly require() necessary number of bytes in the buffer.

Thanks to those changes, the BigDecimalSerializer is even faster (mostly) than the version from #1015:

Of course it's thus also faster (completely) than the code in the master branch:

Any suggestions for the method names (and the code) are welcome, and if this change is too intrusive I'm also happy with only the #1015 being merged.

…Input/Output .

NathanSweet · 2023-10-16T02:29:34Z

src/com/esotericsoftware/kryo/io/Output.java

+		int p = position;
+		position = p + count;
+		for (int i = count - 1; i >= 0; i--) {
+			buffer[p++] = (byte) (bytes >> (i << 3));


Writing could be one less shift per iteration (untested):

for (int i = (count - 1) << 3; i >= 0; i -= 8) buffer[p++] = (byte)(bytes >> i);

Same for ByteBufferOutput.

Reading could match, like:

for (int i = (count - 1) << 3; i > 0; i -= 8) bytes |= (buffer[p++] & 0xFF) << i;

However, for reading what you have may be better: simpler initializer and similar operations per iteration.

I tried to do that, but the read side got a bit more complex:

long bytes = buffer[p] >= 0 ? 0 : ~(-1L >>> ((8-count) << 3)); for (int i = (count - 1) << 3; i >= 0; i -= 8) { bytes |= (buffer[p++] & (long) 0xFF) << i; }

And also much slower (while the write side stayed the same when it comes to speed):

So in 813ae4b I did something different - I unrolled the loops manually replacing it with some switches and repeated code. This got the reading/writing a little bit faster:

NathanSweet · 2023-10-16T02:42:11Z

src/com/esotericsoftware/kryo/io/ByteBufferInput.java

@@ -352,6 +352,16 @@ public void readBytes (byte[] bytes, int offset, int count) throws KryoException
 		}
 	}

+	public long readBytesAsLong (int count) {


This is better than my initial idea of readBytes2, readBytes3, etc. Nice!

I don't hate the name readBytesAsLong, but what if it was readLong(int count)? Should we have a readInt(int count) for 2-3 bytes?

The first line of the method should be:

if (count < 0 || count > 8) throw new IllegalArgumentException("count must be >= 0 and <= 8: " + count);

Same with other methods taking a count parameter.

Done in 029190b, and the readInt(int count) in 9698899.

NathanSweet · 2023-10-16T02:44:34Z

src/com/esotericsoftware/kryo/io/ByteBufferOutput.java

@@ -276,6 +276,14 @@ public void writeBytes (byte[] bytes, int offset, int count) throws KryoExceptio
 		}
 	}

+	public void writeBytesFromLong (long bytes, int count) {


writeLong(long bytes, int count)? Should we also have writeInt(int bytes, int count)?

I have renamed to writeLong(long bytes, int count) and added writeInt(int bytes, int count). I wonder if those methods should stay where they are in the Output class, i.e. in the section marked as // byte:, or should I move them to sections // long: and // int: correspondingly (and to the same in Input class)?

NathanSweet · 2023-10-20T16:03:12Z

DefaultSerializers.java is showing as the whole file modified?

I like the switches, kudos!

Last one: can I bother you to check if read/writeLong is better with it's own implementation? Calling the int method twice is probably slower, since there are more method calls and multiple require.

gdela · 2023-10-26T13:07:02Z

I've tried versions wher read/writeLong has its own implementation, but the results puzzle me. The 10- and 19-digits long value, which in serialized form uses 5 and 8 bytes correspondigly, got a bit better, but at the same time the 1-2 digits values, which uses one byte got worse, even though it shouldn't.

Version 1:

Version 2:

So given that the results are inconclusive, and the code gets longer, maybe it's not worth it?

gdela · 2023-10-26T13:40:13Z

DefaultSerializers.java is showing as the whole file modified?

Yes, seems I spoiled line endings. To avoid making mess in git history, I've created a fresh pull request with the same changes as the ones in this pull request, but with proper line endings - it's pull request #1018, so I'll close this one.

NathanSweet · 2023-11-05T18:11:41Z

Sorry, I got swamped for a while there.

Interesting results for the int/long methods. Maybe the JVM is optimizing when the number of the method bytecodes is below some threshold and the 8 count switch exceeds that. I suppose it's not worth doing then.

We can squash commits when a PR is merged, so it's fine to make lots of a commits in the PR rather than open separate issues to keep the PR history clean. Not a big deal either way, we'll carry on in #1018.

Avoid allocation in BigDecimal serializer, new method for bytes in …

62bcd2d

…Input/Output .

gdela marked this pull request as ready for review October 14, 2023 17:45

gdela mentioned this pull request Oct 14, 2023

BigDecimal serializer memory and throughput optimizations #1014

Merged

NathanSweet reviewed Oct 16, 2023

View reviewed changes

Wojtek Gdela added 3 commits October 17, 2023 14:38

Improve names, tests, checks for writing and reading bytes of a long.

029190b

Add to Input/Output methods for writing and reading bytes of an int.

9698899

Optimize writing and reading bytes of a long/int by unrolling loop.

813ae4b

gdela mentioned this pull request Oct 26, 2023

Avoid allocation in BigDecimal serializer with alternative way to write bytes [crlf corrected]. #1018

Merged

gdela closed this Oct 26, 2023

gdela mentioned this pull request Oct 26, 2023

Avoid allocation in BigDecimal serializer. #1015

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid allocation in `BigDecimal` serializer with alternative way to write bytes. #1016

Avoid allocation in `BigDecimal` serializer with alternative way to write bytes. #1016

gdela commented Oct 14, 2023

NathanSweet Oct 16, 2023

gdela Oct 17, 2023

NathanSweet Oct 16, 2023 •

edited

gdela Oct 17, 2023

NathanSweet Oct 16, 2023

gdela Oct 17, 2023

NathanSweet commented Oct 20, 2023

gdela commented Oct 26, 2023

gdela commented Oct 26, 2023

NathanSweet commented Nov 5, 2023

Avoid allocation in BigDecimal serializer with alternative way to write bytes. #1016

Avoid allocation in BigDecimal serializer with alternative way to write bytes. #1016

Conversation

gdela commented Oct 14, 2023

NathanSweet Oct 16, 2023

Choose a reason for hiding this comment

gdela Oct 17, 2023

Choose a reason for hiding this comment

NathanSweet Oct 16, 2023 • edited

Choose a reason for hiding this comment

gdela Oct 17, 2023

Choose a reason for hiding this comment

NathanSweet Oct 16, 2023

Choose a reason for hiding this comment

gdela Oct 17, 2023

Choose a reason for hiding this comment

NathanSweet commented Oct 20, 2023

gdela commented Oct 26, 2023

gdela commented Oct 26, 2023

NathanSweet commented Nov 5, 2023

Avoid allocation in `BigDecimal` serializer with alternative way to write bytes. #1016

Avoid allocation in `BigDecimal` serializer with alternative way to write bytes. #1016

NathanSweet Oct 16, 2023 •

edited