Fix: remove extraneous content encoding #650

chingor13 · 2019-05-24T17:11:49Z

Fixes #648

In order to calculate the content length, we must write the encoded the
data first. The encoded data is written to a buffer using a
ByteArrayOutputStream implementation and we use that to figure out how
many bytes of data is to be sent.

Previously, this data was thrown away and the content was re-encoded
when it was actually time to send the data.

Instead, we now replace the content with a ByteArrayContent which
contains the buffer we wrote to when calculating the size.

We implemented a new CachingByteArrayOutputStream so that we can access
the byte buffer directly rather than copying into a new byte array (for
memory performance)

In order to calculate the contentLength, we must write the encode the data first. The encoded data is written to a buffer using a ByteArrayOutputStream implementation and we use that to figure out how many bytes of data is to be sent. Previously, this data was thrown away and the content was re-encoded when it was actually time to send the data. Instead, we now replace the content with a ByteArrayContent which contains the buffer we wrote to when calculating the size. We implemented a new CachingByteArrayOutputStream so that we can access the byte buffer directly rather than copying into a new byte array (for memory performance)

We really only care that the contents was encode and is correctly marked as such.

* Fix javadoc param name * Group serializeHeaders() overloads

* Remove deprecated google-http-client-jackson artifact. Jackson 1.x has been unsupported for a long time. Users should be using Jackson 2.x. the google-http-client-jackson artifact was deprecated in 1.28.0. * Fix assembly references to jackson

* Add Base64Test case for some base64 decoding edge cases * Preserve decoding behavior for null decodeBase64(null) * Handle encoding with null inputs

sduskis · 2019-05-29T19:20:26Z

google-http-client/src/main/java/com/google/api/client/http/HttpRequest.java

@@ -932,7 +932,14 @@ public HttpResponse execute() throws IOException {
        } else {
          contentEncoding = encoding.getName();
          streamingContent = new HttpEncodingStreamingContent(streamingContent, encoding);
-          contentLength = contentRetrySupported ? IOUtils.computeLength(streamingContent) : -1;
+          if (contentRetrySupported) {
+            CachingByteArrayOutputStream outputStream = new CachingByteArrayOutputStream();


IOUtils checks the size via streaming, and doesn't actually store the data in memory. The same thing for sending the data. A large object will currently never fully have to be in memory.

This change may cause an unexpected memory spike for users with large objects

ludoch · 2019-05-29T22:51:03Z

Since this bug was filed on the GAE standard product, memory management is critical as we do not have big mems on instance classes machines...
A test in this env would be welcome (request size limit is 32Mb, compressed or not IIUC).

chingor13 · 2019-05-29T23:36:46Z

As is, this implementation won't be good enough

…oogleapis#653)

In order to calculate the contentLength, we must write the encode the data first. The encoded data is written to a buffer using a ByteArrayOutputStream implementation and we use that to figure out how many bytes of data is to be sent. Previously, this data was thrown away and the content was re-encoded when it was actually time to send the data. Instead, we now replace the content with a ByteArrayContent which contains the buffer we wrote to when calculating the size. We implemented a new CachingByteArrayOutputStream so that we can access the byte buffer directly rather than copying into a new byte array (for memory performance)

We really only care that the contents was encode and is correctly marked as such.

…p-java-client into fix-double-encoding

googlebot · 2019-06-03T18:24:56Z

So there's good news and bad news.

👍 The good news is that everyone that needs to sign a CLA (the pull request submitter and all commit authors) have done so. Everything is all good there.

😕 The bad news is that it appears that one or more commits were authored or co-authored by someone other than the pull request submitter. We need to confirm that all authors are ok with their commits being contributed to this project. Please have them confirm that here in the pull request.

Note to project maintainer: This is a terminal state, meaning the cla/google commit status will not change from this state. It's up to you to confirm consent of all the commit author(s), set the cla label to yes (if enabled on your project), and then merge this pull request when appropriate.

ℹ️ Googlers: Go here for more info.

IdtokenProvider implementation for UserCredentials with unit tests for idtoken

Add failing test showing that encoding happens more than once

ed237e3

googlebot added the cla: yes This human has signed the Contributor License Agreement. label May 24, 2019

chingor13 added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label May 24, 2019

kokoro-team removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label May 24, 2019

chingor13 marked this pull request as ready for review May 24, 2019 17:53

chingor13 requested a review from a team as a code owner May 24, 2019 17:53

chingor13 added 5 commits May 24, 2019 11:06

Remove assertion that the content is a HttpEncodingStreamingContent

0604972

We really only care that the contents was encode and is correctly marked as such.

Fix a bunch of typos (googleapis#640)

c4f29bc

Linting cleanup (googleapis#645)

1d4270d

* Fix javadoc param name * Group serializeHeaders() overloads

Add Base64Test case for some base64 decoding edge cases (googleapis#644)

91cbfdb

* Add Base64Test case for some base64 decoding edge cases * Preserve decoding behavior for null decodeBase64(null) * Handle encoding with null inputs

sduskis reviewed May 29, 2019

View reviewed changes

chingor13 added the do not merge Indicates a pull request not ready for merge, due to either quality or timing. label May 29, 2019

renovate-bot and others added 8 commits May 31, 2019 10:55

Add renovate.json (googleapis#651)

1b18a6e

Update dependency com.fasterxml.jackson.core:jackson-core to v2.9.9 (g…

1a4a5cc

…oogleapis#653)

Update dependency com.coveo:fmt-maven-plugin to v2.9 (googleapis#652)

b0d7245

Update dependency com.google.code.gson:gson to v2.8.5 (googleapis#657)

13731cd

Add failing test showing that encoding happens more than once

78d4991

Remove assertion that the content is a HttpEncodingStreamingContent

3d20d9a

We really only care that the contents was encode and is correctly marked as such.

Merge branch 'fix-double-encoding' of github.com:chingor13/google-htt…

a03bd43

…p-java-client into fix-double-encoding

googlebot added cla: no This human has *not* signed the Contributor License Agreement. and removed cla: yes This human has signed the Contributor License Agreement. labels Jun 3, 2019

chingor13 closed this Jun 3, 2019

clundin25 pushed a commit to clundin25/google-http-java-client that referenced this pull request Aug 11, 2022

feat: add Id token support for UserCredentials (googleapis#650)

5a8f467

IdtokenProvider implementation for UserCredentials with unit tests for idtoken

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: remove extraneous content encoding #650

Fix: remove extraneous content encoding #650

chingor13 commented May 24, 2019 •

edited

sduskis May 29, 2019

ludoch commented May 29, 2019

chingor13 commented May 29, 2019

googlebot commented Jun 3, 2019

Fix: remove extraneous content encoding #650

Fix: remove extraneous content encoding #650

Conversation

chingor13 commented May 24, 2019 • edited

sduskis May 29, 2019

Choose a reason for hiding this comment

ludoch commented May 29, 2019

chingor13 commented May 29, 2019

googlebot commented Jun 3, 2019

chingor13 commented May 24, 2019 •

edited