feat: add caching to GapicCallable #527

daniel-sanche · 2023-09-06T23:06:58Z

_GapicCallable currently does a lot of work on each call, re-building each wrapped function, using a lot of calls to helper functions. This cost is added to every single rpc, so it can really add up

This PR does the following optimizations

removes the _apply_decorators and _is_not_none_or_false to build the wrapped call more directly.
- This seems to make it ~10% faster
add a new helper that builds a wrapped call using a timeout and retry object, and then cache the result with @lru_cache
- This seems to make it 50% faster
- In practice, I think it's safe to assume most calls will be re-using timeout and retry values
- I currently have the cache size of 4, but this can be changed

Benchmark:

from google.api_core.gapic_v1.method import _GapicCallable
from google.api_core.retry import Retry

callable =  _GapicCallable(lambda *a, **k: 1, retry=Retry(), timeout=1010, compression=False)

from timeit import timeit
timeit(lambda: callable())

Before: 20.43s
After: 9.48s

This reverts commit c97a636.

parthea

This seems to make it 50% faster

Thanks for fixing this!

Please could you add a simple benchmarking presubmit, similar to the test that you ran manually, to avoid future regressions in performance?

daniel-sanche · 2024-02-13T22:18:47Z

Sure, I just added a unit test. Let me know if that works

parthea

LGTM, but please wait for @vchudnov-g to review

parthea · 2024-02-13T22:24:29Z

We may need a larger buffer to ensure that the test is not flaky but still captures regressions. Perhaps double the value?

https://github.com/googleapis/python-api-core/actions/runs/7893470370/job/21542170780?pr=527

>       assert avg_time < 0.15  # expect ~0.1, but allow for some variance
E       assert 0.17590151499999251 < 0.15

parthea · 2024-02-14T15:46:05Z

Assigning back to @daniel-sanche to resolve the presubmit failure

daniel-sanche · 2024-02-14T23:52:19Z

Hmm good point, the benchmark result will be machine-specific, and I was doing my tests locally instead of with the CI workers.

I guess I'll have to find an assertion value that works well for the CI nodes, and I'll add a comment explaining that it may be flake on slower hardware. Or let me know if you have other suggestions for how to approach this

parthea · 2024-02-14T23:59:32Z

Can you set it high enough that we don't get flaky results, but low enough that we can detect performance regressions.

Perhaps set the threshold to 0.4 for now and create an issue in https://github.com/googleapis/python-api-core/issues to add a proper benchmarking test ? I believe @ohmayr started looking into a benchmarking presubmit so please tag him on the issue.

daniel-sanche · 2024-02-15T00:23:13Z

Sure, I opened an issue to track this here: #616

I adjusted the value to 0.4. Feel free to merge it with that number, but I suspect we can find a lower value that still avoids flakiness. Let me know if you want me to do some investigation

parthea · 2024-02-27T15:42:58Z

@vchudnov-g Please could you review?

vchudnov-g

Minor code comment, and an idea about tightening benchmarks.

vchudnov-g · 2024-03-29T20:05:07Z

tests/unit/gapic/test_method.py

+    Note: The threshold has been tuned for the CI workers. Test may flake on
+    slower hardware
+
+    https://github.com/googleapis/python-api-core/pull/527


Do you mean to self-reference this PR?

It was intentional, to give the context on this test. But on second thought, git blame should be enough. Removed

vchudnov-g · 2024-03-29T20:09:53Z

tests/unit/gapic/test_method.py

+        lambda *a, **k: 1, retry=Retry(), timeout=1010, compression=False
+    )
+    avg_time = timeit(lambda: gapic_callable(), number=10_000)
+    assert avg_time < 0.4


Idea: If the assertion fails, print both the actual time it took and enough platform information so that in the future we can add the right threshold for the platform. The latter would be something like this

platform_threshold = { "foo": 0.2, "bar": 0.6 } current_platform = ... ... assert avg_time < platform_threshold.get(current_platform, 0.4)

In fact, you could implement platform_threshold now, and start with whatever your current machine is.

That's an interesting idea, but it's not completely clear to me what we'd need to capture for the platform. Number of CPUs? Architecture? OS? Let me know if you have thoughts

We already have #616 to track improving this though, so if it's alright with you, I'll merge this as-is and we can discuss follow-up there

feat: optimize _GapicCallable

5518645

product-auto-label bot added the size: s Pull request size is small. label Sep 6, 2023

daniel-sanche mentioned this pull request Sep 7, 2023

chore: optimize gapic calls googleapis/python-bigtable#863

Merged

daniel-sanche and others added 9 commits September 7, 2023 23:49

cleaned up metadata lines

0cc03bd

chore: avoid type checks in error wrapper

c97a636

Revert "chore: avoid type checks in error wrapper"

b453df4

This reverts commit c97a636.

add default wrapped function

2f7acff

fixed decorator order

31f0b4e

fixed spacing

b92328c

fixed comment typo

0831dbf

fixed spacing

a1563d2

Merge branch 'main' into optimize_gapic_callable

fb1a372

daniel-sanche mentioned this pull request Oct 25, 2023

v3: fix identified gapic performance issues googleapis/python-bigtable#883

Closed

daniel-sanche added 6 commits February 9, 2024 16:36

Merge branch 'main' into optimize_gapic_callable

52ed5be

fixed spacing

85e2102

removed unneeded helpers

c76f51c

use caching

f4a9021

improved metadata parsing

cacc73c

improved docstring

a30101d

product-auto-label bot added size: m Pull request size is medium. and removed size: s Pull request size is small. labels Feb 10, 2024

fixed logic

db9a9c4

daniel-sanche marked this pull request as ready for review February 10, 2024 01:35

daniel-sanche requested review from a team as code owners February 10, 2024 01:35

daniel-sanche changed the title ~~[DRAFT] feat: optimize GapicCallable~~ feat: add caching to GapicCallable Feb 10, 2024

vchudnov-g self-assigned this Feb 12, 2024

parthea requested changes Feb 13, 2024

View reviewed changes

parthea assigned daniel-sanche Feb 13, 2024

added benchmark test

a555629

Merge branch 'main' into optimize_gapic_callable

cfe5c7d

parthea approved these changes Feb 13, 2024

View reviewed changes

parthea requested review from parthea and vchudnov-g February 14, 2024 15:45

parthea unassigned vchudnov-g Feb 14, 2024

update threshold

fbbaaca

daniel-sanche mentioned this pull request Feb 15, 2024

improve __call__ performance benchmarking #616

Open

daniel-sanche and others added 6 commits February 16, 2024 12:27

run benchmark in loop for testing

576bb0f

use verbose logs

7c32a5d

Revert testing

26bec79

used smaller value

c25e0eb

changed threshold

49201ca

Merge branch 'main' into optimize_gapic_callable

3d2c964

parthea approved these changes Feb 27, 2024

View reviewed changes

parthea assigned vchudnov-g and unassigned daniel-sanche Feb 27, 2024

vchudnov-g approved these changes Mar 29, 2024

View reviewed changes

daniel-sanche and others added 2 commits May 3, 2024 13:24

Merge branch 'main' into optimize_gapic_callable

c9c1ff3

removed link in comment

deca58c

daniel-sanche merged commit d96eb5c into main May 3, 2024
27 checks passed

daniel-sanche deleted the optimize_gapic_callable branch May 3, 2024 20:05

release-please bot mentioned this pull request May 3, 2024

chore(main): release 2.20.0 #650

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add caching to GapicCallable #527

feat: add caching to GapicCallable #527

daniel-sanche commented Sep 6, 2023 •

edited

parthea left a comment

daniel-sanche commented Feb 13, 2024

parthea left a comment

parthea commented Feb 13, 2024 •

edited

parthea commented Feb 14, 2024

daniel-sanche commented Feb 14, 2024

parthea commented Feb 14, 2024 •

edited

daniel-sanche commented Feb 15, 2024

parthea commented Feb 27, 2024

vchudnov-g left a comment

vchudnov-g Mar 29, 2024

daniel-sanche May 3, 2024

vchudnov-g Mar 29, 2024 •

edited

daniel-sanche May 3, 2024

feat: add caching to GapicCallable #527

feat: add caching to GapicCallable #527

Conversation

daniel-sanche commented Sep 6, 2023 • edited

parthea left a comment

Choose a reason for hiding this comment

daniel-sanche commented Feb 13, 2024

parthea left a comment

Choose a reason for hiding this comment

parthea commented Feb 13, 2024 • edited

parthea commented Feb 14, 2024

daniel-sanche commented Feb 14, 2024

parthea commented Feb 14, 2024 • edited

daniel-sanche commented Feb 15, 2024

parthea commented Feb 27, 2024

vchudnov-g left a comment

Choose a reason for hiding this comment

vchudnov-g Mar 29, 2024

Choose a reason for hiding this comment

daniel-sanche May 3, 2024

Choose a reason for hiding this comment

vchudnov-g Mar 29, 2024 • edited

Choose a reason for hiding this comment

daniel-sanche May 3, 2024

Choose a reason for hiding this comment

daniel-sanche commented Sep 6, 2023 •

edited

parthea commented Feb 13, 2024 •

edited

parthea commented Feb 14, 2024 •

edited

vchudnov-g Mar 29, 2024 •

edited