transient in memory cache #4922

Geal · 2024-04-05T13:45:51Z

This adds another level of in memory cache, to mitigate issues with unique or infrequent queries pushing frequently used queries out of the in memory cache.
If a query has only been seen once, then it is stored inthe transient cache (short term, small number of entries, LRU).
If that query is requested again, and is still present in the transient cache, then it is added to the long term, larger in memory cache, to the redis cache as well, and removed from the transient cache. If that query is requested again but not present in the transient cache, then we test the larger in memory cache and redis

Description here

Fixes #issue_number

Checklist

Complete the checklist (and note appropriate exceptions) before the PR is marked ready-for-review.

Exceptions

Note any exceptions here

Notes

It may be appropriate to bring upcoming changes to the attention of other (impacted) groups. Please endeavour to do this before seeking PR approval. The mechanism for doing this will vary considerably, so use your judgement as to how and when to do this. ↩
Configuration is an important part of many changes. Where applicable please try to document configuration examples. ↩
Tick whichever testing boxes are applicable. If you are adding Manual Tests, please document the manual testing (extensively) in the Exceptions. ↩

This adds another level of in memory cache, to mitigate issues with unique or infrequent queries pushing frequently used queries out of the in memory cache. If a query has only been seen once, then it is stored inthe transient cache (short term, small number of entries, LRU). If that query is requested again, and is still present in the transient cache, then it is added to the long term, larger in memory cache, to the redis cache as well, and removed from the transient cache. If that query is requested again but not present in the transient cache, then we test the larger in memory cache and redis

github-actions · 2024-04-05T13:46:05Z

@Geal, please consider creating a changeset entry in /.changesets/. These instructions describe the process and tooling.

router-perf · 2024-04-05T13:46:22Z

apollo-router/src/cache/mod.rs

garypen · 2024-04-10T10:37:09Z

We need to do something in this space, but I feel we can do something better when #4796 lands.

I imagine that with the new rust support we could do something like:

(query from client) -> new functionality in 4796 -> (normalised form)

Then use (normalised form) as the key to the query plan.

If the conversion to (normalised form) is expensive we could introduce a smaller cache to hold (query from client) -> (normalised form), but I imagine that shouldn't be required.

Note: Some details (extra items in the key such as metadata) are omitted for brevity, but wouldn't this basic approach work?

Geal · 2024-04-10T13:02:22Z

That's definitely something we could do. And we already have the right cache for that, in query analysis #4796 (comment)

bonnici · 2024-04-11T01:17:52Z

Note that right now the normalised form is stripping out some important information like alias names and input objects, but I'm planning to add support for including those things as part of PULSR-695.

Geal · 2024-04-11T07:18:26Z

@bonnici for some context, we recently merged #4883 which uses a hashing scheme for the query that keeps the same hash across schema updates, if the update does not affect the query. Unfortunately, we needed to add back the operation name to the query plan cache key (#4921), because the query planner was returning the usage reporting structure as well, and the operation signature contains the operation name, so with the operation name in the cache key, we would have not reported operations properly.
But now, with the usage reporting and operation signature generation happening on the rust side, we will be able to move that task entirely to the query analysis layer (ase we're doing with validation in #4551), which executes much earlier. And in that layer, we could also have a transformation layer to normalize the operation name, extract hardcoded variables, etc, so that we reduce further the number of queries sent to planning. And the client side would see no difference because the responnse formatting is done according to the original query

abernix · 2024-05-08T09:12:37Z

I've converted this to a draft so we can discuss it further, but it won't be on our review backlog for now.

Geal · 2024-05-13T08:00:51Z

@abernix it was in the review queue because I expected some reviews :/

garypen · 2024-05-13T08:41:18Z

Before looking too much into the implementation of this, do we still think this is the right solution for this problem?

My comment above still seems applicable. More so, since various changes have landed that make "normalised form" a more realistic prospect.

In general, this feels like a "bigger" decision than code review and could use some more substantial design reviewing before proceeding to code review. Perhaps we could discuss this in the next router architectural working group meeting? One thing we definitely want to discuss is whether we should expose configuration of this feature.

Geal · 2024-05-13T10:44:36Z

normalization is a much larger issue than this, and not that much related. I agree that it needs further discussion though, I added it to the topics of the meeting

apollo-bot2 assigned Geal Apr 5, 2024

Geal added 4 commits April 5, 2024 18:09

Merge branch 'dev' into geal/transient-cache

a9cc626

snapshot

a65edd2

snapshot

45f5d6b

fix tests

b9d1b08

Geal commented Apr 8, 2024

View reviewed changes

apollo-router/src/cache/mod.rs Show resolved Hide resolved

Update apollo-router/src/cache/mod.rs

9bd3aa7

Geal added 3 commits April 12, 2024 17:31

Merge branch 'dev' into geal/transient-cache

32f1903

fix config defaults

6a48b8b

Merge branch 'dev' into geal/transient-cache

60da416

Geal requested a review from a team April 23, 2024 10:08

Merge branch 'dev' into geal/transient-cache

a823eca

abernix marked this pull request as draft May 8, 2024 09:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transient in memory cache #4922

transient in memory cache #4922

Geal commented Apr 5, 2024

github-actions bot commented Apr 5, 2024

router-perf bot commented Apr 5, 2024

garypen commented Apr 10, 2024

Geal commented Apr 10, 2024

bonnici commented Apr 11, 2024

Geal commented Apr 11, 2024

abernix commented May 8, 2024 •

edited

Geal commented May 13, 2024

garypen commented May 13, 2024

Geal commented May 13, 2024

transient in memory cache #4922

Are you sure you want to change the base?

transient in memory cache #4922

Conversation

Geal commented Apr 5, 2024

Footnotes

github-actions bot commented Apr 5, 2024

router-perf bot commented Apr 5, 2024

garypen commented Apr 10, 2024

Geal commented Apr 10, 2024

bonnici commented Apr 11, 2024

Geal commented Apr 11, 2024

abernix commented May 8, 2024 • edited

Geal commented May 13, 2024

garypen commented May 13, 2024

Geal commented May 13, 2024

abernix commented May 8, 2024 •

edited