Pre-warmed cluster upgrades #82

ecordell · 2022-09-07T20:01:19Z

Right now, the operator rolls out new versions of SpiceDB by updating a deployment.

New pods become available as soon as they are connected to the datastore and the dispatch ring, which means that cache is lost during upgrades. Depending on the queries SpiceDB is handling, this could cause a significant increase in latency.

Some options worth exploring:

Slowly introducing new pods so that only some % of a cluster loses its cache at a time. This is probably the simplest option, but requires all dispatch API changes to be fully backwards-compatible.
Traffic mirroring via external routing (similar to how flagger provides generic blue/green mirroring). Currently, spicedb-operator operates "below" the level that most of these tools work, so the scope would need to increase dramatically to include more networking/ingress concerns.
Traffic mirroring via SpiceDB itself. We could introduce mirroring flags into SpiceDB itself, so that incoming traffic can be forwarded to a parallel set of nodes to fill their cache. This would require old and new clusters to be exposed under different service objects so that their hashrings don't collide.
Saving and restoring the cache. Currently, SpiceDB caches exist only in memory. We could switch to a cache that syncs to the filesystem or provide apis for dumping the cache (either over the network or to disk), and the operator could ensure the caches come back in the new pods. We would likely want to switch to a StatefulSet if we try this.

The text was updated successfully, but these errors were encountered:

ecordell added priority/3 low This would be nice to have state/needs discussion This can't be worked on yet labels Sep 7, 2022

ecordell mentioned this issue Sep 21, 2022

Support Autoscaling #88

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pre-warmed cluster upgrades #82

Pre-warmed cluster upgrades #82

ecordell commented Sep 7, 2022 •

edited

Pre-warmed cluster upgrades #82

Pre-warmed cluster upgrades #82

Comments

ecordell commented Sep 7, 2022 • edited

ecordell commented Sep 7, 2022 •

edited