Flagger and Flux gitops workflow in the case of cluster rebuilds #1577

spandan541 · 2024-01-09T16:13:12Z

Hi Team,

I have tried looking for a similar issue addressing the same problem but could not find one.

Describe the bug

My team deploys all manifests using Flux and we don't prefer doing it through kubectl. We are keen on using Flagger for our canary releases along with Flux but we have faced a peculiar challenge as our clusters need to be rebuilt very often.

Steps

Deployment in Gitops repo initially has image tag v1
Canary CR is initialised in the cluster with manual gating enabled
Deployment in Gitops repo is updated to image tag v2
Flux reconciles and updates image tag v1 -> v2
Flagger starts the promotion process. Say with manual gating enabled, the promotion process is waiting at a weight of 20.
At this point due to some reason the cluster needs to be rebuilt
When the cluster is rebuilt, flux applies deployment with image tag v2 even though promotion was not complete (This is the problem!)

Expected behavior

This causes the limitation where Flagger & Flux don't know that the promotion process was interrupted due to a cluster rebuild i.e. no state of the promotion process is saved.

Possible solutions :-

Provided 2 target deployments (primary and canary) under Canary.spec so no change in image tag is necessary
Flagger somehow saves the state of the promotion across rebuilds so even if Flux creates a Deployment with v2, zero traffic is sent to it

Please do let me know if there are better ways to solve this corner case. Any help would be greatly appreciated!
Thanks in advance.

Additional context

Flagger version: 1.31.0
Kubernetes version: 1.27
Service Mesh provider: Istio

The text was updated successfully, but these errors were encountered:

spandan541 · 2024-01-29T12:01:56Z

Any ideas anyone?
@stefanprodan @aryan9600

LiZhenCheng9527 · 2024-03-06T02:05:15Z

You can use version 1.36.1 of Flagger to see if it solves this problem.

spandan541 · 2024-05-26T14:18:35Z

Unfortunately, that doesn't solve the problem

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flagger and Flux gitops workflow in the case of cluster rebuilds #1577

Flagger and Flux gitops workflow in the case of cluster rebuilds #1577

spandan541 commented Jan 9, 2024

spandan541 commented Jan 29, 2024

LiZhenCheng9527 commented Mar 6, 2024

spandan541 commented May 26, 2024

Flagger and Flux gitops workflow in the case of cluster rebuilds #1577

Flagger and Flux gitops workflow in the case of cluster rebuilds #1577

Comments

spandan541 commented Jan 9, 2024

Describe the bug

Steps

Expected behavior

Additional context

spandan541 commented Jan 29, 2024

LiZhenCheng9527 commented Mar 6, 2024

spandan541 commented May 26, 2024