fix: replica cluster should restart after promotion #4399

litaocdl · 2024-04-29T13:11:54Z

When a replica cluster is promoted, the archive_mode is changed from
always to on. This change requires a restart because Postgres
does not reload the configuration during the promotion.

Closes: #4172

github-actions · 2024-04-29T13:12:07Z

❗ By default, the pull request is configured to backport to all release branches.

To stop backporting this pr, remove the label: backport-requested ◀️ or add the label 'do not backport'
To stop backporting this pr to a certain release branch, remove the specific branch label: release-x.y

jsilvela · 2024-05-09T08:38:17Z

/test limit=local

github-actions · 2024-05-09T08:38:29Z

@jsilvela, here's the link to the E2E on CNPG workflow run: https://github.com/cloudnative-pg/cloudnative-pg/actions/runs/9014692079

jsilvela · 2024-05-09T11:40:17Z

internal/management/controller/instance_controller.go

@@ -1114,7 +1113,7 @@ func (r *InstanceReconciler) reconcilePrimary(ctx context.Context, cluster *apiv
 		if err := r.handlePromotion(ctx, cluster); err != nil {
 			return false, err
 		}
-		restarted = true
+		promoted = true


This is not your PR but the existing code.
This is asymmetric. We decide that the instance is a primary with IsPrimary, which checks for the existence of "recovery.conf"

Let's say we:

successfully promote the instace

when trying to update the Status or drop the connections, we fail

Then the next time around we'll see the instance is a primary, and we won't retry.

If we don't update the status, the Status patch step is retried.
The drop connection is not vital. It is there only to prevent a long-running session through the read-only service from silently becoming a read-write one.

internal/management/controller/instance_controller.go

litaocdl · 2024-05-13T08:50:04Z

update the pr by removing the unused variable

fcanovai · 2024-05-21T11:25:53Z

/test level=4

github-actions · 2024-05-21T11:26:06Z

@fcanovai, here's the link to the E2E on CNPG workflow run: https://github.com/cloudnative-pg/cloudnative-pg/actions/runs/9173885815

Signed-off-by: Tao Li <tao.li@enterprisedb.com>

Signed-off-by: Jaime Silvela <jaime.silvela@enterprisedb.com>

Signed-off-by: Tao Li <tao.li@enterprisedb.com>

Co-authored-by: Jaime Silvela <jaime.silvela@enterprisedb.com> Signed-off-by: Tao Li <tao.li@enterprisedb.com>

When a replica cluster is promoted, the `archive_mode` is changed from `always` to `on`. This change requires a restart because Postgres does not reload the configuration during the promotion. Closes: #4172 Signed-off-by: Tao Li <tao.li@enterprisedb.com> Signed-off-by: Jaime Silvela <jaime.silvela@enterprisedb.com> Co-authored-by: Jaime Silvela <jaime.silvela@enterprisedb.com> (cherry picked from commit 33d7b65)

) When a replica cluster is promoted, the `archive_mode` is changed from `always` to `on`. This change requires a restart because Postgres does not reload the configuration during the promotion. Closes: cloudnative-pg#4172 Signed-off-by: Tao Li <tao.li@enterprisedb.com> Signed-off-by: Jaime Silvela <jaime.silvela@enterprisedb.com> Co-authored-by: Jaime Silvela <jaime.silvela@enterprisedb.com> Signed-off-by: Douglass Kirkley <dkirkley@eitccorp.com>

litaocdl requested review from fcanovai, gbartolini, leonardoce, mnencia, phisco, sxd and armru as code owners April 29, 2024 13:11

github-actions bot added backport-requested ◀️ This pull request should be backported to all supported releases release-1.21 release-1.22 release-1.23 labels Apr 29, 2024

mnencia force-pushed the dev/cnp-4172 branch from ad2810b to 1a567de Compare May 8, 2024 09:58

litaocdl requested a review from a team as a code owner May 8, 2024 09:58

jsilvela force-pushed the dev/cnp-4172 branch from 1a567de to 6b84c66 Compare May 9, 2024 08:15

jsilvela approved these changes May 9, 2024

View reviewed changes

jsilvela reviewed May 9, 2024

View reviewed changes

internal/management/controller/instance_controller.go Outdated Show resolved Hide resolved

jsilvela reviewed May 9, 2024

View reviewed changes

internal/management/controller/instance_controller.go Outdated Show resolved Hide resolved

litaocdl force-pushed the dev/cnp-4172 branch from f5cc8ad to 481a478 Compare May 13, 2024 08:29

fcanovai force-pushed the dev/cnp-4172 branch from 6a893d7 to 90547b0 Compare May 21, 2024 09:22

github-actions bot added the ok to merge 👌 This PR can be merged label May 21, 2024

fcanovai force-pushed the dev/cnp-4172 branch from 90547b0 to 6229afe Compare May 22, 2024 07:34

fcanovai approved these changes May 22, 2024

View reviewed changes

mnencia approved these changes May 22, 2024

View reviewed changes

fix: replica cluster should restart after promote

d7d835c

Signed-off-by: Tao Li <tao.li@enterprisedb.com>

jsilvela and others added 3 commits May 22, 2024 17:30

Update internal/management/controller/instance_controller.go

dbd6507

Signed-off-by: Jaime Silvela <jaime.silvela@enterprisedb.com>

remove unused variable

8dd23ca

Signed-off-by: Tao Li <tao.li@enterprisedb.com>

Update internal/management/controller/instance_controller.go

94865a1

Co-authored-by: Jaime Silvela <jaime.silvela@enterprisedb.com> Signed-off-by: Tao Li <tao.li@enterprisedb.com>

mnencia force-pushed the dev/cnp-4172 branch from 6229afe to 94865a1 Compare May 22, 2024 15:30

mnencia changed the title ~~fix: replica cluster should restart after promote~~ fix: replica cluster should restart after promotion May 22, 2024

mnencia merged commit 33d7b65 into main May 22, 2024
30 checks passed

mnencia deleted the dev/cnp-4172 branch May 22, 2024 16:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: replica cluster should restart after promotion #4399

fix: replica cluster should restart after promotion #4399

litaocdl commented Apr 29, 2024 •

edited by mnencia

github-actions bot commented Apr 29, 2024

jsilvela commented May 9, 2024

github-actions bot commented May 9, 2024

jsilvela May 9, 2024

mnencia May 22, 2024

litaocdl commented May 13, 2024

fcanovai commented May 21, 2024

github-actions bot commented May 21, 2024

fix: replica cluster should restart after promotion #4399

fix: replica cluster should restart after promotion #4399

Conversation

litaocdl commented Apr 29, 2024 • edited by mnencia

github-actions bot commented Apr 29, 2024

jsilvela commented May 9, 2024

github-actions bot commented May 9, 2024

jsilvela May 9, 2024

Choose a reason for hiding this comment

mnencia May 22, 2024

Choose a reason for hiding this comment

litaocdl commented May 13, 2024

fcanovai commented May 21, 2024

github-actions bot commented May 21, 2024

litaocdl commented Apr 29, 2024 •

edited by mnencia