master node fails to automatically rejoin the cluster after recovery from failure #850

nuowei2543 · 2024-04-11T03:54:54Z

Hello, during my simulation of host failover, I stopped the master host's PostgreSQL instance, and the standby node successfully switched to become the new master node. However, when I restarted the original master node, it did not automatically rejoin the cluster as a standby node.
version:
ubuntu:20.4
postgresql:16.2
repmgrd:5.4.1

2、on node1 execute command
supervisorctl stop postgresql

4、on node1 execute command
supervisorctl startpostgresql

WARNING: following issues were detected

node "node1" (ID: 1) is running but the repmgr node record is inactive

So, I don't know why node1 is still the primary.

stephan-hahn · 2024-04-25T06:23:12Z

Hi, there is no inbuilt automatic rejoin. By just starting the old master again, you create a split brain scenario. But it's no problem to automatically rejoin the old master after promoting the new one via script.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

master node fails to automatically rejoin the cluster after recovery from failure #850

master node fails to automatically rejoin the cluster after recovery from failure #850

nuowei2543 commented Apr 11, 2024 •

edited

stephan-hahn commented Apr 25, 2024 •

edited

master node fails to automatically rejoin the cluster after recovery from failure #850

master node fails to automatically rejoin the cluster after recovery from failure #850

Comments

nuowei2543 commented Apr 11, 2024 • edited

stephan-hahn commented Apr 25, 2024 • edited

nuowei2543 commented Apr 11, 2024 •

edited

stephan-hahn commented Apr 25, 2024 •

edited