New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remediation of MISSING_LOST
segments.
#1590
Comments
Hi! You can archive empty file as 0000007200000CBF00000017, 0000007300000CBF00000067 and 0000007400000CBF000000AB. |
Meanwhile I'd appreciate if you describe how you lost WALs in more details. We should consider if WAL-G could prevent this. |
Hi @x4m! What happened was we made some adjustments to a Patroni managed cluster in order to band-aid some application problems. Namely we increased What then followed was the k8s pods reached a near OOM state. PostgreSQL then failed to archive segments with this message:
Followed by one of the backends geting OOM Killed and then postmaster shutting down and killing the archive task. After failover I thought these segments would get uploaded by they did not. It does not appear that PostgreSQL even tried to but I am not that familiar with what happens after WAL archiving fails or if the status of not-archived segments is persisted to the secondary. There was a trace from wal-g but it was because it got a SIGQUIT from postgres, likely during the ungraceful shutdown, other than that nothing else to indicate what went wrong. As far as I can tell wal-g wasn't killed by the OOM killer as I have no logs of it being killed. |
Hi,
We are using wal-g with PostgreSQL and it's been great thus far.
Unfortunately we ran into some issues where we had repeated OOM events that caused unclean shutdowns that had also prevented WAL segments being archived. These segments are now long-lost unfortunately.
We know these segments are gone and aren't recoverable at this stage. What is the best way to return
wal-verify integrity
to reportingSUCCESS
?For posterity here is the output of the integrity job:
Thanks in advance!
The text was updated successfully, but these errors were encountered: