Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automatic agent recovery full resync #1445

Open
mmourafiq opened this issue Nov 16, 2021 · 0 comments
Open

Automatic agent recovery full resync #1445

mmourafiq opened this issue Nov 16, 2021 · 0 comments

Comments

@mmourafiq
Copy link
Contributor

mmourafiq commented Nov 16, 2021

Current behavior

When there's an issue with the control plane and an agent is unable to reconcile it states, the platform does not recover the lost events. This happens for short lived jobs, which forces the user to either stop the jobs or to delete the operator so it can reconcile them.

Enhancement

Perform a full recheck when the agent goes to the warning status and back to running to reconcile all operations our-of-sync.

@mmourafiq mmourafiq changed the title Automatic recovery full resync Automatic agent recovery full resync Nov 16, 2021
@mmourafiq mmourafiq added this to In progress in Roadmap Jan 23, 2022
@mmourafiq mmourafiq moved this from In progress to Priority in Roadmap Jan 23, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Roadmap
Priority
Development

No branches or pull requests

1 participant