Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Do not retry permanent backend failures #4803

Merged
merged 14 commits into from
May 18, 2024
Merged

Conversation

MichaelEischer
Copy link
Member

@MichaelEischer MichaelEischer commented May 11, 2024

What does this PR change? What problem does it solve?

Restic currently retries attempts to access a non-existing file or out of bound reads for truncated files. These errors are however permanent and cannot be fixed by retrying. This PR adds the necessary infrastructure to let the RetryBackend detect these cases and stop the useless retries.

In addition, the RetryBackend now implements a circuit breaker for inaccessible files. If accessing a file exhausts its retries, then the file is considered broken and won't be accessed again for one hour. This also benefits #4800 , as the fallback paths will immediately fail for such broken files.

Was the change previously discussed in an issue or on the forum?

Not really, but this is another prerequisite (and hopefully the last one) for #4784 .

Checklist

  • I have read the contribution guidelines.
  • I have enabled maintainer edits.
  • I have added tests for all code changes.
  • [ ] I have added documentation for relevant changes (in the manual).
  • There's a new file in changelog/unreleased/ that describes the changes for our users (see template).
  • I have run gofmt on the code in all commits.
  • All commit messages are formatted in the same style as the other commits in the repo.
  • I'm done! This pull request is ready for review.

@MichaelEischer MichaelEischer force-pushed the permanent-retry-failure branch 3 times, most recently from 2a05742 to 4426255 Compare May 12, 2024 10:36
@MichaelEischer MichaelEischer changed the title WIP: Do not retry permanent backend failures Do not retry permanent backend failures May 12, 2024
@MichaelEischer MichaelEischer marked this pull request as ready for review May 12, 2024 13:26
This is currently gated behind a feature flag as some unexpected
interactions might show up in the wild.
These backends tend to use a large variety of server implementations.
Some of those implementations might prove problematic with the new
checks.
If a file exhausts its retry attempts, then it is likely not accessible
the next time. Thus, immediately fail all load calls for that file to
avoid useless retries.
Copy link
Member Author

@MichaelEischer MichaelEischer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

None yet

1 participant