Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] system restore stuck because of the volume/PV/PVC restoration #8601

Closed
ChanYiLin opened this issue May 20, 2024 · 4 comments
Closed

[BUG] system restore stuck because of the volume/PV/PVC restoration #8601

ChanYiLin opened this issue May 20, 2024 · 4 comments
Assignees
Labels
kind/bug priority/0 Must be fixed in this release (managed by PO) require/auto-e2e-test Require adding/updating auto e2e test cases if they can be automated require/backport Require backport. Only used when the specific versions to backport have not been definied. require/qa-review-coverage Require QA to review coverage
Milestone

Comments

@ChanYiLin
Copy link
Contributor

Describe the bug

in our code https://github.com/longhorn/longhorn-manager/blob/ebace7b56389b017c97876f15b5298d07095b745/controller/system_rollout_controller.go#L1264-L1290
PV restoration will tried to find the Volume first
PVC restoration will tried to find the PV first
But I have moved the Volume restoration out of the parallel restoration
So all 3 resources won't be able to restored and because of the deadlock

To Reproduce

Create a Volume/PV/PVC and do the system backup and restore

Expected behavior

All 3 resources should be restored successfully

@longhorn-io-github-bot
Copy link

longhorn-io-github-bot commented May 24, 2024

Pre Ready-For-Testing Checklist

  • Where is the reproduce steps/test steps documented?
    The reproduce steps/test steps are at:
  1. Create a BackingImage vol-0 , Created From Download From URL and a volume using it
  2. Create the PV/PVC for the volume
  3. Create a system-backup
  4. kubectl get lhbbi, lhb -n longhorn-system will see the backup of the Volume and the Backingimage
  5. Delete the Volume and the BackingImage and the PV and PVC
  6. Restore the system-backup
  7. The BackingImage and The BackingImage source type will be restored, and the Volume/PV/PVC will be restored correctly

PR: longhorn/longhorn-manager#2819

@c3y1huang
Copy link
Contributor

@ChanYiLin , should we have an automation test case to cover this?

@ChanYiLin
Copy link
Contributor Author

ChanYiLin commented May 24, 2024

Hi @c3y1huang Yes, I will modify the origin test case to include PV and PVC after the fix merged
here: https://github.com/longhorn/longhorn-tests/blob/master/manager/integration/tests/test_system_backup_restore.py#L121

@roger-ryao
Copy link

Verified on master-head 20240530

The test steps
#8601 (comment)

Result Passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug priority/0 Must be fixed in this release (managed by PO) require/auto-e2e-test Require adding/updating auto e2e test cases if they can be automated require/backport Require backport. Only used when the specific versions to backport have not been definied. require/qa-review-coverage Require QA to review coverage
Projects
None yet
Development

No branches or pull requests

5 participants