feat(lep): auto-balance pressured disks #7576

c3y1huang · 2024-01-08T04:20:51Z

Which issue(s) this PR fixes:

What this PR does / why we need it:

Propose to enhance replica auto-balance (best-effort) with replica-auto-balance-disk-pressure-percentage setting to allow automatically rebuild replica on another disk when the defined threshold is reached.

Special notes for your reviewer:

None

Additional documentation or context

None

longhorn-4105 Signed-off-by: Chin-Ya Huang <chin-ya.huang@suse.com>

ejweber

IIUC correctly, there is no mechanism to select which replica on a disk would be best (whatever that means) to autobalance. Instead, it is essentially random. Whichever volume happens to reconcile first when a disk is under pressure is balanced to a different disk.

Do you think there is a significant advantage to choosing one replica versus another? If so, how difficult do you think it would be to add that kind of choice to the implementation? (My initial thought is that it would add significant complexity, since right now all replica scheduling decisions are made with only the replica to be scheduled and the aggregate available space on nodes in disks in mind.)

ejweber · 2024-03-26T15:26:49Z

enhancements/20240108-replica-auto-balance-pressured-disks.md

+		string target = 2;
+	}
+	```
+1. Then the sync-agent server is responsible for copying the files using `req.SyncFileInfoList` from source to the target directory when `req.FilesSyncRequest.LocalSync` is provided.


How exactly are the files copied? I have experienced some strange behavior when experimenting with copying replica sparse files before, even when using options meant to preserve sparseness. The checksums always matched (and IIRC matched even if sparseness was not preserved), but it the actual space usage was different by a little bit between the source and target file. I think the exact behavior varied between copy utilities (e.g. rsync vs cp, etc.).

derekbit · 2024-04-25T16:41:47Z

Some questions before diving into the codes:

There is no lock mechanism in replica scheduler. Is it possible that all volumes are triggered to be rebuilt on the same disk while reaching to the user-defined threshold?
Set replica-soft-anti-affinity to enabled is a requirement for enabling the feature. Will it introduce a side effect that multiple replicas of a volume are created on the same node?
Snapshot files are sparse. Will the copy method handle holes?

c3y1huang self-assigned this Jan 8, 2024

c3y1huang force-pushed the lep-feat-auto-balance-node-disks branch 2 times, most recently from 101af10 to 66b6aed Compare January 30, 2024 05:27

feat(lep): replica auto-balance pressured node disks design

3e6c271

longhorn-4105 Signed-off-by: Chin-Ya Huang <chin-ya.huang@suse.com>

c3y1huang force-pushed the lep-feat-auto-balance-node-disks branch from 66b6aed to 3e6c271 Compare January 30, 2024 05:28

c3y1huang marked this pull request as ready for review January 30, 2024 05:29

c3y1huang requested a review from a team as a code owner January 30, 2024 05:29

c3y1huang mentioned this pull request Feb 5, 2024

[FEATURE] Auto-balancing volumes between nodes & disks #4105

Open

innobead requested review from ejweber, shuo-wu and derekbit March 26, 2024 15:16

ejweber reviewed Mar 26, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(lep): auto-balance pressured disks #7576

feat(lep): auto-balance pressured disks #7576

c3y1huang commented Jan 8, 2024 •

edited

ejweber left a comment

ejweber Mar 26, 2024

derekbit commented Apr 25, 2024 •

edited

feat(lep): auto-balance pressured disks #7576

Are you sure you want to change the base?

feat(lep): auto-balance pressured disks #7576

Conversation

c3y1huang commented Jan 8, 2024 • edited

Which issue(s) this PR fixes:

What this PR does / why we need it:

Special notes for your reviewer:

Additional documentation or context

ejweber left a comment

Choose a reason for hiding this comment

ejweber Mar 26, 2024

Choose a reason for hiding this comment

derekbit commented Apr 25, 2024 • edited

c3y1huang commented Jan 8, 2024 •

edited

derekbit commented Apr 25, 2024 •

edited