CephFS: prevent hanging NodeGetVolumeStats on `stat()` syscall when an MDS is slow #4200

nixpanic · 2023-10-17T15:40:55Z

When an MDS is slow, the NodeGetVolumeStats CSI procedure may hang for a while. In case the MDS does not respond at all (or the reply is lost), the stat() syscall may hang indefinitely.

Because Kubelet retries the NodeGetVolumeStats CSI procedure regularly, the number of hanging stat() syscalls can pike up. Each hanging call will consume a thread (go routine), and this may eventually starve the processing capabilities of the CephFS CSI-nodeplugin container.

The commits in this PR come from #4125, which is planned to have a little more features (like a separate data/ directory for the contents of the volume).

Manual Testing

create a PVC and start a Pod that uses it
let it run for a few minutes, confirm that NodeGetVolumeStats calls are in the CephFS node-plugin logs
scale down the MDS(s), to simulate a failure
check the CephFS node-plugin logs and see if there are unhealthy messages for NodeGetVolumeStats calls

Results (see GRPC response):

I1024 12:31:13.033160       1 utils.go:195] ID: 11 GRPC call: /csi.v1.Node/NodeGetCapabilities
I1024 12:31:13.033195       1 utils.go:206] ID: 11 GRPC request: {}
I1024 12:31:13.033274       1 utils.go:212] ID: 11 GRPC response: {"capabilities":[{"Type":{"Rpc":{"type":1}}},{"Type":{"Rpc":{"type":2}}},{"Type":{"Rpc":{"type":4}}},{"Type":{"Rpc":{"type":5}}}]}
I1024 12:31:13.034119       1 utils.go:195] ID: 12 GRPC call: /csi.v1.Node/NodeGetVolumeStats
I1024 12:31:13.034261       1 utils.go:206] ID: 12 GRPC request: {"volume_id":"0001-0011-openshift-storage-0000000000000001-af951451-f063-4a67-bf7c-8e788bdc587c","volume_path":"/var/lib/kubelet/pods/b2a6ae50-16a9-4326-83d5-58aafcb70f90/volumes/kubernetes.io~csi/pvc-66c3bfe6-1046-438b-8070-60b070b71149/mount"}
I1024 12:31:13.035323       1 utils.go:212] ID: 12 GRPC response: {"usage":[{"available":1073741824,"total":1073741824,"unit":1}],"volume_condition":{"message":"volume is in a healthy condition"}}

I1024 12:32:33.878002       1 utils.go:195] ID: 13 GRPC call: /csi.v1.Node/NodeGetCapabilities
I1024 12:32:33.878281       1 utils.go:206] ID: 13 GRPC request: {}
I1024 12:32:33.878379       1 utils.go:212] ID: 13 GRPC response: {"capabilities":[{"Type":{"Rpc":{"type":1}}},{"Type":{"Rpc":{"type":2}}},{"Type":{"Rpc":{"type":4}}},{"Type":{"Rpc":{"type":5}}}]}
I1024 12:32:33.879443       1 utils.go:195] ID: 14 GRPC call: /csi.v1.Node/NodeGetVolumeStats
I1024 12:32:33.879502       1 utils.go:206] ID: 14 GRPC request: {"volume_id":"0001-0011-openshift-storage-0000000000000001-af951451-f063-4a67-bf7c-8e788bdc587c","volume_path":"/var/lib/kubelet/pods/b2a6ae50-16a9-4326-83d5-58aafcb70f90/volumes/kubernetes.io~csi/pvc-66c3bfe6-1046-438b-8070-60b070b71149/mount"}
I1024 12:32:33.879549       1 utils.go:212] ID: 14 GRPC response: {"volume_condition":{"abnormal":true,"message":"health-check has not responded for 119.215949 seconds"}}
I1024 12:34:17.335423       1 utils.go:195] ID: 15 GRPC call: /csi.v1.Node/NodeGetCapabilities
I1024 12:34:17.335461       1 utils.go:206] ID: 15 GRPC request: {}
I1024 12:34:17.335542       1 utils.go:212] ID: 15 GRPC response: {"capabilities":[{"Type":{"Rpc":{"type":1}}},{"Type":{"Rpc":{"type":2}}},{"Type":{"Rpc":{"type":4}}},{"Type":{"Rpc":{"type":5}}}]}
I1024 12:34:17.336271       1 utils.go:195] ID: 16 GRPC call: /csi.v1.Node/NodeGetVolumeStats
I1024 12:34:17.336350       1 utils.go:206] ID: 16 GRPC request: {"volume_id":"0001-0011-openshift-storage-0000000000000001-af951451-f063-4a67-bf7c-8e788bdc587c","volume_path":"/var/lib/kubelet/pods/b2a6ae50-16a9-4326-83d5-58aafcb70f90/volumes/kubernetes.io~csi/pvc-66c3bfe6-1046-438b-8070-60b070b71149/mount"}
I1024 12:34:17.336442       1 utils.go:212] ID: 16 GRPC response: {"volume_condition":{"abnormal":true,"message":"health-check has not responded for 222.672824 seconds"}}
I1024 12:35:55.149537       1 utils.go:195] ID: 17 GRPC call: /csi.v1.Node/NodeGetCapabilities
I1024 12:35:55.149583       1 utils.go:206] ID: 17 GRPC request: {}
I1024 12:35:55.149667       1 utils.go:212] ID: 17 GRPC response: {"capabilities":[{"Type":{"Rpc":{"type":1}}},{"Type":{"Rpc":{"type":2}}},{"Type":{"Rpc":{"type":4}}},{"Type":{"Rpc":{"type":5}}}]}
I1024 12:35:55.150294       1 utils.go:195] ID: 18 GRPC call: /csi.v1.Node/NodeGetVolumeStats
I1024 12:35:55.150331       1 utils.go:206] ID: 18 GRPC request: {"volume_id":"0001-0011-openshift-storage-0000000000000001-af951451-f063-4a67-bf7c-8e788bdc587c","volume_path":"/var/lib/kubelet/pods/b2a6ae50-16a9-4326-83d5-58aafcb70f90/volumes/kubernetes.io~csi/pvc-66c3bfe6-1046-438b-8070-60b070b71149/mount"}
I1024 12:35:55.150431       1 utils.go:212] ID: 18 GRPC response: {"volume_condition":{"abnormal":true,"message":"health-check has not responded for 320.486780 seconds"}}

Scale up the MDS(s) again, see that the volume becomes healthy again:

I1024 12:39:03.241203       1 utils.go:195] ID: 21 GRPC call: /csi.v1.Node/NodeGetCapabilities
I1024 12:39:03.241236       1 utils.go:206] ID: 21 GRPC request: {}
I1024 12:39:03.241308       1 utils.go:212] ID: 21 GRPC response: {"capabilities":[{"Type":{"Rpc":{"type":1}}},{"Type":{"Rpc":{"type":2}}},{"Type":{"Rpc":{"type":4}}},{"Type":{"Rpc":{"type":5}}}]}
I1024 12:39:03.242094       1 utils.go:195] ID: 22 GRPC call: /csi.v1.Node/NodeGetVolumeStats
I1024 12:39:03.242132       1 utils.go:206] ID: 22 GRPC request: {"volume_id":"0001-0011-openshift-storage-0000000000000001-af951451-f063-4a67-bf7c-8e788bdc587c","volume_path":"/var/lib/kubelet/pods/b2a6ae50-16a9-4326-83d5-58aafcb70f90/volumes/kubernetes.io~csi/pvc-66c3bfe6-1046-438b-8070-60b070b71149/mount"}
I1024 12:39:03.242220       1 utils.go:212] ID: 22 GRPC response: {"volume_condition":{"abnormal":true,"message":"health-check has not responded for 448.545782 seconds"}}
I1024 12:40:40.223909       1 utils.go:195] ID: 23 GRPC call: /csi.v1.Node/NodeGetCapabilities
I1024 12:40:40.223962       1 utils.go:206] ID: 23 GRPC request: {}
I1024 12:40:40.224044       1 utils.go:212] ID: 23 GRPC response: {"capabilities":[{"Type":{"Rpc":{"type":1}}},{"Type":{"Rpc":{"type":2}}},{"Type":{"Rpc":{"type":4}}},{"Type":{"Rpc":{"type":5}}}]}
I1024 12:40:40.224870       1 utils.go:195] ID: 24 GRPC call: /csi.v1.Node/NodeGetVolumeStats
I1024 12:40:40.224989       1 utils.go:206] ID: 24 GRPC request: {"volume_id":"0001-0011-openshift-storage-0000000000000001-af951451-f063-4a67-bf7c-8e788bdc587c","volume_path":"/var/lib/kubelet/pods/b2a6ae50-16a9-4326-83d5-58aafcb70f90/volumes/kubernetes.io~csi/pvc-66c3bfe6-1046-438b-8070-60b070b71149/mount"}
I1024 12:40:40.225807       1 utils.go:212] ID: 24 GRPC response: {"usage":[{"available":1073741824,"total":1073741824,"unit":1}],"volume_condition":{"message":"volume is in a healthy condition"}}

Show available bot commands

These commands are normally not required, but in case of issues, leave any of
the following bot commands in an otherwise empty comment in this PR:

/retest ci/centos/<job-name>: retest the <job-name> after unrelated
failure (please report the failure too!)

internal/cephfs/nodeserver.go

Madhu-1 · 2023-10-25T09:29:58Z

internal/cephfs/nodeserver.go

+	err = ns.healthChecker.StartChecker(req.GetVolumeId(), stagingTargetPath)
+	log.WarningLog(ctx, "failed to start healtchecker: %v", err)
+


What if the volume is already mounted (case where csi plugin just restarted)

In that case, there it no health-checker.

Possibly NodeGetVolumeStats can be extended to start the checker to catch this.

I think we need this , can we have a tracker for this one?

yes, makes sense. #4219 has been created for this.

Madhu-1 · 2023-10-25T09:32:47Z

internal/cephfs/nodeserver.go

@@ -599,6 +606,9 @@ func (ns *NodeServer) NodeUnstageVolume(
 	}

 	volID := req.GetVolumeId()
+
+	ns.healthChecker.StopChecker(volID)


This should happen right before umount but after all other operations right?

It should be done early, yes. Unmounting may not be possible if the checker writes/reads from the mountpoint at the same time.

Madhu-1 · 2023-10-25T09:36:01Z

internal/cephfs/nodeserver.go

@@ -694,6 +711,18 @@ func (ns *NodeServer) NodeGetVolumeStats(
 		return nil, status.Error(codes.InvalidArgument, err.Error())
 	}

+	// health check first, return without stats if unhealthy
+	healthy, msg := ns.healthChecker.IsHealthy(req.GetVolumeId())


what happens if the NodeGetVolumeStats is called right after the NodeStageVolume?

Before running the 1st check, the volume is regarded healthy. It is the assumption that when mounting works, the volume is still healthy shortly after.

Madhu-1 · 2023-10-25T09:38:42Z

internal/cephfs/nodeserver.go

+		if err != nil {
+			return nil, err
+		}
+
+		return res, nil


we dont need this check and can have return csicommon.FilesystemNodeGetVolumeStats(ctx, ns.Mounter, targetPath, false) as it was before

ah, yes. It was modified in an earlier version to set res.VolumeCondition = ... after getting the stats. ... that wasn't my smartest move, as getting the stats got hung, so the VolumeCondition was never set and returned 🤦‍♂️

Madhu-1 · 2023-10-25T09:40:39Z

internal/health-checker/filechecker.go

+// command is what is sent through the channel to terminate the go routine.
+type command string
+
+const (
+	// stopCommand is sent through the channel to stop checking.
+	stopCommand = command("STOP")
+)


can make use of empty struct to signal between go routines?

That is possible, but what is the advantage?

I've chosen this way to make it extendable if needed, it does not matter at the moment what is sent over the channel.

In most cases empty structs are preferred to signal in the go routines, that's why the suggestion

Is that a recommendation that is documented anywhere? I have seen several examples with integers and booleans, or structs that contain one or more members...

Madhu-1 · 2023-10-25T09:54:33Z

internal/health-checker/filechecker.go

+		fc.healthy = false
+		fc.err = fmt.Errorf("health-check has not responded for %f seconds", delay.Seconds())


Will there be any problem with multiple goroutines trying to update the same variables?

Not really. There is a very small race in case one go-routine does not enter this if-statement, and an other does + updates the fc.err before the 1st go-routines exits the function. I don't think it is a realistic failure scenario to account for.

A same way can happen with this function and concurrent runChecker(). But it also looks quite an edge case.

If you insist, I can add some locking around updating/reading fc.healthy and fc.err.

Locking has been added.

Madhu-1 · 2023-10-25T09:57:42Z

internal/health-checker/manager.go

+		return fmt.Errorf("failed to created workdir %q for health-checker: %w", workdir, err)
+	}
+
+	cc := newFileChecker(workdir)


This always starts a FileChecker what about BlockChecker for future?

When a BlockChecker is introduced, the Manager interface will need some modifications in any case.

Madhu-1 · 2023-10-25T09:59:10Z

internal/health-checker/manager.go

+	} else {
+		// 'cc' was stored, start it only once
+		cc.start()
+	}


This can happen only for Static PVC? because the NodeStageVolume will be called only once per volumeID

NodeStageVolume might have failed/timedout after the checker was started. Calling NodeStageVolume again should be a no-op in that case, this just ensures some idempotency.

Madhu-1 · 2023-10-25T09:59:20Z

internal/health-checker/manager.go

+
+	cc := newFileChecker(workdir)
+
+	// load the 'old' ConditionChecker if it exists, otherwuse store 'cc'


otherwuse to otherwise

internal/health-checker/manager.go

docs/design/proposals/volume-condition.md

nixpanic · 2023-10-27T09:24:31Z

@Madhu-1 and @riya-singhal31 , I have manually tested this again (by scaling MDS's down/up) and it still works as before. Could you review again and mark the addressed comments as resolved? Many thanks!

riya-singhal31 · 2023-10-27T13:03:39Z

internal/cephfs/nodeserver.go

@@ -270,6 +272,9 @@ func (ns *NodeServer) NodeStageVolume(
 		}
 	}

+	err = ns.healthChecker.StartChecker(req.GetVolumeId(), stagingTargetPath)
+	log.WarningLog(ctx, "failed to start healtchecker: %v", err)


Suggested change

log.WarningLog(ctx, "failed to start healtchecker: %v", err)

log.WarningLog(ctx, "failed to start healthchecker: %v", err)

Madhu-1

Left some questions, Mostly LGTM

Madhu-1 · 2023-10-30T10:54:08Z

internal/health-checker/manager.go

+}
+
+func (hcm *healthCheckManager) StartChecker(volumeID, path string) error {
+	workdir := filepath.Join(path, ".csi")


what if the PVC already has this path or the application doesn't expect any folders inside the mounted directory?

Other filesystems also maintain their own metadata in the root, like lost+found on ext4. Applications that (do not) expect certain filesystem internal metadata should be quite rare, as they are not very portable.

Would you prefer to have an option per volume to use a different directory, or disable it?

Yes, I agree on that one, this is something we are introducing, but not all the application owners/developers will be aware of it. IMHO we should not write or touch anything in the path that is provided to the application pods, I would prefer the below

Mount the subvolume to the stagingPath and create a new directory as in the design doc and mount that directory to the targetPath

Do health check only inside the stagingPath which is not available to the application pod

For legacy volumes, we could also do a minimal stat() on the root of the volume, instead of writing/reading a timestamp. It might not be able to detect all failures scenarios, but it would be a non-intrusive check.

Newly created volumes can have their own data subdirectory that is mapped for use inside of the container, leaving the root of the volume for contents that is not visible to applications. That can be done in PR #4125 once this one is accepted.

For legacy volumes, we could also do a minimal stat() on the root of the volume, instead of writing/reading a timestamp. It might not be able to detect all failures scenarios, but it would be a non-intrusive check.

It's a temporary problem, once the node gets rebooted or the application gets restagted we will move to the new format. We can live with it.

Newly created volumes can have their own data subdirectory that is mapped for use inside of the container, leaving the root of the volume for contents that is not visible to applications. That can be done in PR #4125 once this one is accepted.

Sounds good 👍🏻

Added statChecker and will use that in the nodeserver for now. fileChecker is still included so that the refactoring with a base checker makes sense.

Madhu-1 · 2023-10-30T10:56:55Z

internal/cephfs/nodeserver.go

@@ -270,6 +272,9 @@ func (ns *NodeServer) NodeStageVolume(
 		}
 	}

+	err = ns.healthChecker.StartChecker(req.GetVolumeId(), stagingTargetPath)


The same check needs to be done at line 233 where we are returning success if the directory is already mounted

Madhu-1 · 2023-10-30T10:57:38Z

internal/cephfs/nodeserver.go

+	err = ns.healthChecker.StartChecker(req.GetVolumeId(), stagingTargetPath)
+	log.WarningLog(ctx, "failed to start healtchecker: %v", err)
+


I think we need this , can we have a tracker for this one?

Madhu-1 · 2023-10-31T09:50:24Z

internal/cephfs/nodeserver.go

@@ -270,6 +275,9 @@ func (ns *NodeServer) NodeStageVolume(
 		}
 	}

+	err = ns.healthChecker.StartChecker(req.GetVolumeId(), stagingTargetPath)
+	log.WarningLog(ctx, "failed to start healthchecker: %v", err)


this should be done inside if err !=nil check

Madhu-1 · 2023-10-31T09:50:28Z

internal/cephfs/nodeserver.go

@@ -228,6 +230,9 @@ func (ns *NodeServer) NodeStageVolume(
 			return nil, status.Error(codes.Internal, err.Error())
 		}

+		err = ns.healthChecker.StartChecker(req.GetVolumeId(), stagingTargetPath)
+		log.WarningLog(ctx, "failed to start healthchecker: %v", err)


this should be done inside if err !=nil check

Madhu-1 · 2023-10-31T09:51:33Z

internal/health-checker/manager.go

+	// load the 'old' ConditionChecker if it exists
+	old, ok := hcm.checkers.Load(volumeID)
+	if !ok {
+		return true, fmt.Errorf("no ConditionChecker for volume-id: %s", volumeID)


This error will not checked as we are returning volume as Healthy and also the error message

Added a check for that now, will start a checker in that case too.

Madhu-1 · 2023-10-31T09:51:38Z

internal/health-checker/manager.go

+	// 'old' was loaded, cast it to ConditionChecker
+	cc, ok := old.(ConditionChecker)
+	if !ok {
+		return true, fmt.Errorf("failed to cast cc to ConditionChecker for volume-id %q", volumeID)


This error will not checked as we are returning volume as Healthy and also the error message

If there is an error in the internals of the health-checker, it is assumed that the volume is healthy. Only when there really is a problem with the volume, the status is reported as unhealthy.

Can we log and also document this one? we might need to have better logging to check whether the health checker is running and what was the latest run details (by looking at the logs), maybe we can do it as an enhancement later.

I've added more docs about it now.

nixpanic · 2023-11-02T16:36:34Z

Lots of manual testing done. including the following combinations:

pod uses PVC and

is healthy
csi-cephfsplugin pods restarted, volume stays healthy
MDS' are stopped, volume becomes unhealthy
MDS' restarted, volume becomes healthy again
csi-cephfsplugin pods restarted when MDS' are unavailable, volume stays unhealthy
csi-cephfsplugin restarted while unhealthy, MDS' restarted, volume becomes healthy again

riya-singhal31

Thanks Niels, LGTM

Madhu-1

LGTM, left a question and small nit

Madhu-1 · 2023-11-03T06:25:24Z

internal/cephfs/nodeserver.go

+	// volume is already staged and published.
+	if healthy && msg != nil {
+		// Start a StatChecker for the mounted targetPath, yhis prevents
+		// writing a file in the user-visible location. Ideally a (shared)


yhis to this

Madhu-1 · 2023-11-03T06:28:09Z

internal/cephfs/nodeserver.go

+		// writing a file in the user-visible location. Ideally a (shared)
+		// FileChecker is started with the stagingTargetPath, but we can't
+		// get the stagingPath from the request.
+		err = ns.healthChecker.StartChecker(req.GetVolumeId(), targetPath, hc.StatCheckerType)


can we use something like below to get the stagingPath from targetPath

ceph-csi/internal/rbd/nodeserver.go

Lines 1079 to 1094 in c09700b

func (ns *NodeServer) getStagingPath(volPath string) (string, error) {

mounts, err := ns.Mounter.GetMountRefs(volPath)

if err != nil {

return "", err

}

for _, mount := range mounts {

// strip the last directory from the staging path

stp := strings.Split(mount, "/")

stagingTargetPath := strings.Join(stp[:len(stp)-1], "/")

if checkRBDImageMetadataStashExists(stagingTargetPath) {

return stagingTargetPath, nil

}

}

return "", fmt.Errorf("failed to get staging path for volume %s", volPath)

}

Something like this could be an enhancement. I am not sure if it is worth the effort and additional complexity though.

When a FileChecker is used, it would make more sense as that could detect the existence of the file holding the timestamp.

Adding a TODO comment with reference to rbd.getStagingPath()

Madhu-1 · 2023-11-03T06:32:44Z

internal/cephfs/nodeserver.go

+		// writing a file in the user-visible location. Ideally a (shared)
+		// FileChecker is started with the stagingTargetPath, but we can't
+		// get the stagingPath from the request.
+		err = ns.healthChecker.StartChecker(req.GetVolumeId(), targetPath, hc.StatCheckerType)


i also see a very small race for the below case

During NodeUnpublish cephcsi stopped the health checker

Before umount operation we got a NodeGetVolumeStats RPC call which inturn started a health checker

Now we have a stale health check which will be cleanedup only during NodeUnstage (i think)

Is it possible to have such a case?

I do not expect the CO to request NodeUnpublish and then still use the same publishTargetPath in a later NodeGetVolumeStats. If a CO does that, then yes, there would be a stale health checker (if it was started in a NodeGetVolumeStats call, and not in NodeStageVolume).

nixpanic · 2023-11-03T08:35:31Z

@Madhu-1 I have addressed your latest concerns as well. Please check again, thanks!

Pull request has been modified.

Madhu-1

LGTM, Thanks @nixpanic

nixpanic · 2023-11-03T11:39:20Z

@Mergifyio rebase

Signed-off-by: Niels de Vos <ndevos@ibm.com>

The HealthChecker is configured to use the Staging path pf the volume, with a `.csi/` subdirectory. In the future this directory could be a directory that is not under the Published directory. Fixes: ceph#4219 Signed-off-by: Niels de Vos <ndevos@ibm.com>

When FilesystemNodeGetVolumeStats() succeeds, the volume must be healthy. This can be included in the VolumeCondition CSI message by default. Checks that detect an abnormal VolumeCondition should prevent calling FilesystemNodeGetVolumeStats() as it is possible that the function will hang. Signed-off-by: Niels de Vos <ndevos@ibm.com>

mergify · 2023-11-03T11:39:34Z

rebase

✅ Branch has been successfully rebased

nixpanic · 2023-11-03T11:39:39Z

@Mergifyio queue

mergify · 2023-11-03T11:39:43Z

queue

✅ The pull request has been merged automatically

The pull request has been merged automatically at 4d3b1fc

ceph-csi-bot · 2023-11-03T11:40:01Z

/test ci/centos/upgrade-tests-cephfs

ceph-csi-bot · 2023-11-03T11:40:02Z

/test ci/centos/upgrade-tests-rbd

ceph-csi-bot · 2023-11-03T11:40:02Z

/test ci/centos/k8s-e2e-external-storage/1.27

ceph-csi-bot · 2023-11-03T11:40:03Z

/test ci/centos/k8s-e2e-external-storage/1.26

ceph-csi-bot · 2023-11-03T11:40:03Z

/test ci/centos/mini-e2e-helm/k8s-1.27

ceph-csi-bot · 2023-11-03T11:40:03Z

/test ci/centos/k8s-e2e-external-storage/1.28

ceph-csi-bot · 2023-11-03T11:40:04Z

/test ci/centos/mini-e2e/k8s-1.27

ceph-csi-bot · 2023-11-03T11:40:04Z

/test ci/centos/mini-e2e-helm/k8s-1.26

ceph-csi-bot · 2023-11-03T11:40:04Z

/test ci/centos/mini-e2e-helm/k8s-1.28

ceph-csi-bot · 2023-11-03T11:40:05Z

/test ci/centos/mini-e2e/k8s-1.28

ceph-csi-bot · 2023-11-03T11:40:05Z

/test ci/centos/mini-e2e/k8s-1.26

nixpanic force-pushed the bug/2244562 branch from 2d9e6fe to 263b502 Compare October 17, 2023 19:57

nixpanic commented Oct 19, 2023

View reviewed changes

internal/cephfs/nodeserver.go Outdated Show resolved Hide resolved

nixpanic force-pushed the bug/2244562 branch 2 times, most recently from bbeced9 to 4e8007a Compare October 20, 2023 08:02

nixpanic marked this pull request as draft October 20, 2023 08:02

nixpanic added the component/cephfs Issues related to CephFS label Oct 24, 2023

nixpanic marked this pull request as ready for review October 24, 2023 14:25

nixpanic requested a review from a team October 24, 2023 14:25

Madhu-1 reviewed Oct 25, 2023

View reviewed changes

riya-singhal31 reviewed Oct 26, 2023

View reviewed changes

docs/design/proposals/volume-condition.md Outdated Show resolved Hide resolved

riya-singhal31 reviewed Oct 26, 2023

View reviewed changes

docs/design/proposals/volume-condition.md Outdated Show resolved Hide resolved

riya-singhal31 reviewed Oct 26, 2023

View reviewed changes

docs/design/proposals/volume-condition.md Outdated Show resolved Hide resolved

nixpanic force-pushed the bug/2244562 branch from 4e8007a to a9c180e Compare October 26, 2023 15:41

riya-singhal31 reviewed Oct 27, 2023

View reviewed changes

Madhu-1 reviewed Oct 30, 2023

View reviewed changes

nixpanic mentioned this pull request Oct 30, 2023

health-checker should get started during NodeGetVolumeStats call if not running yet #4219

Closed

nixpanic force-pushed the bug/2244562 branch from a9c180e to e72b358 Compare October 30, 2023 15:33

nixpanic requested review from Madhu-1 and a team October 30, 2023 15:56

Madhu-1 reviewed Oct 31, 2023

View reviewed changes

nixpanic force-pushed the bug/2244562 branch from e72b358 to bc932fb Compare November 2, 2023 16:28

nixpanic requested a review from Madhu-1 November 2, 2023 16:29

riya-singhal31 previously approved these changes Nov 3, 2023

View reviewed changes

Madhu-1 reviewed Nov 3, 2023

View reviewed changes

nixpanic force-pushed the bug/2244562 branch from bc932fb to 2e2c704 Compare November 3, 2023 08:34

nixpanic requested review from riya-singhal31 and Madhu-1 November 3, 2023 08:35

Madhu-1 approved these changes Nov 3, 2023

View reviewed changes

riya-singhal31 approved these changes Nov 3, 2023

View reviewed changes

nixpanic added 4 commits November 3, 2023 11:39

doc: Add initial design notes for the Health Checker

72a2f53

Signed-off-by: Niels de Vos <ndevos@ibm.com>

util: add health-checker for periodic filesystem checks

d4ceaf1

Signed-off-by: Niels de Vos <ndevos@ibm.com>

nixpanic force-pushed the bug/2244562 branch from 2e2c704 to 34d26d3 Compare November 3, 2023 11:39

mergify bot added the ok-to-test Label to trigger E2E tests label Nov 3, 2023

ceph-csi-bot removed the ok-to-test Label to trigger E2E tests label Nov 3, 2023

mergify bot merged commit 4d3b1fc into ceph:devel Nov 3, 2023
34 checks passed

		err = ns.healthChecker.StartChecker(req.GetVolumeId(), stagingTargetPath)
		log.WarningLog(ctx, "failed to start healtchecker: %v", err)

		fc.healthy = false
		fc.err = fmt.Errorf("health-check has not responded for %f seconds", delay.Seconds())


		cc := newFileChecker(workdir)

		// load the 'old' ConditionChecker if it exists, otherwuse store 'cc'

	func (ns *NodeServer) getStagingPath(volPath string) (string, error) {
	mounts, err := ns.Mounter.GetMountRefs(volPath)
	if err != nil {
	return "", err
	}
	for _, mount := range mounts {
	// strip the last directory from the staging path
	stp := strings.Split(mount, "/")
	stagingTargetPath := strings.Join(stp[:len(stp)-1], "/")
	if checkRBDImageMetadataStashExists(stagingTargetPath) {
	return stagingTargetPath, nil
	}
	}

	return "", fmt.Errorf("failed to get staging path for volume %s", volPath)
	}

CephFS: prevent hanging NodeGetVolumeStats on stat() syscall when an MDS is slow #4200

CephFS: prevent hanging NodeGetVolumeStats on stat() syscall when an MDS is slow #4200

Conversation

nixpanic commented Oct 17, 2023 • edited

Manual Testing

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nixpanic commented Oct 27, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Madhu-1 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nixpanic commented Nov 2, 2023

riya-singhal31 left a comment

Choose a reason for hiding this comment

Madhu-1 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nixpanic commented Nov 3, 2023

Madhu-1 left a comment

Choose a reason for hiding this comment

nixpanic commented Nov 3, 2023

mergify bot commented Nov 3, 2023

✅ Branch has been successfully rebased

nixpanic commented Nov 3, 2023

mergify bot commented Nov 3, 2023 • edited

✅ The pull request has been merged automatically

ceph-csi-bot commented Nov 3, 2023

ceph-csi-bot commented Nov 3, 2023

ceph-csi-bot commented Nov 3, 2023

ceph-csi-bot commented Nov 3, 2023

ceph-csi-bot commented Nov 3, 2023

ceph-csi-bot commented Nov 3, 2023

ceph-csi-bot commented Nov 3, 2023

ceph-csi-bot commented Nov 3, 2023

ceph-csi-bot commented Nov 3, 2023

CephFS: prevent hanging NodeGetVolumeStats on `stat()` syscall when an MDS is slow #4200

CephFS: prevent hanging NodeGetVolumeStats on `stat()` syscall when an MDS is slow #4200

nixpanic commented Oct 17, 2023 •

edited

mergify bot commented Nov 3, 2023 •

edited