Add a /health endpoint which does not require authentication #147

anguslees · 2021-03-28T00:22:36Z

Trying to deploy restic/rest-server:0.10.0 in k8s. I want a URL endpoint that I can use for "livenessProbe" (GET, should return 200).

As far as I can see from some testing, guided by mux.go - everything returns 401 Unauthorized. At the moment, I'm using a naive tcp 'liveness' probe, which is not nearly as expressive/accurate.

It would be nice if there was a /health endpoint (or similar) which did not require authentication, and returned 200 (assuming the server was ok).

Enrico204 · 2021-08-23T10:33:44Z

I created a WIP pull request with a proposal for a /health endpoint. For now, the endpoint checks for free space (at least 8MB *) and that the repository path is writable. Should we check for more things?

Maybe we can add some other checks when/if external auth backends will be implemented (see #111 and #70 )

I picked the value as the size on one pack, however I don't know if it's correct. Feel free to propose a different value

MichaelEischer · 2021-09-24T20:02:41Z

Which checks do we really need for a /health endpoint? Checking whether there is enough free space to store at least one pack file is mostly of academic interest. After all it will only take a very short time to fill up the free space completely at which point the server will stop accepting new uploads. And I don't think that there's a right answer to what the limit should be, probably every admin will want to use different limits.

If I understand liveness probes correctly, they are used to restart stuck containers. That is a failed health probe would cause container restarts. However, restarting the rest-server container once a disk has run full is highly problematic as that would prevent (read) access to the backup repositories.

And I guess a similar reasoning applies to whether the repository path is writeable (although that might be less of a problem).

anguslees · 2021-09-24T22:58:38Z

Agreed, the focus should be on whether killing+restarting this container would help. This isn't a substitute for more comprehensive monitoring.

A quite reasonable first step is to do no extra logic and just respond with 200 ok immediately, from your main http event handler. Even that trivial check still confirms that the program is running, is listening on the correct port, has completed any startup steps, isn't in deadlock or oom-thrash, etc.

Just for completeness, don't make 'healthiness' depend on reachability/health of some other remote service. This is a common error and leads to cascading failures.

wojas · 2021-10-18T09:44:33Z

It would make sense to just add a handler for /health that always returns 200.

I can think of the following additional things to check:

Check if the .htpasswd file is readable if htpasswd auth is enabled.
Check if the repo root directory exists.

Restarting rest-server will not resolve any of these, but I can imagine that the failure state of the container/pod is useful to administrators. But I think that rest-server will actually fail to start in those cases anyway, in which case adding these checks is not that useful.

As discussed, free disk space is something that can and should be monitored outside of rest-server.

Perhaps we could add a Prometheus metric for write errors?

wojas · 2021-10-18T09:48:24Z

As a workaround, you could set the -prometheus-no-auth flag to disable auth on the /metrics endpoint, if you do not mind exposing the metrics or have a reverse proxy that can restrict access to that path in front of the service.

queeup · 2024-02-21T12:01:13Z

I would love to have this for checking if my rest-server up before triggering remote backup with systemd.

Right now I am using this for check:

curl --silent --fail --head -L http://192.168.1.100:8000/myrestic-backup-repo-name/config

Enrico204 mentioned this issue Aug 23, 2021

WIP: Add health endpoint #159

Open

7 tasks

DtxdF mentioned this issue Dec 6, 2023

Improve check of services, Fix #19 erohtar/Dasherr#23

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a /health endpoint which does not require authentication #147

Add a /health endpoint which does not require authentication #147

anguslees commented Mar 28, 2021

Enrico204 commented Aug 23, 2021

MichaelEischer commented Sep 24, 2021

anguslees commented Sep 24, 2021

wojas commented Oct 18, 2021

wojas commented Oct 18, 2021

queeup commented Feb 21, 2024 •

edited

Add a /health endpoint which does not require authentication #147

Add a /health endpoint which does not require authentication #147

Comments

anguslees commented Mar 28, 2021

Enrico204 commented Aug 23, 2021

MichaelEischer commented Sep 24, 2021

anguslees commented Sep 24, 2021

wojas commented Oct 18, 2021

wojas commented Oct 18, 2021

queeup commented Feb 21, 2024 • edited

queeup commented Feb 21, 2024 •

edited