You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Based on today's AHM discussion:
It would nice to have something like a /healthcheck endpoint that could report substantial problems and could be monitored.
So for example,
When /healthcheck is called, it could make sure that the KP info cache is less than 30 minutes old
There aren't a bunch of stale active processes
Relay any major errors*
It can start small, but ideally be flexible so that we could add more health checks, too.
Footnote* I have often mused about somehow have a response.error() option that is something like tell_a_human=True or something, where not only did the processing end in error, but this condition really ought to be relayed to an administrator rather than buried in a log file that no one is likely to read.
The text was updated successfully, but these errors were encountered:
Based on today's AHM discussion:
It would nice to have something like a /healthcheck endpoint that could report substantial problems and could be monitored.
So for example,
It can start small, but ideally be flexible so that we could add more health checks, too.
Footnote* I have often mused about somehow have a response.error() option that is something like tell_a_human=True or something, where not only did the processing end in error, but this condition really ought to be relayed to an administrator rather than buried in a log file that no one is likely to read.
The text was updated successfully, but these errors were encountered: