You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
I'm looking into implementing some basic monitoring to ensure the overall system is in health. I observe some things like networking troubles etc. I'd like to have an API endpoint I can integrate with the monitoring system.
Describe the solution you'd like
The most basic and meaningful information would be to have an endpoint reporting the number of outstanding and queued submissions from Dispatcher and Ingester modules. Those give the best overall feeling, whether the system works or not.
Describe alternatives you've considered
The current /system/status/ALL/ API is a great start, but it reports only if those modules are enabled, not how they work.
The Dashboard tab is a great place, but requires manual observation.
I assume this information could be extracted from Elasticsearch metrics, but it doesn't look like the best approach to get just basic info through dealing with deeply internal stuff.
More information - like error numbers, per-service stats - can be, of course, useful, but there is a reason why they are delivered using websocets, and not a typical API. A limited, basic info should be enough to indicate if there is a general problem or not.
Additional context
Dashboard tab is great, but it's always nice to connect with a centralized monitoring :)
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem? Please describe.
I'm looking into implementing some basic monitoring to ensure the overall system is in health. I observe some things like networking troubles etc. I'd like to have an API endpoint I can integrate with the monitoring system.
Describe the solution you'd like
The most basic and meaningful information would be to have an endpoint reporting the number of outstanding and queued submissions from Dispatcher and Ingester modules. Those give the best overall feeling, whether the system works or not.
Describe alternatives you've considered
/system/status/ALL/
API is a great start, but it reports only if those modules are enabled, not how they work.Additional context
Dashboard tab is great, but it's always nice to connect with a centralized monitoring :)
The text was updated successfully, but these errors were encountered: