[feature request] OpenTelemetry endpoint in the backend #13

erdnaxe · 2022-10-13T14:51:11Z

This needs discussion as we mostly don't want to bloat Tulip.
It would be nice to have a /metrics endpoint in the backend following the OpenTelemetry format.
This could allow teams to monitor their instance and be alerted when something very wrong is happening (before the scoreboard).

Metrics wishlist:

(Counter per service) Total count of TCP flows in the MongoDB
(Counter per service) Total size in bytes of all payloads in the MongoDB
(Counter per service) Total amount of FLAG OUT / IN
(Counter) Total amount of backend API requests
(Gauge) Average duration of backend API response time

I don't believe we should expose per-TCP flow information as the Tulip frontend is already made for that.

The text was updated successfully, but these errors were encountered:

ItsShadowCone · 2022-10-17T10:11:54Z

massive +1

i would go further than just "health" checks, i agree with no per-TCP flow information, but we should also group by:

relevant data per-tick, probably also per-service, maybe rolling counters so it can be properly ingested into time-series database
relevant data per-tag, per-service and optionally also per-tick. not just flag in and out.

Simplify data format, remove hex data from flow item. The old format is terribly inefficient, and moving it to gridFS is slow and clunky, even if you were to manually handle appends. Data is capped at 15 MB, to stay well under the document limit of 16MB. Any data beyond that is discarded, but the start of the session will remain searchable.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature request] OpenTelemetry endpoint in the backend #13

[feature request] OpenTelemetry endpoint in the backend #13

erdnaxe commented Oct 13, 2022 •

edited

ItsShadowCone commented Oct 17, 2022

[feature request] OpenTelemetry endpoint in the backend #13

[feature request] OpenTelemetry endpoint in the backend #13

Comments

erdnaxe commented Oct 13, 2022 • edited

ItsShadowCone commented Oct 17, 2022

erdnaxe commented Oct 13, 2022 •

edited