Add guidance for adding new metrics #5116

BrynCooke · 2024-05-08T08:17:39Z

Until now there has been little guidance on adding new metrics to the router. This PR expands the dev doc to include this.

Checklist

Complete the checklist (and note appropriate exceptions) before the PR is marked ready-for-review.

Exceptions

Note any exceptions here

Notes

It may be appropriate to bring upcoming changes to the attention of other (impacted) groups. Please endeavour to do this before seeking PR approval. The mechanism for doing this will vary considerably, so use your judgement as to how and when to do this. ↩
Configuration is an important part of many changes. Where applicable please try to document configuration examples. ↩
Tick whichever testing boxes are applicable. If you are adding Manual Tests, please document the manual testing (extensively) in the Exceptions. ↩

github-actions · 2024-05-08T08:17:52Z

@BrynCooke, please consider creating a changeset entry in /.changesets/. These instructions describe the process and tooling.

router-perf · 2024-05-08T08:18:11Z

dev-docs/metrics.md

Co-authored-by: Jesse Rosenberger <git@jro.cc>

Geal

a lot of what is encoded in this document is unclear to me, I think we should discuss it a bit more

Geal · 2024-05-27T12:58:58Z

dev-docs/metrics.md

+## Adding new metrics
+There are different types of metrics.
+
+* Static - Used by us to monitor feature usage.


Suggested change

* Static - Used by us to monitor feature usage.

* Static - Used by Router developers to monitor feature usage.

let's assume router users will end up looking at the dev docs

Geal · 2024-05-27T13:02:06Z

dev-docs/metrics.md

+> Why are static metrics no longer recommended for users to use directly?
+> 
+> They can, but usually it'll be only a starting point for them. We can't predict the things that users will want to monitor, and if we tried we would blow up the cardinality of our metrics resulting in high costs for our users via their APMs.


that is not clear to me. What do we mean by "users using static metrics directly?" Is it when they would add that in their custom plugin? (which would not increase cardinality for all users) Or asking us to add a new metric to the router?

Geal · 2024-05-27T13:07:06Z

dev-docs/metrics.md

+
+### Static metrics
+When adding a new feature to the Router you must also add new static metrics to monitor the usage of that feature and users cannot turn them off.
+These metrics must be low cardinality and not leak any sensitive information. Users cannot change these metrics and they are primarily for us to see how our features are used so that we can inform future development.


a lot of static metrics actually monitor standard router operations and are not for us to collect data, but for users to observe the router.
If we want this to be the defining point, let's maybe not call them static VS dynamic metrics, but internal VS monitoring or user metrics, something like that?
I'd prefer we keep the distinction between static metrics as defined directly with tracing, and dynamic metrics as the ones defined by custom instruments, that can be activated with runtime conditions, and have another clear separation beween the metrics used for internal reporting (as with the apollo.router.operations and apollo.router.config prefixes) and the user facing ones.

Geal · 2024-05-27T13:12:01Z

dev-docs/metrics.md

+* Look at the [OTel semantic conventions](https://opentelemetry.io/docs/specs/semconv/general/metrics/) 
+* Notify `#proj-router-analytics` channel in Slack.
+* Add the metrics to the spreadsheet linked in the `#proj-router-analytics` channel in Slack.
+


code example of what is a static metric, to be sure which is which between static and dynamic?

Geal · 2024-05-27T13:13:42Z

dev-docs/metrics.md

+
+When defining new operation metrics use the following conventions:
+
+**Name:** `apollo.router.operations.<feature>` - (counter)


some of the apollo.router.operations metrics are actually monitored by users. What is the strategy here? Do we keep them available for users?

Add guidance for adding mew metrics

d57ba3d

apollo-bot2 assigned BrynCooke May 8, 2024

Improve doc

15364eb

BrynCooke changed the title ~~Add guidance for adding mew metrics~~ Add guidance for adding new metrics May 8, 2024

BrynCooke requested review from Geal and bnjjj May 8, 2024 09:18

abernix reviewed May 8, 2024

View reviewed changes

dev-docs/metrics.md Show resolved Hide resolved

abernix reviewed May 8, 2024

View reviewed changes

dev-docs/metrics.md Outdated Show resolved Hide resolved

bryn and others added 4 commits May 13, 2024 10:33

Add more guidance for naming

1bbadc8

Update dev-docs/metrics.md

fc27a37

Co-authored-by: Jesse Rosenberger <git@jro.cc>

Warning markdown

ae3903a

Merge branch 'dev' into bryn/metrics-conventions

4c76381

BrynCooke requested a review from abernix May 13, 2024 12:33

Geal requested changes May 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add guidance for adding new metrics #5116

Add guidance for adding new metrics #5116

BrynCooke commented May 8, 2024

github-actions bot commented May 8, 2024

router-perf bot commented May 8, 2024

Geal left a comment

Geal May 27, 2024

Geal May 27, 2024

Geal May 27, 2024

Geal May 27, 2024

Geal May 27, 2024

Geal May 27, 2024

	* Static - Used by us to monitor feature usage.
	* Static - Used by Router developers to monitor feature usage.


		When defining new operation metrics use the following conventions:

		Name: `apollo.router.operations.<feature>` - (counter)

Add guidance for adding new metrics #5116

Are you sure you want to change the base?

Add guidance for adding new metrics #5116

Conversation

BrynCooke commented May 8, 2024

Footnotes

github-actions bot commented May 8, 2024

router-perf bot commented May 8, 2024

Geal left a comment

Choose a reason for hiding this comment

Geal May 27, 2024

Choose a reason for hiding this comment

Geal May 27, 2024

Choose a reason for hiding this comment

Geal May 27, 2024

Choose a reason for hiding this comment

Geal May 27, 2024

Choose a reason for hiding this comment

Geal May 27, 2024

Choose a reason for hiding this comment

Geal May 27, 2024

Choose a reason for hiding this comment