feat(generic-metrics): Add dropped percentiles to aggregation options #70824

ayirr7 · 2024-05-13T22:45:50Z

Refactoring the way we used an older option for enabling 10s granularity so its purpose is clearer.

Enable org-level and use case-level of disabling percentiles using the newly created Sentry options.

codecov · 2024-05-13T23:18:25Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 77.92%. Comparing base (fea4348) to head (5626223).
Report is 173 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #70824      +/-   ##
==========================================
+ Coverage   77.87%   77.92%   +0.04%     
==========================================
  Files        6529     6525       -4     
  Lines      290909   290480     -429     
  Branches    50338    50254      -84     
==========================================
- Hits       226548   226348     -200     
+ Misses      58122    57874     -248     
- Partials     6239     6258      +19

Files	Coverage Δ
src/sentry/options/defaults.py	`100.00% <100.00%> (ø)`
...ntry/sentry_metrics/aggregation_option_registry.py	`100.00% <100.00%> (ø)`

... and 162 files with indirect coverage changes

nikhars · 2024-05-15T20:25:28Z

src/sentry/sentry_metrics/aggregation_option_registry.py

+        }
+
+
+def get_aggregation_options(mri: str, org_id: int) -> dict[AggregationOption, TimeWindow] | None:


Since we are only able to write 1 aggregation option at the moment, what if we added a test case which validated that assumption. I am not exactly sure whether that constraint is enforced somewhere in the code or not. But it would be good to have the constrain in place to avoid potential problems in the near future.

Good point. Added some logic to our existing unit tests where there are a variety of aggregation options set up, which cover all the use cases etc. The unit test asserts that only 1 exists per metric bucket payload, though.

Once we have multiple aggregation option support, we can remove and refactor these tests to communicate that change in expectations.

What do you think of these changes?

What if we change the signature of this function to -> (AggregationOption, TimeWindow) | None to enforce this in application code? Once we support multiple options, we can easily change the signature again.

The indexer already assumes a dict output from calling get_aggregation_options and basically chooses the first element. So in any case, we are always guaranteed to have 1 aggregation option per payload.

I can change the typing and indexer behavior in a subsequent PR, but for now I think I'd like to not have this PR touch too many files at once.

I added some unit tests which should hopefully make the behavior/sequence of operations in get_aggregation_options clearer.

jan-auer · 2024-05-17T07:58:39Z

src/sentry/sentry_metrics/aggregation_option_registry.py

+        }
+
+
+def get_aggregation_options(mri: str, org_id: int) -> dict[AggregationOption, TimeWindow] | None:


What if we change the signature of this function to -> (AggregationOption, TimeWindow) | None to enforce this in application code? Once we support multiple options, we can easily change the signature again.

jan-auer · 2024-05-17T08:01:00Z

src/sentry/sentry_metrics/aggregation_option_registry.py

+
+    # Set various aggregation options that
+    # are use case-wide aggregations
+    set_use_case_aggregation_options()


This function internally modifies a global, but this is done on every call of get_aggregation_options. The global isn't really needed therefore, it could be just a local variable. This would further allow you to simplify and inline the checks into a single if-chain below that would also more clearly show precedence of the options.

src/sentry/sentry_metrics/aggregation_option_registry.py

ayirr7 · 2024-05-21T14:53:20Z

Will add more unit tests to check that the logic in get_aggregation_options is correct

src/sentry/sentry_metrics/aggregation_option_registry.py

jan-auer · 2024-05-22T14:58:23Z

src/sentry/sentry_metrics/aggregation_option_registry.py

-        return USE_CASE_AGG_OPTION[use_case_id]
+    elif use_case_id in use_case_agg_options:
+        if org_id not in drop_uc_org_override.get(use_case_id.value, []):
+            return use_case_agg_options[use_case_id]


As an optional suggestion, this could be easier to read and follow without use_case_agg_options. This branch could check directly the options and then early return the aggregation option.

Pseudo code skeleton for the entire function:

if org_id in options.get("per-org", []): return DISABLE_PERCENTILES opt = options.get("with-override", {}) if use_case in opt and org_id not in opt[use_case]: return DISABLE_PERCENTILES if mri in METRIC_ID_AGG_OPTION: return METRIC_ID_AGG_OPTION[mri] if use_case == CUSTOM and options.get("10s"): return TEN_SECOND return {}

jan-auer · 2024-05-22T15:00:34Z

src/sentry/options/defaults.py

@@ -1260,6 +1260,15 @@
    flags=FLAG_AUTOMATOR_MODIFIABLE,
 )

+# Option to remove support for percentiles on a per-use case basis.
+# Add the use case to list to disable percentiles.


nit: This isn't a list, the use case must be an entry with a list value (otherwise causing a crash). Should we make this more clear?

Good callout, thanks

add no percentiles

e8fc76e

ayirr7 requested a review from a team as a code owner May 13, 2024 22:45

ayirr7 marked this pull request as draft May 13, 2024 22:45

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label May 13, 2024

vercel bot deployed to Preview May 13, 2024 22:48 View deployment

refactor aggregation options a bit

8ce2cf4

vercel bot deployed to Preview May 14, 2024 21:01 View deployment

ayirr7 mentioned this pull request May 14, 2024

Use feature flags to select correct aggregation options in the indexer getsentry/snuba#5914

Open

ayirr7 added 3 commits May 15, 2024 10:46

Merge remote-tracking branch 'origin' into refactor-add-aggregations

5325df6

Merge remote-tracking branch 'origin' into refactor-add-aggregations

05f1627

fix comment

d0df302

ayirr7 marked this pull request as ready for review May 15, 2024 17:51

vercel bot deployed to Preview May 15, 2024 17:55 View deployment

fix typing

774ffb8

vercel bot deployed to Preview May 15, 2024 17:59 View deployment

nikhars reviewed May 15, 2024

View reviewed changes

add assert

130c006

vercel bot deployed to Preview May 16, 2024 20:19 View deployment

add aggregation options to test

fc4e5f3

vercel bot deployed to Preview May 16, 2024 21:19 View deployment

ayirr7 requested a review from nikhars May 16, 2024 22:31

ayirr7 added 2 commits May 16, 2024 18:33

rename the aggregation option for better consistency throughout pipeline

435b5cc

Merge remote-tracking branch 'origin' into refactor-add-aggregations

a59e8bd

vercel bot deployed to Preview May 16, 2024 22:38 View deployment

jan-auer reviewed May 17, 2024

View reviewed changes

ayirr7 added 2 commits May 21, 2024 10:50

refactor

6727b1f

Merge remote-tracking branch 'origin' into refactor-add-aggregations

d54c68b

vercel bot deployed to Preview May 21, 2024 14:56 View deployment

fix test

908202c

vercel bot deployed to Preview May 22, 2024 03:28 View deployment

fix typing

077d7ce

vercel bot deployed to Preview May 22, 2024 03:38 View deployment

add unit tests for aggregation logic

2e2b00c

vercel bot deployed to Preview May 22, 2024 04:56 View deployment

ayirr7 requested a review from jan-auer May 22, 2024 05:02

jan-auer reviewed May 22, 2024

View reviewed changes

improve tests and add comments

6862f68

vercel bot deployed to Preview May 22, 2024 18:43 View deployment

clean up formatting of the options comment

8dfb496

vercel bot deployed to Preview May 22, 2024 18:47 View deployment

make logic simpler

a76eef4

vercel bot deployed to Preview May 23, 2024 03:04 View deployment

jan-auer approved these changes May 23, 2024

View reviewed changes

ayirr7 added 2 commits May 23, 2024 12:27

simpler logic (no org-based overrides for disabling percentiles)

ef4da13

simplify test (make org id 1 again)

e36737b

vercel bot deployed to Preview May 23, 2024 16:30 View deployment

remove extra line that was randomly added

5626223

vercel bot deployed to Preview May 23, 2024 16:33 View deployment

vercel bot deployed to Preview May 23, 2024 16:36 View deployment

jan-auer approved these changes May 23, 2024

View reviewed changes

ayirr7 merged commit 7ae185f into master May 23, 2024
49 checks passed

ayirr7 deleted the refactor-add-aggregations branch May 23, 2024 19:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(generic-metrics): Add dropped percentiles to aggregation options #70824

feat(generic-metrics): Add dropped percentiles to aggregation options #70824

ayirr7 commented May 13, 2024 •

edited

codecov bot commented May 13, 2024 •

edited

nikhars May 15, 2024

ayirr7 May 16, 2024 •

edited

jan-auer May 17, 2024

ayirr7 May 22, 2024 •

edited

jan-auer May 17, 2024

jan-auer May 17, 2024

ayirr7 May 21, 2024

ayirr7 commented May 21, 2024

jan-auer May 22, 2024 •

edited

jan-auer May 22, 2024

ayirr7 May 22, 2024

		}


		def get_aggregation_options(mri: str, org_id: int) -> dict[AggregationOption, TimeWindow] \| None:

feat(generic-metrics): Add dropped percentiles to aggregation options #70824

feat(generic-metrics): Add dropped percentiles to aggregation options #70824

Conversation

ayirr7 commented May 13, 2024 • edited

codecov bot commented May 13, 2024 • edited

Codecov Report

nikhars May 15, 2024

Choose a reason for hiding this comment

ayirr7 May 16, 2024 • edited

Choose a reason for hiding this comment

jan-auer May 17, 2024

Choose a reason for hiding this comment

ayirr7 May 22, 2024 • edited

Choose a reason for hiding this comment

jan-auer May 17, 2024

Choose a reason for hiding this comment

jan-auer May 17, 2024

Choose a reason for hiding this comment

ayirr7 May 21, 2024

Choose a reason for hiding this comment

ayirr7 commented May 21, 2024

jan-auer May 22, 2024 • edited

Choose a reason for hiding this comment

jan-auer May 22, 2024

Choose a reason for hiding this comment

ayirr7 May 22, 2024

Choose a reason for hiding this comment

ayirr7 commented May 13, 2024 •

edited

codecov bot commented May 13, 2024 •

edited

ayirr7 May 16, 2024 •

edited

ayirr7 May 22, 2024 •

edited

jan-auer May 22, 2024 •

edited