Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Dimension tracking #342

Open
mateuszklimek opened this issue Aug 24, 2022 · 2 comments
Open

[FEATURE] Dimension tracking #342

mateuszklimek opened this issue Aug 24, 2022 · 2 comments

Comments

@mateuszklimek
Copy link
Member

Tell us about the problem you're trying to solve

It's pretty often that we want to know anomaly on counts on some specific values (categorical) values in the columns.
Getting anomalies on those would be really useful for some of the re_data users

@fgiroud
Copy link

fgiroud commented Sep 15, 2022

That would be an awesome feature to have
In many scenarios, we are tracking anomalies and metrics, on a per dimension basis (like country, brand, or category)
I see a few things

  • the anomaly detection would be more performant when the dimension are not equally distributed. An anomaly in an under-represented dimension could be missed
  • that would save a lot of time in debugging, especially for tests, if the test fails for a dimension that doesn't really matter

From a UI perspective, we would need to be able to stack the metrics per dimension, having basically everything "by dimension", so guessing that's a big effort for re_data.

We tried to implement something similar, and we faced the following issues

  • When the dimension contains too many values, it makes the UI extremely difficult to read - in some occasions (20K dimensions) it simply breaks our testing UI. Forcing us to disable the metrics by dimension
  • Need to "stack" all the charts
  • Performances bottleneck
  • Having to nest basically every test configuration (level, alerts, config) per dimension

@whanata
Copy link

whanata commented Apr 19, 2023

+1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants