Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there a better to way to render a visualization when no data exists? #402

Open
npatki opened this issue Jul 25, 2023 · 0 comments
Open
Labels
feature:reports Related to any of the generated reports feature request Request for a new feature

Comments

@npatki
Copy link
Contributor

npatki commented Jul 25, 2023

Environment details

  • SDMetrics version: Dev Branch (for upcoming 0.11.0)

Problem description

Some datasets may not have many statistical columns*, so there may not be enough data for certain types of evaluation.
*Statistical columns are the modeled ones: categorical, boolean, datetime, numerical.

For example, consider a multi-table dataset where I only have 1 numerical column in a table (the others are all ID or PII types). If I try to run the quality report, there is nothing to compute for Column Pair Trends -- as this property requires 2 or more statistical columns for computation.

Observed

The report is correctly identifying that there is nothing to be computed. As expected, the details are blank.

report.get_details('Column Pair Trends')
image

However, the visualization is also blank because there was nothing computed.
image

Expected

It is a bit odd to see this type of visualization. Perhaps we could render a single blank graph with a text overlay that says something like:

"No data to display. This property requires at least 2 or more statistical columns within a single table."

Additional Context

It may be worth auditing and revisiting other cases where this can trigger as well. For example:

  • Quality report, column pair trends property: If there are only categorical columns, then there is nothing to display in the heatmaps
  • Quality report, column shapes property: If there are no statistical columns
  • Diagnostic report, all properties: if there are no statistical columns
  • Diagnostic report, boundaries property: If there are only categorical columns, this is not computed
@npatki npatki added feature request Request for a new feature feature:reports Related to any of the generated reports labels Jul 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature:reports Related to any of the generated reports feature request Request for a new feature
Projects
None yet
Development

No branches or pull requests

1 participant