Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Precise histogram can miss rare values #401

Open
darabos opened this issue May 2, 2023 · 0 comments
Open

Precise histogram can miss rare values #401

darabos opened this issue May 2, 2023 · 0 comments
Labels
bug Something isn't working

Comments

@darabos
Copy link
Contributor

darabos commented May 2, 2023

The histograms in LynxKite are based on a random sample. But you have a "precise" checkbox in case you want to go through the whole dataset. But it looks like if there is a category that was not represented in the sample, it won't be counted in the precise counts either!

At least that's my guess. What I'm seeing is that a category with just 3 instances is not listed in the histogram in a graph with ~50,000 nodes. But I can see it in SQL. 💀

@darabos darabos added the bug Something isn't working label May 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant