Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace misleading numeric values in covid_hosp_facility? #1366

Open
nmdefries opened this issue Jan 4, 2024 · 2 comments
Open

Replace misleading numeric values in covid_hosp_facility? #1366

nmdefries opened this issue Jan 4, 2024 · 2 comments
Labels
acquisition changes acquisition logic data quality

Comments

@nmdefries
Copy link
Contributor

nmdefries commented Jan 4, 2024

covid_hosp_facility total_adult_patients_hospitalized_confirmed_covid_7_day_avg contains lots of -999999, but also has NAs. The -999999 could easily lead to errors in analyses. Should we change these to NA before adding them to our system? Or do we want to re-report as-is?

@nmdefries nmdefries added acquisition changes acquisition logic data quality labels Jan 4, 2024
@melange396
Copy link
Collaborator

theres presumably some significance to the distinction between that value and a null. perhaps we should contact the dataset curators for clarification to help us decide.

@brookslogan
Copy link
Contributor

brookslogan commented Feb 9, 2024

From docs:

Suppression is applied to the file for sums and averages less than four (4). In these cases, the field will be replaced with “-999,999”.

But I have also seen 0s in the data. So maybe --- for the sum I was looking at --- this is just for [1..3], or maybe it's inconsistently for [1..3] or [0..3] depending on facility/time/mood/etc. For other sums or other averages, I don't know if it's consistent.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
acquisition changes acquisition logic data quality
Projects
None yet
Development

No branches or pull requests

3 participants