Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor data in us_* datapackages #85

Open
repentsinner opened this issue Jul 3, 2020 · 0 comments
Open

Refactor data in us_* datapackages #85

repentsinner opened this issue Jul 3, 2020 · 0 comments

Comments

@repentsinner
Copy link

repentsinner commented Jul 3, 2020

As a user of the covid-19 datapackage, I want to be able to be able to more easily calculate incidence rates of confirmed cases.

Currently the us_*.csv files include lat/long info (raised in #1) as well as a variety of other identifiers that don't appear to ever change with the time series.

In addition, the us_deaths file contains a population field which can be helpful to calculate incidence rates rather than absolute counts, but the us_confirmed file is missing this population field.

It would be great if there was a us_counties file that used the same UID or FIPS/INCITS 38:2009 to provide this non-changing data in a more uniform way for further processing.

Note: it appears as though this issue is due to directly re-packaging the CSSE data as a datapackage, rather than being opinionated about how that data might usefully be presented/consumed.

Thanks for considering!

Gwardii pushed a commit to Gwardii/covid-19 that referenced this issue Mar 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant