Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data for Massachusetts displaying weird #213

Open
stephen304 opened this issue Sep 5, 2020 · 7 comments
Open

Data for Massachusetts displaying weird #213

stephen304 opened this issue Sep 5, 2020 · 7 comments

Comments

@stephen304
Copy link

stephen304 commented Sep 5, 2020

Looking at the data, it seems normal (all around 120k), so maybe this is just a bug in the graph. It looks like the line is going to negative infinity (I can't find the end) @aatishb

image

@drlaurenwasson
Copy link

Came here to leave the same comment. Thanks for beating me to it!

@DesiOtaku
Copy link

I tried to see what could be causing the bug but I can't seem to find it.

Only lead I have so far is that it is downloading the data directly from https://raw.githubusercontent.com/nytimes/covid-19-data/master/us-states.csv and the Massachusetts data looks fine there.

@stephen304
Copy link
Author

The data looks extra weird today, it looks like the cumulative confirmed cases must have decreased in the data, as Massachusetts moved left on the x axis:

image
@aatishb

@aatishb
Copy link
Owner

aatishb commented Sep 10, 2020

Hi all, Thanks for filing and looking into this. It looks like Massachusettes changed the way their data is being reported which resulted in fewer case counts. It's a bit easier to see on the linear scale view.

nytimes/covid-19-data#447
https://www.nytimes.com/interactive/2020/us/massachusetts-coronavirus-cases.html#anomaly-notes
https://www.mass.gov/doc/covid-19-dashboard-september-2-2020/download

Since it's a data source / methodology issue we don't really have a great way of dealing with this on our end. We can wait and see if NYT decides to retroactively update their data to account for the methodology.

@stephen304
Copy link
Author

stephen304 commented Sep 10, 2020

@aatishb Does that also explain the graph not showing any points for 9/2-9/8? There should be data for those days but no points exist on the graph

Edit: Just realized that I was looking at the wrong number in the data - I guess the negative change in the data is what caused the missing points

@aatishb
Copy link
Owner

aatishb commented Sep 10, 2020

@stephen304 Exactly. It seems the large negative change in numbers on 9/2 led to the 'change in the previous week' as being negative for the entire next week (9/2 - 9/8).

A potential workaround might be to drop negative values before graphing, that way the graph would instead show a jump from 9/1 to 9/9. However if this is the only place this is occurring it might be worth waiting to see if they update the data source first.

@vjandrea
Copy link

Hello, the graph looks bizarre for France:

image

and Spain shows a drop:

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants