Add NCHS mortality geo aggregation at the HHS and nation levels #1243

alexcoda · 2021-09-14T02:48:22Z

Description

Add NCHS mortality geo aggregation at the HHS and nation levels.

Note that because one of the columns being aggregated is a percentage, we're doing a weighted sum with the geomapper

Also there are some pylint disables for too many branches, this is because of the slight difference in logic for handling the percentage vs. count variables. This could be refactored later for the entire run.py function, but is not in scope for this change.

Fixes

Closes Make NCHS data available at HHS, nation level #1041
Closes NCHS data available at HHS, nation level #1213 which this PR was based off of (because I'm on a fork, I couldn't push directly to that open PR)

alexcoda · 2021-09-14T02:58:04Z

@krivard - ready for review!

krivard

As a bonus, resolving the code-duplication questions may get the linter to hush.
I could be convinced otherwise though if I've misread what's going on, wow that is a monster.

krivard · 2021-09-14T13:57:47Z

nchs_mortality/delphi_nchs_mortality/run.py

@@ -18,7 +19,7 @@
 from .pull import pull_nchs_mortality_data


-def run_module(params: Dict[str, Any]):
+def run_module(params: Dict[str, Any]):  # pylint: disable=too-many-branches, too-many-statements


is this better than splitting it up into more-specific functions?

Definitely not, just pushed some changes to pull out some helper methods!

krivard · 2021-09-14T14:10:36Z

nchs_mortality/delphi_nchs_mortality/run.py

                df = df_pull.copy()
+                df["se"] = np.nan
+                df["sample_size"] = np.nan


this can be pulled out of the for loop

actually it can probably be pulled out of the if block as well

krivard · 2021-09-14T14:11:34Z

nchs_mortality/delphi_nchs_mortality/run.py

+                if geo in ["hhs", "nation"]:
+                    df = gmpr.replace_geocode(
+                        df, "state_id", "state_code", from_col="geo_id", date_col="timestamp")
+                    df = gmpr.replace_geocode(
+                        df, "state_code", geo, date_col="timestamp").rename(columns={geo: "geo_id"})


can we reduce some of the duplication with percent_of_expected_deaths here at all?

alexcoda · 2021-09-17T02:15:45Z

nchs_mortality/.pylintrc

@@ -4,6 +4,7 @@
 disable=logging-format-interpolation,
    too-many-locals,
    too-many-arguments,
+    fixme,


Added this for a TODO I left in the code. Let me know if this is something we'd want to keep consistent across all indicators and I can add it to others.

zhuoran-Cheng16 and others added 4 commits August 20, 2021 13:31

NCHS data available at HHS, nation level

59f5c22

Merge branch 'main' into nchs_geo_res

3392709

Aggregate nchs mortality data at the hhs and nation levels

70bd670

pylint disable

ed6e1a4

krivard self-requested a review September 14, 2021 13:55

krivard reviewed Sep 14, 2021

View reviewed changes

Pull out repeated functionality into helper methods

b418496

alexcoda requested a review from krivard September 17, 2021 02:10

alexcoda commented Sep 17, 2021

View reviewed changes

krivard mentioned this pull request Sep 20, 2021

Alternate: Add NCHS mortality geo aggregation at the HHS and nation levels #1258

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NCHS mortality geo aggregation at the HHS and nation levels #1243

Add NCHS mortality geo aggregation at the HHS and nation levels #1243

alexcoda commented Sep 14, 2021 •

edited

alexcoda commented Sep 14, 2021

krivard left a comment

krivard Sep 14, 2021

alexcoda Sep 17, 2021

krivard Sep 14, 2021

krivard Sep 14, 2021

krivard Sep 14, 2021

alexcoda Sep 17, 2021

Add NCHS mortality geo aggregation at the HHS and nation levels #1243

Are you sure you want to change the base?

Add NCHS mortality geo aggregation at the HHS and nation levels #1243

Conversation

alexcoda commented Sep 14, 2021 • edited

Description

Fixes

alexcoda commented Sep 14, 2021

krivard left a comment

Choose a reason for hiding this comment

krivard Sep 14, 2021

Choose a reason for hiding this comment

alexcoda Sep 17, 2021

Choose a reason for hiding this comment

krivard Sep 14, 2021

Choose a reason for hiding this comment

krivard Sep 14, 2021

Choose a reason for hiding this comment

krivard Sep 14, 2021

Choose a reason for hiding this comment

alexcoda Sep 17, 2021

Choose a reason for hiding this comment

alexcoda commented Sep 14, 2021 •

edited