README

This Github repository contains code and data to reproduce results in the article:

Robert Kubinec, Luiz Max Carvalho, Joan Barceló, Cindy Cheng, Luca Messerschmidt and Matthew Sean Cottrell. "A Bayesian latent variable model for the optimal identification of disease incidence rates given information constraints." Journal of the Royal Statistical Society Series A: Statistics in Society. 2024. https://doi.org/10.1093/jrsssa/qnae040 .

A brief description of the files is found below. If you have any questions about the information in the repo, please contact Bob Kubinec at bobkubinec@gmail.com.

First, note that the paper relies on a fitted cmdstanr model to reproduce results. These model fits are too big to store on Github, but you can access them from this Google drive folder and place them in the data sub-folder to reproduce results without fitting models (may take up to a few days):

https://drive.google.com/drive/folders/1hVzD_qL1CnOkTkwI6VH1PEgC1LK44RS1?usp=sharing

Paper files:

kubinec_model_preprint.Rmd: This file contains the text and embedded R code to reproduce the figures and tables in the paper.
kubinec_model_SI.Rmd: This file contains the text and embedded R code to reproduce the supplementary information.

Code:

corona_tscs_betab_mix_prior_v2.stan This Stan file contains the code to fit the model described in the paper using Stan (specifically, cmdstan accessed via the cmdstanr package). See code in the kubinec_model_preprint.Rmd file to see how to fit the model from R.
estimate_beta_priors_v2.stan: This Stan file calculates the uncertainty of the empirical distributions of the estimates of the expert survey about COVID-19 incidence in the early pandemic period.

Data:

data/combined.rds: the combined dataset with COVID-19 cases, tests, Census data and expert and serology survey data
nyt_data.rds, goog_mobile.rds and tests.rds: New York Times (reported cases), Google mobility data and testing data for the time period described in the paper. Note that the paper code can download these from Github repositories, but these sources may no longer be available.
data/simulation/: contains masking and Civiqs polls about COVID-19 related fears and behaviors.
data/covid_amp_state_policy_data.xlsx: contains COVID-AMP state-level policy data as described in the paper
count_pol_covidamp.rds: Aggregated form of COVID-AMP data to the state level as a count of policies.
cdc_sample_sizes.csv: CDC serology surveys
data/consensusForecastsDB.csv": expert survey of epidemiologists during the early pandemic period
data/rhat_summaries*.rds Rhat summaries for different models as reported in the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
data		data
prior_sensitivity_experiments		prior_sensitivity_experiments
.gitignore		.gitignore
BibTexDatabase.bib		BibTexDatabase.bib
Nationwide_Commercial_Laboratory_Seroprevalence_Survey.csv		Nationwide_Commercial_Laboratory_Seroprevalence_Survey.csv
README.md		README.md
all_cdc_sero.csv		all_cdc_sero.csv
author-info-blocks.lua		author-info-blocks.lua
basic_oval.tikzstyles		basic_oval.tikzstyles
cases_matrix.rds		cases_matrix.rds
cdc_sample_sizes.csv		cdc_sample_sizes.csv
cdc_sero.xlsx		cdc_sero.xlsx
combined.rds		combined.rds
corona_tscs_betab_mix_prior_v2.stan		corona_tscs_betab_mix_prior_v2.stan
coronanet_data.csv		coronanet_data.csv
count_pol.rds		count_pol.rds
count_pol_covidamp.rds		count_pol_covidamp.rds
estimate_beta_priors_v2.stan		estimate_beta_priors_v2.stan
goog_mobile.rds		goog_mobile.rds
just_us_paper.csv		just_us_paper.csv
kubinec_model_SI.Rmd		kubinec_model_SI.Rmd
kubinec_model_SI.pdf		kubinec_model_SI.pdf
kubinec_model_preprint.Rmd		kubinec_model_preprint.Rmd
kubinec_model_preprint.pdf		kubinec_model_preprint.pdf
luca_scraping_code.R		luca_scraping_code.R
nyt_data.rds		nyt_data.rds
percap.rds		percap.rds
policy_dag.tikz		policy_dag.tikz
policy_dag_mediate.tikz		policy_dag_mediate.tikz
preamble.tex		preamble.tex
preamble2.tex		preamble2.tex
preamble_SI.tex		preamble_SI.tex
real_data.rds		real_data.rds
scholarly-metadata.lua		scholarly-metadata.lua
science.csl		science.csl
serology.rds		serology.rds
serology_real.rds		serology_real.rds
solid_circle.tikzstyles		solid_circle.tikzstyles
test_data.rds		test_data.rds
tests.rds		tests.rds
tikzit.sty		tikzit.sty
world_data_estimate.R		world_data_estimate.R

CoronaNetDataScience/covid_model

Folders and files

Latest commit

History

Repository files navigation

README

Paper files:

Code:

Data:

About

Topics

Resources

Stars

Watchers

Forks

Languages