GitHub - dsquintana/synthpop-primer: A primer on using the 'synthpop' package for the biobehavioral sciences

Synthetic datasets: A non-technical primer for the biobehavioural sciences to promote reproducibility and hypothesis-generation

Synthetic datasets are an emerging method originally developed to permit the sharing of confidential census data. Synthetic datasets mimic real datasets by preserving their statistical properties and the relationships between variables. Importantly, this method also reduces disclosure risk to essentially nil as no record in the synthetic dataset represents a real individual. This is the accompanying R script for my primer manuscript, which enables scholars to create synthetic datasets and assess their utility via the synthpop R package. By sharing synthetic datasets that mimic original datasets that could not otherwise be made open, researchers can ensure the reproducibility of their results and facilitate data exploration while maintaining participant privacy.

Run the analysis in your web browser

To launch a RStudio server instance and run my analysis scripts online, click here or on the "Launch Binder" badge below.

Once the Rstudio server instance has loaded, run the commands in the "R_script.R" file.

Due to resource constraints of the RStudio server instance, the scripts that create Supplementary Figures 1-3 described in the primer manuscript could not be included. These scripts can be found on the manuscript's Open Science Framework page.

Run this analysis locally

To run the analysis locally in RStudio, download this repository as a zipped file. The R version and package versions are noted in the sessionInfo.txt file.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.binder		.binder
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
README.html		README.html
README.md		README.md
R_script.R		R_script.R
ot_dat.csv		ot_dat.csv
sessionInfo.txt		sessionInfo.txt
socio.csv		socio.csv
synthpop-primer.Rproj		synthpop-primer.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.binder

.binder

.gitignore

.gitignore

DESCRIPTION

DESCRIPTION

LICENSE

LICENSE

README.html

README.html

README.md

README.md

R_script.R

R_script.R

ot_dat.csv

ot_dat.csv

sessionInfo.txt

sessionInfo.txt

socio.csv

socio.csv

synthpop-primer.Rproj

synthpop-primer.Rproj

Repository files navigation

Synthetic datasets: A non-technical primer for the biobehavioural sciences to promote reproducibility and hypothesis-generation

Run the analysis in your web browser

Run this analysis locally

About

Releases 2

Packages

Languages

License

dsquintana/synthpop-primer

Folders and files

Latest commit

History

Repository files navigation

Synthetic datasets: A non-technical primer for the biobehavioural sciences to promote reproducibility and hypothesis-generation

Run the analysis in your web browser

Run this analysis locally

About

Resources

License

Stars

Watchers

Forks

Languages