Skip to content

Latest commit

 

History

History
46 lines (28 loc) · 2.58 KB

README.md

File metadata and controls

46 lines (28 loc) · 2.58 KB

Example code and data for "Practical Data Science with R, 2nd Edition" by Nina Zumel and John Mount, Manning 2019.

The code and data in this directory supports examples from:

  • Chapter 7: Using Linear and Logistic Regression
  • Chapter 9: Exploring Advanced Methods
  • Chapter 10: Documentation and Deployment

Data

Data originally downloaded 4-25-2013 from http://www.cdc.gov/nchs/data_access/Vitalstatsonline.htm

  • ftp://ftp.cdc.gov/pub/Health_Statistics/NCHS/Dataset_Documentation/DVS/natality/UserGuide2010.pdf
  • Shasum: 12520b8384defc63a93fad957936fd7ee9a42ef3 UserGuide2010.pdf
  • ftp://ftp.cdc.gov/pub/Health_Statistics/NCHS/Datasets/DVS/natality/Nat2010us.zip
  • Shasum: dad8b9fc9e8b4a4d250b7febf10d4b97350e2651 Nat2010us.zip

Files:

  • README.html : this docuemnt
  • README.md : this docuemnt
  • UserGuide2010.pdf : original CDC user guide
  • NatalBirthData.rData : prepared saved data in R format (produced by prepBirthWeightData.R)
  • NatalRiskData.rData : prepared saved data in R format (produced by PrepNatalRiskData.R)
  • prepBirthWeightData.R : R steps to prepare birth weight data from natal2010Sample.tsv.gz
  • PrepNatalRiskData.R : R steps to prepare natality risk data from natal2010Sample.tsv.gz
  • gamSplinePlots.R : some plotting examples
  • natal2010Sample.tsv.gz : a uniform sample of the Natality risk data in gzipped TSV format (produced by procedures in loadExample directory)
  • loadExample : detailed procedures used to prepare the data

License for our additional documentation, notes, code, and example data:

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

No guarantee, indemnification or claim of fitness is made regarding any of these items.

No claim of license on works of others or derived data.