Skip to content

coletl/geocode

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Approximate Geocoding

Social scientists have rapidly seized on the research potential of GIS data. However, the analyses that spatial data afford almost always require some amount of record linkage to join quantities of interest to the relevant spatial covariates. In locations with a nascent spatial data industry, researchers’ typical problems managing messy data are compounded with the difficulties of working with imprecise or unstandardized GIS files.

We outline methods of record linkage specialized to the context of spatial data. We use these techniques to assign geographic coordinates to all Kenyan polling stations from 1997 to 2013. Through a generalizable example of this automated geocoding process, we provide what we hope to be a helpful introduction to rigorous, systematic record linkage in the world of messy spatial data.

To retrieve the main document:

Please clone or download this repository to your local machine and open code/doc.html in a web browser. You may also read the guide online.

To reproduce our results, clone or download the repository. All the files you need are included. Opening geocode.Rproj in RStudio will automatically set your working directory, so you can run the code without editing any file paths.

Please direct correspondence to Cole Tanigawa-Lau at coletl@nyu.edu.