Skip to content

davebraze/i990-and-nih-exporter

Repository files navigation

README

The code in this repository is an exercise in working with public data from two main sources. The first source is IRS form 990, "Return of Organization Exempt from Income Tax." The 990 is an informational form that non-profit organizations (NPO) must file with the Internal Revenue Service (IRS) each year. The completed forms are publicly available records as a matter of federal law. The second source of information used here is details of individual grants from National Institutes of Health (NIH). The NIH Exporter gives access to information about all individual grants made by the NIH. That information includes the direct and indirect moneys paid to grantees each year for each grant.

NIH grant data can be downloaded from the NIH Exporter web tool. The data files are available for each fiscal year since 1985 in either CSV or XML format. I've chosen to do bulk downloads of NIH "Project" data files (in CSV format). See 01-load-data.R for details. Another option would have been to use the NIH REST API documented in a PDF entitled Reporter API Data Elements. But, I consider the bulk download approach I've used here to be simpler and more efficient. With the bulk data on my machine, I then do filtering and variable selection locally using regular dplyr operations. If you're more partial to API-based downloads, you might check out repoRter.nih, an R package by Michael Barr that was released in Feb. 2022. It's at an early stage of development and looks fairly bare-bones, but it will almost certainly save you some headaches.

A convenient place to get IRS 990s is from the ProPublica Nonprofit Explorer. It is necessary to search for the particular NPOs you are interested in, and individually download form 990s for each tax year of interest. The form for a specific NPO/tax-year will be available either as a scanned PDF, if it was filed as a paper document, or as an XML file, if it was filed electronically. All NPOs are required to file electronically from 2019 forward. Another option would be to use the IRS search and bulk download tools.

Possibly useful resources are IRS 990

Notes

Note that this repo does NOT contain the NIH or IRS data files themselves, since they are a bit large, and can be obtained elsewhere. Other public data referenced here includes:

About

Doodles with IRS i990 NPO data, together with NIH Exporter grant data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published