Daily refreshed data on representation certification and unfair labor cases from nlrb.gov. Data is updated nightly.
This repository contains code to build and update a database of representation certification and unfair labor cases published on the National Labor Relations Board's website. This diagram shows the different tables and variables contained in the database.
- The data starts around 2010. See https://github.com/labordata/nlrb-cats/ for data from the previous system
- The update process starts by downloading a CSV of all the cases within a specified time period from the nlrb.gov website. However, repeatingly requesting the same information from nlrb.gov results in spreadsheets with slightly different contents. We might be a bit out of date if the last CSV we downloaded missed a recent case.
System requirements:
- wget
- unzip
- sqlite3>=3.3
- python>=3
- jq
- libxml2
- libxsl
- grep
- cat
- tr
pip install -r requirements.txt
make