Skip to content

Latest commit

 

History

History
86 lines (63 loc) · 9.44 KB

README.md

File metadata and controls

86 lines (63 loc) · 9.44 KB

GitHub last commit Website

Quick Navigation

About

A scraper application for crawling US Congress, industry associations, and think tanks press releases, hearings, markups, and bills for analytical purposes.

Time Range: Past content within one week (for most sources) and all future content.

Export Format: CSV, US Government, Think Tanks

Note: For easier navigation, think tank press content are located on a seperate page from the US Government releases.

Contents

Think Tanks

US Congress

US Senate

US Senate Committees

US House

US House Committees

US Republican Committees

Industry

How to run/update

  1. Clone repository.
  2. Run ./script.bash in the terminal.
  3. Using Crontab(Mac/Linux) or Task Scheduler(Windows), set up execution schedule to automatically run scraping job.