Skip to content

TonyxSun/PressScraper

Repository files navigation

GitHub last commit Website

Quick Navigation

About

A scraper application for crawling US Congress, industry associations, and think tanks press releases, hearings, markups, and bills for analytical purposes.

Time Range: Past content within one week (for most sources) and all future content.

Export Format: CSV, US Government, Think Tanks

Note: For easier navigation, think tank press content are located on a seperate page from the US Government releases.

Contents

Think Tanks

US Congress

US Senate

US Senate Committees

US House

US House Committees

US Republican Committees

Industry

How to run/update

  1. Clone repository.
  2. Run ./script.bash in the terminal.
  3. Using Crontab(Mac/Linux) or Task Scheduler(Windows), set up execution schedule to automatically run scraping job.

About

A scraper application for crawling press releases of US government agencies and major think tanks for analytical purposes.

Topics

Resources

Stars

Watchers

Forks