Skip to content

GoC-Spending/goc-spending-mini

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

goc-spending-mini

Work-in-progress code to scrape and then parse contracting data from departments' Proactive Disclosure website.

Dependencies

  1. PHP 5.7+
  2. Composer, which can be downloaded from https://getcomposer.org/download/

Install instructions

  1. Clone the repository.
  2. In the folder, run composer to install the "Guzzle" dependency with, composer update

You're ready to go!

Scraping departments

The scrapers are located in contracts-scraper.php, which can be run with composer run-script scrape

By default, it will download 2 quarters and 2 contract files from each department that has a scraper function.

Parsing departments

Parsing data - to extract data from the HTML files downloaded with the scraper - are located in contracts-parser.php, which can be run with composer run-script parse

Contributing

To keep track of which departments are scraped/parsed, check out this spreadsheet.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages