Skip to content

kjam/europarl_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

europarl_scraper

A Python scrapy project for scraping data from European Parliament's website.

Installation

pip install -r requirements.txt

Just give me the data!

It's on S3 in a public bucket!

To run:

  • First, grab the start urls. Run python get_urls.py

  • Then, run any of the scrapers:

scrapy crawl europarl_speeches -o data/speeches.csv scrapy crawl europarl_debates -o data/debates.csv scrapy crawl europarl_speakers -o data/speakers.csv

Notes

There are many TODO's for this still, so plz be patient.

Questions?

Feel free to reach out on Twitter or Freenode (@kjam).

About

European Parliament website Python scraper

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published