ICA Conference Site Scraper

Author: Rodrigo Zamith
Version: 1.0

Usage

Just edit the 'start_url' variable in downloader.py and execute the scripts in the following order:

downloader.py (Downloads each page from the online conference program)
scraper.py (Scrapes the locally-stored pages)
longform.py (Converts the data into long-form (each author as a case), which facilitates certain analyses in R)

Requirements

This script requires Python, as well as the urllib2 and BeautifulSoup libraries.

License

This script is licensed under the Mozilla Public License Version 2.0 (see LICENSE file in root folder). TL;DR: feel free to use it commercially, modify it, and distribute it, provided you disclose both the source code and any moditations you make to it. Attribution, where appropriate, is appreciated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pages

pages

LICENSE

LICENSE

README.md

README.md

downloader.py

downloader.py

longform.py

longform.py

scraper.py

scraper.py

Repository files navigation

ICA Conference Site Scraper

Usage

Requirements

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
pages		pages
LICENSE		LICENSE
README.md		README.md
downloader.py		downloader.py
longform.py		longform.py
scraper.py		scraper.py

License

rodzam/ica_scraper

Folders and files

Latest commit

History

Repository files navigation

ICA Conference Site Scraper

Usage

Requirements

License

About

Resources

License

Stars

Watchers

Forks

Languages