ccxzhang / scrapers-and-parsers Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

The collection of scrapers and parsers I have written.

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
Bipartisan-Index		Bipartisan-Index
FBpages		FBpages
Paraguay		Paraguay
Selenium-Tutorial		Selenium-Tutorial
Zhihu		Zhihu
.DS_Store		.DS_Store
README.md		README.md

Repository files navigation

Web-scrapers

The following scrapers are either part of the project that I did or merely for fun.

Bipartisan-Index: scraper and raw data for the Lugar Center’s Bipartisan index for 116th congress. The Code folder includes codes to scrape house and sentae bills and extract congress member personal details. Data foler contains two json files extracted from the Biographic Directory of the United States Congress.
Paraguay: scraper for Paraguay's Comptroller General (Contraloría General de la República).
FBpages: scraper built on selenium to extract information from Facebook public pages.
Zhihu: scraper to get answers from Zhihu (Chinese version of Quora) which used Ajax.
Selenium-Tutorial: presentation slides for Selenium, including the common usages of Selenium.

About

The collection of scrapers and parsers I have written.

python selenium ajax requests pdfminer pdfplumber

Report repository

Languages