Skip to content

A scraper built to pull information from the Senate Judiciary Website. This scrapper ultimately creates a map categorizing current federal court nominees based on their pro bono work experience.

Notifications You must be signed in to change notification settings

jessieblaeser/Federal-Court-Nominees-Scrape-and-Analysis

Repository files navigation

Analyzing Fed Court Nominees via Pro Bono Work

A scraper built to pull information from the Senate Judiciary Website.

This tool moves from the homepage of the Judiciary Committee to:

  • the meeting page
  • each past "Nominations" meeting
  • click and download each questionnaire form for each nominnee presented in each nomination meeting
  • read and parse each PDF, checking for inconsistencies
  • using regex, pull key information from each PDF including, name, school, nominated position, pro bono work experience and more
  • push this information into a Pandas dataframe
  • download that information into a CSV

Using textual analysis, I've created a categorization system for the type of (required) pro bono work completed by those appointed to the federal courts. Examples of categories include things like:

  • Criminal Justice
  • Child Protection
  • Discrimination and Human Rights

The final output of this scraping project is a GeoJSON choropleth map that displays pro bono work categories by district.

Notebooks:

  • Downloading nominee questionnaires: Using a combination of Beautiful Soup and Selenium, this notebook downloads over 50 PDFs from the Senate Judiciary Committee website. I use pdfminer to parse the PDFs and regex to pull information from each into a dataframe and CSV.
  • Creating pro bono categories and mapping: Using textual analysis, I categorize types of pro bono work, ultimately mapping the categories using GeoJSON.

Other uses:

If you have no interest at all in pro bono work, this code can still be used to:

  • Scrape the Senate Judiciary Committee's website
  • Download specific files
  • Pool information about nominees

About

A scraper built to pull information from the Senate Judiciary Website. This scrapper ultimately creates a map categorizing current federal court nominees based on their pro bono work experience.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published