Skip to content

chinanu9a/PDF-Scrapping

Repository files navigation

PDF Scrapping

Goal: Creates a JSON file from data scrapped in a PDF file.

The routes.py script scraps a PDF file looking for the highway routes trucks are permitted to take in a state.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.

Requirements

You need Python 3.4 or later. You can have multiple Python versions (2.x and 3.x) installed on the same system without problems.

In Ubuntu, Mint and Debian you can install Python 3 like this:

$ sudo apt-get install python3 python3-pip

For other Linux flavors, OS X and Windows, packages are available at

http://www.python.org/getit/

Quick start

You can always use a Python interpreter to run your statically typed programs on a command-line, even if they have type errors, using:

$ python3 PROGRAM

Prerequisites

  • StringIO Main Library

  • re Main Library

  • json Main Library

  • pdfminer.six Install package using:

      $ python3 -m pip install pdfminer.six
    

Authors

  • Chinanu Onyekachi

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages