Skip to content

Agropontos Regex is a simple Python program that extracts geolocation coordinates from PDF files, like rural property registration documents.

License

Notifications You must be signed in to change notification settings

guinacio/AgropontosRegex

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AgropontosRegex

Agropontos Regex is a small Python program that extracts geolocation coordinates from PDF files, eg.: rural property registration documents.

It works for two types of coordinates, UTM and Lat-Long. And generates a CSV file that can be imported directly to GIS software, like QGIS.

The program interface can be used like a notepad to correct any errors or wrong characters brought by the OCR scanning. It also generates a new PDF file correcting the page tilt and rotation.

Screenshot

Installation

You need to install the following packages for Windows:

I recommend using the Chocolatey package manager to install some of the following: (Run in an Administrator command prompt)

  • Python 3.8 (64-bit) or later
    • choco install python3
  • Tesseract 4.1.1 (64-bit) or later
  • Ghostscript 9.50 (64-bit) or later
    • choco install ghostscript
  • OCRmyPDF 14.2.0 (64-bit) or later
    • pip install ocrmypdf
  • pypdf 3.9.0 (64-bit) or later
    • pip install pypdf

About

Agropontos Regex is a simple Python program that extracts geolocation coordinates from PDF files, like rural property registration documents.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages