Skip to content

zejn/pypdf2xml

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pypdf2xml

This project started as an alternative to poppler's pdftoxml, which didn't properly decode CID Type2 fonts in PDFs. This script requires pdfminer.

License

Public domain.

About

Convert text from PDF to XML.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages