From using xpdf, rvest, and quanteda on United Nations Digital Library search results to applying dictionaries to speeches in United Nations meeting records
-
Updated
Apr 16, 2019 - R
From using xpdf, rvest, and quanteda on United Nations Digital Library search results to applying dictionaries to speeches in United Nations meeting records
Patches for the Xpdf toolkit (version 4.05) to enable TIFF output and more
"Documents Search Engine" based on Lucene for indexing and searching in many type of the documents
Ferramenta de conversão de arquivos pdf para o formato txt utilizando a ferramenta xpdf.
Pdf2xNet is a .NET library for seamless integration with Xpdf tools, enabling easy conversion of PDF documents to text, images, and HTML formats within your .NET applications.
A collection of Java APIs for Xpdf - the open source library for operating on PDF files.
This is a highly efficient python wrapper for tesseract-ocr.
Static library built from source of www.xpdfreader.com with most of dependencies built within
🚀 Make your PDF press-ready PDF/X-1a.
Batch-convert pdf to text, extract data from pdf in python
Fast and memory-efficient Python PDF Parser based on xpdf sources
Add a description, image, and links to the xpdf topic page so that developers can more easily learn about it.
To associate your repository with the xpdf topic, visit your repo's landing page and select "manage topics."