#install jruby
bundle install
#run with example schema
bundle exec jruby page_extractor.rb
(jruby) extract text and tables from PDFs using Mozilla's tabula-extract or just straightup OCR.
License
noahpryor/pdflib
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
(jruby) extract text and tables from PDFs using Mozilla's tabula-extract or just straightup OCR.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published