Skip to content

1.2.0 Release (1.2.0.18022700)

Compare
Choose a tag to compare
@jazzido jazzido released this 27 Feb 20:26
· 602 commits to master since this release

tabula 1.2.0 / tabula-java 1.0.1

Tabula 1.2.0 includes an update of our processing backend (tabula-java 1.0.1), which improves table detection and extraction and fixes several bugs. Special thanks to Shuttleworth Foundation for funding @melisabok's port of tabula-java to PDFBox 2.0. This release includes two additional new features: templates and the Repeat to Next Page button.

This page contains technical release notes; please visit the Tabula official homepage for an overview of Tabula and quick download links:
http://tabula.technology/


If you have any issues with this version of Tabula, please let us know!

New features

  • Templates Now you can create selections, save them, and reload those same templates for another PDF. Templates are stored in a Template Library located below the Document Library. You can also export those selections to JSON and load them into Tabula on another computer (or use them as input for Tabula-Java). (#711)
  • Repeat to Next Page button: Now you can repeat a selection onto only the next page. (#715)

Bug fixes

  • Make Tabula work under a non-root path (#110, #547, #586)
  • Escape HTML (b29806a)
  • Use JBundler for managing Java dependencies.
  • Fixed a bug causing duplicate object IDs (cef4483)
  • Fixed: Windows throws when trying to mv a Tempfile (40e1d05)
  • Upgraded build to use jruby-9.1.15.0
  • Other fixes & improvements.

Known Issues & Caveats

  • Note: If you are using the .jar version (for Linux/etc), you now need to open your browser to the Tabula page (http://127.0.0.1:8080/) manually.
  • Caveat: Tabula only works on text-based PDFs (ones where you can select text). Scanned documents do not work, and we do not recommend OCR for large files unless you have a data cleaning plan since even state-of-the-art OCR software can have significant error rates.
  • _OS X Gatekeeper_: If you’re running Mac OS X 10.8 or later and get a message that says "Tabula can't be opened because it is from an unidentified developer" message, please let us know. (It shouldn't be happening anymore.) See this GateKeeper page for more assistance — your "Allow applications downloaded from" setting should be set to "Mac App Store and identified developers" or "Anywhere."

Downloads

Windows & Linux users need to have Java installed to use Tabula. You can download Java here. The Mac version contains an integrated copy of Java.

Verification:

SHA-256

$ shasum -a256 tabula*.zip
2dfaa7ce41f6a0b0113b5143619236e6dfbe18d11d05827ecaec15424150c969  tabula-jar-1.2.0.zip
14de2ac043fa6bb1180eb2217c3405e49c0f01cc2142a422e880522a30374e67  tabula-mac-1.2.0.zip
b3adcb71ec6224298609152e151cfe3ceaacdda7237c4dafd3c001fe382fd54c  tabula-win-1.2.0.zip