Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump vers.tika from 1.13 to 1.22 #494

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

dependabot[bot]
Copy link

@dependabot dependabot bot commented on behalf of github Oct 18, 2019

Bumps vers.tika from 1.13 to 1.22.

Updates tika-xmp from 1.13 to 1.22

Changelog

Sourced from tika-xmp's changelog.

Release 2.0.0 - ???
BREAKING CHANGES in 2.0.0

  • Remove deprecated Metadata keys/properties (TIKA-1974).

Other changes

Release 1.23

  • Upgrade to PDFBox 2.0.17

Release 1.22 - ???

  • NOTE: Known regression: PDFBOX-4587 -- PDF passwords with codepoints
    between 0xF000 and 0XF0000 will cause an exception.

  • Add parser for HWP v5 files via SooMyung Lee (soomyung) and
    JinSup Kim (ddoleye) (TIKA-2909).

  • Fix order of closing streams to avoid "Failed to close temporary resource"
    exception (TIKA-2908).

  • Improve AutoDetectReader performance by caching encoding
    detector (TIKA-1568).

  • Prevent RTFParser from outputting illegal tag combinations (TIKA-2889).

  • Fix RereadableInputStream to release all resources (TIKA-2903).

  • Implement custom language identifier in the tika-eval module based on
    OpenNLP's language detector; add 18 languages and add common words
    lists for all 121 languages (TIKA-2790).

  • Fix NPE in MimeTypesReader.releaseParser() via Eamonn Saunders (TIKA-2896).

  • Fix RTFParser to extract more content (TIKA-2883).

  • Add clientSubmitTime to the metadata extracted from PST files (TIKA-2898).

  • Improve StreamingZipContainerDetector for xltx, xltm and
    several other file formats (TIKA-2886).

Release 1.21 - 05/14/2019

  • Add optional AUTO mode to OCR'ing of PDFs. If tesseract is installed
    and on the path, and this option is selected programmatically
    or via TikaConfig(), the PDFParser will use heuristics to decide
    whether or not to run OCR per page on PDFs. (TIKA-2749)

  • The ZipContainerDetector's default behavior was changed to run
    streaming detection up to its markLimit. Users can get the

... (truncated)
Commits
  • aa2a385 [maven-release-plugin] prepare release 1.22-rc4
  • de0fca9 roll back for rc#4...update date
  • 4db132e roll back for rc#4
  • c5daaf4 Merge remote-tracking branch 'origin/branch_1x' into branch_1x
  • 357c163 include opennlp lang model in tika-eval during assembly
  • 0f3790e [maven-release-plugin] prepare for next development iteration
  • c23f47e [maven-release-plugin] prepare release 1.23-rc3
  • c25b81d Merge remote-tracking branch 'origin/branch_1x' into branch_1x
  • fd40040 roll back for rc#3, again...
  • 950ee35 [maven-release-plugin] prepare for next development iteration
  • Additional commits viewable in compare view

Updates tika-core from 1.13 to 1.22

Changelog

Sourced from tika-core's changelog.

Release 2.0.0 - ???
BREAKING CHANGES in 2.0.0

  • Remove deprecated Metadata keys/properties (TIKA-1974).

Other changes

Release 1.23

  • Upgrade to PDFBox 2.0.17

Release 1.22 - ???

  • NOTE: Known regression: PDFBOX-4587 -- PDF passwords with codepoints
    between 0xF000 and 0XF0000 will cause an exception.

  • Add parser for HWP v5 files via SooMyung Lee (soomyung) and
    JinSup Kim (ddoleye) (TIKA-2909).

  • Fix order of closing streams to avoid "Failed to close temporary resource"
    exception (TIKA-2908).

  • Improve AutoDetectReader performance by caching encoding
    detector (TIKA-1568).

  • Prevent RTFParser from outputting illegal tag combinations (TIKA-2889).

  • Fix RereadableInputStream to release all resources (TIKA-2903).

  • Implement custom language identifier in the tika-eval module based on
    OpenNLP's language detector; add 18 languages and add common words
    lists for all 121 languages (TIKA-2790).

  • Fix NPE in MimeTypesReader.releaseParser() via Eamonn Saunders (TIKA-2896).

  • Fix RTFParser to extract more content (TIKA-2883).

  • Add clientSubmitTime to the metadata extracted from PST files (TIKA-2898).

  • Improve StreamingZipContainerDetector for xltx, xltm and
    several other file formats (TIKA-2886).

Release 1.21 - 05/14/2019

  • Add optional AUTO mode to OCR'ing of PDFs. If tesseract is installed
    and on the path, and this option is selected programmatically
    or via TikaConfig(), the PDFParser will use heuristics to decide
    whether or not to run OCR per page on PDFs. (TIKA-2749)

  • The ZipContainerDetector's default behavior was changed to run
    streaming detection up to its markLimit. Users can get the

... (truncated)
Commits
  • aa2a385 [maven-release-plugin] prepare release 1.22-rc4
  • de0fca9 roll back for rc#4...update date
  • 4db132e roll back for rc#4
  • c5daaf4 Merge remote-tracking branch 'origin/branch_1x' into branch_1x
  • 357c163 include opennlp lang model in tika-eval during assembly
  • 0f3790e [maven-release-plugin] prepare for next development iteration
  • c23f47e [maven-release-plugin] prepare release 1.23-rc3
  • c25b81d Merge remote-tracking branch 'origin/branch_1x' into branch_1x
  • fd40040 roll back for rc#3, again...
  • 950ee35 [maven-release-plugin] prepare for next development iteration
  • Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot ignore this [patch|minor|major] version will close this PR and stop Dependabot creating any more for this minor/major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
  • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
  • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
  • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
  • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

You can disable automated security fix PRs for this repo from the Security Alerts page.

Bumps `vers.tika` from 1.13 to 1.22.

Updates `tika-xmp` from 1.13 to 1.22
- [Release notes](https://github.com/apache/tika/releases)
- [Changelog](https://github.com/apache/tika/blob/master/CHANGES.txt)
- [Commits](apache/tika@1.13...1.22)

Updates `tika-core` from 1.13 to 1.22
- [Release notes](https://github.com/apache/tika/releases)
- [Changelog](https://github.com/apache/tika/blob/master/CHANGES.txt)
- [Commits](apache/tika@1.13...1.22)

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot bot added the dependencies Pull requests that update a dependency file label Oct 18, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dependencies Pull requests that update a dependency file
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

0 participants