Skip to content

open-hopin/mbFXWords

Repository files navigation

mbFXWords

divide plain text and PDF content in subject, predicate, object with OpenNLP

Version 1.04

NetBeans project with Ant build script.

Applies and builds upon Apache OpenNLP. For English, French and German files. JavaFX Application.

NLP extensions:

  • Divide sentences in subclauses: segmentation.
  • Divide plain text: subject, predicate, object.
  • Count words: stemming.
  • Search for similar content: PDF's.

Also on: https://sourceforge.net/projects/mbfxwords/