Skip to content

A lightweight Java framework for Natural Language Processing projects.

License

Notifications You must be signed in to change notification settings

txtData/txtDataNLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

txtDataNLP

txtDataNLP is a lightweight Java framework for Natural Language Processing (NLP) projects.

Typical use cases include:

  • Combining various third-party NLP and Machine Learning (ML) tools into one setup.
  • Implementing algorithms that combine output from various NLP or ML modules and return one coherent analysis—also if their output is contradictory.
  • Implementing dictionary-based, ontology-based or linguistically motivated approaches to NLP.
  • Extraction of features from low-level NLP tools (tokenizers, taggers) for downstream ML models.
  • Implementing white- or backlists that often sit on top of ML approaches in order to deal with specific mistakes of the ML models.

About

A lightweight Java framework for Natural Language Processing projects.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published