Skip to content

MichaelPaulukonis/common-corpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

text-corpus

common texts used in NLG/NLP projects

Installation

npm i -s github:michaelpaulukonis/common-corpus

Roadmap

  • MOAR TEXTS
  • zipped content, as the current size is 75MB uncompressed, and 25MB compressed
  • tool consolidation
  • migrate from compromise instead of nlp_compromise
  • use of original texts with algos to remove boilerplate
  • retrieval of texts from gitenberg

About

common texts that I like to use in NLG/NLP projects

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published