Skip to content

spaCy backend

Latest
Compare
Choose a tag to compare
@thoppe thoppe released this 19 Mar 14:36
· 25 commits to master since this release
8654343
  • Backend NLP engine pattern.en has been replaced with spaCy v 2.1.0. This is a major fix for some of the problems with pattern.en including poor lemmatization. (eg. cytokine -> cytocow)
  • Support for python 2 has been dropped
  • Support for custom dictionaries in replace_from_dictionary
  • Option for suffix to be used instead of prefix in replace_from_dictionary
  • URL replacement can now remove emails
  • token_replacement can remove symbols