Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LatinBackoffLemmatizer fails on words with -aris ending #1122

Open
TylerKirby opened this issue Aug 23, 2021 · 0 comments
Open

LatinBackoffLemmatizer fails on words with -aris ending #1122

TylerKirby opened this issue Aug 23, 2021 · 0 comments
Assignees
Labels

Comments

@TylerKirby
Copy link
Contributor

TylerKirby commented Aug 23, 2021

Describe the bug
The LatinBackoffLemmatizer seems to fail on words with -aris ending.

To Reproduce

from cltk.lemmatize.lat import LatinBackoffLemmatizer
lemmatizer = LatinBackoffLemmatizer()
lemmatizer.lemmatize(["abundaris"])
Out[7]: [('abundaris', 'abundaris')]
lemmatizer.lemmatize(["amaris"])
Out[8]: [('amaris', 'amaris')]
lemmatizer.lemmatize(["duraris"])
Out[9]: [('duraris', 'duraris')]
lemmatizer.lemmatize(["crearis"])
Out[10]: [('crearis', 'crearis')]

Expected behavior
E.g., amaris should lemmatize to amo.

Desktop (please complete the following information):

  • OS and version: MacOS 11.5.1
  • cltk version 1.0.18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants