Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix filtering of characters #1

Open
wants to merge 2 commits into
base: master
Choose a base branch
from
Open

Conversation

smilli
Copy link

@smilli smilli commented May 19, 2016

Right now if BookNLP finds character names such as "Sakura Kinomoto" and "Kinomoto Sakura" it flags both both "Sakura Kinomoto" and "Kinomoto Sakura" because they are subsets of each other, so the character ends up not being added at all. I made a fix to add the character name with earliest lexicographic order in that case.

I think that the getVariants method still has to be modified to include both of them though. Because "Kinomoto Sakura" is the name that will end up getting ended, but "Sakura Kinomoto" isn't found in the getVariants part.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant