Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Do not remove dots from utterance in entity recognition #1318

Open
alberchou opened this issue Jun 1, 2023 · 1 comment
Open

Do not remove dots from utterance in entity recognition #1318

alberchou opened this issue Jun 1, 2023 · 1 comment

Comments

@alberchou
Copy link

alberchou commented Jun 1, 2023

I have some entities with dots inside (for example: aaaaa.bbbbb.ccccc) and I need to set 1 as accuracy but if I do that those entities are not recognized,

Is there any option to exclude some characters from being used as token separator?
Furthermore, is it possible to use that only for entity recognition (not for intent recognition).

Thank you!

@alberchou
Copy link
Author

Other thing, that can be a bug (maybe): When I put a large list of values (that are previously added with nlp.addNamedEntityText function), the number of coincidences are less than the passed in the original string (in my case: 22 entities were found while sent a text chain of 29 values).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant