Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Come up with a way to handle extended grapheme clusters #24

Open
MnO2 opened this issue Jun 16, 2019 · 1 comment
Open

Come up with a way to handle extended grapheme clusters #24

MnO2 opened this issue Jun 16, 2019 · 1 comment
Assignees
Labels
bug Something isn't working

Comments

@MnO2
Copy link
Collaborator

MnO2 commented Jun 16, 2019

Examples here: https://developer.apple.com/swift/blog/?id=30

"abcde\u{0301}\u{1100}\u{1161}\u{AC00}" should not be segmented as "abcde" and "\u{0301}\u{1100}\u{1161}\u{AC00}". "e\u{0301}" should be together.

@MnO2 MnO2 self-assigned this Jun 16, 2019
@MnO2 MnO2 added the bug Something isn't working label Jun 16, 2019
@MnO2
Copy link
Collaborator Author

MnO2 commented Jun 17, 2019

https://unicode-rs.github.io/unicode-segmentation/unicode_segmentation/struct.Graphemes.html This could be considered. But only required if the behaviour of re_han results into incorrect segmentation by SplitMatch

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant