Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The algorithm to compute the derived property should use EAW more #765

Open
kojiishi opened this issue Apr 6, 2024 · 2 comments
Open

The algorithm to compute the derived property should use EAW more #765

kojiishi opened this issue Apr 6, 2024 · 2 comments

Comments

@kojiishi
Copy link

kojiishi commented Apr 6, 2024

This issue is a continuation of kojiishi/unicode-auto-spacing#16, originally raised by @asmusf.

@kojiishi
Copy link
Author

kojiishi commented Apr 6, 2024

@asmusf The algorithm to compute the property was adjusted to:

  1. Add by scripts and script_extensions.
  2. Remove by EAW.

In other words, the algorithm is an intersection of scripts and EAW.

Does it make clear that EAW plays one of the primary roles for this property?

@kojiishi
Copy link
Author

kojiishi commented Apr 21, 2024

With #768, ea=N|Na are exlucded from the W set, unless their script is one of CJK.

The current algorithm to compute the property is here. It's in Python with a library, but I hope it's readable without prior knowledge.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant