Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tolong Siki #732

Draft
wants to merge 9 commits into
base: main
Choose a base branch
from
Draft

Tolong Siki #732

wants to merge 9 commits into from

Conversation

josh-hadley
Copy link
Collaborator

See UTC-174-C22: Provisionally assign 54 code points U+11DB0..U+11DE9 in a new Tolong Siki block at U+11DB0..U+11DEF, for 54 Tolong Siki characters as described in L2/23-024 and section 6 of L2/23-012.

11DD6;TOLONG SIKI LETTER X;Lo;0;L;;;;;N;;;;;
11DD7;TOLONG SIKI LETTER RR;Lo;0;L;;;;;N;;;;;
11DD8;TOLONG SIKI LETTER RRH;Lo;0;L;;;;;N;;;;;
11DD9;TOLONG SIKI SIGN SELA;Lo;0;L;;;;;N;;;;;
Copy link
Member

@eggrobin eggrobin Mar 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a vowel length mark, see p. 5 and the chart on p. 13 of the proposal L2/23-024.

I think this should be gc=Lm, and in PropList.txt, should have the Diacritic and Extender properties (like U+02D0).

The proposal bundles U+11DDA with it. For that one I don’t know whether it is more appropriate to treat it as a letter that is a glottal stop or as modifier that does mysterious syllable coda things. @Ken-Whistler, what do you think?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think it (hecakā) is a vowel length mark, as contrasted with selā, which is. The glottal stop seems to be a full segment, although perhaps in restricted distribution. See the discussion of Kurukh and Brahui phonology on Wikipedia, for instance, which shows length distinctions (and in some cases nasalization distinctions) for vowels, which when marked in the orthography can be considered vowel modifiers and/or extenders, etc. But I think it is safest to just leave the glottal stop as Lo and treat it as another letter.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this change also require changes to LineBreak.txt?

Based on feedback from @eggrobin and @Ken-Whistler :
- set gc=Lm (UnicodeData.txt)
- set Diacritic and Extender (PropsList.txt)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants