Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Flesh out grammatical categories #25

Open
macchiati opened this issue Mar 19, 2024 · 1 comment
Open

Flesh out grammatical categories #25

macchiati opened this issue Mar 19, 2024 · 1 comment
Labels
discuss Discussion item

Comments

@macchiati
Copy link
Member

macchiati commented Mar 19, 2024

We have a set of grammatical categories/features in CLDR, that are also used in ICU. It would be very useful to flesh out these categories so that we have a consistent set of identifiers for grammatical categories, and lists of which categories are applicable to which languages, and for which scopes.

Currently the data for this is limited:

  1. Nouns & noun clauses: gender, case, definiteness, plurals (cardinals), ordinals, plural ranges.
  2. Two scopes: general and units
  3. Limited locales
    1. gender, case, definiteness: (50) Amharic, Arabic, Armenian, Azerbaijani, Bangla, ... Turkish, Ukrainian, Urdu, Uzbek
    2. plurals: (300+) Afrikaans, Akan, Albanian, Amharic, Anii, Arabic, Aragonese, Armenian, Assamese, Asturian, Asu, Azerbaijani, ... Xhosa, Yakut, Yiddish, Yoruba, Zulu

https://www.unicode.org/cldr/charts/45/grammar/index.html

https://www.unicode.org/cldr/charts/45/supplemental/language_plural_rules.html

@macchiati macchiati added the discuss Discussion item label Mar 19, 2024
@nciric
Copy link
Contributor

nciric commented Mar 19, 2024

I think this would be beneficial to CLDR/ICU with improving data quality and maybe reducing the size. It could also help our effort in defining which categories we want to tackle.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discuss Discussion item
Projects
Status: In Progress
Development

No branches or pull requests

2 participants