Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Grammatical agreement for quantities #5

Open
grhoten opened this issue Mar 7, 2024 · 4 comments
Open

Grammatical agreement for quantities #5

grhoten opened this issue Mar 7, 2024 · 4 comments
Labels
discuss Discussion item

Comments

@grhoten
Copy link
Member

grhoten commented Mar 7, 2024

Getting quantities grammatically correct should be within the scope of this working group. The ability to take a unit and to add a numerical value, like 1 or 2, to make a quantity is important. The scope should involve the adjectives and nouns that are a part of the scope. Anything involving number pronunciation should remain a part of RBNF in CLDR.

Here's an example for the word foot:

Number Grammeme Resolved surface form
1 singular foot
2 plural feet

Here's an example with the word карандаш (pencil) in Russian:

Number Grammemes Resolved surface form
1 singular & nominative карандаш
2 singular & genitive карандаша
5 plural & genitive карандашей

While CLDR plural rules can be used to map which form to use. The hope is that this project can define how to turn the surface form from one to another, like from карандаш to карандаша or карандашей.

For a language like English, the rules to change a word from singular to plural is pretty regular. The rules can fit onto a single page, but you need the data to be able to handle the edge cases, like for goose, moose, foot, child, new and so forth.

If it's done well, you should be able to reinflect a word while maintaining the grammatical case of the word. For a language like German or Russian, this is an important topic.

@grhoten grhoten added the discuss Discussion item label Mar 7, 2024
@nciric
Copy link
Contributor

nciric commented Mar 15, 2024

To summarize, you are ok with CLDR plural rules telling us - you need a form 0, 1, few or many here, but I don't know how to change the word to match. Use the inflection library to do so.

I agree with that - it would use previously collected plural rules for languages, and it falls perfectly into our domain to inflect the word to match the plurality.

@richgillam
Copy link

Anything involving number pronunciation should remain a part of RBNF in CLDR.

At some point we should explore changing this. It seems like we might be able to simplify the RBNF rules for at least some languages if RBNF could take advantage of the inflection engine to inflect individual words in a spelled-out number.

@richgillam
Copy link

To summarize, you are ok with CLDR plural rules telling us - you need a form 0, 1, few or many here, but I don't know how to change the word to match. Use the inflection library to do so.

Can we replace the CLDR plural rules with something that lives in the inflection engine and knows which quantities are interesting, or does that basically amount to the same stuff we're doing now, but in a new location?

@macchiati
Copy link
Member

macchiati commented Mar 18, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discuss Discussion item
Projects
Status: In Progress
Development

No branches or pull requests

4 participants