Non-ASCII thousands and decimal separators not processed correctly #14630

groenroos · 2021-05-11T17:10:01Z

In some languages (such as Finnish), thousands are usually separated by spaces. However, this means that in some cases the number can confusingly wrap onto multiple lines, as the space makes the parts of the single number behave like two separate words;

Numbers being injected into strings should be protected from being wrapped onto multiple lines.

duncanspumpkin · 2021-05-11T17:46:05Z

That would be very hard to pull off. The layout part of the code that decides where to place line breaks is after the token processing code. You would need to change it so that the space is actually a non breaking space.

ZxBiohazardZx · 2021-05-12T09:48:15Z

would it be an idea to use the UNICODE non-breaking space ( U+00A0) as opposite of the normal space (U+0020)

not sure you can add that as character in the localisation in an easy way (U00A0 as seperator)

ShimmerFairy · 2021-05-13T16:03:30Z

If Unicode is an option, then using NBSP as mentioned would be an easy way to specify spaces that should not be broken across lines. In fact, for reference, Unicode comes with a line breaking algorithm, specified in UAX#14. So even if you had to write your own Unicode-aware line breaker (as opposed to using some library that figures it out for you), you at least wouldn't have to figure out an algorithm by yourself.

groenroos · 2021-05-13T16:18:57Z

If the Unicode (or some other) non-breaking space character is supported, then it's a gotcha for translators, but at least the user-facing issue can be resolved. Are non-breaking spaces supported & respected in the game currently?

Gymnasiast · 2021-05-13T16:33:32Z

Yes, they are. I fixed that to allow for French-style punctuation (which similarly puts a non-breaking space before ! and ?).

If you change it in Localisation, could you also update the other language files that have a space for a thousands separator?

groenroos · 2021-05-15T21:21:28Z

I just tried giving it a go with a single U+00A0 for STR_5151, but the same behaviour still happens - the two segments of the number are still split onto two lines... 😕

I'm running 0.3.3 (3f65f282d) if that makes a difference.

groenroos · 2024-05-19T18:46:07Z

I noticed that my previous attempt was thwarted by my IDE trying to be "helpful" by converting my non-breaking spaces to regular spaces.

I now added a proper U+00A0 for STR_5151 in OpenRCT2/Localisation#2832 - however, I had to close that PR, because in-game this seems to corrupt any parent string that needs a thousands separator, by truncating the rest of the string:

I'm assuming the same character works fine for punctuation in fr-FR (albeit it looks like they also use a regular space for STR_5151?). Is there something extra that needs to be fixed or enabled to make non-breaking spaces work for fi-FI in this context?

Gymnasiast · 2024-05-19T19:25:58Z

If I had to hazard a guess: the symbol is represented by two bytes in UTF-8, and our code probably makes an assumption somewhere that the symbol is ASCII.

Gymnasiast · 2024-05-19T20:16:27Z

@groenroos A fix is pending, are you able to check that out?

groenroos · 2024-05-19T22:48:51Z

@Gymnasiast Thanks - built it locally, and that does indeed seem to fix the issue! 🎉

As for OpenRCT2/Localisation#2832, given that we probably wouldn't want to distribute those strings unless your patch was also merged, would it be safe to re-open that PR now, or wait for later?

Gymnasiast added the localisation Related to translations and localisation label May 13, 2021

Gymnasiast changed the title ~~Numbers should not wrap in languages that use space for thousands separators~~ Non-ASCII thousands and decimal separators not processed correctly May 19, 2024

Gymnasiast added a commit to Gymnasiast/OpenRCT2 that referenced this issue May 19, 2024

Fix OpenRCT2#14630: Number separators not processed correctly

dd64586

Gymnasiast added a commit to Gymnasiast/OpenRCT2 that referenced this issue May 19, 2024

Fix OpenRCT2#14630: Number separators not processed correctly

da6c3ec

Gymnasiast added a commit to Gymnasiast/OpenRCT2 that referenced this issue May 19, 2024

Fix OpenRCT2#14630: Number separators not processed correctly

49522b0

Gymnasiast linked a pull request May 19, 2024 that will close this issue

Fix #14630: Number separators not processed correctly #22064

Merged

groenroos mentioned this issue May 19, 2024

Fix #14630: Number separators not processed correctly #22064

Merged

Gymnasiast closed this as completed in #22064 May 23, 2024

Gymnasiast added a commit that referenced this issue May 23, 2024

Fix #14630: Number separators not processed correctly

308cc3c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-ASCII thousands and decimal separators not processed correctly #14630

Non-ASCII thousands and decimal separators not processed correctly #14630

groenroos commented May 11, 2021

duncanspumpkin commented May 11, 2021

ZxBiohazardZx commented May 12, 2021

ShimmerFairy commented May 13, 2021

groenroos commented May 13, 2021

Gymnasiast commented May 13, 2021 •

edited

groenroos commented May 15, 2021

groenroos commented May 19, 2024

Gymnasiast commented May 19, 2024

Gymnasiast commented May 19, 2024

groenroos commented May 19, 2024

Non-ASCII thousands and decimal separators not processed correctly #14630

Non-ASCII thousands and decimal separators not processed correctly #14630

Comments

groenroos commented May 11, 2021

duncanspumpkin commented May 11, 2021

ZxBiohazardZx commented May 12, 2021

ShimmerFairy commented May 13, 2021

groenroos commented May 13, 2021

Gymnasiast commented May 13, 2021 • edited

groenroos commented May 15, 2021

groenroos commented May 19, 2024

Gymnasiast commented May 19, 2024

Gymnasiast commented May 19, 2024

groenroos commented May 19, 2024

Gymnasiast commented May 13, 2021 •

edited