Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is wikicreole parser slightly broken or is it tyxml or both. #133

Open
hhugo opened this issue Nov 2, 2022 · 1 comment
Open

Is wikicreole parser slightly broken or is it tyxml or both. #133

hhugo opened this issue Nov 2, 2022 · 1 comment

Comments

@hhugo
Copy link
Member

hhugo commented Nov 2, 2022

The wikicreole parser currently emit too many B.phrasing elements, eventually splitting words into pieces.

For example, with input the, the parser emits B.phrasing for t and he. The reason seems to be that there is a rule for parsing http:..., the parser stops after t in case the h is the start of http:....

The other aspect that is weird to me is that tyxml can generate files with different browser rendering with and without indent.
I think it boills down to the following : printing [ pcdata "a"; pcdata "b"] inserts a cutting hint between 'a' and 'b'. Format can decide to insert newlines if the text is too long. The browser renders "a\nb" and "ab" differently.

@hhugo
Copy link
Member Author

hhugo commented Nov 2, 2022

ocsigen/tyxml#288

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant