Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

deformatters -o: maybe double newline without end-of-line period should give heading-symbol instead of period #198

Open
unhammer opened this issue Jan 25, 2024 · 1 comment

Comments

@unhammer
Copy link
Member

Headings and list items have different syntax, so apertium-deshtml -o can output a ❡ after them, which the disambiguator can use to e.g. relax rules like "always require a verb in a sentence":

$ echo 'Chase bank</h1>' | apertium-deshtml -o
Chase bank[]❡.[][<\/h1>
]

But for plain text, we often have simple headings like

Chase bank

This organisation has an ambiguous name.

Maybe the -o option could give a instead of just . before double newlines without final punctuation

@unhammer
Copy link
Member Author

(see also TinoDidriksen/Transfuse#15 , I guess such rules would have to go there when running with transfuse, though we should get the simple h1 case solved first there)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant