Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Changes to Finnish verb tables break their parsing in really annoying ways #241

Open
kristian-clausal opened this issue Apr 28, 2023 · 1 comment

Comments

@kristian-clausal
Copy link
Collaborator

This is mostly a reminder for myself about this.

Module:fi-verbs has been going through changes that break our parser in spectacular ways, mainly by creating toggleable sub-tables inside bigger tables that rely on headers from the bigger table for data.

At the bottom of the inflection table in syödä (as an example) we have the section for infinitives (which was a messy and annoying section even before and required extra work with tags and tag-trigger stuff); now what used to be single cells with easy and direct access to headers we have sections of expandable subtables that are just a pain to work with.

If someone is arranging a coup d'etat on en.wiktionary for the purposes to stopping this kind of thing, let me know.

It really is like hitting your head against a wall, and finally when you can see through the hole that you've been digging with your forehead the walls just grows back.

@kekbur
Copy link

kekbur commented Dec 27, 2023

As a related note: Finnish noun declension table extraction does not capture possessive forms with all necessary tags.

See for example aalto, which yields for example:

{
  "form": "aaltomme",
  "source": "declension",
  "tags": [
	"nominative",
	"singular"
  ]
}

Which would be more accurately stated as:

{
  "form": "aaltomme",
  "source": "declension",
  "tags": [
	"nominative",
	"singular",
	"first-person plural possessor"
  ]
}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants