Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lack of vocabularies translation #188

Open
3 tasks done
Dieofu opened this issue Feb 21, 2024 · 6 comments
Open
3 tasks done

Lack of vocabularies translation #188

Dieofu opened this issue Feb 21, 2024 · 6 comments

Comments

@Dieofu
Copy link

Dieofu commented Feb 21, 2024

Checkboxes

  • I have read the document at xxyzz.github.io/WordDumb.
  • I have not found similar issue or disscussion at GitHub.
  • Reboot doesn't fix the problem.

Describe the bug

word wise creating function works, however, somehow, only very limited vocabularies have word wise translations. When I check customize kindle word wise, I try typing some vocabularies in Filter Lemma.
For example, "progressive", it only shows phrases
20240220203136;
For vocabulary "fly", it only shows animal fly in translation, but no verb form in translation.
20240220203748;
For vocabulary fry", the translation doesn't show up.
20240220203932
I would like to ask what I can do to correct translation database.

Operating System name and version

Windows 10

Python version

3.12

calibre version

7.5.1

WordDumb plugin version

3.31.0

Error message

No error message shown

Plugin settings and reproduce steps

20240220204402
20240220204440

Generated files, screenshots or videos

No response

@xxyzz
Copy link
Owner

xxyzz commented Feb 21, 2024

The gloss data are obtained from the Chinese Wiktionary, you won't find some words because Wiktionary doesn't have the definition or the page has a terrible wikitext layout. For example, the fly page only has the none gloss, and the fry page doesn't use the standard layout.

You could deselect the "Use Wiktionary definition" to use the Kindle Word Wise db.

If you want to improve the Wiktionary data quality, you could edit the Wiktionary page and take a look of these packages:

Word Wise db files are created from the "Proficiency" project, and "wiktextract" is for extracting data from Wiktionary pages.

@Dieofu
Copy link
Author

Dieofu commented Feb 21, 2024

For word: progressive and fry. Both of them have Chinese Wiktionary translations. But, none of them is in the worddumb.
Does it mean that both words have terrible wikitext layout?

When I deselect the "Use Wiktionary definition" to use the Kindle Word Wise db, it shows:
20240220203136
On kindle, Chinese translations won't match vocabularies . What I can do is using English to English wordwise. Do you have any idea what causes it? is something wrong with the Kindle Word Wise db?

Thanks

@xxyzz
Copy link
Owner

xxyzz commented Feb 21, 2024

Both "progressive" and "fry" in Chinese Wiktionary are low quality pages: no part of speech title, and gloss text is not inside a list. So they can't be extracted or can't be used without pos.

When "Use Wiktionary definition" option is selected, a new word wise db will be uploaded to Kindle to replace the Chinese word wise db. If you want to use Kindle's Chinese word wise file, you could delete the file at /system/kll and Kindle will redownload the file.

@Dieofu
Copy link
Author

Dieofu commented Feb 21, 2024

/system/kll
is this file in kindle? I don't see any system files on my kindle.
20240220221540

Would it cause the problem I had?

calibre, version 7.5.1
even if the device is connected,
it says: Device not found: Definition data will be added when Kindle or Android(requires adb) device is connected.

@xxyzz
Copy link
Owner

xxyzz commented Feb 21, 2024

It's a hidden folder. Are you using Scribe? The downloading word wise db file from kindle code might not work for Scribe, otherwise there might be a bug in the code.

@Dieofu
Copy link
Author

Dieofu commented Feb 21, 2024

It's a hidden folder. Are you using Scribe? The downloading word wise db file from kindle code might not work for Scribe, otherwise there might be a bug in the code.

I am using paperwhite 5. I found system file and just deleted kll files. Hope it could solve the problems

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants