Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

List of dictionary words #103

Open
silencesys opened this issue Apr 8, 2020 · 5 comments
Open

List of dictionary words #103

silencesys opened this issue Apr 8, 2020 · 5 comments

Comments

@silencesys
Copy link

silencesys commented Apr 8, 2020

Is there a way to get a list of all the grammar variations of all the words the dictionary knows? I would like to use regex to get list of possible words if I'm unsure of the word.

I also tried to compile this dictionary and it got stuck on this line.

@mk270
Copy link
Owner

mk270 commented Apr 8, 2020

Thanks for getting in touch. Please file the compile failure as a separate ticket

@ids1024
Copy link
Contributor

ids1024 commented Apr 8, 2020

For listing all forms the dictionary can generate, there's a LISTALL file mentioned on the old website.

Though, that doesn't account for some rules. I'm not sure how that's supposed to be generated.

@silencesys
Copy link
Author

silencesys commented Apr 8, 2020

Thank you for your responses! I'll do a separate ticket then, but I realized that it might be caused by the Docker container so I'll try it on a Linux machine and write the ticket if the issue persists.

Yes, I found that file, but there are some entries missing, for example acrem. So I was wondering if is possible to generate a new one. :)

@silencesys silencesys changed the title Dictionary words and makefile List of dictionary words Apr 8, 2020
@Aurorum
Copy link

Aurorum commented Jul 2, 2022

Hi all, just stumbled across this thread but still looking for a simple TXT file of all Latin words. If anybody has one which they could share, I'd be really grateful. Many thanks! :)

@spr93
Copy link
Contributor

spr93 commented Jul 7, 2022

Under src/tools there are programs that come close to what you're looking for:

DICTPAGE generates a single text file that lists most*** words in a paper-dictionary-style format (that is, the output lists the principal parts, part of speech, and gender in addition to the definition).

PAGE2HTM adds basic HTML formatting to the output.

***Some entries will need some manual fix ups, and the output won't include the UNIQUES or the forms that can be generated with ADDONS unless you do some additional work. For example, the dictionary and its input file have some hacks to conjugate common irregular words (especially the qui- words), and that will make the output a bit wonky.

/HOWTO.txt contains Whitaker's original documentation for DICTPAGE, including a list of steps to handle many, but not all, of these quirks. I

IMHO, you're better off using a PDF version of an out-of-copyright Lewis & Short edition than relying on DICTPAGE. The dictionary input files for Words rely heavily on the Words engine for conjugation and word formation; they're just not designed for manual reference like a paper dictionary. Despite its many flaws, L&S will still be more comprehensive and reliable than DICTPAGE's output.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants