Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Grammar-checked Markov chain #15

Open
IFcoltransG opened this issue Nov 2, 2023 · 0 comments
Open

Grammar-checked Markov chain #15

IFcoltransG opened this issue Nov 2, 2023 · 0 comments

Comments

@IFcoltransG
Copy link

IFcoltransG commented Nov 2, 2023

All these new LLMs generate coherent text, but it would take all the fun out of this to use one. So I'm considering making a Markov chain from some (probably quite large, public domain) corpora, and then run a grammar-fixer over top of it of my own design. Ideally it would fix up verb tenses and so on.
Maybe I'll impose some additional structure if I get time, like changing the corpora in different parts, to change the tone. I might also experiment with some more advanced (but still very simple) machine learning, like word2vec.
The goal is for the overarching structures and rules that the sentences follow to be made by me, even if the words and sentences chosen are generated randomly.
I might also consider tagging words during Markov chain construction, by how far through their source text they appeared. That way I can generate words weighted by their distance to a certain part of a book, and hopefully get more introductions at the start, and more climactic finale sentences at the end.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant