Skip to content

N-Gram language model that learns n-gram probabilities from a given corpus and generates new sentences from it based on the conditional probabilities from the generated words and phrases.

License

Notifications You must be signed in to change notification settings

levindoneto/lanGen

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

lanGen | Natural language generation using an n-gram language model

N-Gram language model that learns n-gram probabilities from a given corpus and generates new sentences from it based on the conditional probabilities from the generated words and phrases.

Author: Levindo Gabriel Taschetto Neto.

How to deploy

$ pip install docopt
$ pip install nltk

Requirements

  • Operating system: Windows.
  • Python: 3.0 or superior.

How to use

$ python lmgen.py InputFile

InputFile is optional. If no input is given, this default corpus is used.

License

MIT License. Click here for more information about this license.

About

N-Gram language model that learns n-gram probabilities from a given corpus and generates new sentences from it based on the conditional probabilities from the generated words and phrases.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages