Skip to content

The repo lets you make phoneme level ngram model of English

Notifications You must be signed in to change notification settings

DavidK0/English-Phoneme-ngrams

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

English-Phoneme-ngrams

The repository lets you make phoneme level ngram model of English. To use it, run build_PDF.py, and then pass the output of that into the input of process_PFD.py. The first script will download two text files, combine them, and save the result. The second script will build an ngram model out of that.

The two text files are a pronunciation dictionary and a frequecny dictionary. Combining these gives a pronunciation-frequency dictionary (PFD). From that we can make an ngram model. Here is some example output:

wʌtɚmʌðɚ, sɛlʌm, paʊntɚnmʌnd, brɪkjʌŋ, sʌbʌt, dɪskul

And here is how I would probably spell those words:

Whatermother, Selum, Pounternmund, Brikyung, Suhbut, Diskul

About

The repo lets you make phoneme level ngram model of English

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages