Skip to content

t-shoemaker/molecular_bigrams

Repository files navigation

Molecular Bigrams

These notebooks visualize words using letter bigram data. The thickness, or weight, of the edges that connect each node represents either the frequency values for pairs of letters in a word (molecularBigrams), or the probability that one letter will follow the next in a letter pair (markovBigrams). At the moment, the frequency notebook only registers one instance of self-looping bigrams ("oo", "aa", etc.). Node layout for frequency values is handled automatically with Fruchterman-Reingold; bigram probabilities are visualized with linear plots.

Data available for:

A fully interactive version of molecularBigrams is also available on my website.

Binder


To do:

  • Add bigram data for a different corpus
  • Handle self-loops ("oo," "aa," etc.) (all but the Python Jupyter Notebook)
  • Do a version with Markov sequences

About

Experiments with letter bigram frequency and probability

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published