- From the SnapshotBROWN.pos.all.txt file extracted all word types and their frequencies.
- Sorted the list of word types in decreasing order based on their frequency. Drew a chart showing the relationship between the rank in the ordered list and the frequency (Zipf's Law). (Do not stem but do ignore punctuation.)
- Generated a Bigram Grammar from the above file.
- Performed add-one smoothing.
- Showed the grammar before and after smoothing for the sentence "A similar resolution passed in the Senate".
Current Version : v1.0.0.3
Last Update : 04.07.2018 (Time : 05:45 P.M)