Bislama language resources
-
Updated
Mar 5, 2015
Bislama language resources
Turkish English Parallel Corpus Generator
St. Petersburg corpus of hagiographic texts
Data from a corpus of written Hawaiian
NLTK libraries and Machine Learning Algorithms in use.
A collection of small corpuses of interesting data, from dariusk/corpora
Corpora designed for different NLP tasks
A growing corpus of fortune cookies (for NLP and fun). Add your fortunes!
Command-line corpus tools
A pos-tagging library with Viterbi, CYK and SVO -> XSV translator made as part of my final exam for the Cognitive System course in Department of Computer Science.
Named Entity Recognition data for Biblioteca Virtual Miguel de Cervantes
Dataset containing Semantic Relations and Metadata, for Training and Evaluating Distributional Semantic Models in English and Mandarin Chinese
Benchmarking various tools for counting word and phrase frequency in corpora [for windows]
Compared writing styles of two authors with different personalities and designation using nltk
Add a description, image, and links to the corpora topic page so that developers can more easily learn about it.
To associate your repository with the corpora topic, visit your repo's landing page and select "manage topics."