Treebanks modified from PROIEL and Perseus.
-
Updated
Jun 1, 2018
Treebanks modified from PROIEL and Perseus.
A module to quickly create Corpus objects containing TTR, tokenized sentences, lexical density, class frequencies and more.
A tool for determinating distances between multimodal annotations.
Paper that Lena Baunaz and I are working on as part of my SNSF-funded 'Focus in diachrony' research project at the University of Cambridge, UK.
All scripts needed to exploit French corpus and create the associated database for the CODIM Project.
Heuristics and cognitive biases in public discourse on climate changes - lingustic data analysis
Open Corpus Workbench with TEITOK Docker compose file
Corpus linguistics final project for the course COMM 313: Computational Text Analysis at the University of Pennsylvania. Aims to determine how the anti-vaccination movement has evolved on social media before and during the COVID-19 pandemic.
Easy Text Annotator
(Ongoing module in development) Getting Wikipedia articles parsed content. Created for getting text corpuses data fast and easy. But can be freely used for other purpuses too
Supplementary materials for: Are online news comments like face-to-face conversation? A multi-dimensional analysis of an emerging register (version 1.0)
Code for the thesis "A Corpus-Based Case Analysis on Syntactic Complexity in Russian ESL Learners’ Writing".
Annotator combining different NLP pipelines.
The repository of dataset and R codes for the study of HAPPINESS metaphors in Classical Malay and Indonesian languages (published in Review of Cognitive Linguistics)
Workbench for corpus tools accessing the Sydney Speaks corpus
This project uses data science to analyze Federal Reserve policy statements and seeks insight on their sentiment
An assortment of word-lists and micro dictionaries in English. Especially suited to English language learning tasks.
Add a description, image, and links to the corpus-linguistics topic page so that developers can more easily learn about it.
To associate your repository with the corpus-linguistics topic, visit your repo's landing page and select "manage topics."