Emacs Lisp corpus. Code collected from many-many projects for you to query it!
-
Updated
Oct 15, 2017 - Emacs Lisp
Emacs Lisp corpus. Code collected from many-many projects for you to query it!
Create a wiki corpus using a wiki dump file for Natural Language Processing
Korpus ręcznie sklasyfikowanych komentarzy do uczenia maszynowego (filtrowanie komentarzy obraźliwych)
Public Domain Words and Texts for Conlangs
Scripts de bots, web scrappings e web crawlers para pesquisa.
Spam-ham-Classification
Asturian language corpus for FreeLing
Discursos presidenciales de Latinoamérica en español
A corpus for the Zazaki and Gorani languages
Estonian TIMEX Annotated Corpora \ Eesti keele ajaväljendimärgendustega korpused
Tidy concordances, collocates, and wordlist
Collection of open source javascript projects
Repositório para disponibilização de bases de dados do Wikipedia e Simple Wikipedia pré-processadas, além de scripts de pré-processamento e geração de bases em Python.
Markov Model to detect the Parts of Speech Tagging.
This repository contains freely available and licensed code and annotated data in order to investigate and evaluate verbal processes in systemic functional linguistics (SFL) (initially with a focus on second language acquisition (SLA))
Code for final assignment for Corpus Studies course at the University of Antwerp (2022)
🌐 ANT Corpus website repository.
Data for HindiRC
Data pipeline for the coco-explorer app.
Add a description, image, and links to the corpus-data topic page so that developers can more easily learn about it.
To associate your repository with the corpus-data topic, visit your repo's landing page and select "manage topics."