(Ongoing module in development) Getting Wikipedia articles parsed content. Created for getting text corpuses data fast and easy. But can be freely used for other purpuses too
-
Updated
Jan 3, 2023 - Python
(Ongoing module in development) Getting Wikipedia articles parsed content. Created for getting text corpuses data fast and easy. But can be freely used for other purpuses too
Utilities for Processing the bAbi Tasks Corpus
The AP Exam Corpus Project is a Python application that generates corpora for AP exams.
Digital Literacy for Philologists (NRU HSE, 2018)
Utilities for Processing the Saarbrücken Corpus of Spoken English
It can help you to convert srt file into CN-? parallel corpus
Python API for extracting data from the MPQA corpus
Tool to generate lists of Bengali words and transcriptions matching given phonological descriptions
branches of https://victorio.uit.no/langtech/trunk/tools/CorpusTools used by Giellatekno.UiT.no for corpus gathering.
Corpus analysis of plain text and providing Type-Token Ratio as well as some other statistics.
Vietnamese corpus search tools and statistical analysis
This repository contains freely available and licensed code and annotated data in order to investigate and evaluate verbal processes in systemic functional linguistics (SFL) (initially with a focus on second language acquisition (SLA))
Cod yr ap Paldaruo i iOS ar gyfer torfoli casglu corpws lleferydd | Code for the Paldaruo speech corpus crowdsourcing ap for iOS
Utilities for Processing the Dialogue State Tracking Challenge 3 Corpus
Flotsam is a moderation tool to supplement Jetsam, for IRC logs stored in the Driftwood format. It identifies flagged content and aggregates a per-user metric. Written in Rust.
A fast, small, and portable Windows application for searching large text corpora, with regex and right-to-left support.
Workbench for corpus tools accessing the Sydney Speaks corpus
A very simple concordancer with XML support.
Recon is a Java-based tool for the annotation of relations among textual elements and semantic concepts.
Companion website for "Corpus Approaches to Language in Social Media" - source and build versions
Add a description, image, and links to the corpus-tools topic page so that developers can more easily learn about it.
To associate your repository with the corpus-tools topic, visit your repo's landing page and select "manage topics."