Skip to content
#

corpora

Here are 154 public repositories matching this topic...

Collection of text corpora for publicly available speeches from Mexican president Andres Manuel Lopez Obrador (AMLO) sourced from YouTube. The dataset includes his daily morning conferences (conferencias mañaneras) 😴🪿

  • Updated Nov 1, 2023
  • Python

In this project we are tryinbg to create unredactor. Unredactor will take a redacted document and the redacted flag as input, inreturn it will give the most likely candidates to fill in redacted location. In this project we are only considered about unredacting names only. The data that we are considering is imdb data set with many review files.…

  • Updated Nov 10, 2021
  • Python

Improve this page

Add a description, image, and links to the corpora topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the corpora topic, visit your repo's landing page and select "manage topics."

Learn more