This repository contains the resources our team used through the course of the CLEF competition.
-
Updated
May 27, 2022 - Jupyter Notebook
This repository contains the resources our team used through the course of the CLEF competition.
Download speech datasets (English and non-English) for Automatic Speech Recognition
top dataset for voice conversion models
Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.
[v.1.0] Lingualibre Languages Gallery in VueJS.
KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a list of YouTube playlists or YouTube channels, KATube will generate dataset with audios and texts.
A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
A library built for easier audio self-supervised training, downstream tasks evaluation
This package aims at simplifying the download of the AudioSet dataset.
open-source audio datasets
Python library for handling audio datasets.
A collection of datasets for the purpose of emotion recognition/detection in speech.
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Add a description, image, and links to the audio-datasets topic page so that developers can more easily learn about it.
To associate your repository with the audio-datasets topic, visit your repo's landing page and select "manage topics."