🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
-
Updated
Mar 15, 2024
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
🦁 Nala is an agile open-source voice assistant framework (20+ actions).
🎤 quick library to extract pause lengths from audio files.
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
Landing page for MooersLab repository
Generate Voice In commands for a new writing project
Automated Expansions of English Contractions For Serenade
Slides to talk at PyTexas 2024
Starter repository for learning how to make machine learning models with voice data.
Add a description, image, and links to the voice-computing topic page so that developers can more easily learn about it.
To associate your repository with the voice-computing topic, visit your repo's landing page and select "manage topics."