🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
-
Updated
Mar 15, 2024
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.
♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
🦁 Nala is an agile open-source voice assistant framework (20+ actions).
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
🎤 quick library to extract pause lengths from audio files.
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
Starter repository for learning how to make machine learning models with voice data.
Generate Voice In commands for a new writing project
Automated Expansions of English Contractions For Serenade
Landing page for MooersLab repository
Slides to talk at PyTexas 2024
Add a description, image, and links to the voice-computing topic page so that developers can more easily learn about it.
To associate your repository with the voice-computing topic, visit your repo's landing page and select "manage topics."