Hugging Face Audio coursework
-
Updated
Sep 7, 2023 - Jupyter Notebook
Hugging Face Audio coursework
Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.
A compilation of libraries, case studies, resources, and research papers revolving around deep learning/machine learning for audio
[UAI 2024 paper] DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution.
Whisper Transcription Service
ASR course past paper revision work for the University of Edinburgh
Speech Recording Tool
Gutural and scream automatic speech recognition (ASR) system using a fine-tuned version of OpenAI's Whisper model
Baidu TTS(Text-To-Speech), ASR(Automatic-Speech-Recognition) Demo for PC
Timestamped ASR microservice
CMUSphinx Website
Trained Transformer model for Speech Recognition
WER, MER, WIL of Whisper vs Vosk vs Google transcribators comparator
Different Task Guides for Audio Data
Speech Recognition with Neural Networks
🎯 🇧🇯 This dataset was created for speech research purposes and contains about 676 recordings of participants reading a script in Dendi as spoken in Parakou, one sentence at a time. Each example includes the audio files and the associated text. The audio is high-quality and recorded in a quiet environment. The dataset is multi-speaker, containing…
DSTA BrainHack Today-I-Learned AI 2023
Bangla Automatic Speech Recognition
Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."