Production First and Production Ready End-to-End Speech Recognition Toolkit
-
Updated
May 24, 2024 - Python
Production First and Production Ready End-to-End Speech Recognition Toolkit
This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/TV shows transcripts, Youtube Video transcripts, Online sources. The corpus has 14, 438 utterances culminating into over 24 hours of speech.
[UAI 2024 paper] DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution.
A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.
تفريغ المواد المرئية أو المسموعة إلى نصوص
OpenAI Whisper ASR Webservice API
Multilingual Multitask Multipurpose Medical Speech Recognition
⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper ... fast!
A reactive and remote-ready version of STT engine for Commbase
A proactive version of STT engine for Commbase
Bash function to ease the transcription of audio files with OpenAI's whisper.
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
Live speech to text transcription.
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
CMUSphinx Website
Interactive web tool for automatically ⚙️ transcribing and subtitling videos from URL or file uploads in your chosen language. The transcript appears alongside the video player, complete with embedded subtitles.
🐍📦 Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.
Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."