#

automatic-speech-recognition

Here are 288 public repositories matching this topic...

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

pytorch transformer speech-recognition automatic-speech-recognition production-ready whisper asr conformer e2e-models

Updated May 24, 2024
Python

csikasote / BembaSpeech

This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/TV shows transcripts, Youtube Video transcripts, Online sources. The corpus has 14, 438 utterances culminating into over 24 hours of speech.

automatic-speech-recognition low-resource-languages bemba

Updated May 23, 2024

matiuste / DistriBlock

[UAI 2024 paper] DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution.

machine-learning automatic-speech-recognition uncertainty-quantification adversarial-examples

Updated May 23, 2024
Python

winstxnhdw / CapGen

A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.

docker caddy automatic-speech-recognition whisper asr fastapi uvicorn-gunicorn huggingface huggingface-spaces ctranslate2

Updated May 23, 2024
Python

ieasybooks / tafrigh

تفريغ المواد المرئية أو المسموعة إلى نصوص

python youtube subtitles srt vtt automatic-speech-recognition whisper audio-processing asr stable-whisper faster-whisper ctranslate2 whisper-jax

Updated May 20, 2024
Python

ahmetoner / whisper-asr-webservice

OpenAI Whisper ASR Webservice API

docker speech speech-recognition automatic-speech-recognition speech-to-text asr openai-whisper

Updated May 20, 2024
Python

leduckhai / MultiMed

Multilingual Multitask Multipurpose Medical Speech Recognition

machine-learning natural-language-processing deep-learning artificial-intelligence automatic-speech-recognition

Updated May 20, 2024
Python

TensorSpeech / TensorFlowASR

⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

tensorflow speech-recognition jasper automatic-speech-recognition speech-to-text ctc conformer deepspeech2 tflite rnn-transducer end2end tensorflow2 contextnet tflite-model tflite-convertion subword-speech-recognition streaming-transducer

Updated May 19, 2024
Python

NavodPeiris / speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

ai automatic-speech-recognition transcription speaker-recognition speaker-verification speaker-diarization whisper-ai faster-whisper

Updated May 19, 2024
Python

th-schmidt / whisply

Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper ... fast!

subtitles speech-recognition automatic-speech-recognition speech-to-text whisper-ai

Updated May 21, 2024
Python

mydroidandi / commbase-stt-whisper-reactive-p

A reactive and remote-ready version of STT engine for Commbase

android python ssh raspberry-pi remote-control engine assistant speech-recognition recorder automatic-speech-recognition stt assistive-technology remote-access-tool secure-shell openai-whisper commbase

Updated May 19, 2024
Python

mydroidandi / commbase-stt-whisper-proactive-p

A proactive version of STT engine for Commbase

python engine speech-recognition automatic-speech-recognition speech-to-text stt asr commbase libcommbase commbase-stt-whisper-p commbase-stt-vosk-p

Updated May 18, 2024
Python

MooersLab / bash-whisper-transcription

Bash function to ease the transcription of audio files with OpenAI's whisper.

audio bash automation automatic-speech-recognition speech-to-text beginner-friendly stt whisper automate-the-boring-stuff asr bash-function audio-messages audio-file-trancription

Updated May 18, 2024
Python

chimechallenge / chime-utils

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

speech-recognition automatic-speech-recognition speech-processing speech-separation speech-enhancement far-field-speech-recognition diarization multi-speaker-asr meeting-transcription

Updated May 16, 2024
Python

EricApgar / live-speech-to-text

Live speech to text transcription.

raspberry-pi offline automatic-speech-recognition asr hugging-face

Updated May 14, 2024
Python

awesome-large-audio-models

EmulationAI / awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

music-information-retrieval automatic-speech-recognition speech-to-text audio-processing music-ai music-processing large-language-models foundational-models speech-ai audio-ai large-audio-models speech-llms large-language-model-speech

Updated May 14, 2024

bricewalker / Hey-Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Updated May 20, 2024
Jupyter Notebook

QubitPi / cmusphinx.github.io

CMUSphinx Website

jekyll documentation automatic-speech-recognition cmusphinx

Updated May 9, 2024
HTML

LD239 / WebTranscript

Interactive web tool for automatically ⚙️ transcribing and subtitling videos from URL or file uploads in your chosen language. The transcript appears alongside the video player, complete with embedded subtitles.

open-source web translation video-player video-annotation automatic-translation webvtt web-tool automatic-speech-recognition transcripts whisper web-tools transcript-editor automatic-transcription subtitles-generator webvtt-subtitles whisper-ai

Updated May 7, 2024
JavaScript

analyticsinmotion / werpy

🐍📦 Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.

python nlp metrics pandas levenshtein-distance automatic-speech-recognition speech-to-text stt asr python-package wer word-error-rate stt-benchmark asr-evaluation

Updated May 21, 2024
Python

Improve this page

Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."