speech-recognition

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated May 15, 2024
Python

akscf / mod_whisper_asr

Star

Freeswitch ASR module to working with wisper_cpp

speech-recognition freeswitch speech-to-text whisper-cpp

Updated May 15, 2024
C

akscf / mod_google_asr

Star

Google cloud speech-to-text service for Freeswitch

speech-recognition freeswitch speech-to-text

Updated May 15, 2024
C

alibaba-damo-academy / FunClip

Star

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm

Updated May 15, 2024
Python

edenai / edenai-apis

Star

Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines

python nlp api natural-language-processing text-to-speech ocr ai computer-vision aggregator machine-translation image-processing speech-recognition speech-to-text optical-character-recognition ai-as-a-service video-recognition pre-trained-model document-parsing

Updated May 15, 2024
Python

k2-fsa / sherpa

Star

Speech-to-text server framework with next-gen Kaldi

python cpp websocket pytorch speech-recognition transducer asr ctc end-to-end-asr

Updated May 15, 2024
C++

flozi00 / atra

Star

An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands

chatbot speech transformers inference speech-recognition asr llm stable-diffusion

Updated May 15, 2024
Jupyter Notebook

sonhm3029 / Realtime-ASR-React-Native-and-Whisper

Star

This project implement end to end realtime speech recognition with PhoWhisper in Backend and frontend in React Native

react-native realtime speech-recognition speech-to-text whisper asr realtime-speech-recognition phowhiper

Updated May 15, 2024
JavaScript

Uriiol1808 / Harmon-AI

Star

Music Genre Classification/Speech Recognition/Lyrics Anlaysis...

data-science natural-language-processing speech-recognition music-genre-classification lyrics-analysis

Updated May 15, 2024
Jupyter Notebook

sebinbenjamin / wav2vec_demo

Star

A Python tool for transcribing speech from audio files using the Wav2Vec 2.0 model. Supports multilingual transcription, automatic audio chunking, and easy setup

transformers pytorch speech-recognition hugging-face wav2vec2

Updated May 15, 2024
Python

Chenyme / Chenyme-AAVT

Star

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

speech-recognition whisper video-translation gpt-4 faster-whisper

Updated May 15, 2024
Python

echogarden-project / echogarden

Star

Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

text-to-speech speech language-detection speech-synthesis speech-recognition speech-to-text source-separation language-identification forced-alignment speech-translation speech-alignment

Updated May 15, 2024
TypeScript

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-recognition

Here are 4,601 public repositories matching this topic...

deepgram / deepgram-js-sdk

huggingface / transformers

ggerganov / whisper.cpp

openvinotoolkit / openvino

geniusrise / geniusrise-audio

yandex-cloud-examples / yc-speechkit-web-ui

huuquyet / PhoWhisper-next

verbio-technologies / python-verbio-speech-center

alibaba-damo-academy / FunASR

akscf / mod_whisper_asr

akscf / mod_google_asr

alibaba-damo-academy / FunClip

edenai / edenai-apis

k2-fsa / sherpa

flozi00 / atra

sonhm3029 / Realtime-ASR-React-Native-and-Whisper

Uriiol1808 / Harmon-AI

sebinbenjamin / wav2vec_demo

Chenyme / Chenyme-AAVT

echogarden-project / echogarden

Improve this page

Add this topic to your repo