speech-recognition

Here are 4,651 public repositories matching this topic...

dictation-toolbox / dragonfly

Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx

python speech-recognition

Updated Jun 11, 2024
Python

Detilisi / Umbrella

Star

A voice-operated emailing mobile application that allows you to compose and send email messages through voice commands.

text-to-speech automation sqlite-database mvvm entity-framework clean-architecture speech-recognition cqrs-pattern intent-recognition communitytoolkit maui-app

Updated Jun 11, 2024
C#

huggingface / transformers

Star

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Updated Jun 11, 2024
Python

modelscope / FunASR

Star

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jun 11, 2024
Python

modelscope / FunClip

Star

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm

Updated Jun 11, 2024
Python

openvinotoolkit / openvino

Star

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

nlp natural-language-processing ai computer-vision deep-learning transformers inference speech-recognition yolo recommendation-system performance-boost good-first-issue openvino diffusion-models stable-diffusion generative-ai llm-inference optimize-ai deploy-ai

Updated Jun 11, 2024
C++

TheSoftDiamond / Kazushin

Star

Customizable TTS Chat Bot using OpenAI & Google Cloud TTS/ElevenLabs

python text-to-speech twitch ai chatbot tts speech-recognition openai speech-to-text gpt googlecloud gemini-api twitchio elevenlabs

Updated Jun 11, 2024
Python

omarx11 / chatin-v2

Sponsor

Star

Talk to Rawan voice-to-voice using speech recognition or text-to-speech, with elevenlabs technology and chatgpt on the web.

bot website text-to-speech ai nextjs chatbot speech-recognition tailwindcss speach-to-text vercel supabase chatgpt elevenlabs

Updated Jun 11, 2024
JavaScript

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Jun 11, 2024
Python

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated Jun 11, 2024
Python

ictnlp / StreamSpeech

Star

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Updated Jun 11, 2024
Python

occ-ai / obs-localvocal

Star

OBS plugin for local speech recognition and captioning using AI

plugin translation ai livestream live-streaming speech-recognition speech-to-text obs transcription obs-studio whisper realtime-translator obs-studio-plugin realtime-transcribe openai-whisper whisper-cpp real-time-transcription

Updated Jun 11, 2024
C++

deepgram / deepgram-python-sdk

Star

Official Python SDK for Deepgram's automated speech recognition APIs.

python speech-recognition hacktoberfest asr deepgram automated-speech-recognition

Updated Jun 11, 2024
Python

Umbaji / NMT-Melinda--Dataset

Sponsor

Star

Official repository for the Opensource Textdataset for NMT for local langues in West Africa (EWE Corpus)

data ai speech-recognition nmt

Updated Jun 10, 2024

Picovoice / web-voice-processor

Star

A library for real-time voice processing in web browsers

javascript real-time browser worker realtime voice-commands microphone speech-recognition webaudio-api pcm web-browser speech-to-text audio-processing wake-word-detection downsampling voice-processing

Updated Jun 10, 2024
TypeScript

Macoron / whisper.unity

Star

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

unity3d speech-recognition openai speech-to-text stt whisper asr

Updated Jun 10, 2024
Metal

frank038 / gspeechread

Star

A simple speech-to-text and text-to-speech program/frontend.

linux text-to-speech python3 gtk3 speech-recognition speech-to-text

Updated Jun 10, 2024
Python

LeonardoSPereira / ExpertNotes

Star

Aplicação com o objetivo de permitir ao usuário de salvar notas, seja por áudio ou texto / Application aimed at allowing the user to save notes, either through audio or text.

react typescript speech-recognition tailwindcss vite radix-ui sonner