speech-to-text

Here are 2,890 public repositories matching this topic...

ictnlp / StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Updated Jun 11, 2024
Python

occ-ai / obs-localvocal

Star

OBS plugin for local speech recognition and captioning using AI

plugin translation ai livestream live-streaming speech-recognition speech-to-text obs transcription obs-studio whisper realtime-translator obs-studio-plugin realtime-transcribe openai-whisper whisper-cpp real-time-transcription

Updated Jun 11, 2024
C++

OpenVoiceOS / status

Star

Open Voice OS Status Page

status text-to-speech translator monitoring alerting cuda sam nvidia tts uptime stats speech-to-text stt piper ovos upptime openvoiceos fasterwhisper mimic3

Updated Jun 11, 2024
Markdown

ErcinDedeoglu / WhisperDock

Star

Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.

api docker machine-learning speech-to-text audio-transcription whisper-cpp

Updated Jun 11, 2024
C++

jianchang512 / pyvideotrans

Star

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

text-to-speech speech-to-text video-transition

Updated Jun 11, 2024
Python

Picovoice / web-voice-processor

Star

A library for real-time voice processing in web browsers

javascript real-time browser worker realtime voice-commands microphone speech-recognition webaudio-api pcm web-browser speech-to-text audio-processing wake-word-detection downsampling voice-processing

Updated Jun 10, 2024
TypeScript

Macoron / whisper.unity

Star

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

unity3d speech-recognition openai speech-to-text stt whisper asr

Updated Jun 10, 2024
Metal

AssemblyAI / assemblyai-java-sdk

Star

The AssemblyAI Java SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, audio intelligence models, as well as the latest LeMUR models.

java ai speech-to-text transcription stt asr assemblyai llm

Updated Jun 10, 2024
Java

mezbaul-h / june

Star

A CLI app to interact with LLMs via text or audio using Hugging Face Transformers, with customizable models and generation parameters.

python text-to-speech tts cli-app speech-to-text command-line-tool huggingface large-language-models llm

Updated Jun 10, 2024
Python

frank038 / gspeechread

Star

A simple speech-to-text and text-to-speech program/frontend.

linux text-to-speech python3 gtk3 speech-recognition speech-to-text

Updated Jun 10, 2024
Python

davidmartinrius / speech-dataset-generator

Star

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.

text-to-speech audio-analysis speech-recognition speech-to-text dataset-generation audio-processing

Updated Jun 10, 2024
Python

ggerganov / whisper.cpp

Sponsor

Star

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Jun 10, 2024
C

mkiol / dsnote

Star

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

text-to-speech translator translation offline machine-translation sailfishos tts speech-synthesis speech-recognition speech-to-text nmt linux-desktop stt asr flatpak-applications

Updated Jun 10, 2024
C++

WhiskerWeirdo / BanterBrain-Buddy

Star

BanterBrain Buddy is a Windows based Speech-To-Text to LLM to Text-To-Speech client-program for general entertainment or as a streaming companion.

text-to-speech ai speech-to-text twitch-bot gpt streamer-tool youtube-bot vtuber llm

Updated Jun 10, 2024
C#

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated Jun 10, 2024
Python

Xewdy444 / Playwright-reCAPTCHA

Star

A Python library for solving reCAPTCHA v2 and v3 with Playwright

library recaptcha solver asyncio speech-to-text playwright

Updated Jun 10, 2024
Python

ChetanXpro / nodejs-whisper

Sponsor

Star

Introducing NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.

ai cpp ml speech-recognition openai timestamp speech-to-text whisper whisper-nodejs nodejs-whisper

Updated Jun 10, 2024
TypeScript

gunarakulangunaretnam / real-time-language-translator

Star

A voice recognition-based tool for translating languages in real-time.

text-to-speech language-translation speech-recognition speech-to-text streamlit

Updated Jun 10, 2024
Python

k2-fsa / sherpa-onnx

Star

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift

android windows macos linux raspberry-pi ios text-to-speech csharp cpp dotnet speech-to-text aarch64 mfc risc-v asr arm32 onnx vits openkylin

Updated Jun 10, 2024
C++

guibranco / talabat-hackathon-2022

Star

🏃 💡 Talabat Hackathon 2022 API project

api aws text-to-speech service hackathon aws-polly speech-to-text amazon-web-services polly talabat speech-to-txt txt-to-speech amazon-text-to-speech

Updated Jun 10, 2024
C#

Improve this page

Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-to-text

Here are 2,890 public repositories matching this topic...

ictnlp / StreamSpeech

occ-ai / obs-localvocal

OpenVoiceOS / status

ErcinDedeoglu / WhisperDock

jianchang512 / pyvideotrans

Picovoice / web-voice-processor

Macoron / whisper.unity

AssemblyAI / assemblyai-java-sdk

mezbaul-h / june

frank038 / gspeechread

davidmartinrius / speech-dataset-generator

ggerganov / whisper.cpp

mkiol / dsnote

WhiskerWeirdo / BanterBrain-Buddy

leon-ai / leon

Xewdy444 / Playwright-reCAPTCHA

ChetanXpro / nodejs-whisper

gunarakulangunaretnam / real-time-language-translator

k2-fsa / sherpa-onnx

guibranco / talabat-hackathon-2022

Improve this page

Add this topic to your repo