#

speech-to-text

Here are 2,858 public repositories matching this topic...

crucials / spoken-words-counter

audio python ai vue speech-to-text whisper python-eel

Updated May 21, 2024
Vue

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift

android windows macos linux raspberry-pi ios text-to-speech csharp cpp dotnet speech-to-text aarch64 mfc risc-v asr arm32 onnx vits openkylin

Updated May 21, 2024
C++

ChetanXpro / nodejs-whisper

Introducing NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.

ai cpp ml speech-recognition openai timestamp speech-to-text whisper whisper-nodejs nodejs-whisper

Updated May 21, 2024
TypeScript

OpenVoiceOS / status

Open Voice OS Status Page

status text-to-speech translator monitoring alerting cuda sam nvidia tts uptime stats speech-to-text stt piper ovos upptime openvoiceos fasterwhisper mimic3

Updated May 21, 2024
Markdown

ErcinDedeoglu / WhisperDock

Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.

api docker machine-learning speech-to-text audio-transcription whisper-cpp

Updated May 21, 2024
C++

tim-roethig-db / amondin

A simple and private transcription tool able to segment speakers and convert audio to text.

open-source local language-detection private speech-to-text transcription speaker-segmentation speaker-diariazation speaker-detection

Updated May 20, 2024
Python

smalltong02 / keras-llm-robot

A web UI Project In order to learn the large language model. This project includes features such as chat, quantization, fine-tuning, prompt engineering templates, and multimodality.

text-to-speech chatbot gemini knowledgebase speech-to-text vectorization multimodal faiss rag milvus streamlit llm code-interpreter chatgpt pgvector fastchat

Updated May 20, 2024
Python

KevKibe / African-Whisper

🚀 Seamlessly fine-tune Whisper model on a multi-lingual dataset and deploy to prod.

speech speech-recognition speech-to-text whisper asr speech-translation speech-transcription

Updated May 20, 2024
Python

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

deep-learning inference transformer speech-recognition openai speech-to-text quantization whisper

Updated May 20, 2024
Python

guibranco / talabat-hackathon-2022

🏃 💡 Talabat Hackathon 2022 API project

api aws text-to-speech service hackathon aws-polly speech-to-text amazon-web-services polly talabat speech-to-txt txt-to-speech amazon-text-to-speech

Updated May 20, 2024
C#

3eeps / llmon-py

Local webui for Large Language Models. Supports the GGUF format. Inference LLMs with support for STT/TTS and function calling.

text-to-speech gui local chatbot web-ui tts image-recognition webui speech-to-text stt llm moondream llm-inference function-calling gguf sdxl-turbo

Updated May 20, 2024
Python

nanihadesuka / NovelDokusha

Android web novel reader

android kotlin translator webnovel reader light-novel light-novels novel speech-to-text android-app epub-reader reading-app jetpack-compose light-novel-reader

Updated May 21, 2024
Kotlin

whisper.cpp

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated May 20, 2024
C

alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Updated May 20, 2024
Jupyter Notebook

deepgram / deepgram-js-sdk

Official JavaScript SDK for Deepgram's automated speech recognition APIs.

javascript typescript ai speech-recognition speech-to-text hacktoberfest asr deepgram automated-speech-recognition

Updated May 20, 2024
TypeScript

Xewdy444 / Playwright-reCAPTCHA

A Python library for solving reCAPTCHA v2 and v3 with Playwright

library recaptcha solver asyncio speech-to-text playwright

Updated May 20, 2024
Python

Mohamad-Hussein / speech-assistant

Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.

desktop-app translation offline speech speech-to-text transcription dictation whisper huggingface openai-whisper whisper-ai distil-whisper

Updated May 20, 2024
Python

occ-ai / obs-cleanstream

CleanStream is an OBS plugin that uses AI to clean live audio streams from unwanted words and utterances

plugin ai speech-to-text obs transcription whisper

Updated May 20, 2024
C++

matthiasn / lotti

Achieve your goals and keep your data private with Lotti. This life tracking app is designed to help you stay motivated and on track, all while keeping your personal information safe and secure. Now with on-device speech recognition.

windows macos ios journal health speech-recognition time-tracker speech-to-text android-app flutter linux-app fitness-app

Updated May 20, 2024
Dart

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

speech speech-recognition speech-to-text whisper asr speaker-diarization

Updated May 20, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."