-
Updated
May 21, 2024 - Vue
speech-to-text
Here are 2,858 public repositories matching this topic...
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
-
Updated
May 21, 2024 - C++
Introducing NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.
-
Updated
May 21, 2024 - TypeScript
Open Voice OS Status Page
-
Updated
May 21, 2024 - Markdown
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.
-
Updated
May 21, 2024 - C++
A simple and private transcription tool able to segment speakers and convert audio to text.
-
Updated
May 20, 2024 - Python
A web UI Project In order to learn the large language model. This project includes features such as chat, quantization, fine-tuning, prompt engineering templates, and multimodality.
-
Updated
May 20, 2024 - Python
🚀 Seamlessly fine-tune Whisper model on a multi-lingual dataset and deploy to prod.
-
Updated
May 20, 2024 - Python
Faster Whisper transcription with CTranslate2
-
Updated
May 20, 2024 - Python
🏃 💡 Talabat Hackathon 2022 API project
-
Updated
May 20, 2024 - C#
Local webui for Large Language Models. Supports the GGUF format. Inference LLMs with support for STT/TTS and function calling.
-
Updated
May 20, 2024 - Python
Android web novel reader
-
Updated
May 21, 2024 - Kotlin
Port of OpenAI's Whisper model in C/C++
-
Updated
May 20, 2024 - C
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
-
Updated
May 20, 2024 - Jupyter Notebook
Official JavaScript SDK for Deepgram's automated speech recognition APIs.
-
Updated
May 20, 2024 - TypeScript
A Python library for solving reCAPTCHA v2 and v3 with Playwright
-
Updated
May 20, 2024 - Python
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
-
Updated
May 20, 2024 - Python
CleanStream is an OBS plugin that uses AI to clean live audio streams from unwanted words and utterances
-
Updated
May 20, 2024 - C++
Achieve your goals and keep your data private with Lotti. This life tracking app is designed to help you stay motivated and on track, all while keeping your personal information safe and secure. Now with on-device speech recognition.
-
Updated
May 20, 2024 - Dart
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
-
Updated
May 20, 2024 - Jupyter Notebook
Improve this page
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."