Official JavaScript SDK for Deepgram's automated speech recognition APIs.
-
Updated
May 15, 2024 - TypeScript
Official JavaScript SDK for Deepgram's automated speech recognition APIs.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Port of OpenAI's Whisper model in C/C++
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Audio components for geniusrise framework
SpeechKit Web UI Example
Demo using PhoWhisper models of VinAI built with Transformers.js + Next.js
Python integration with the Verbio Speech Center Cloud. https://speechcenter.verbio.com/
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Freeswitch ASR module to working with wisper_cpp
Google cloud speech-to-text service for Freeswitch
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
Speech-to-text server framework with next-gen Kaldi
An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands
This project implement end to end realtime speech recognition with PhoWhisper in Backend and frontend in React Native
Music Genre Classification/Speech Recognition/Lyrics Anlaysis...
A Python tool for transcribing speech from audio files using the Wav2Vec 2.0 model. Supports multilingual transcription, automatic audio chunking, and easy setup
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."