The macOS built-in `say` CLI for JavaScript
-
Updated
May 14, 2024 - TypeScript
The macOS built-in `say` CLI for JavaScript
AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)
An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands
ModelScope: bring the notion of Model-as-a-Service to life.
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
VITS-based Voice Conversion focused on simplicity, quality and performance.
This repository lists publicly available datasets for visual-audio, speech and audio, and biomedical signal related tasks.
Machine learning speaker characteristics
Pronounce and Speech Text - Enter Word and Get the Pronunciation and Speech Text.
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Audio Codec Speech processing Universal PERformance Benchmark
High-Fidelity Neural Phonetic Posteriorgrams
Add a description, image, and links to the speech topic page so that developers can more easily learn about it.
To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."