speech

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

speech multimodal rag edge-ai vector-database vision-transformer llm-inference

Updated May 14, 2024
Python

mishra-ankit / modi-speeches

Star

Dataset of Narendra Modi speeches released to encourage research and analysis

politics speech dataset india politicians modi

Updated May 14, 2024
JavaScript

OvidijusParsiunas / deep-chat

Sponsor

Star

Fully customizable AI chatbot component for your website

react chat files angular image ai component vue solid nextjs chatbot speech svelte openai cohere huggingface ai-chatbot react-chatbot chatgpt

Updated May 13, 2024
TypeScript

speechanddebate / tabroom

Star

Tabroom.com Legacy Perl/Mason Code

web speech forensics debate tabulation mocktrial

Updated May 13, 2024
JavaScript

sensein / senselab

Star

PipePal is a Python package that simplifies building pipelines for speech and voice analysis.

voice speech

Updated May 13, 2024
Python

IAHispano / Applio

Star

VITS-based Voice Conversion focused on simplicity, quality and performance.

text-to-speech ai voice speech pytorch rvc voice-conversion vc voice-cloning speech-to-speech vits voice-clone applio

Updated May 13, 2024
Python

MuSAELab / Multimodal-dataset-catalog

Star

This repository lists publicly available datasets for visual-audio, speech and audio, and biomedical signal related tasks.

speech healthcare dataset visual-audio deepfake biomedical-signal

Updated May 13, 2024

felixbur / nkululeko

Star

Machine learning speaker characteristics

machine-learning speech pytorch

Updated May 13, 2024
Python

weirongxu / auditory-reader

Star

📖 A Speech Reader, Support Epub, URL, Text.

speech epub reader utterance

Updated May 13, 2024
TypeScript

mskian / pronounce-and-speech

Star

Pronounce and Speech Text - Enter Word and Get the Pronunciation and Speech Text.

javascript css html text-to-speech rollup speech pronounce tailwindcss pronounciation

Updated May 13, 2024
JavaScript

Mohamad-Hussein / speech-assistant

Star

Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.

desktop-app translation offline speech speech-to-text transcription dictation whisper huggingface openai-whisper whisper-ai distil-whisper

Updated May 13, 2024
Python

jim60105 / docker-whisperX

Star

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)

dockerfile docker-image speech speech-recognition speech-to-text whisper asr

Updated May 12, 2024
Dockerfile

echogarden-project / echogarden

Star

Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.