A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。
-
Updated
May 15, 2024 - Python
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。
End-to-End Speech Processing Toolkit
A PyTorch-based Speech Toolkit
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
turnkey self-hosted offline transcription and diarization service with llm summary
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Speaker Diarization, Recognition and Language Identification. Scripts to generate GT using our WebApp and Praat software
On-device speaker diarization powered by deep learning
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Full-stack Transcription-UI: Features OpenAI Whisper and NVIDIA NeMo, with Docker for easy deployment.
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
Subtitle generation w/ Speaker Diarization using Whisper and pyannote.audio
WhisperX Slack bot for transcribing audio files
Pyannote/speaker-diarization-3.1 is an open-source toolkit written in Python for speaker diarization, which is the task of determining "who spoke when" in an audio recording. It is based on the PyTorch machine learning framework and provides a set of trainable end-to-end neural building blocks that can be combined and jointly optimized.
Speaker diarization service
Add a description, image, and links to the speaker-diarization topic page so that developers can more easily learn about it.
To associate your repository with the speaker-diarization topic, visit your repo's landing page and select "manage topics."