#

speaker-diarization

Here are 103 public repositories matching this topic...

NavodPeiris / speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

ai automatic-speech-recognition transcription speaker-recognition speaker-verification speaker-diarization whisper-ai faster-whisper

Updated Jun 3, 2024
Python

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jun 3, 2024
Python

transcriptionstream / transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

automation speech-recognition transcription whisper speaker-diarization diarization llm whisperx ollama mistral-7b

Updated Jun 2, 2024
Python

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Updated Jun 3, 2024
Python

diart

juanmc2005 / diart

A python package to build AI-powered real-time audio applications

real-time deep-learning transcription speaker-diarization streaming-audio voice-activity-detection speaker-embedding

Updated Jun 1, 2024
Python

katagaki / FiresideSubtitles

Video transcription, speaker diarization, and face detection in Python.

audio python opencv video dnn openai face-detection transcription speaker-diarization openai-whisper

Updated May 31, 2024
Python

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated May 31, 2024
Jupyter Notebook

espnet / espnet

End-to-End Speech Processing Toolkit

deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated May 30, 2024
Python

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

speaker-verification speaker-diarization language-identification voxceleb modelscope campplus eres2net 3d-speaker rdino cnceleb

Updated May 30, 2024
Python

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

speech speech-recognition speech-to-text whisper asr speaker-diarization

Updated May 25, 2024
Jupyter Notebook

Wenhao-Yang / SpeakerVerifiaction-pytorch

Speaker Verification using Pytorch

python pytorch kaldi speaker-verification speaker-diarization

Updated May 23, 2024
Jupyter Notebook

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Updated May 23, 2024
Python

luisst / SpeakerLID_GT_code

Speaker Diarization, Recognition and Language Identification. Scripts to generate GT using our WebApp and Praat software

speaker-recognition speaker-diarization

Updated May 22, 2024
Python

linto-ai / linto-diarization

Speaker diarization service

asr speaker-diarization speaker-identification linto

Updated May 24, 2024
Python

yinruiqing / pyannote-whisper

whisper asr speaker-diarization meeting-summarization pyannote chatgpt

Updated May 11, 2024
Python

cvqluu / simple_diarizer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

speech-to-text transcription asr speaker-diarization colab-notebook diarization

Updated May 2, 2024
Python

Picovoice / falcon

On-device speaker diarization powered by deep learning

speaker-diarization

Updated Apr 29, 2024
Python

linto-ai / whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Updated Apr 22, 2024
Python

Joost385 / transcription-ui

Full-stack Transcription-UI: Features OpenAI Whisper and NVIDIA NeMo, with Docker for easy deployment.

transcription speaker-diarization

Updated Apr 19, 2024
TypeScript

nuaazs / VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

microservices speech-recognition speaker-recognition antifraud speaker-diarization

Updated Apr 16, 2024
Python

Improve this page

Add a description, image, and links to the speaker-diarization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speaker-diarization topic, visit your repo's landing page and select "manage topics."