annotation generator for diarization task
-
Updated
Sep 10, 2022 - Jupyter Notebook
annotation generator for diarization task
Machine learning applied to soundscape audio.
Automatically setup the MSDWild dataset for usage with pyannote-database (and pyannote-audio)
Our group's submission to the first DIHARD speaker diarization challenge held as a special session in INTERSPEECH '18.
无监督说话人聚类算法比较
The goal of this research project is to be able to control the movements of characters in a Maze game using real-time voice commands such as saying out loud Up, Down, Left or Right.
Pyannote/speaker-diarization-3.1 is an open-source toolkit written in Python for speaker diarization, which is the task of determining "who spoke when" in an audio recording. It is based on the PyTorch machine learning framework and provides a set of trainable end-to-end neural building blocks that can be combined and jointly optimized.
Speaker Diarization, Recognition and Language Identification. Scripts to generate GT using our WebApp and Praat software
pyannote.audio benchmark for NVIDIA GPUs
Semi Supervised Speaker Diarization with Gaussian Mixture Models
Speaker diarization simulation built with python
Speech toolkit for audio analysis, diarization and transcription
Speaker Diarisation implemented in Python with the help of IBM Cloud's Watson, which provides a free speech-to-text API
A course project for DA 623: Computing with Signals. We investigate the use of Non-negative Matrix Factorization for speaker diarization and source separation.
Speaker Diarization using Python, Flask and Html
An easy way to make perfect audio transcript with Whisper model and speaker diarization
Full-stack Transcription-UI: Features OpenAI Whisper and NVIDIA NeMo, with Docker for easy deployment.
WhisperX Slack bot for transcribing audio files
Subtitle generation w/ Speaker Diarization using Whisper and pyannote.audio
Add a description, image, and links to the speaker-diarization topic page so that developers can more easily learn about it.
To associate your repository with the speaker-diarization topic, visit your repo's landing page and select "manage topics."