Scoring Toolkit for the Fearless Steps Challenge Phase-02 Tasks
-
Updated
Feb 16, 2021 - Python
Scoring Toolkit for the Fearless Steps Challenge Phase-02 Tasks
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Fork of the official kaldi.
Speaker change detection using SincNet and an LSTM/Transformer
A curated list of awesome voice activity detection
Detecting depressed Patient based on Speech Activity, Pauses in Speech and Using Deep learning Approach
The codebase for Data-driven general-purpose voice activity detection.
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
PyAnnote Voice Activity Detection (ONNX version)
The Voxseg implementation in PyTorch. Voxseg is a python library for voice activity detection (VAD) for speech/non-speech segmentation.
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Lightweight speech-to-speech web-based chat app combining speech recognition, LLM completion and text-to-speech. Implemented with Python (Flask) and vanilla JavaScript only.
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Voice activity detection and speaker gender segmentation audiovisual corpus
Add a description, image, and links to the speech-activity-detection topic page so that developers can more easily learn about it.
To associate your repository with the speech-activity-detection topic, visit your repo's landing page and select "manage topics."