Automagically synchronize subtitles with video.
-
Updated
Mar 18, 2024 - Python
Automagically synchronize subtitles with video.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
faster_whisper GUI with PySide6
An audio/acoustic activity detection and audio segmentation tool
Voice Activity Detection based on Deep Learning & TensorFlow
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
A statistical model-based Voice Activity Detection
Python bindings of WebRTC Audio Processing
On-device voice activity detection (VAD) powered by deep learning
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
Enumerate user mode shared memory mappings on Windows.
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Add a description, image, and links to the vad topic page so that developers can more easily learn about it.
To associate your repository with the vad topic, visit your repo's landing page and select "manage topics."