#

vad

Here are 84 public repositories matching this topic...

smacke / ffsubsync

Automagically synchronize subtitles with video.

Updated Mar 18, 2024
Python

alibaba-damo-academy / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. ｜语音识别工具包，包含丰富的性能优越的开源预训练模型，支持语音识别、语音端点检测、文本后处理等，具备服务部署能力。

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated May 9, 2024
Python

jtkim-kaist / VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

data speech dnn lstm speech-recognition attention vad voice-detection voice-activity-detection bdnn acam speech-activity-detection

Updated Jun 9, 2021
MATLAB

CheshireCC / faster-whisper-GUI

faster_whisper GUI with PySide6

openai vad whisper asr transcribe voice-transcription faster-whisper whisperx

Updated Apr 17, 2024
Python

amsehili / auditok

An audio/acoustic activity detection and audio segmentation tool

vad audio-data audio-activities audio-segmentation voice-detection voice-activity-detection

Updated Mar 30, 2023
Python

filippogiruzzi / voice_activity_detection

Voice Activity Detection based on Deep Learning & TensorFlow

python machine-learning deep-neural-networks deep-learning time-series tensorflow speech artificial-intelligence speech-recognition vad resnet deeplearning time-series-classification voice-activity-detection librispeech speech-detection librispeech-dataset mfcc-features

Updated Mar 24, 2023
Python

Baidu-AIP / speech-vad-demo

集成Webrtc的VAD，用于切分音频文件

webrtc speech vad webrtc-vad

Updated Aug 26, 2020
C

DmitryRyumin / ICASSP-2023-24-Papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated May 9, 2024
Python

EtienneAb3d / WhisperHallu

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

text-to-speech sound-processing vad whisper audio-processing asr noise-removal vocals

Updated Feb 6, 2024
Python

gkonovalov / android-vad

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Updated Feb 12, 2024
C

shashikg / WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

deep-learning speech-recognition vad speech-to-text whisper asr tensorrt voice-activity-detection tensorrt-llm

Updated Apr 5, 2024
Jupyter Notebook

eesungkim / Voice_Activity_Detector

A statistical model-based Voice Activity Detection

vad voice-detection voice-activity-detection

Updated Nov 30, 2018
Jupyter Notebook

xiongyihui / python-webrtc-audio-processing

Python bindings of WebRTC Audio Processing

python vad ns agc webrtc-audio-processing

Updated Jan 22, 2019
C++

Picovoice / cobra

On-device voice activity detection (VAD) powered by deep learning

speech-recognition vad voice-activity-detection on-device voice-activity voice-activity-detector

Updated Apr 8, 2024
Python

voithru / voice-activity-detection

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

vad voice-activity-detection

Updated Oct 26, 2021
Python

sic

0vercl0k / sic

Enumerate user mode shared memory mappings on Windows.

driver windows-10 windows-kernel vad shm shared-memory ntoskrnl prototype-pte

Updated Feb 14, 2021
C

fjchange / object_centric_VAD

An Tensorflow Re-Implement of CVPR 2019 "Object-centric Auto-Encoders and Dummy Anomalies for Abnormal Event Detection in Video"

vad anomaly cvpr2019

Updated May 6, 2022
Python

xia-chu / webrtc_apm

webrtc中apm相关代码的提取，包括AEC/NS/AGC/VAD ，另外还包括mp3/aac编码器、SoundTouch

webrtc mp3 aac jni vad ns agc soundtouch aec

Updated Jun 30, 2023
C

NickWilkinson37 / voxseg

A python library for voice activity detection (VAD) for speech/non-speech segmentation.

python python-library speech vad speech-processing voice-activity-detection speech-segmentation

Updated Sep 7, 2022
Python

spokestack-android

spokestack / spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

android text-to-speech nlu voice speech tts speech-synthesis voice-recognition speech-recognition vad asr voice-assistant natural-language-understanding voice-as-an-interface speech-api voice-activity-detection voice-synthesis wakeword wakeword-activation

Updated Oct 18, 2021
Java

Improve this page

Add a description, image, and links to the vad topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vad topic, visit your repo's landing page and select "manage topics."