speech-activity-detection

Lightweight speech-to-speech web-based chat app combining speech recognition, LLM completion and text-to-speech. Implemented with Python (Flask) and vanilla JavaScript only.

Updated Mar 3, 2024
Python

ina-foss / inaSpeechSegmenter

Star

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

music speech audio-analysis noise gender-equality segmentation gender praat gender-classification male female voice-activity-detection music-detection mirex speech-activity-detection speech-segmentation speech-music speaker-gender speech-detection

Updated Mar 15, 2024
Python

pyannote / pyannote-audio

Star

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated May 13, 2024
Jupyter Notebook

ina-foss / InaGVAD

Star

Voice activity detection and speaker gender segmentation audiovisual corpus

radio benchmark corpus media tv gender audio-segmentation voice-activity-detection gender-prediction speech-dataset gender-bias speech-activity-detection speaker-gender speech-corpus audio-dataset audiovisual-dataset acoustic-diversity gender-representation

Updated May 14, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-activity-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-activity-detection topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-activity-detection

Here are 16 public repositories matching this topic...

aditya-joglekar / FS02_Scoring_Toolkit

sajR / V-SAD

jtkim-kaist / VAD

vimalmanohar / kaldi

HHousen / speaker-change-detection

bigcash / awesome-vad

AmirHoseein99 / Depression-Engine

RicherMans / Datadriven-GPVAD

RicherMans / GPV

dangvansam / pyannote-onnx

rafaelgreca / voxseg-pytorch

idiap / zff_vad

KF-R / turk-chat

ina-foss / inaSpeechSegmenter

pyannote / pyannote-audio

ina-foss / InaGVAD

Improve this page

Add this topic to your repo