[FG 2024] "Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention"
-
Updated
May 14, 2024 - Python
[FG 2024] "Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention"
A PyTorch-based Speech Toolkit
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
kaldi-asr/kaldi is the official location of the Kaldi project.
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
DELTA is a deep learning based natural language and speech processing platform.
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
UniSpeech - Large Scale Self-Supervised Learning for Speech
In defence of metric learning for speaker recognition
Official repository for RawNet, RawNet2, and RawNet3
Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
A toolbox of audio models and algorithms based on MindSpore
Solution Code for Signal Processing Cup - 2024 by Team EigenSharks
Siamese Networks + general Encoder networks project
Deep learning for audio processing
Add a description, image, and links to the speaker-verification topic page so that developers can more easily learn about it.
To associate your repository with the speaker-verification topic, visit your repo's landing page and select "manage topics."