speaker-recognition

Here are 275 public repositories matching this topic...

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models speaker-diariazation generative-ai

Updated May 14, 2024
Python

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated May 14, 2024
Python

pyannote / pyannote-audio

Star

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated May 13, 2024
Jupyter Notebook

google / uis-rnn

Star

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

machine-learning clustering supervised-learning speaker-recognition speaker-diarization supervised-clustering uis-rnn

Updated Aug 28, 2023
Python

astorfi / 3D-convolutional-speaker-recognition

Sponsor

Star

🔈 Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

deep-learning convolutional-neural-networks speaker-recognition 3d

Updated Mar 3, 2020
Python

clovaai / voxceleb_trainer

Star

In defence of metric learning for speaker recognition

metric-learning speaker-recognition speaker-verification voxceleb

Updated Mar 26, 2024
Python

mravanelli / SincNet

Star

SincNet is a neural architecture for efficiently processing raw audio samples.

Updated Apr 28, 2021
Python

athena-team / athena

Star

an open-source implementation of sequence-to-sequence based speech processing engine

deployment tensorflow tts speech-synthesis transformer speech-recognition sequence-to-sequence unsupervised-learning speaker-recognition asr ctc wfst

Updated Dec 2, 2022
C++

IBM-Cloud / chatbot-watson-android

Star

An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.

android java ibm-watson-services conversation-service watson chatbot dialog speech intent workspace entity conversation android-studio speaker-recognition watson-services ibm-watson speaker-diarization ibm-cloud ibm-cloud-solutions

Updated Nov 17, 2021
Java

taylorlu / Speaker-Diarization

Star

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

speaker-recognition speaker-diarization uis-rnn ghostvlad vgg-speaker-recognition

Updated Jul 1, 2021
Python

TaoRuijie / ECAPA-TDNN

Star

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

speaker-recognition speaker-verification voxceleb1 voxceleb2 ecapa-tdnn

Updated Apr 11, 2024
Python

yeyupiaoling / VoiceprintRecognition-Pytorch

Star

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

pytorch voice-recognition speaker-recognition arcface ecapa-tdnn