Self-Supervised Speech Pre-training and Representation Learning Toolkit
-
Updated
Apr 28, 2024 - Python
Self-Supervised Speech Pre-training and Representation Learning Toolkit
speech to text with self-supervised learning based on wav2vec 2.0 framework
A live speech recognition using Facebooks wav2vec 2.0 model.
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer
Wave2vec 2.0 Recognize pipeline
A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.
Speeech Recognition for Indic languages.
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
Wav2vec resources and models for Brazilian Portuguese
Training scripts for Speech-To-Text models for Ukrainian language
Fine-tuning wav2vec2 to for Pathological Speech Processing
A repo to make installation and training of a wav2vec model easier
Building a speaker identification & verification pipeline for Vietnamese voices 😪
Recognition of a medical diagnosis from speech.
Deep audio modeling
Add a description, image, and links to the wav2vec topic page so that developers can more easily learn about it.
To associate your repository with the wav2vec topic, visit your repo's landing page and select "manage topics."