Self-Supervised Speech Pre-training and Representation Learning Toolkit
-
Updated
Apr 28, 2024 - Python
Self-Supervised Speech Pre-training and Representation Learning Toolkit
A live speech recognition using Facebooks wav2vec 2.0 model.
Fine-tuning wav2vec2 to for Pathological Speech Processing
This repository contains scripts to prune Wav2vec2 using a neuroevolution-based method. More details about this method can be found in the paper Compressing Wav2vec2 for Embedded Applications.
Training scripts for Speech-To-Text models for Ukrainian language
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
Recognition of a medical diagnosis from speech.
Wav2vec resources and models for Brazilian Portuguese
Deep audio modeling
speech to text with self-supervised learning based on wav2vec 2.0 framework
Building a speaker identification & verification pipeline for Vietnamese voices 😪
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.
Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
Speeech Recognition for Indic languages.
Add a description, image, and links to the wav2vec topic page so that developers can more easily learn about it.
To associate your repository with the wav2vec topic, visit your repo's landing page and select "manage topics."