Fine-tuning XLS-R for Multi-Lingual ASR with 🤗 Transformers
-
Updated
Jul 12, 2022 - Jupyter Notebook
Fine-tuning XLS-R for Multi-Lingual ASR with 🤗 Transformers
AI model for speech disorder detection
Fine-tuning Multilingual Large Speech Recognition Models: Wav2vec and Whisper
This repository demonstrates development of Hindi ASR model using transformers.
Intent and Entity Extraction and Classification from audio files
SER and audio classification using both a Wav2Vec2 based model and an ASR->Bert pipeline, as well as utilizing a multimodal late-fusion model
A simple Speech Emotion Recognization (SER) project based on Wav2Vec2.
A natural language processing and machine learning project for a low resource langauge in Zambia.
Application to search for similar sound effects by voice and title.
Speaker recognition task using wav2vec2 model.
Speech Assessment API in NextJS
This repository contains code/papers/research on Speech or Audio Classification
This repository contains the implementation of our published paper titled 'Improving Automatic Speech Recognition with Dialect-Specific Language Models,' presented at SPECOM'23.
Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS
Python Colab for speech recognition with wav2vec2. Since wav2vec2 requires heavy GPU I've come up with a way to run this on Google Colab as well as local machines with minimum GPU.
Spoken NER implementation based on Wav2Vec2-XLS-R with experiments on transfer learning
Review of Speech to text voice denoisers
Add a description, image, and links to the wav2vec2 topic page so that developers can more easily learn about it.
To associate your repository with the wav2vec2 topic, visit your repo's landing page and select "manage topics."