Here are
103 public repositories
matching this topic...
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Updated
May 23, 2024
Python
Convert speech to text using HuggingFace, comparing Wav2Vec2 versus OpenAI Whisper
Updated
May 22, 2024
Jupyter Notebook
A Python tool for transcribing speech from audio files using the Wav2Vec 2.0 model. Supports multilingual transcription, automatic audio chunking, and easy setup
Updated
May 15, 2024
Python
This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.
Updated
May 12, 2024
Jupyter Notebook
Spoken NER implementation based on Wav2Vec2-XLS-R with experiments on transfer learning
Updated
May 7, 2024
Python
A fast Khmer Forced Aligner powered by Wav2Vec2CTC and Phonetisaurus
Updated
May 2, 2024
Python
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Updated
Apr 28, 2024
Python
Speaker recognition task using wav2vec2 model.
Updated
Apr 25, 2024
Python
Material for my lecture on Automatic Speech Recognition
Updated
Apr 24, 2024
Jupyter Notebook
A simple Speech Emotion Recognization (SER) project based on Wav2Vec2.
Updated
Apr 22, 2024
Python
[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition
Updated
Apr 11, 2024
Python
😺 Research on Automatic Speech Recognition for dysarthric speech
Updated
Apr 9, 2024
Jupyter Notebook
Speech Assessment API in NextJS
Updated
Mar 22, 2024
TypeScript
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
Updated
Feb 15, 2024
Python
Updated
Feb 5, 2024
Jupyter Notebook
Accent Classification Dissertation Project
Updated
Feb 5, 2024
Jupyter Notebook
A live speech recognition using Facebooks wav2vec 2.0 model.
Updated
Feb 4, 2024
Python
BALanced Execution through Natural Activation : a human-computer interaction methodology for code running.
Updated
Jan 29, 2024
Python
Fine-tuning Multilingual Large Speech Recognition Models: Wav2vec and Whisper
Updated
Jan 23, 2024
Python
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Updated
Dec 18, 2023
Python
Improve this page
Add a description, image, and links to the
wav2vec2
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
wav2vec2
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.