#

wav2vec2

Here are 103 public repositories matching this topic...

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated May 23, 2024
Python

tracyreuter / NLP-speech-to-text

Convert speech to text using HuggingFace, comparing Wav2Vec2 versus OpenAI Whisper

nlp natural-language-processing sentiment-analysis speech-to-text punctuation huggingface wav2vec2 openai-whisper

Updated May 22, 2024
Jupyter Notebook

sebinbenjamin / wav2vec_demo

A Python tool for transcribing speech from audio files using the Wav2Vec 2.0 model. Supports multilingual transcription, automatic audio chunking, and easy setup

transformers pytorch speech-recognition hugging-face wav2vec2

Updated May 15, 2024
Python

Sarasadeghii / Sharif-Wav2vec2

This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.

nlp speech-recognition speech-to-text language-model wer kenlm farsi-datasets wav2vec2 xlsr

Updated May 12, 2024
Jupyter Notebook

moncefbenaicha / SpokenNER

Spoken NER implementation based on Wav2Vec2-XLS-R with experiments on transfer learning

speech-recognition transfer-learning ner asr spoken-language-understanding wav2vec2 xlsr spoken-ner

Updated May 7, 2024
Python

seanghay / kfa

A fast Khmer Forced Aligner powered by Wav2Vec2CTC and Phonetisaurus

alignment cambodia khmer forced-alignment wav2vec2

Updated May 2, 2024
Python

s3prl

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Updated Apr 28, 2024
Python

seb5433 / wav2vec2-speaker-recognition

Speaker recognition task using wav2vec2 model.

speaker-recognition fine-tuning speaker-recognition-systems wav2vec2

Updated Apr 25, 2024
Python

PeterGilles / Speech-Recognition-Lecture---Data-Science-in-Humanities

Material for my lecture on Automatic Speech Recognition

automatic-speech-recognition whisper asr luxembourgish wav2vec2

Updated Apr 24, 2024
Jupyter Notebook

JingleCate / SpeechEmotionRecog

A simple Speech Emotion Recognization (SER) project based on Wav2Vec2.

audio classification wav2vec2

Updated Apr 22, 2024
Python

ECNU-Cross-Innovation-Lab / ENT

[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition

automatic-speech-recognition speech-emotion-recognition wav2vec2

Updated Apr 11, 2024
Python

jmaczan / asr-dysarthria

😺 Research on Automatic Speech Recognition for dysarthric speech

deep-learning automatic-speech-recognition asr self-supervised-learning dysarthric-speech wav2vec2 dysarthria

Updated Apr 9, 2024
Jupyter Notebook

aryanxxvii / lark

Speech Assessment API in NextJS

machine-learning nextjs pronunciation speech-recognition prisma huggingface phoneme-recognition wav2vec2 llm

Updated Mar 22, 2024
TypeScript

egorsmkv / asr-corpus-creator

This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.

audio speech-recognition automatic-speech-recognition nemo whisper audio-processing asr wav2vec2

Updated Feb 15, 2024
Python

gulabpatel / Speech-to-Text

text-to-speech speech-to-text gtts deepspeech wav2vec2

Updated Feb 5, 2024
Jupyter Notebook

wizboz / dissertation

Accent Classification Dissertation Project

cnn-classification wav2vec2 masters-dissertation-project

Updated Feb 5, 2024
Jupyter Notebook

oliverguhr / wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

pyaudio speech speech-recognition speech-to-text asr wav2vec wav2vec2

Updated Feb 4, 2024
Python

balena

louisbrulenaudet / balena

BALanced Execution through Natural Activation : a human-computer interaction methodology for code running.

terminal transformers python3 speech-recognition execution speech-to-text sentence-similarity speech-to-function sentence-transformers wav2vec2

Updated Jan 29, 2024
Python

aitor-alvarez / large-speech-models

Fine-tuning Multilingual Large Speech Recognition Models: Wav2vec and Whisper

whisper asr asr-model speech-recognition-model wav2vec2 arabic-speech-recognition large-speech-models finetuning-wav2vec finetuning-whisper

Updated Jan 23, 2024
Python

ECNU-Cross-Innovation-Lab / ShiftSER

[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations

speech-emotion-recognition hubert wav2vec2

Updated Dec 18, 2023
Python

Improve this page

Add a description, image, and links to the wav2vec2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the wav2vec2 topic, visit your repo's landing page and select "manage topics."