asr
Here are 1,017 public repositories matching this topic...
Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.
-
Updated
Sep 12, 2023 - Python
SPEAR-ASR and SPEAR-WakeUp Software Development Kit in Python for Linux
-
Updated
Nov 22, 2021 - Python
🧑🏻🎓 📑 October'20 - April'21. Group uni project. The project topic is Speech-to-Text Assessment Tool. It is a research-type project, most of the documentation is in a private GitLab repository.
-
Updated
Jun 8, 2021 - Jupyter Notebook
ASR pytorch project
-
Updated
Oct 16, 2022 - Python
This is a Machine Learning project. This model takes video of person face as input and predicts the word. It uses tensorflow and keras for training the model. It uses Sequential models for trainning and predicting. It used relu and softmax as activation functions
-
Updated
Aug 8, 2023 - Jupyter Notebook
The project,being part of Kagglex BIPOC Mentorship Program final project, aims to train two separate Hindi ASR models using the Facebook Wav2Vec2 (300M parameters) and OpenAI Whisper-Small models, respectively. The goal is to compare their performance, with a target WER of less than 13%, across various Hindi accents and dialects.
-
Updated
Nov 18, 2023 - Jupyter Notebook
-
Updated
Jan 29, 2024 - Jupyter Notebook
557-Hours-Kazakh-Spontaneous-Speech-Data
-
Updated
Apr 18, 2024
Korean-Pronunciation-Dictionary
-
Updated
Apr 18, 2024
Burmese Spontaneous Speech Data
-
Updated
Apr 18, 2024
Indonesian conversational speech data
-
Updated
Apr 18, 2024
French Child's Spontaneous Speech Data
-
Updated
Apr 18, 2024
Dialect-Pronunciation-Dictionary
-
Updated
Apr 19, 2024
194999-Uyghur-Pronunciation-Dictionary
-
Updated
Apr 19, 2024
1044-Hours-Minnan-Dialect-Speech-Data-by-Mobile-Phone
-
Updated
Apr 19, 2024
Vietnamese-Spontaneous-Dialogue-Telephony-speech-dataset
-
Updated
Apr 19, 2024
Improve this page
Add a description, image, and links to the asr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."