#

asr

Here are 1,017 public repositories matching this topic...

elimu-ai / ml-asr.elimu.ai

Automated Speech Recognition

machine-learning ml asr

Updated Apr 15, 2022
Shell

BScUniversityCollaborations / automatic-speech-recognition

Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.

python classifier automatic-speech-recognition asr openslr mel-spectrogram recognition-algorithms

Updated Sep 12, 2023
Python

Think-A-Move / SPEAR-SDK-Python-Linux

SPEAR-ASR and SPEAR-WakeUp Software Development Kit in Python for Linux

Updated Nov 22, 2021
Python

lizunowa / project-asr-metrics

🧑🏻‍🎓 📑 October'20 - April'21. Group uni project. The project topic is Speech-to-Text Assessment Tool. It is a research-type project, most of the documentation is in a private GitLab repository.

asr asr-benchmark

Updated Jun 8, 2021
Jupyter Notebook

maximkm / DLA_ASR_HW

ASR pytorch project

transformers pytorch lm beam-search asr asr-model bpe

Updated Oct 16, 2022
Python

jevil25 / Lip-Read-ML-Model

This is a Machine Learning project. This model takes video of person face as input and predicts the word. It uses tensorflow and keras for training the model. It uses Sequential models for trainning and predicting. It used relu and softmax as activation functions

machine-learning tensorflow asr

Updated Aug 8, 2023
Jupyter Notebook

kingabzpro / hindiSpeechPro-Automatic-Speech-Recognization

The project,being part of Kagglex BIPOC Mentorship Program final project, aims to train two separate Hindi ASR models using the Facebook Wav2Vec2 (300M parameters) and OpenAI Whisper-Small models, respectively. The goal is to compare their performance, with a target WER of less than 13%, across various Hindi accents and dialects.

transformer speech-recognition whisper asr hindi-language wav2vec2

Updated Nov 18, 2023
Jupyter Notebook

marks038 / Test

Test Repo

test test1 calculators asr

Updated Feb 23, 2024

ZYancey / ASR-Jukebox

A Spotify Remote that operates using an ML Powered Automated Speech Recognization and Intent Detection Pipeline

spacy asr spotipy

Updated Nov 5, 2023
Python

alekseevskaia / audio_attack

asr adversarial-attacks

Updated Jan 29, 2024
Jupyter Notebook

Nexdata-AI / 557-Hours-Kazakh-Spontaneous-Speech-Data

557-Hours-Kazakh-Spontaneous-Speech-Data

speech-recognition speech-to-text asr kazakh spontaneous-speech-recognition

Updated Apr 18, 2024

Nexdata-AI / 444202-Korean-Pronunciation-Dictionary

Korean-Pronunciation-Dictionary

text lexicon speech-to-text pronunciation-dictionary asr

Updated Apr 18, 2024

Nexdata-AI / 212-Hours-Burmese-Spontaneous-Speech-Data

Burmese Spontaneous Speech Data

audio machine-translation speech-recognition asr voiceprint

Updated Apr 18, 2024

Nexdata-AI / 89-Hours-Indonesian-Conversational-Speech-Data-by-Telephone

Indonesian conversational speech data

audio speech-recognition asr conversational-ai call-center

Updated Apr 18, 2024

Nexdata-AI / 162-Hours-French-Children-Spontaneous-Speech-Data

French Child's Spontaneous Speech Data

audio machine-translation speech-recognition asr children-speech

Updated Apr 18, 2024

Nexdata-AI / 87166-Minnan-Dialect-Pronunciation-Dictionary

Dialect-Pronunciation-Dictionary

text lexicon speech-to-text pronunciation-dictionary asr

Updated Apr 19, 2024

Nexdata-AI / 194999-Uyghur-Pronunciation-Dictionary

194999-Uyghur-Pronunciation-Dictionary

speech-recognition pronunciation-dictionary asr uyghur

Updated Apr 19, 2024

Nexdata-AI / 1044-Hours-Minnan-Dialect-Speech-Data-by-Mobile-Phone

1044-Hours-Minnan-Dialect-Speech-Data-by-Mobile-Phone

speech-recognition speech-to-text minnan asr

Updated Apr 19, 2024

Nexdata-AI / Vietnamese-Spontaneous-Dialogue-Telephony-speech-dataset

Vietnamese-Spontaneous-Dialogue-Telephony-speech-dataset

speech-recognition speech-to-text asr spontaneous-speech-recognition

Updated Apr 19, 2024

vad-babushkin / docker-kaldi-gstreamer-server

Dockerfile for kaldi-gstreamer-server.

gstreamer kaldi asr

Updated Jul 14, 2017
Shell

Improve this page

Add a description, image, and links to the asr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."