speech-recognition

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated Jun 1, 2024
Python

Sritam-K-Behera / SER-WebApp

Star

This project implements a Speech Emotion Recognition (SER) model using TensorFlow Lite, specifically designed for deployment on microcontrollers like the Arduino Nano BLE33. The model is trained on the RAVDESS dataset and can recognize seven emotions: Angry, Disgust, Fear, Happy, Neutral, Sad, and Surprise.

machine-learning speech-recognition streamlit

Updated Jun 1, 2024
Jupyter Notebook

deepgram / deepgram-python-sdk

Star

Official Python SDK for Deepgram's automated speech recognition APIs.

python speech-recognition hacktoberfest asr deepgram automated-speech-recognition

Updated May 31, 2024
Python

deepgram / deepgram-go-sdk

Star

Go SDK for Deepgram's automated speech recognition APIs.

go speech-recognition speech-to-text hacktoberfest deepgram

Updated May 31, 2024
Go

lobehub / lobe-tts

Sponsor

Star

🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser

react nodejs text-to-speech edge tts speech-recognition speech-to-text stt bun auzre microsoft-speech-api opeanai lobehub

Updated Jun 1, 2024
TypeScript

Detilisi / Umbrella

Star

A voice-operated emailing mobile application that allows you to compose and send email messages through voice commands.

text-to-speech automation sqlite-database mvvm entity-framework clean-architecture speech-recognition cqrs-pattern intent-recognition communitytoolkit maui-app

Updated May 31, 2024
C#

mkiol / dsnote

Star

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

text-to-speech translator translation offline machine-translation sailfishos tts speech-synthesis speech-recognition speech-to-text nmt linux-desktop stt asr flatpak-applications

Updated May 31, 2024
C++

TranscribeJs / transcribe.js

Star

Monorepo for Transcribe.js

javascript speech wasm speech-recognition speech-to-text whisper transcribe

Updated May 31, 2024
JavaScript

lhotse-speech / lhotse

Star

Tools for handling speech data in machine learning projects.

audio python data machine-learning ai deep-learning speech pytorch speech-recognition kaldi

Updated May 31, 2024
Python

JSchmie / ScrAIbe-WebUI

Star

WebUI for ScAIbe

ai speech-recognition speech-to-text

Updated May 31, 2024
Python

botbahlul / crx-live-translate

Star

Chrome/Edge BROWSER EXTENSION that can RECOGNIZE any live audio/video streaming then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE!

javascript chrome edge voice-recognition speech-recognition browser-extension speech-to-text google-translate-api webkitspeechrecognition auto-caption auto-subtitle webkit-speech-recognition

Updated May 31, 2024
JavaScript

KevKibe / African-Whisper

Star

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

speech speech-recognition speech-to-text whisper asr speech-translation speech-transcription

Updated May 31, 2024
Python

botbahlul / js-live-audio-video-translate

Star

HTML Web template that can RECOGNIZE any live audio/video streaming (using Chrome webkitSpeechRecognition API) then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE

javascript html web voice-recognition speech-recognition google-translate web-template google-translate-api webkitspeechrecognition auto-caption auto-subtitle webkit-speech-recognition

Updated May 31, 2024
JavaScript

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-recognition

Here are 4,632 public repositories matching this topic...

huggingface / transformers

openvinotoolkit / openvino

Adisol07 / SharpSpeech

octimot / StoryToolkitAI

wenet-e2e / wenet

compulim / web-speech-cognitive-services

thevickypedia / Jarvis

DmitryRyumin / ICASSP-2023-24-Papers

Sritam-K-Behera / SER-WebApp

deepgram / deepgram-python-sdk

deepgram / deepgram-go-sdk

lobehub / lobe-tts

Detilisi / Umbrella

mkiol / dsnote

TranscribeJs / transcribe.js

lhotse-speech / lhotse

JSchmie / ScrAIbe-WebUI

botbahlul / crx-live-translate

KevKibe / African-Whisper

botbahlul / js-live-audio-video-translate

Improve this page

Add this topic to your repo