Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information
-
Updated
Jul 9, 2017 - Python
Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information
A voice user interface that recognizes the user's voice via the Sphinx library to execute some commands. The system responds with a computer generated voice and sound clips. Finally, there's a server for storing and reacting to the data, and a client for connecting to the system.
This python script is to convert an Indonesian word to phoneme sequence to generate a lexicon used to train Indonesian Automatic Speech Recognition system.
A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.
Notebooks notes about Automatic Speech Recognition
Some approaches based on deep learning to build the acoustic model for an end-to-end automatic speech recognition (ASR) pipeline.
A set of scripts to create an acoustic models based on OPS Aphasia database
Development of a simulator of acoustic model
This is now the official location of the Kaldi project.
It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater
It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater
This is a sub-repository in building to create acoustic model in Mandarin speech recognition.
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
A Bash script designed to make training sphinx4 and pocketsphinx acoustic libraries faster and easier
Sequential adaptive elastic net (SAEN) approach, complex-valued LARS solver for weighted Lasso/elastic-net problems, and sparsity (or model) order detection with an application to single-snapshot source localization.
some papers about automatic speech recognition
Acoustic event detection using yamnet model. Model is deployed using tensorflow serving in docker container and Flask API
PyTorch implementation of automatic speech recognition models.
A text-to-speech framework for Mardarin speech synthesis, including chinese frontend process, acoustic model, vocoder and other tools.
Code for "Ok Google, What Am I Doing? Acoustic Activity Recognition Bounded by Conversational Assistant Interactions"
Add a description, image, and links to the acoustic-model topic page so that developers can more easily learn about it.
To associate your repository with the acoustic-model topic, visit your repo's landing page and select "manage topics."