An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
-
Updated
May 12, 2024 - Python
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
A Python implementation of observer-based audibility modelling methods
Command line utility for forced alignment using Kaldi
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)
Cellular automata based simulator for acoustic wave propagation with random obstacles.
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Vector Quantized PPGs based Voice conversion
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
Repository of an implementation of the matrix method for acoustic levitation simulations.
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Code for: "Leveraging Sound and Wrist Motion to Detect Activities of Daily Living with Commodity Smartwatches"
A voice driven 3D chess game for learning Voice AI
Designed and analysed a military helmet embedded with bone conduction earphones which will be used to transfer the commands directly to the soldier instead of having an earphone which will cover the Soldier’s ear canal and might decrease the spatial awareness which might be critical in a warzone.
A BCNN prediction pipeline to discover mosquito sounds from audio.
Automated, end-to-end wakeword model maker using the Precise Wakeword Engine
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
Acoustic mosquito detection code with Bayesian Neural Networks
My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundamental frequency contours, and formants.
Add a description, image, and links to the acoustic-model topic page so that developers can more easily learn about it.
To associate your repository with the acoustic-model topic, visit your repo's landing page and select "manage topics."