一个使用C++编写的音频处理软件
-
Updated
May 23, 2024 - C
一个使用C++编写的音频处理软件
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
SA-toolkit: Speaker speech anonymization toolkit in python
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Persian/Farsi text to speech(TTS) training using coqui tts
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
🇺🇦 Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model
RADTTS + HiFiGAN vocoder
Training and Tunning a Text to speech model with Nvidia NeMo and Weights and Biases
Ultrafast GAN based Vocoder for Text to Speech
homework for deep generation. Combine FastSpeech2 with different vocoders ⭐REFERENCE (modify origin repos): https://github.com/ming024/FastSpeech2 https://github.com/NVIDIA/waveglow https://github.com/mindslab-ai/univnet https://github.com/jik876/hifi-gan
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Add a description, image, and links to the hifigan topic page so that developers can more easily learn about it.
To associate your repository with the hifigan topic, visit your repo's landing page and select "manage topics."