Realtime voice conversion support #3707
Labels
feature request
feature requests for making TTS better.
wontfix
This will not be worked on but feel free to help.
馃殌 Feature Description
Realtime voice conversion to build accent translation
Want streaming support in both encoder and vocoder
Solution
A streaming phone recognizer / encoder word bound boundary prediction , and then pass this to streaming synthesizer
Alternative Solutions
Additional context
The text was updated successfully, but these errors were encountered: