Skip to content

An experiment to confirm the effect of noise in comparison to Grad-TTS.

Notifications You must be signed in to change notification settings

reppy4620/ode_tts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

ode_tts

This repo contains the implementation of my TTS model named ODE-TTS for my study.

The purpose of this repo is to confirm that effect of noise compared with grad_tts.

At least in this problem, noise in trajectries between distributions has important role that not only act as reguralization but also promotes to learn high-frequency features.

In my experiments, Grad-TTS generates high-fidelity mel-spectrograms properly by probability flow ode, but ODE-TTS cannot generate ones.

Then I have a question that if noise makes learning high-frequency features, adding constant noise to ode trajectries like Conditional Flow Matching can be effective?

As a result, it works a little better, but is significantly less effective than Grad-TTS.
Generated mel-spectrograms by ODE with noise are so smoothed.

About

An experiment to confirm the effect of noise in comparison to Grad-TTS.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages