Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Different Audio quality among intermediate results #188

Open
auspicious3000 opened this issue Apr 14, 2020 · 2 comments
Open

Different Audio quality among intermediate results #188

auspicious3000 opened this issue Apr 14, 2020 · 2 comments

Comments

@auspicious3000
Copy link

While training the vocoder, it writes to the following three folders "audio", "dev_eval", and "train_no_dev_eval".
The audios inside "dev_eval" and "train_no_dev_eval" all sounds very good. However, the audios inside "audio" all have audible hissing background noise.
What's the difference between those three folders?
Is it possible that the audios written to "audios" folder were somehow generated differently from the ones inside the other two folders?
Thanks!

@r9y9
Copy link
Owner

r9y9 commented Apr 14, 2020

In short: the difference is using teacher-forcing generation or not.

  • dev_eval: Results for development (validation) set. All waveform is generated by autoregressive generation (i.e. inference mode).
  • train_no_dev_eval: Results for training set. All waveform is generated by autoregressive generation (i.e. inference mode).
  • audio: Results for training set. All waveform is generated by teacher-forcing generation (i.e. training mode).

@auspicious3000
Copy link
Author

Does it make sense that the waveform generated under training mode sounds worse than that generated under inference mode?
I mean, under training mode, since the model has access to the previous ground truth, the output should sound at least as good as the output using inference mode.
What's your opinion on this?
Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants