Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to synthesis a speech which I need? #63

Open
sanena opened this issue Jan 3, 2022 · 1 comment
Open

How to synthesis a speech which I need? #63

sanena opened this issue Jan 3, 2022 · 1 comment

Comments

@sanena
Copy link

sanena commented Jan 3, 2022

Hello!It’s not long for me since I just learned voice conversion.So I have many questions, and one of which is how to appoint the source speech and the target speech so I can synthesis the speech which I need.
In short, Could you tell me how to restructure the demo.pkl?
If you can answer me ,I'll be very grateful!

@HJYblur
Copy link

HJYblur commented Apr 12, 2022

Actually,I believe the structure of demo.pkl is:

-sbmt[0](the data of speaker1)
--name of the speaker,e.g. P226
--one-hot vector of the speaker
--four components
---x0(the mel spectrum of the wav)
---f0(symbolize as pitch)
---length of x0 or f0
---a number I don't know the meaning

-sbmt[1](the same as above)
....
You can print them out to see the details.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants