Question: Using a pretrained encoder for getting the speaker embedding. #29

nischal-sanil · 2021-01-19T07:54:48Z

Hi,

Did you guys experiment using a pretrained encoder for getting the speaker embedding similar to your previous work (AutoVC).

PS: Amazing work by the way!

Thanks,

FurkanGozukara · 2021-01-23T08:43:57Z

@nischal-sanil did you make it work?

can you check my question please? #28

terbed · 2021-06-04T17:15:25Z

I have the same question @auspicious3000
Here you use the one-hot encoded embedding with a lent of 82 (the number of speakers it was pretrained), but could you generate a zeros-shot general embedding like in AutoVC. If I am correct the size of the used embedding was larger in that, I assume you cannot use that here.

So to wrap up: this method with the pretrained weights works only on the 82 speakers it was trained and conditioned on if we consider only the timbre conversion?

auspicious3000 · 2021-06-04T17:58:00Z

@terbed Yes. Unless you retrain the model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: Using a pretrained encoder for getting the speaker embedding. #29

Question: Using a pretrained encoder for getting the speaker embedding. #29

nischal-sanil commented Jan 19, 2021

FurkanGozukara commented Jan 23, 2021

terbed commented Jun 4, 2021

auspicious3000 commented Jun 4, 2021

Question: Using a pretrained encoder for getting the speaker embedding. #29

Question: Using a pretrained encoder for getting the speaker embedding. #29

Comments

nischal-sanil commented Jan 19, 2021

FurkanGozukara commented Jan 23, 2021

terbed commented Jun 4, 2021

auspicious3000 commented Jun 4, 2021