custom phone_set file #73

michaellin99999 · 2022-09-28T09:37:51Z

Hi,
with data preview we have create 72 phonemes, is there a way to train the model such that it doesn't use the existing phone_set file with 62 phonemes and can use up to 72 phonemes?

Thanks

michaellin99999 · 2022-10-03T10:23:19Z

get this error when usiing the svs inference.

size mismatch for fs2.encoder_embed_tokens.weight: copying a param with shape torch.Size([72, 256]) from checkpoint, the shape in current model is torch.Size([64, 256]).

size mismatch for fs2.encoder.embed_tokens.weight: copying a param with shape torch.Size([72, 256]) from checkpoint, the shape in current model is torch.Size([64, 256]).

what is the current mode referred to in this error message? [72,256] is our trained model.
im tracing the code and cannot find what model [64,256] is refering to. when I train the model with my own dataset, do I need to train something else?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

custom phone_set file #73

custom phone_set file #73

michaellin99999 commented Sep 28, 2022

michaellin99999 commented Oct 3, 2022

custom phone_set file #73

custom phone_set file #73

Comments

michaellin99999 commented Sep 28, 2022

michaellin99999 commented Oct 3, 2022