How to train a Chinese and English hybrid acoustic model by using MFA? #733

Raise-me-up · 2024-01-12T03:05:35Z

Hi, all

I have a request to generate the alignments for the Chinese and English hybrid dataset, but the pretrained model is either pure Chinese or English one. Therefore, I have to train my own model. However, I can't find any tutorial. I don't know which phone set is suitable for me, and how to make a dictory, and so on. Any useful advice will be grateful. Thanks!

NataliaShmueli · 2024-01-16T08:18:02Z

Hey! There are multiple ways you could do this! The easiest way, in my opinion, would be using IPA for both. MFA can link to a dictionary-per-speaker model. You could use a Mandarin dictionary and an English dictionary and then use this methodology:
https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/dictionary.html#per-speaker-dictionaries

Mind you, this should also improve the model for each other.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train a Chinese and English hybrid acoustic model by using MFA? #733

How to train a Chinese and English hybrid acoustic model by using MFA? #733

Raise-me-up commented Jan 12, 2024

NataliaShmueli commented Jan 16, 2024

How to train a Chinese and English hybrid acoustic model by using MFA? #733

How to train a Chinese and English hybrid acoustic model by using MFA? #733

Comments

Raise-me-up commented Jan 12, 2024

NataliaShmueli commented Jan 16, 2024