Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to train a Chinese and English hybrid acoustic model by using MFA? #733

Open
Raise-me-up opened this issue Jan 12, 2024 · 1 comment

Comments

@Raise-me-up
Copy link

Hi, all

I have a request to generate the alignments for the Chinese and English hybrid dataset, but the pretrained model is either pure Chinese or English one. Therefore, I have to train my own model. However, I can't find any tutorial. I don't know which phone set is suitable for me, and how to make a dictory, and so on. Any useful advice will be grateful. Thanks!

@NataliaShmueli
Copy link

Hey! There are multiple ways you could do this! The easiest way, in my opinion, would be using IPA for both. MFA can link to a dictionary-per-speaker model. You could use a Mandarin dictionary and an English dictionary and then use this methodology:
https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/dictionary.html#per-speaker-dictionaries

Mind you, this should also improve the model for each other.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants