Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Proper installation steps #14

Open
iamanigeeit opened this issue Feb 24, 2023 · 1 comment
Open

Proper installation steps #14

iamanigeeit opened this issue Feb 24, 2023 · 1 comment

Comments

@iamanigeeit
Copy link

iamanigeeit commented Feb 24, 2023

@Rongjiehuang I got this repo to work, but i had to correct some things. Hope it helps someone else.

  • Before starting the install, you should sudo apt install gfortran libopenblas-base. These are required but not specified.
  • Change environment.yaml to remove duplicates in scipy and numpy, and remove version requirements on scipy and numba (old vresions cause conflicts with numpy).
  • If you already installed CUDA yourself, remove the one installed with environment.yaml with pip uninstall nvidia_cublas_cu11 (or whatever version you have).
  • In modules/GenerSpeech/config/generspeech.yaml, change emotion_encoder_path to checkpoints/Emotion_encoder.pt
  • Add the Generspeech dir root to your sys.path, either by moving GenerSpeech.py to the GenerSpeech dir or adding these lines at the top of GenerSpeech (otherwise Python can't find the imports)
import sys, os
sys.path.insert(0, os.path.join(os.path.dirname(__file__), '..'))
  • Run mfa thirdparty download
  • In utils.hparams.py, lines 29 and 32 should remove help='location of the data corpus' becuase it's misleading. Line 41 needs to include remove=False.
  • Preprocessing fails at data_gen_utils line 299 if there is a word missing from mfa_dict.txt, because the TextGrid will skip the phones of the missing word. Actually, some common words are not in the dictionary like "her" (HH_ER1) and "processing" (P_R_AA1_S_EH0_S_IH0_NG). You have add them to the dictionary yourself. The correct way is to run mfa validate and append to mfa_dict.txt first (see this script),
  • Also you may want to use praatio as the standard TextGrid parser.
@CindyTing
Copy link

CindyTing commented Mar 29, 2023

Hi @iamanigeeit, thank you so much for your installation steps, it helps a lot! but I haven't set it up, so I may need to bother you with some questions. After I use these
two lines to install mfa

conda config --add channels conda-forge
conda install montreal-forced-aligner

I can't run mfa thirdparty download directly, it will have "thirdparty command not exist" error. May I know the version of kaldi pynini mfa you installed?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants