From 3f0f20f2d12403629224347664b3e75c13b2c8e0 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Rapha=C3=ABl=20Merx?= Date: Thu, 25 Jan 2024 02:54:38 +0800 Subject: [PATCH] MMS alignment README fixes (#5432) * Mention sox install through apt, on top of the Python wrapper * Fix argument name in example command --- examples/mms/data_prep/README.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/examples/mms/data_prep/README.md b/examples/mms/data_prep/README.md index 4a06e7fef9..ccc3e5bbe1 100644 --- a/examples/mms/data_prep/README.md +++ b/examples/mms/data_prep/README.md @@ -14,8 +14,8 @@ We describe the process of aligning long audio files with their transcripts and - Step 3: Install a few other dependencies ``` - pip install sox - pip install dataclasses + apt install sox + pip install sox dataclasses ``` - Step 4: Create a text file containing the transcript for a (long) audio file. Each line in the text file will correspond to a separate audio segment that will be generated upon alignment. @@ -29,7 +29,7 @@ We describe the process of aligning long audio files with their transcripts and - Step 5: Run forced alignment and segment the audio file into shorter segments. ``` - python align_and_segment.py --audio /path/to/audio.wav --textfile /path/to/textfile --lang --outdir /path/to/output --uroman /path/to/uroman/bin + python align_and_segment.py --audio /path/to/audio.wav --text_filepath /path/to/textfile --lang --outdir /path/to/output --uroman /path/to/uroman/bin ``` The above code will generated the audio segments under output directory based on the content of each line in the input text file. The `manifest.json` file consisting of the of segmented audio filepaths and their corresponding transcripts.