Name		Name	Last commit message	Last commit date
parent directory ..
conf		conf
local		local
pyscripts		pyscripts
scripts		scripts
README.md		README.md
asr_local.sh		asr_local.sh
cmd.sh		cmd.sh
db.sh		db.sh
path.sh		path.sh
run_local_conformer_near_alimeeting.sh		run_local_conformer_near_alimeeting.sh
run_local_multispeaker_conformer_alimeeting.sh		run_local_multispeaker_conformer_alimeeting.sh

README.md

Automatic Speech Recognition (ASR)

Usage

For ASR track, we provide two baseline systems includes single-speaker and multi-speaker ASR. For single-speaker, please run all steps in ./run_local_conformer_near_alimeeting.sh, while ./run_local_multispeaker_conformer_alimenting.sh is used for the multi-speaker ASR.

The main stage:

We use the implementation of Conformer ASR model in the ESPnet2. Please install the latest ESPnet toolkit and copy our all files to the espnet/egs2/AliMeeting/asr.
Both data preparation, language model training, and ASR model training are included in asr_local.sh.
First, please run ./run_local_conformer_near_alimeeting.sh to train the single-speaker ASR model. Then, run run_local_multispeaker_conformer_alimeeting.sh to train the multi-speaker ASR model. Please note that you don’t need to repeat the data preparation procedure in the multi-speaker ASR training, since all the preparation will be done in the first training.

Reference

ESPnet
VBx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asr

asr

conf

conf

local

local

pyscripts

pyscripts

scripts

scripts

README.md

README.md

asr_local.sh

asr_local.sh

cmd.sh

cmd.sh

db.sh

db.sh

path.sh

path.sh

run_local_conformer_near_alimeeting.sh

run_local_conformer_near_alimeeting.sh

run_local_multispeaker_conformer_alimeeting.sh

run_local_multispeaker_conformer_alimeeting.sh

README.md

Automatic Speech Recognition (ASR)

Usage

Reference

Files

asr

Directory actions

More options

Directory actions

More options

Latest commit

History

asr

Folders and files

parent directory

Automatic Speech Recognition (ASR)

Usage

Reference