Skip to content

Latest commit

 

History

History
56 lines (50 loc) · 2.26 KB

SIMULTANEOUS_OFFLINE_ST.md

File metadata and controls

56 lines (50 loc) · 2.26 KB

Wait-k agent for offline ST systems

Agent for the paper: Does Simultaneous Speech Translation need Simultaneous Models?

To run the agent, please make sure that SimulEval 1.0.2 (commit d1a8b2f) is installed and set --port accordingly.

Set --source, --target, and --config as described in the Fairseq Simultaneous Translation repository. --model-path is the offline ST model checkpoint, --lagging is the value of k_test used for the wait-k inference (lagging=[3, 5, 7, 9, 11] in the paper).
The simultaneous output will be saved in --output.

Fixed Word Detection

simuleval \
    --agent ${FBK_FAIRSEQ_ROOT}/examples/speech_to_text/simultaneous_translation/agents/v1_0/simul_offline_waitk.py \
    --source ${SRC_LIST_OF_AUDIO} \
    --target ${TGT_FILE} \
    --data-bin ${DATA_ROOT} \
    --config ${CONFIG_YAML} --gpu \
    --model-path ${ST_SAVE_DIR}/${CHECKPOINT_FILENAME} \
    --waitk ${LAGGING} \
    --speech-segment-factor 8 \
    --output ${OUT_DIR} \
    --port ${PORT} \
    --scores

Adaptive Word Detection

simuleval \
    --agent ${FBK_FAIRSEQ_ROOT}/examples/speech_to_text/simultaneous_translation/agents/v1_0/simul_offline_waitk.py \
    --source ${SRC_LIST_OF_AUDIO} \
    --target ${TGT_FILE} \
    --data-bin ${DATA_ROOT} \
    --config ${CONFIG_YAML} --gpu \
    --model-path ${ST_SAVE_DIR}/${CHECKPOINT_FILENAME} \
    --waitk ${LAGGING} \
    --speech-segment-factor 8 \
    --adaptive-segmentation \
    --vocabulary-type sentencepiece \
    --output ${OUT_DIR} \
    --port ${PORT} \
    --scores

Citation

@inproceedings{papi-et-al-2022-does,
title = "Does Simultaneous Speech Translation need Simultaneous Models?",
author = {Papi, Sara and Gaido, Marco and Negri, Matteo and Turchi, Marco},
booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2022",
year = "2022",
address = "Abu Dhabi, United Arab Emirates",
publisher = "Association for Computational Linguistics"
}