- BERTScore: Evaluating Text Generation with BERT
- Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling
- Line-Based Splitter to Generate Train/Dev/Test Dataset
bash ./bin/etl/train_dev_test_splitter_for_lines_data.sh ${DATA_LINES_PATH} ${DEV_DATA_SIZE} ${TEST_DATA_SIZE}