ESPnet version 202310
What's Changed
- Support arbitrary language finetune for Whisper models. by @pengchengguo in #5344
- Update Dipco Data URL by @Fhrozen in #5391
- Update readme in TEMPLATE/svs1 by @linyueqian in #5394
- add gramvaani asr recipe by @bloodraven66 in #5366
- ESPnet-SPK: sampler by @Jungjee in #5365
- Adding general data augmentation methods for speech preprocessing by @Emrys365 in #5370
- Update of several SE recipes and some minor fixes by @Emrys365 in #5401
- Reproducing MIMOIRIS by @YoshikiMas in #5409
- Kathbath asr by @bloodraven66 in #5369
- Add pytorch2.0.1 to CI by @kamo-naoyuki in #5413
- [skip ci] Update README.md by @kamo-naoyuki in #5417
- In spec_augment.py, check whether an array is writeable before modifying it inplace by @mdecerbo in #5416
- Docker updates for local builds by @Fhrozen in #5406
- fix typo in TEMPLATE/svs1/README.md by @linyueqian in #5426
- Update install_mwerSegmenter.sh by @sw005320 in #5437
- Support Whisper-style training as a new task S2T by @pyf98 in #5120
- fix twice numpy installation issue by @kan-bayashi in #5447
- Add Whisper SOT recipe for Librimix by @LiChenda in #5371
- Update for the JOSS paper editor review by @neillu23 in #5418
- Add the VOiCES recipe for ASR by @Emrys365 in #5448
- Improve diacritic compatibility in data_prep.pl preprocessing scripts by @zuazo in #5445
- [WIP] create recipe for acesinger by @linyueqian in #5431
- Add BibleTTS recipe by @wyh2000 in #5436
- ASR2 CHiME4 & Gigaspeech Recipes by @yichen14 in #5434
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #5427
- Simple fix to reduce test_slu_inference time by @siddhu001 in #5460
- Do not use root logger in Beamsearch by @vsd-vector in #5454
- Fix whisper test by @siddhu001 in #5464
- Add doc for OWSM by @pyf98 in #5463
- Speech-to-speech translation Task by @ftshijt in #4859
- AVSR recipes on LRS3 using pre-trained AV-HuBERT model by @ms-dot-k in #5456
- Support LoRA based large model finetuning. by @pengchengguo in #5400
- Multilingual Librispeech (MLS) refactor ASR1 recipe by @juice500ml in #5323
- Add phonemized LibriTTS ASR recipe by @akreal in #5466
- Update the Enh framework to support training with variable numbers of speakers by @Emrys365 in #5414
- speed up TFGridNet code by @zqwang7 in #5395
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #5468
- ASR2 recipe on Tedlium3 dataset by @kohei0209 in #5331
- Create README.md in OWSM v1 by @pyf98 in #5489
- Update setup.py by @sw005320 in #5490
- Fix default value in ML-SUPERB by @ftshijt in #5492
- Fix bugs of Whisper SOT. by @pengchengguo in #5494
- Multilingual Librispeech ASR2 + ASR1 baselines by @juice500ml in #5441
- Add a new SE recipe combining five public corpora by @Emrys365 in #5484
- Update .mergify.yml by @kamo-naoyuki in #5502
- update version to 202310 by @kan-bayashi in #5501
New Contributors
- @linyueqian made their first contribution in #5394
- @mdecerbo made their first contribution in #5416
- @zuazo made their first contribution in #5445
- @wyh2000 made their first contribution in #5436
- @yichen14 made their first contribution in #5434
- @vsd-vector made their first contribution in #5454
- @ms-dot-k made their first contribution in #5456
- @juice500ml made their first contribution in #5323
- @kohei0209 made their first contribution in #5331
Full Changelog: v.202308...v.202310