Skip to content

ESPnet version 202205

Compare
Choose a tag to compare
@kan-bayashi kan-bayashi released this 28 May 02:21
· 6242 commits to master since this release
5fa6dcc

New Features

  • [New Features][ESPnet1][ESPnet2][ASR] Add quantization in ESPnet2 for asr inference #4349 by @pyf98
  • [New Features][ESPnet2][SE] Add svoice recipe for wsj0-2mix speech separation #4257 by @nateanl
  • [New Features][ESPnet2][SE] Merge Deep Clustering and Deep Attractor Network to enh separator #4110 by @earthmanylf
  • [New Features][ESPnet2][SE] Some improvements to current enh functions #4251 by @Emrys365
  • [New Features][ESPnet2][SE][Installation] Import fast_bss_eval and update some time-domain losses for enh task #4256 by @LiChenda
  • [New Features][ESPnet2][TTS] add e2e tts model: JETS #4364 by @imdanboy

Bugfix

  • [Bugfix][ESPnet1] Fix minimum input length for Conv2dSubsampling2 in check_short_utt #4378 by @akreal
  • [Bugfix][ESPnet1][ESPnet2] Minor fixes for the intermediate loss usage and Mask-CTC decoding #4374 by @YosukeHiguchi
  • [Bugfix][ESPnet2] Fix #4396 #4398 by @kamo-naoyuki
  • [Bugfix][ESPnet2] Fix a bug in utterance_mvn #4304 by @Emrys365
  • [Bugfix][ESPnet2] Minor fix for Mask-CTC forward function #4347 by @YosukeHiguchi
  • [Bugfix][ESPnet2] Wandb Minor Fix for Model Resume #4329 by @roshansh-cmu
  • [Bugfix][ESPnet2] fix the enh_s2t_task argument in espnet2/bin/st_inference.py #4323 by @simpleoier
  • [Bugfix][ESPnet2][MT][ST] fix bug in mt/st templates for having separate token lists #4149 by @brianyan918
  • [Bugfix][ESPnet2][Recipe] Fix aishell3 data preparation script #4277 by @LanceaKing
  • [Bugfix][ESPnet2][SE] Fix a bug in stats aggregation when PITSolver is used #4343 by @Emrys365
  • [Bugfix][ESPnet2][SE] fix for enhancement model loading compatibility #4259 by @LiChenda
  • [Bugfix][ESPnet2][ST] bug fixes in ST recipes #4341 by @chintu619
  • [Bugfix][ESPnet2][TTS] Fix optional data names for TTS #4355 by @kan-bayashi
  • [Bugfix][ESPnet2][TTS] fix a bug in Mandarin pypinyin_g2p_phone #4206 by @WeiGodHorse
  • [Bugfix][ESPnet2][TTS] fix loss = NaN in VITS with mixed precision #4356 by @kan-bayashi
  • [Bugfix][ESPnet2][streaming] Add unit test to streaming ASR inference #4352 by @espnetUser
  • [Bugfix][Installation] fix s3prl install by using legacy version. Temporal solution. #4399 by @simpleoier
  • [Bugfix][README] Fix typo #4338 by @ftshijt

Enhancement

  • [Enhancement][ESPnet1][ESPnet2][ASR][SE][SLU][ST] enh_s2t joint model #4226 by @simpleoier
  • [Enhancement][ESPnet2] Add progress bar to phonemization #4320 by @G-Thor
  • [Enhancement][ESPnet2][MT] Update show_translation_result.sh to show all decoding results under the given exp directory #4330 by @pyf98

Recipe

  • [Recipe][ESPnet1][ASR] Accented English Speech Recognition Challenge 2020 recipe (AESRC2020) #3898 by @brianyan918
  • [Recipe][ESPnet1][ESPnet2][ASR][README][Recipe] Add MediaSpeech ASR recipe #4183 by @AshibaWu
  • [Recipe][ESPnet2][ASR][README] recipee for Microsoft speech corpus for Indian Languages #4191 by @navya-yarrabelly
  • [Recipe][ESPnet2][ASR][README] Accented French Openslr57 ASR recipe (ESPnet2) (part of Homework3 MNLP) #4280 by @DanBerrebbi
  • [Recipe][ESPnet2][ASR][README] Add Mask-CTC results #4180 by @YosukeHiguchi
  • [Recipe][ESPnet2][ASR][README] Add ml_openslr63 ASR recipe #4173 by @bharaniuk
  • [Recipe][ESPnet2][ASR][README] Adding new recipe for Burmese (OpenSLR80) #4182 by @JainSameer06
  • [Recipe][ESPnet2][ASR][README] add chime6 recipe #4332 by @simpleoier
  • [Recipe][ESPnet2][ASR][SE][README] add egs2/chime4/enh_asr1 recipe and results #4316 by @simpleoier
  • [Recipe][ESPnet2][README][RNNT] updated librispeech-asr with rnn-t results #4281 by @chintu619
  • [Recipe][ESPnet2][README][SE] 2021 Clarity Challenge recipe #4210 by @popcornell
  • [Recipe][ESPnet2][README][SE] Add AISHELL-4 ENH recipe #4249 by @Emrys365
  • [Recipe][ESPnet2][README][SE] Add ConferencingSpeech 2021 recipe to egs2 #4192 by @Emrys365
  • [Recipe][ESPnet2][README][SE] Add ICASSP2021 DNS Challenge 2 recipe #4253 by @YoshikiMas
  • [Recipe][ESPnet2][README][SE] Add INTERSPEECH 2021 DNS Challenge 3 recipe #4238 by @YoshikiMas
  • [Recipe][ESPnet2][README][SE] Add results of ICASSP2021 DNS Challenge 2 recipe #4309 by @YoshikiMas
  • [Recipe][ESPnet2][README][SE] Rename egs2/clarity21/enh_2021 to egs2/clarity21/enh1 #4328 by @Emrys365
  • [Recipe][ESPnet2][README][SE] add convtasnet recipe for dns_ins20 #4314 by @muqiaoy
  • [Recipe][ESPnet2][README][SLU] Harpervalley recipe #4315 by @YushiUeda
  • [Recipe][ESPnet2][README][SLU] SLUE Voxpopuli base recipe #4262 by @siddhu001
  • [Recipe][ESPnet2][README][ST] CoVOST2 recipes #4300 by @ftshijt
  • [Recipe][ESPnet2][SLU][README] Update SLU results for ICASSP #4283 by @siddhu001

Others

  • [CI][Docker] Github Action Trigger Docker Build #4295 by @Fhrozen
  • [CI][Docker] Github Action for Docker build #4219 by @Fhrozen
  • [CI][ESPnet1][ESPnet2][Installation][README] Add isort checking to the CI tests #4372 by @kamo-naoyuki
  • [CI][ESPnet1][ESPnet2][Installation][README][mergify] Add pytorch=1.10.2 and 1.11.0 to ci configurations #4348 by @kamo-naoyuki
  • [CI][ESPnet2][ASR][SE] add integration test and fix the decoding in enh_asr and enh_st #4310 by @simpleoier
  • [CI][ESPnet2][New Features][SLU][ST][streaming] Add streaming ST/SLU #4243 by @D-Keqi
  • [CI][ESPnet2][ST] Add Test Functions for ST Train and Inference #4324 by @ftshijt
  • [CI][Installation] update install_pesq.sh #4265 by @LiChenda
  • [Documentation][ESPnet2][README][TTS] Minor update for JETS #4369 by @kan-bayashi
  • [Documentation][README] Change the order of README #4289 by @ftshijt
  • [Documentation][README] Update README.md #4284 by @sw005320

Acknowledgements

Special thanks to @AshibaWu, @D-Keqi, @DanBerrebbi, @Emrys365, @Fhrozen, @G-Thor, @JainSameer06, @LanceaKing, @LiChenda, @WeiGodHorse, @YoshikiMas, @YosukeHiguchi, @YushiUeda, @akreal, @bharaniuk, @brianyan918, @chintu619, @earthmanylf, @espnetUser, @ftshijt, @imdanboy, @kamo-naoyuki, @kan-bayashi, @muqiaoy, @nateanl, @navya-yarrabelly, @popcornell, @pyf98, @roshansh-cmu, @siddhu001, @simpleoier, @sw005320.