Skip to content

Releases: espnet/espnet

ESPnet Version 0.9.7

15 Jan 08:51
e9502fc
Compare
Choose a tag to compare

New Feature

  • [New Features][ESPnet1][ASR] Option for GTN CTC mode #2866 by @brianyan918
  • [New Features][ESPnet2][SE][README] Update to speech enhancement task #2649 by @LiChenda
  • [New Features][ESPnet2][ASR][README] Lightweight Sinc Convolutions for Espnet2 #2768 by @lumaku
  • [New Features][ESPnet2][Documentation] --freeze_param option #2787 by @kamo-naoyuki
  • [New Features][ESPnet2][TTS][README] Add a new G2P pyopenjtalk_accent_with_pause #2843 by @kan-bayashi
  • [New Features][ESPnet2][TTS][README] Add pyopenjtalk_accent g2p for ESPnet2 TTS #2781 by @ota
  • [New Features][ESPnet2][TTS][README] Support X-vector based multi-speaker TTS model in ESPnet2 #2800 by @kan-bayashi

Enhancement

  • [Enhancement][ESPnet1][ESPnet2] Add version info in args #2841 by @kan-bayashi
  • [Enhancement][ESPnet1][ESPnet2][ASR] AMI Recipe (Short UTT checker) #2802 by @ftshijt
  • [Enhancement][Installation] add default activate_python.sh #2788 by @kamo-naoyuki
  • [Enhancement][Installation] modified: check_install.py #2834 by @kamo-naoyuki
  • [Enhancement][Installation][Documentation][ESPnet1][ESPnet2] Change version info location #2840 by @kan-bayashi

Bugfix

Recipe

  • [Recipe][ESPnet1][ASR] Add LibriSpeech Conformer results for LibriCSS #2861 by @akreal
  • [Recipe][ESPnet1][ASR] Update Commonvoice Recipe with Conformer Settings #2739 by @ftshijt
  • [Recipe][ESPnet1][ASR] Update Russian open STT recipe for v1.01 of the dataset #2776 by @akreal
  • [Recipe][ESPnet1][ASR] Update models and results of Conformer. #2765 by @pengchengguo
  • [Recipe][ESPnet1][ESPnet2][ASR][README] ESPnet2 recipe for commonvoice #2793 by @hchung12
  • [Recipe][ESPnet1][VC][README] VCC2020 database #2754 by @unilight
  • [Recipe][ESPnet2][ASR][README] Update Dirha WSJ result #2756 by @kamo-naoyuki
  • [Recipe][ESPnet2][ASR][README] espnet2 hkust recipe #2863 by @kamo-naoyuki
  • [Recipe][ESPnet2][ASR][README] update the AMI result in espnet2 #2817 by @sw005320
  • [Recipe][ESPnet2][ASR][README] updated the laborotv result #2750 by @sw005320
  • [Recipe][ESPnet2][ASR][README] Update reverb result #2876 by @kamo-naoyuki
  • [Recipe][ESPnet2][ASR] Minor fix of laborotv recipe #2877 by @hfujihara
  • [Recipe][ESPnet2][TTS] Fix total number of iterations #2813 by @kan-bayashi
  • [Recipe][ESPnet2][TTS][README] Add libritts recipe for ESPnet2 #2807 by @kan-bayashi
  • [Recipe][ESPnet2][TTS][README] Add x-vector based configs for VCTK #2808 by @kan-bayashi
  • [Recipe][ESPnet2][TTS][README] Minor update TTS README #2818 by @kan-bayashi
  • [Recipe][ESPnet2][TTS][README] Update JSUT TTS results #2792 by @kan-bayashi
  • [Recipe][ESPnet2][TTS][README] Update JSUT results #2809 by @kan-bayashi
  • [Recipe][ESPnet2][TTS][README] Update JSUT results #2871 by @kan-bayashi
  • [Recipe][ESPnet2][TTS][README] Update LibriTTS results #2842 by @kan-bayashi
  • [Recipe][ESPnet2][TTS][README] Update VCTK results #2814 by @kan-bayashi
  • [Recipe][ESPnet2][TTS][README] Update libritts results #2828 by @kan-bayashi
  • [Recipe][ESPnet2][TTS][README] update latest CSMSC link address #2777 by @meowtech

Other

  • [CI][Documentation][Installation] Change warp-ctc and warp-transducer to extra #2748 by @kamo-naoyuki
  • [CI][README] Update ci setting #2848 by @kan-bayashi
  • [ASR][Documentation][ESPnet2] Sinc Convolutions - add documentation for plot_sinc_filters.py #2782 by @lumaku
  • [Documentation][ESPnet1] fixed some typos #2855 by @jumon
  • [Documentation][Installation] Update documentation #2757 by @kamo-naoyuki
  • [Installation][Refactoring] Move the dependencies coming from recipes #2740 by @kamo-naoyuki

Acknowledgements

Special thanks to @AdolfVonKleist, @LiChenda, @YosukeHiguchi, @akreal, @b-flo, @brianyan918, @ftshijt, @hchung12, @hfujihara, @jumon, @kamo-naoyuki, @kan-bayashi, @lumaku, @meowtech, @ota, @pengchengguo, @sw005320, @unilight, @yuekaizhang.

ESPnet Version 0.9.6

01 Dec 12:17
747c46d
Compare
Choose a tag to compare

New Feature

Bug fix

Recipe

  • [Recipe][ESPnet1] Extend model averaging condition in run scripts #2613 by @b-flo
  • [Recipe][ESPnet1][ASR] Enable multi-thread processing of json files. #2681 by @Peidong-Wang
  • [Recipe][ESPnet1][ASR] Update KsponSpeech conformer results #2624 by @jubang0219
  • [Recipe][ESPnet1][ASR] Update Voxforge with Conformer results #2642 by @YosukeHiguchi
  • [Recipe][ESPnet1][ASR] lang was being used before being parsed for user input #2654 by @siddalmia
  • [Recipe][ESPnet1][ASR][ESPnet2][Installation][README] espnet2 reverb recipe #2691 by @kamo-naoyuki
  • [Recipe][ESPnet1][ASR][README] Update Switchboard with conformer results #2697 by @Emrys365
  • [Recipe][ESPnet1][ASR][README] add librispeech conformer w/ speed perturbation + specaug #2617 by @yuekaizhang
  • [Recipe][ESPnet2][ASR] ASR template recipe: --srctexts -> --lm_train_text, --bpe_train_text #2660 by @kamo-naoyuki
  • [Recipe][ESPnet2][ASR] Add $token_type to asr_tag and lm_tag #2625 by @kamo-naoyuki
  • [Recipe][ESPnet2][ASR][Installation][README][Recipe] Laborotv recipe #2703 by @sw005320
  • [Recipe][ESPnet2][ASR][README] Add AISHELL w/o LM result #2718 by @kamo-naoyuki
  • [Recipe][ESPnet2][ASR][README] ESPnet2 recipe for TIMIT #2568 by @sknadig
  • [Recipe][ESPnet2][ASR][README] JSUT conformer recipe achieving 12.0/13.9 CER(%) for dev/eval1 #2720 by @hchung12
  • [Recipe][ESPnet2][ASR][README] Update README.md #2659 by @sw005320
  • [Recipe][ESPnet2][ASR][README] Update WSJ result #2628 by @kamo-naoyuki
  • [Recipe][ESPnet2][ASR][README] espnet2 librispeech with conformer #2687 by @sw005320
  • [Recipe][ESPnet2][README] Corpus README in egs2 #2713 by @sw005320
  • [Recipe][ESPnet2][README] update egs2/README.md #2719 by @Emrys365

Enhancement

  • [Enhancement][Documentation][ESPnet2] Add --init_param option #2680 by @kamo-naoyuki
  • [Enhancement][ESPnet1][ASR] Save model snapshot at every epoch even if save_interval_iters > 0 - for model averaging #2637 by @sknadig
  • [Enhancement][ESPnet2] Update wandb part #2708 by @kamo-naoyuki
  • [Enhancement][ESPnet2][ASR] Add *_stats_dir options in asr.sh #2724 by @kan-bayashi

Documentation

Refactoring

  • [Refactoring][ESPnet1][ASR][README] Refactor Mask CTC non-autoregressive ASR #2223 by @YosukeHiguchi
  • [Refactoring][ESPnet2] Added unicode support for generated configs #2672 by @Piteryo

Others

Acknowledgements

Special thanks to @Emrys365, @Fhrozen, @LiChenda, @Peidong-Wang, @Piteryo, @YosukeHiguchi, @b-flo, @hchung12, @jubang0219, @kamo-naoyuki, @kan-bayashi, @siddalmia, @sknadig, @sw005320, @yuekaizhang.

ESPnet Version 0.9.5

31 Oct 12:28
c370ab2
Compare
Choose a tag to compare

New Features

  • [New Features][ESPnet2][TTS] Support g2p=none for text with phonemes #2551 by @kan-bayashi
  • [New Features][ESPnet2][TTS] Add MCD evaluation script for ESPnet2-TTS #2554 by @kan-bayashi
  • [New Features][ESPnet1][ST] Conformer End-to-End Speech Translation #2523 by @hirofumi0810

Bugfix

  • [Bugfix][ESPnet1] CTC segmentation - package update #2566 by @lumaku
  • [Bugfix][ASR][ESPnet1] fix bug about att_ws in multi-enc case #2549 by @lzm0706
  • [Bugfix][ESPnet1] Conformer averaging model support for pytorch 1.6 #2604 by @siddalmia
  • [Bugfix][ESPnet1][ASR] Set built-in CTC for asr_recog #2588 by @lumaku
  • [Bugfix][ESPnet1][ASR][Installation] Transducer float16 loss bug fix #2496 by @GNroy

Refactoring

  • [Refactoring][ESPnet1][ASR] Refactor BeamSearchTransducer and ErrorCalculatorTrans #2538 by @b-flo

Recipe

  • [Recipe][ESPnet1][ASR] Alignment recipe for CSJ. #2531 by @jnishi
  • [Recipe][ESPnet1][ASR] New Recipe for KsponSpeech (Korean spontaneous speech; 969 hours) #2555 by @jubang0219
  • [Recipe][ESPnet1][ASR] Update TedLium3 conformer results #2600 by @LiChenda
  • [Recipe][ESPnet1][ASR] Update VIVOS models #2574 by @b-flo
  • [Recipe][ESPnet1][ASR] Update model link in Puebla-Nahuatl #2607 by @ftshijt
  • [Recipe][ESPnet1][ASR] Update tedlium2 with conformer results #2599 by @Emrys365
  • [Recipe][ESPnet1][ASR] update the JSUT recipe with conformer #2546 by @sw005320
  • [Recipe][ESPnet2][ASR] Add CSJ conformer config #2560 by @kan-bayashi
  • [Recipe][ESPnet2][ASR] Add CSJ conformer results #2552 by @kan-bayashi
  • [Recipe][ESPnet2][ASR] Small changes for aishell config #2586 by @kamo-naoyuki
  • [Recipe][ESPnet2][ASR] Update espnet2 AISHELL results #2580 by @kamo-naoyuki
  • [Recipe][ESPnet2][ASR] update JSUT espnet2 with pre-trained models #2563 by @sw005320
  • [Recipe][ESPnet2][TTS] Add JSSS recipe for ESPnet2-TTS #2558 by @kan-bayashi
  • [Recipe][ESPnet2][TTS] Update ESPnet2 TTS result #2542 by @kan-bayashi

CI

Other

  • [Installation] Install warpctc-pytorch wheel when torch version is 1.1 - 1.6 #2547 by @ysk24ok
  • [Installation] Modified requirements: "dataclasses; python_version < '3.7'", #2541 by @kamo-naoyuki
  • [Installation] Remove pip3 check in setup_python.sh #2567 by @ShigekiKarita

Acknowledgements

Special thanks to @Emrys365, @GNroy, @LiChenda, @ShigekiKarita, @b-flo, @ftshijt, @hirofumi0810, @jnishi, @jubang0219, @kamo-naoyuki, @kan-bayashi, @lumaku, @lzm0706, @siddalmia, @sw005320, @ysk24ok.

ESPnet Version 0.9.4

30 Sep 11:04
c1e8198
Compare
Choose a tag to compare

New Features

  • [New Features][ESPnet1][ASR] Transducer v4 #2444 by @b-flo
  • [New Features][ESPnet2] Support audio_format=flac.ark, wav.ark #2451 by @kamo-naoyuki
  • [New Features][ESPnet2][ASR] Support conformer encoder in ESPnet2 ASR #2515 by @kan-bayashi

Bugfix

Documentation

Recipe

Refactoring

  • [Refactoring] Modify uttid to "${spkid}-${uttid}" for trn files #2527 by @kamo-naoyuki
  • [Refactoring][ESPnet1][ASR][LM] Remove all future lines #2481 by @ShigekiKarita
  • [Refactoring][ESPnet1][ASR][MT][ST] Unify arguments #2506 by @hirofumi0810
  • [Refactoring][ESPnet1][ESPnet2][TTS] Refactor length regulator to improve the speed #2482 by @kan-bayashi
  • [Refactoring][ESPnet1][MT][ST] Refactor decoding for translation tasks #2501 by @hirofumi0810
  • [Refactoring][ESPnet2] Change add_scalars to add_scalar for tensorboard SummaryWriter #2525 by @kamo-naoyuki

CI

Other

Acknowledgements

Special thanks to @Fhrozen, @LiChenda, @ShigekiKarita, @b-flo, @hirofumi0810, @kamo-naoyuki, @kan-bayashi, @lumaku, @ruizhilijhu, @shigabeev, @yuekaizhang.

ESPnet Version 0.9.3

15 Sep 04:28
8f725c7
Compare
Choose a tag to compare

New Features

  • [New Features][ESPnet2] Implement --grad_clip_type #2399 by @kamo-naoyuki
  • [New Features][ESPnet2][ASR] Implement batch_score() method for ASR decoder and LM #2377 by @kamo-naoyuki
  • [New Features][ESPnet2][README][TTS] Support Conformer-based FastSpeech / FastSpeech2 #2413 by @kan-bayashi

Bugfix

Documentation

Enhancement

Recipe

  • [Recipe][ESPnet1][ASR] Add LibriCSS recipe #2246 by @akreal
  • [Recipe][ESPnet1][ASR] Update for the Official Split of YM Recipe #2435 by @ftshijt
  • [Recipe][ESPnet1][ESPnet2][ASR] Update CommonVoice for Latest Version #2455 by @ftshijt
  • [Recipe][ESPnet2][ASR] [zeroth korean] Not to use pipe format if feats_type=raw #2402 by @kamo-naoyuki
  • [Recipe][ESPnet2][ASR][README] espnet2 zeroth_korean recipe changing feats_type from fbank_pitch to raw. #2393 by @hchung12
  • [Recipe][ESPnet2][README][TTS] Add ESPnet2 TTS finetuning example recipe (JVS) #2465 by @kan-bayashi

CI

Acknowledgements

Special thanks to @LiChenda, @ShigekiKarita, @akreal, @ftshijt, @glynpu, @hchung12, @hirofumi0810, @jaesong, @jnishi, @kamo-naoyuki, @kan-bayashi, @mrazizi, @sw005320, @ysk24ok.

ESPnet Version 0.9.2

31 Aug 06:05
4fe8946
Compare
Choose a tag to compare

New Features

  • [New Features][ESPnet1] CTC segmentation #2301 by @lumaku
  • [New Features][ESPnet2] Support multiple averaged nbest models #2353 by @kamo-naoyuki
  • [New Features][ESPnet2] Support recursive add in pack_funcs and add images to packed model #2367 by @kamo-naoyuki

Bugfix

Documentation

  • [Documentation] updated comment on the documentation #2351 by @GauravPandey892
  • [Documentation][ESPnet2] Update TTS README #2316 by @kan-bayashi
  • [Documentation][ESPnet2][README] Update ESPnet2 TTS README #2376 by @kan-bayashi
  • [Documentation][ESPnet2][README][TTS] Update README #2330 by @kan-bayashi
  • [Documentation][Installation] Devide setup_python.sh into setup_venv.sh and setup_python.sh #2382 by @kamo-naoyuki
  • [Documentation][Installation] add a description about check install. #2360 by @sw005320
  • [Documentation][README] CTC segmentation - Demo #2347 by @lumaku
  • [Documentation][README] Update README.md #2379 by @kamo-naoyuki

Enhancement

  • [Enhancement][ESPnet2] Change the default inference model to averaged model instead of the best #2346 by @kamo-naoyuki
  • [Enhancement][ESPnet2][TTS] Add pitch and energy stats in packing #2350 by @kan-bayashi
  • [Enhancement][Installation] Add checking for pytorch-cuda compatibility in Makefile #2334 by @kamo-naoyuki
  • [Enhancement][Installation] Show raw error message when failed to import packages #2374 by @kamo-naoyuki

Refactoring

  • [Refactoring] Apply new version black #2366 by @kamo-naoyuki
  • [Refactoring][ASR][ESPnet2] Not to add _sp to $asr_exp if --asr_exp option is specified #2368 by @kamo-naoyuki
  • [Refactoring][CI][ESPnet1][ESPnet2][Installation] Add installers for sctk and sph2pipe and create tools/extra_path.sh #2332 by @kamo-naoyuki
  • [Refactoring][ESPnet1][Recipe] Disable preparation for lm in wsj recipe #2373 by @kamo-naoyuki
  • [Refactoring][ESPnet2] Update Task design #2345 by @kamo-naoyuki
  • [Refactoring][ESPnet2][SE] Remove unused option from enh.sh:--feats_normalize #2325 by @kamo-naoyuki

Recipe

  • [Recipe][ASR][ESPnet1] MGB-2 #2289 by @AmirHussein96
  • [Recipe][ASR][ESPnet1] Remove duplicated class definition of Conformer and update some new results of Aishell1 and Switchboard. #2364 by @pengchengguo
  • [Recipe][ASR][ESPnet2][README] ASR WSJ RESULT update: Tuning LM #2355 by @kamo-naoyuki
  • [Recipe][ASR][ESPnet2][README] add pretrained model link #2378 by @kamo-naoyuki

CI

Acknowledgements

Special thanks to @AmirHussein96, @Emrys365, @GauravPandey892, @Piteryo, @ShigekiKarita, @kamo-naoyuki, @kan-bayashi, @koji-okabe-hub, @lumaku, @pengchengguo, @sw005320.

ESPnet Version 0.9.1

15 Aug 07:34
3629c91
Compare
Choose a tag to compare

New Features

  • [New Features] Add metric option to checkpoint averaging for Transformer #2259 by @hirofumi0810
  • [New Features][ESPnet2] Generate run.sh in the experiment dir for resuming #2284 by @kamo-naoyuki
  • [New Features][ESPnet2] Support larger num_iters_per_epoch than the number of batches in small corpus #2255 by @kamo-naoyuki
  • [New Features][ESPnet2] Support torch native automatic mixed precision for espnet2 #2257 by @kamo-naoyuki

Documentation

Enhancement

Refactoring

  • [Refactoring][ESPnet2] Add some new features and a new recipe for the enhancement task #2238 by @Emrys365
  • [Refactoring][Documentation] Remove installation part of Python from Makefile #2245 by @kamo-naoyuki

Recipe

Bug fix

Others

Acknowledgements

Special thanks to @Emrys365, @ShigekiKarita, @hchung12, @hirofumi0810, @kamo-naoyuki, @kan-bayashi, @nzhoward, @placebokkk, @qmpzzpmq.

ESPnet Version 0.9.0

01 Aug 02:10
5410797
Compare
Choose a tag to compare

New Features

Enhancement

Recipe

Refactoring

Documentation

Bugfix

Acknowledgements

Special thanks to @Cescfangs, @Emrys365, @GNroy, @LiChenda, @YosukeHiguchi, @ftshijt, @hirofumi0810, @houwenxin, @ibkuroyagi, @kamo-naoyuki, @kan-bayashi, @nzhoward, @pengchengguo, @qmpzzpmq, @simpleoier, @sw005320, @takaaki-hori, @unilight, @yistLin.

ESPnet Version 0.8.0

16 Jun 06:33
7e193fd
Compare
Choose a tag to compare

ESPnet2

New Features

  • [New Features] Lightweight and Dynamic Convolutions. #1599 by @yuyfujit
  • [New Features] Implement Ngram scorer #1946 by @qmpzzpmq
  • [New Features] resampling in utils/compute-fbank-feats.py and utils/compute-stft-feats.py #2035 by @kamo-naoyuki

Enhancement

Documentation

  • [Documentation] fix a typo for the decoder add_argument_group #2030 by @sw005320
  • [Documentation] Update multiple GPU descriptions. #2016 by @sw005320
  • [Documentation] Finetuning doc + freezing parameters option #1897 by @b-flo

Bugfix

CI

Acknowledgements

Special thanks to @SeanNaren, @ShigekiKarita, @atozto9, @b-flo, @gullyboy007, @hirofumi0810, @houwenxin, @kamo-naoyuki, @qmpzzpmq, @sw005320, @takenori-y, @yuyfujit.

ESPnet Version 0.7.0

24 May 07:01
52382c6
Compare
Choose a tag to compare

Now, the ESPnet project moves on to a new endeavor! We launched espnet2, which aims to refine the modularities (chainer-free, kaldi-free), use a more customizable trainer, support distributed training, and achieve the scalability mainly led by @kamo-naoyuki with his great efforts and leadership. This project is one of the outcomes of our ESPnet hackathon in Tokyo 2019 with a lot of discussions about the design, new features, and community contributions. espnet2 currently supports main ASR recipes (with a well-designed recipe template) and limited TTS recipes. We maintain both espnet1 and espnet2, but gradually move to our development in espnet2. The ESPnet project is further accelerated!

ESPnet2

Bugfix

New Features

Enhancement

Documentation

Recipe

CI

Acknowledgements

Special thanks to @AdolfVonKleist, @Emrys365, @Fhrozen, @ShigekiKarita, @YosukeHiguchi, @beckgom, @b-flo, @ftshijt, @kamo-naoyuki, @kan-bayashi, @kdubovikov, @magictron, @qmeeus, @sknadig, @sw005320, @takenori-y, @yuekaizhang, @zh794390558