ESPnet version 202211
What's Changed
- Update muskits update by @ftshijt in #4616
- Muskit installation by @A-Quarter-Mile in #4617
- Sync Muskits branch with Master by @ftshijt in #4640
- Updates on Muskit Migration by @A-Quarter-Mile in #4631
- Update Muskits branch by @ftshijt in #4662
- Add stage 5 & stage 6 by @A-Quarter-Mile in #4649
- Muskit: rename & reorganize features by @A-Quarter-Mile in #4668
- Update Muskits branch by @ftshijt in #4671
- Muskits CI fixing by @ftshijt in #4672
- Muskits CI fix by @ftshijt in #4673
- Muskits - apply isort by @ftshijt in #4677
- Muskits CI fix by @ftshijt in #4678
- Muskit: Add tokenizer by @A-Quarter-Mile in #4676
- Muskits - various fix for CI test by @ftshijt in #4679
- Muskit: add recipe ofuton by @A-Quarter-Mile in #4681
- Muskits (CI fix) by @ftshijt in #4682
- Fix CI issue in muskits by @ftshijt in #4687
- Add dns_icassp22 Speech Enhancement Recipe by @slSeanWU in #4657
- Singing Voice Synthesis Task for ESPnet by @ftshijt in #4670
- Documentation of Tutorial and Muskits by @ftshijt in #4692
- Add tests on MacOS and Windows (only installation) by @kamo-naoyuki in #4669
- Add missing entries in readme by @ftshijt in #4699
- Support ST without texts in source language by @sophia1488 in #4688
- Update ConvInput for Transducer by @b-flo in #4720
- Small changes for standalone Transducer by @b-flo in #4722
- Fix input block tutorial documentation for Transducer by @b-flo in #4724
- Fix HF Pytest Errors by @siddhu001 in #4737
- Update to puebla-nahuatl recipe (some minor fixes) by @ftshijt in #4713
- Add espnet2 TTS recipe on M-AILABS by @Takaaki-Saeki in #4701
- Update outdated enh config files by @Emrys365 in #4719
- add src_sos & src_eos for mt task to address the index out of range w… by @simpleoier in #4736
- Add g2pk_explicit_space tokenizer by @jonghwanhyeon in #4718
- Fix JETS inference with GST (#4743) by @kan-bayashi in #4744
- Update on Muskit by @A-Quarter-Mile in #4700
- add fleurs conformer+sc-ctc results by @wanchichen in #4746
- Add recipe for OCR task on IAM handwriting dataset by @kenzheng99 in #4707
- Add Talromur2 recipe by @G-Thor in #4680
- Add multi-channel enh_asr for CHiME-4 by @YoshikiMas in #4706
- chunk_mask error by @aky15 in #4751
- fix wav2vec2 encoder mask bug by @simpleoier in #4772
- Add Hugging Face Transformers Decoder, Tokenizer and their example on SLURP by @akreal in #4099
- [Recipe PR] MELD: Multimodal EmotionLines Dataset by @realzza in #4771
- MultiIRIS follow up by @YoshikiMas in #4765
- Add CATSLU results for XLS-R with mBART-50 by @akreal in #4782
- Add MEDIA and PortMEDIA results for XLS-R with mBART-50 by @akreal in #4794
- Add SLUE-VoxPopuli results for WavLM with mBART-50 by @akreal in #4777
- Follow up for SLURP and CATSLU by @akreal in #4796
- Update README in chime4/enh_asr1 by @YoshikiMas in #4795
- fix parsing token_list by @imdanboy in #4778
- Use torchaudio functions for beamforming related operations in torch 1.12.1+ by @Emrys365 in #4638
- PIT E2E multi-speaker ASR and librimix recipe by @simpleoier in #4753
- Fix an audio format issue in some enh recipes by @YoshikiMas in #4799
- Fixing How2-2000h Data preparation and Seq Length Assert for Longformer Encoder by @roshansh-cmu in #4805
- Adding MFA scripts for LJSpeech by @iamanigeeit in #4801
- fix typo in espnet2_tutorial.md by @eltociear in #4811
- [WIP] E-Branchformer Encoder in ESPnet2 by @kkim-asapp in #4812
- Muskit update by @A-Quarter-Mile in #4783
New Contributors
- @A-Quarter-Mile made their first contribution in #4617
- @sophia1488 made their first contribution in #4688
- @kenzheng99 made their first contribution in #4707
- @realzza made their first contribution in #4771
- @iamanigeeit made their first contribution in #4801
- @eltociear made their first contribution in #4811
- @kkim-asapp made their first contribution in #4812
Full Changelog: v.202209...v.202211