Skip to content
@Audio-WestlakeU

Audio-WestlakeU

Audio Signal and Information Processing Lab at Westlake University

Pinned

  1. FullSubNet FullSubNet Public

    PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

    Python 507 148

  2. NBSS NBSS Public

    The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

    Python 157 19

  3. McNet McNet Public

    The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023

    Python 89 11

  4. audiossl audiossl Public

    A library built for easier audio self-supervised training, downstream tasks evaluation

    Python 65 7

  5. RCT RCT Public

    This repo gives the code for the official implementation of RCT.

    Python 12 1

  6. FN-SSL FN-SSL Public

    PyTorch implementation of "FN-SSL: Full-Band and Narrow-Band Fusion for Sound Source Localization." [INTERSPEECH 2023]

    Python 59 5

Repositories

Showing 10 of 26 repositories
  • ATST-SED Public

    This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

    Jupyter Notebook 43 MIT 6 3 0 Updated May 8, 2024
  • NBSS Public

    The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

    Python 157 MIT 19 17 0 Updated Apr 24, 2024
  • audiossl Public

    A library built for easier audio self-supervised training, downstream tasks evaluation

    Python 65 7 0 0 Updated Apr 18, 2024
  • RVAE-EM Public

    Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

    Python 30 MIT 3 1 0 Updated Mar 20, 2024
  • pytorch_lightning_template_for_beginners Public

    A pytorch template for beginners based on pytorch_lightning

    Python 27 3 0 0 Updated Feb 1, 2024
  • FS-EEND Public

    The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

    Python 59 MIT 4 3 0 Updated Jan 24, 2024
  • UMA-ASR Public

    This repository is the official implementation of "Unimodal Aggregation for CTC-based Speech Recognition".

    Shell 11 3 1 0 Updated Dec 15, 2023
  • FN-SSL Public

    PyTorch implementation of "FN-SSL: Full-Band and Narrow-Band Fusion for Sound Source Localization." [INTERSPEECH 2023]

    Python 59 5 0 0 Updated Oct 10, 2023
  • FullSubNet Public

    PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

    Python 507 MIT 148 36 1 Updated Aug 19, 2023

Top languages

Loading…

Most used topics

Loading…