NASTAR: NOISE ADAPTIVE SPEECH ENHANCEMENT WITH TARGET-CONDITIONALRESAMPLING

This is the official implementation of our paper. Our work applies resampling techniques on existing source datasets to the noise adaptive speech enhancement task in a one-shot scenario. The resampling scheme can achieve higher scores with less training data.

Current Information

This project is partially public. The source code of SE has been released. We will open the implementation of our resampling method if our paper gets accepted. Those checkpoints and logs under all the noise adaptation settings and the testing data are provided. The pseudo-noise and relevant-cohort demonstration for NASTAR can be found here.

Installation

Note that our environment is in Python 3.7.10. To run the experiment of NASTAR, you can clone the repository and install it by using the pip tool:

git clone https://github.com/ChangLee0903/NASTAR
cd NASTAR
# Install all the necessary packages
pip install -r requirements.txt

Steps and Usages

Data Preprocess:

You can produce your own pair data by using the Corruptor class in data.py and set the data path in config/config.yaml. Note that all the training noisy utterances are generated online. Our source noise dataset is provided by DNS-Challenge. The target_data directory contains the noise signals test.wav used for testing and the results of pseudo-noise pesudo.wav / cohort-set cohort.txt. Check dataset in config/config.yaml:
```
dataset:
 train:
   speech: ../speech_data/LibriSpeech/train-clean-360
   noise: ../noise_data/DNS_noise
 dev:
   speech: ../speech_data/LibriSpeech/dev-clean
 test:
   data: ../NASTAR_VCB_test_data
 ...
 
```
Since we only test adapted model on one specifice noise type, there is no need to set the noise dataset path in dev. Instead, we use --eval_noise to assign the noise signal for evaluation. To avoid randomness as testing, the testing data have been mixed and put in NASTAR_VCB_test_data. All the noisy and clean utterances were saved as npy files.
Training the adapted SE model for each target noise condition:

First, download the checkpoint of pretrained model in the PTN directory, and make sure the argument --ae_ckpt has been set correctly. All the recipes of different settings are recorded in our repository so that you can just run the script. Note that you can set --device to change the identity of the used CUDA device.
```
bash train_NASTAR.sh
```
Evaluation:

Make sure all of your models have been trained and stored in the path of --ckptdir. All the results will be stored in dictionaries and saved as pth files.
- Check ckpt directory
- Run:
```
python main.py --task test --n_jobs 16
```

Contact

Any bug report or improvement suggestion will be appreciated!

e-mail: r08922a28@csie.ntu.edu.tw

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
config		config
denoiser		denoiser
pesq		pesq
target_data		target_data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data.py		data.py
evaluation.py		evaluation.py
layer.py		layer.py
loss.py		loss.py
main.py		main.py
model.py		model.py
requirements.txt		requirements.txt
train.py		train.py
train_ALL.sh		train_ALL.sh
train_DAT_full.sh		train_DAT_full.sh
train_DAT_one.sh		train_DAT_one.sh
train_EXTR.sh		train_EXTR.sh
train_GT.sh		train_GT.sh
train_NASTAR.sh		train_NASTAR.sh
train_OPT.sh		train_OPT.sh
train_PTN.sh		train_PTN.sh
train_RETV.sh		train_RETV.sh
write.py		write.py

License

ChangLee0903/NASTAR

Folders and files

Latest commit

History

Repository files navigation

NASTAR: NOISE ADAPTIVE SPEECH ENHANCEMENT WITH TARGET-CONDITIONALRESAMPLING

Current Information

Contents

Installation

Steps and Usages

Contact

About

Resources

License

Stars

Watchers

Forks

Languages