Full-band and Narrow-band fusion Network for SSL

Introduction

This repository provides methods which based on full-band and narrow-band fusion network for sound source localization. The narrow-band module processes the along-time sequences to focus on learning these narrow-band spatial information. The full-band module processes the along-frequency sequence to focus on learning the full-band correlation of spatial cues, such as the linear relation of DP-IPD to frequency.

Methods

Two official implemented sound source localization methods are included:

Datasets

Source signals: from LibriSpeech database
Real-world multi-channel microphone signals: from LOCATA database

Quick start (will be update soon)

Preparation
- Download the required dataset and organize the data according to the data_org in the data folder.
- Generate multi-channel data, You can set data_num (in Simu.py) to control the size of the dataset. --train, -- test, --dev are used to control the generation of train dataset, test dataset, and validation dataset, respectively. The source data path of them are specified by dirs ['sousig_train '] in Opt.py.
```
python Simu.py --train/--test/--dev
```
- For DP-IPD regression, set is_doa = False (Model.FN_SSL), and use mse loss function, for DOA classification, set is_doa = True (Model.FN_SSL), and use ce loss function, meanwhile, the predgt2doa needs to be replaced synchronously. The initial Learning rate of doa classification is set to 5e-4.
```
net = at_model.FN_SSL(is_doa=True/False)
```
Training
- For train step, --gpu-id is used to specify the gpu, ---bz corresponds to the batch size of train process, validation process, and test process, respectively.
```
python Train.py --train --gpu-id [*] --bz * * * 
```
Evaluation
- In the inference stage, you can set checkpoints_dir (Predict. py) to select weights, we provide simulation dataset inference and locata dataset inference.
- For simulated data evaluation
```
python Predict.py --test --datasetMode simulate --bz * * *
```
- For LOCATA dataset evaluation
```
python Predict.py --test --datasetMode locata
```
Pytorch Lightning version
- We have re implemented FN-SSL using the Pytorch-lightning framework, which has a improvement in training speed compared to the torch.
- For Train,
```
python main.py fit --data.batch_size=[*,*] --trainer.devices=*,*
```
- For test,
```
python main.py test  --ckpt_path logs/MyModel/version_x/checkpoints/**.ckpt --trainer.devices=*,*
```
Pretrained models
- Using the FN_lightning model to load the lightning checkpoint in torch framework.

Framework	Task	Checkpoint
Lightning	DP-IPD regression	https://pan.baidu.com/s/1zRKpiqbSuo80Xu5ZRoS1gQ?pwd=6w51
Lightning	DOA classification	https://pan.baidu.com/s/1U1Wl5ZBZBItc2Vku7AyqNA?pwd=ceqm

more checkpoints will be update soon.

Citation

If you find our work useful in your research, please consider citing:

@InProceedings{wang2023fnssl,
    author = "Yabo Wang and Bing Yang and Xiaofei Li",
    title = "FN-SSL: Full-Band and Narrow-Band Fusion for Sound Source Localization",
    booktitle = "Proceedings of INTERSPEECH",
    year = "2023",
    pages = ""}

Reference code

Licence

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
Lightning		Lightning
data		data
Dataset.py		Dataset.py
Learner.py		Learner.py
Model.py		Model.py
Module.py		Module.py
Opt.py		Opt.py
Predict.py		Predict.py
README.md		README.md
Simu.py		Simu.py
Train.py		Train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lightning

Lightning

data

data

Dataset.py

Dataset.py

Learner.py

Learner.py

Model.py

Model.py

Module.py

Module.py

Opt.py

Opt.py

Predict.py

Predict.py

README.md

README.md

Simu.py

Simu.py

Train.py

Train.py

utils.py

utils.py

Repository files navigation

Full-band and Narrow-band fusion Network for SSL

Introduction

Methods

Datasets

Quick start (will be update soon)

Citation

Reference code

Licence

About

Packages

Contributors 2

Languages

Audio-WestlakeU/FN-SSL

Folders and files

Latest commit

History

Repository files navigation

Full-band and Narrow-band fusion Network for SSL

Introduction

Methods

Datasets

Quick start (will be update soon)

Citation

Reference code

Licence

About

Topics

Resources

Stars

Watchers

Forks

Languages