Skip to content

zhu00121/two_stage_fusion

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Physiology-inspired two-stage COVID-19 detection system

This repository contains scripts used to produce reults shown in our IEEE-TASLP paper 'COVID-19 Detection via Fusion of Modulation Spectrum and Linear Prediction Speech Features'.

Link to the paper: https://ieeexplore.ieee.org/document/10097559 (early-access version).

Hand-crafted features

We provided two types of feature sets to capture different abnormalities of articulation system:

  1. Modulation spectrum features
    We also have another repository which includes a toolbox that automatically extracts different versions of modulation spectrum, it has been used previously for characterizing unnatural speech, evaluating reverberation level, and many other applications in biomedical signal analysis. You can find the toolbox here: https://github.com/MuSAELab/modulation_filterbanks

  2. Linear Prediction (LP) features
    LP analysis has been used for separating excitation source and vocal tract, we further extracted low-level descriptors from LP residuals to characterize abnoarmal phonation pattern.

System overview

Repository structure

  • feature: stores extracted modulation features and LP features
  • script
    • LPfunc.py: Functions for LP analysis and feature extraction
    • Dico_LPmain.py: Extract LP features from DICOVA2 dataset
    • Cambridge_LPmain.py: Extract LP features from Cambridge dataset (track2)
    • Dico_MODmain.py: Extract modulation spectrogram features from DICOVA2 dataset
    • Cambridge_MODmain.py: Extract modulation spectrogram features from Cambridge dataset (track2)
    • two_stage.py: Functions to build a two-stage classification system
    • feature_eva.py: Main code to evaluate feature performance

Data availability

COVID-19 datasets experimented in our study can be obtained upon requests from the data holders. Please reach out to them to get access these datasets.

Citation

If you find our feature sets or the system useful, you may use the following format to cite our paper:

@ARTICLE{10097559,
  author={Zhu, Yi and Tiwari, Abhishek and Monteiro, João and Kshirsagar, Shruti and Falk, Tiago H.},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing}, 
  title={COVID-19 Detection via Fusion of Modulation Spectrum and Linear Prediction Speech Features}, 
  year={2023},
  volume={},
  number={},
  pages={1-14},
  doi={10.1109/TASLP.2023.3265603}}

Author

If you have any questions about the paper/code/data, please do not hesitate to reach me at Yi.Zhu@inrs.ca.

About

A physiology-inspired two-stage speech COVID-19 detection system

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages