ASV-anti-spoofing-with-EABN

Efficient Attention Branch Network with Combined Loss Function for Automatic Speaker Verification Spoof Detection Many endeavors have sought to develop countermeasure techniques as enhancements on Automatic Speaker Verification (ASV) systems, in order to make them more robust against spoof attacks. As evidenced by the latest ASVspoof 2019 countermeasure challenge, models currently deployed for the task of ASV are, at their best, devoid of suitable degrees of generalization to unseen attacks. Upon further investigation of the proposed methods, it appears that a broader three-tiered viewof the proposed systems; comprised of the classifier, feature extraction phase, and model loss function, may to some extent lessen the problem. Accordingly, the present study proposes the Efficient Attention Branch Network (EABN) modular architecture with a combined loss function to address the generalization problem. The EABN architecture is based on attention and perception branches; the purpose of the attention branch—also interpretable from a human’s point of view—is to produce an attention mask meant to improve classification performance. The perception branch, on the other hand, is used for the primary purpose of the problem at hand, that is, spoof detection. The new EfficientNet-A0 (paper/code) architecture was employed for the perception branch, with nearly ten times fewer parameters and approximately seven times fewer floating-point operations than the top performing SE-Res2Net50 network. The final evaluation results on ASVspoof 2019 dataset suggest an EER = 0.86% and t-DCF = 0.0239 in the Physical Access (PA) scenario using the log-PowSpec input feature, the EfficientNet-A0 for the perceptionbranch, and the combined loss function. Furthermore, using the LFCC input feature, the SE-Res2Net50 for the perception branch, and the combined loss function, the proposed modelperformed at figures of EER = 1.89% and t-DCF = 0.507 in the Logical Access (LA) scenario, which to the best of our knowledge, is the best single system ASV spoofing countermeasure.

More details of architecture, experiments, and results can be found in our published paper.

Dependencies

Python and packages

This code was tested on Python 3.8 with PyTorch 1.9.0. Other packages can be installed by:
```
pip install -r requirements.txt
```
Kaldi-io-for-python

kaldi-io-for-python is a python package that is used for reading and writing data of ark,scp kaldi format. See README.md in kaldi-io-for-python for installation.
MATLAB

The LFCC feature adopted in this work is extracted via the MALTAB codes privided by ASVspoof2019 orgnizers.

Dataset

This work is conducted on ASVspoof2019 Dataset, which can be downloaded via https://datashare.ed.ac.uk/handle/10283/3336. It consists of two subsets, i.e. physical access (PA) for replay attacks and logical access (LA) for synthetic speech attacks.

Start Your Project

This repository mainly consists of two parts: (i) feature extraction and (ii) system training and evaluation.

Feature extraction

Three features are adopted in this repo, i.e. Spec, LFCC and CQT. The top script for feature extraction is extract_feats.sh, where the first step (Stage 0) is required to prepare dataset before feature extraction. It also provides feature extraction for Spec (Stage 1) and CQT (Stage 2), while for LFCC extraction, you need to run the ./baseline/write_feature_kaldi_PA_LFCC.sh and ./baseline/write_feature_kaldi_LA_LFCC.sh scripts. All features are required to be truncated by the Stage 4 in extract_feats.sh.

Given your dataset directory in extract_feats.sh, you can run any stage (e.g. NUM) in the extract_feats.sh by

./extract_feats.sh --stage NUM

For LFCC extraction, you need to run

./baseline/write_feature_kaldi_LA_LFCC.sh
./baseline/write_feature_kaldi_PA_LFCC.sh

System training and evaluation

This repo supports different system architectures, as configured in the conf/training_mdl directory. You can specify the system architecture, acoustic features in start.sh, then run the codes below to train and evaluate your models.

./start.sh

Remember to rename your runid in start.sh to differentiate each configuration.

Citation

Contact

Feel free to contact us for any further information via below channels.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
baseline		baseline
conf		conf
data_reader		data_reader
feats_extraction		feats_extraction
local		local
sid		sid
src		src
steps		steps
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cmd.sh		cmd.sh
eval.py		eval.py
eval_metrics.py		eval_metrics.py
evaluate_tDCF_asvspoof19.py		evaluate_tDCF_asvspoof19.py
extract_feats.sh		extract_feats.sh
fuse_result.py		fuse_result.py
model.py		model.py
parse_options.sh		parse_options.sh
path.sh		path.sh
requirements.txt		requirements.txt
settings.json		settings.json
start.sh		start.sh
train.py		train.py
visualizer.py		visualizer.py

License

AmirmohammadRostami/ASV-anti-spoofing-with-EABN

Folders and files

Latest commit

History

Repository files navigation

ASV-anti-spoofing-with-EABN

Dependencies

Dataset

Start Your Project

Feature extraction

System training and evaluation

Citation

Contact

Amirmohhammad Rostami:

Mohammad Mehdi Homayounpour

Ahmad Nickabadi

About

Resources

License

Stars

Watchers

Forks

Languages