Backdoor Attack on Hash-based Image Retrieval via Clean-label Data Poisoning

This repository provides the pytorch implementatin of our BMVC 2023 work: Backdoor Attack on Hash-based Image Retrieval via Clean-label Data Poisoning.

Abstract

A backdoored deep hashing model is expected to behave normally on original query images and return the images with the target label when a specific trigger pattern presents. To this end, we propose the confusing perturbations-induced backdoor attack (CIBA). It injects a small number of poisoned images with the correct label into the training data, which makes the attack hard to be detected. To craft the poisoned images, we first propose the confusing perturbations to disturb the hashing code learning. As such, the hashing model can learn more about the trigger. The confusing perturbations are imperceptible and generated by optimizing the intra-class dispersion and inter-class shift in the Hamming space. We then employ the targeted adversarial patch as the backdoor trigger to improve the attack performance. We have conducted extensive experiments to verify the effectiveness of our proposed CIBA

Installation

This code is tested on our local environment (python=3.7), and we recommend you to use anaconda to create a vitural environment:

conda create -n CIBA python=3.7

Then, activate the environment:

conda activate CIBA

Install PyTorch:

pip install torch==1.4.0 torchvision==0.5.0

Data Preparation

Please download the ImageNet dataset.
We give the list of training, database and query images in data_prepare/imagenet/train.txt, data_prepare/imagenet/database.txt and data_prepare/imagenet/query.txt. Note that replace the corresponding paths with yours.

Get Started

Pre-trained model

You should first train the model on the clean datasets. The model will be saved to models/<dataset>_<arch>_<n-bits>_backdoor.

python train.py --arch vgg11 --dataset imagenet --n-bits 48 --gpu-id 0

Generate the trigger pattern

The trigger pattern will be saved to <path>/<target_label>/<trigger_size>. We have provided five target labels and the trigger pattern in our experiments.

python generate_trigger_pattern.py --path models/imagenet_vgg11_48_backdoor --arch vgg11 --dataset imagenet --n-bits 48 --trigger_size 24 --target_label yurt --gpu-id 0

Backdoor attack

We craft poisoned images by adding trigger and perturbations to the images with the target label. Then, train the model on the poisoned dataset and test the backdoored model. The backdoored model will be saved to <path>/<target_label>/<trigger_size>/<poison_num>/<pert><clambda>.

Four backdoor attacks in our paper can be run as follows.

"Tri"

python backdoor_attack.py --path models/imagenet_vgg11_48_backdoor --arch vgg11 --dataset imagenet --n-bits 48 --poison_num 60 --trigger_size 24 --target_label yurt --pert non --gpu-id 0

"Tri+Noise"

python backdoor_attack.py --path models/imagenet_vgg11_48_backdoor --arch vgg11 --dataset imagenet --n-bits 48 --poison_num 60 --trigger_size 24 --target_label yurt --pert noise --gpu-id 0

"Tri+Adv"

python backdoor_attack.py --path models/imagenet_vgg11_48_backdoor --arch vgg11 --dataset imagenet --n-bits 48 --poison_num 60 --trigger_size 24 --target_label yurt --pert confusing --clambda 0 --gpu-id 0

"CIBA"

python backdoor_attack.py --path models/imagenet_vgg11_48_backdoor --arch vgg11 --dataset imagenet --n-bits 48 --poison_num 60 --trigger_size 24 --target_label yurt --pert confusing --clambda 0.8 --gpu-id 0

Citation

@inproceedings{gao2023ciba,
  title={Backdoor Attack on Hash-based Image Retrieval via Clean-label Data Poisoning},
  author={Gao, Kuofeng and Bai, Jiawang and Chen, Bin and Wu, Dongxian and Xia, Shu-Tao},
  booktitle={BMVC},
  year={2023}
}

Acknowledgements

This respository is mainly based on DTHA. Thanks for their wonderful works!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
data_prepare		data_prepare
models/imagenet_vgg11_48_backdoor		models/imagenet_vgg11_48_backdoor
.DS_Store		.DS_Store
README.md		README.md
backdoor_attack.py		backdoor_attack.py
dataloader.py		dataloader.py
evaluate.py		evaluate.py
evaluation.py		evaluation.py
generate_trigger_pattern.py		generate_trigger_pattern.py
loss.py		loss.py
models.py		models.py
train.py		train.py
utils.py		utils.py

KuofengGao/CIBA

Folders and files

Latest commit

History

Repository files navigation

Backdoor Attack on Hash-based Image Retrieval via Clean-label Data Poisoning

Abstract

Installation

Data Preparation

Get Started

Pre-trained model

Generate the trigger pattern

Backdoor attack

Citation

Acknowledgements

About

Resources

Stars

Watchers

Forks

Languages