Diagnosing Vulnerability of Variational Auto-Encoders to Adversarial Attacks

This repository contains the source code from the paper:

Diagnosing Vulnerability of Variational Auto-Encoders to Adversarial Attacks
Anna Kuzina, Max Welling, Jakub M. Tomczak.

Abstract

In this work, we explore adversarial attacks on the Variational Autoencoders (VAE). We show how to modify data point to obtain a prescribed latent code (supervised attack) or just get a drastically different code (unsupervised attack). We examine the influence of model modifications ($\beta$-VAE, NVAE) on the robustness of VAEs and suggest metrics to quantify it.

Below we provide examples of adversarial attacks on NVAE. The first column depicts target images, then we plot adversarial inputs (on the left, with yellow background) and its reconstructions (on the right).

Reproduce

Dependencies

In order to reproduce our results, please first install the required dependencies. This can be done by:

conda env create -f conda_requirements.yaml

This will create the conda environment adv_vae with the correct dependencies.

Experiments

Parameters which can be specified via the argsparser can be found in the corresponding config.py file.

We use wandb to track the experiments. Specify you API key and user name in th files run_experiments.py, run_attack.py and utils/wandb.py before running the experiments below.

Train VAE

bash runs/train_mnist.sh

2.1. Run supervised attack

bash runs/attack_mnist.sh

2.2. Run unsupervised attack

bash runs/attack_mnist_uns.sh

Cite

If you found this work useful in your research, please consider citing:

@article{kuzina2021diagnosing,
	title={Diagnosing Vulnerability of Variational Auto-Encoders to Adversarial Attacks},
	author={Kuzina, Anna and Welling, Max and Tomczak, Jakub M},
	year={2021},
    journal={RobustML Workshop@ICLR 2021},
}

Acknowledgements

This research was (partially) funded by the Hybrid Intelligence Center, a 10-year programme funded by the Dutch Ministry of Education, Culture and Science through the Netherlands Organisation for Scientific Research, https://hybrid-intelligence-centre.nl.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
attack		attack
datasets		datasets
img		img
runs		runs
thirdparty/pytorch_msssim		thirdparty/pytorch_msssim
utils		utils
vae		vae
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml
conda_requirements.yaml		conda_requirements.yaml
config.py		config.py
run_attack.py		run_attack.py
run_experiment.py		run_experiment.py

License

AKuzina/attack_vae

Folders and files

Latest commit

History

Repository files navigation

Diagnosing Vulnerability of Variational Auto-Encoders to Adversarial Attacks

Abstract

Reproduce

Dependencies

Experiments

Cite

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Languages