Crafting Adversarial Example Attacks on Policy Learners

Framework for experimental analysis of adversarial example attacks on policy learning in Deep RL. Attack methodologies are detailed in our paper "Whatever Does Not Kill Deep Reinforcement Learning, Makes It Stronger" (Behzadan & Munir, 2017 - https://arxiv.org/abs/1712.09344 ).

This project provides an interface between @openai/baselines and @tensorflow/cleverhans to facilitate the crafting and implementation of adversarial example attacks on deep RL algorithms. We would also like to thank @andrewliao11/NoisyNet-DQN for inspiring solutions to implementing the NoisyNet algorithm for DQN.

Dependencies

Python 3
cleverhans v2.0.0

pip install -e git+http://github.com/tensorflow/cleverhans.git#egg=cleverhans

others (e.g., gym, ...)

git clone https://github.com/behzadanksu/rl-attack.git
cd rl-attack
pip install -e .

Examples

Two example scripts are included.

enjoy-adv.py : sample implementation of test-time FGSM attack on pre-trained DQN Atari agents.
train.py: sample implementation of training-time FGSM attack on DQN Atari agents.

Some example executions on the Breakout game environment are:

Test-time, No attack, testing a DQN model of Breakout trained without parameter noise:

$> python3 enjoy-adv.py --env Breakout --model-dir ./data/Breakout/model-173000 --video ./Breakout.mp4

Test-time, No attack, testing a DQN model of Breakout trained with parameter noise (NoisyNet implementation):

$> python3 enjoy-adv.py --env Breakout --noisy --model-dir ./data/Breakout/model-173000 --video ./Breakout.mp4

Test-time, Whitebox FGSM attack, testing a DQN model of Breakout trained without parameter noise:

$> python3 enjoy-adv.py --env Breakout --model-dir ./data/Breakout/model-173000 --attack fgsm --video ./Breakout.mp4

Test-time, Whitebox FGSM attack, testing a DQN model of Breakout trained with parameter noise (NoisyNet implementation):

$> python3 enjoy-adv.py --env Breakout --noisy --model-dir ./data/Breakout/model-173000 --attack fgsm --video ./Breakout.mp4

Test-time, Blackbox FGSM attack, testing a DQN model of Breakout trained without parameter noise:

$> python3 enjoy-adv.py --env Breakout --model-dir ./data/Breakout/model-173000 --attack fgsm --blackbox --model-dir2 ./data/Breakout/model-173000-2 --video ./Breakout.mp4

Test-time, Blackbox FGSM attack, testing a DQN model of Breakout trained with parameter noise (NoisyNet implementation), replica model trained without parameter noise:

$> python3 enjoy-adv.py --env Breakout --noisy --model-dir ./data/Breakout/model-173000 --attack fgsm --blackbox --model-dir2 ./data/Breakout/model-173000-2 --video ./Breakout.mp4

Test-time, Blackbox FGSM attack, testing a DQN model of Breakout trained with parameter noise (NoisyNet implementation), replica model trained with parameter noise:

$> python3 enjoy-adv.py --env Breakout --noisy --model-dir ./data/Breakout/model-173000 --attack fgsm --blackbox --model-dir2 ./data/Breakout/model-173000-2 --noisy2 --video ./Breakout.mp4

Training-time, Whitebox attack, no parameter noise, injecting adversarial example with 20% probability:

$> python3 train.py --env Breakout --save-dir ./data/Breakout/ --attack fgsm --num-steps 200000000 --attack-prob 0.2

Training-time, Whitebox attack, NoisyNet parameter noise, injecting adversarial example with 100% probability:

$> python3 train.py --env Breakout --noisy --save-dir ./data/Breakout/ --attack fgsm --num-steps 200000000 --attack-prob 1.0

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
__pycache__		__pycache__
data		data
rlattack		rlattack
README.md		README.md
enjoy-adv.py		enjoy-adv.py
model.py		model.py
setup.py		setup.py
statistics.py		statistics.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pycache

pycache

data

data

rlattack

rlattack

README.md

README.md

enjoy-adv.py

enjoy-adv.py

model.py

model.py

setup.py

setup.py

statistics.py

statistics.py

train.py

train.py

Repository files navigation

Crafting Adversarial Example Attacks on Policy Learners

Dependencies

Examples

About

Releases

Packages

Languages

behzadanksu/rlattack-dev

Folders and files

Latest commit

History

Repository files navigation

Crafting Adversarial Example Attacks on Policy Learners

Dependencies

Examples

About

Resources

Stars

Watchers

Forks

Languages