Robustness via curvature regularization, and vice versa(CURE) with fast adversarial training

This is the latest version of the PyTorch implementation of this paper CURE and Fast Adversarial training with FGSM.

To gain a clearer understanding of the CURE paper, I highly recommend watching this introductory video first. video

Apex-NVIDIA & Automatic Mixed Precision-Pytorch

Adversarial training, a method for learning robust deep networks, is typically assumed to be more expensive than traditional training due to the necessity of constructing adversarial examples via a first-order method like projected gradient decent (PGD). In this projet, I make the surprising discovery with previous findings of Fast Adversarial training with FGSM that it is possible to train empirically robust models using a much weaker and cheaper adversary, an approach that was previously believed to be ineffective (catastrophic overfitting, low robust test accuracy->PGD), rendering the method no more costly than standard training in practice. Specifically, I show that adversarial training with the fast gradient sign method (FGSM), when combined with random initialization and CURE curvature regularizer, is as effective as PGD-based training but has significantly lower cost. Furthermore I show that FGSM adversarial training can be further accelerated by using standard techniques for efficient training of deep networks, allowing us to learn a robust CIFAR10 classifier with 48.3% robust accuracy to PGD attacks with ϵ = 8/255 in 6 minutes, in comparison to past work based on “free” adversarial training which took 10 and 50 hours to reach the same respective thresholds.

Previous implementations used Apex-NVIDIA to accelerate the training processes on GPU. However, as using "apex" nowadays is not straightforward and reasonable, I implement "Fast Adversarial training" with PyTorch-amp scalers.

Run this command for "long-phase" training without "CO (catastrophic overfitting)" with classical CURE :

python main.py --delta 'classic' --lr-schedule 'piecewise' --lr-max 0.1 --lr-min 0.0 --opt 'SGD' --lambda 10 --h 3 --epochs 200

Run this command for "short-phase" training without "CO (catastrophic overfitting)" with classical CURE :

python main.py --delta 'FGSM' --lr-schedule 'cyclic' --lr-max 0.3 --lr-min 0.0 --opt 'SGD' --lambda 700  --h 3 --epochs 30 --batch-size 128 --betta 0.00 --gamma 0.00 --kapa 0.00 --hat 0

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
attacks		attacks
.gitignore		.gitignore
LLR.py		LLR.py
README.md		README.md
cure.py		cure.py
linearize.py		linearize.py
models.py		models.py
preactresnet.py		preactresnet.py
requirements.txt		requirements.txt
resnet.py		resnet.py
run_python.bat		run_python.bat
test_CIFAR10.py		test_CIFAR10.py
test_CIFAR100.py		test_CIFAR100.py
test_SVHN.py		test_SVHN.py
train_CIFAR10.py		train_CIFAR10.py
train_CIFAR100.py		train_CIFAR100.py
train_SVHN.py		train_SVHN.py
utils.py		utils.py

alirezaabdollahpour/CURE_fast_adversarial

Folders and files

Latest commit

History

Repository files navigation

Robustness via curvature regularization, and vice versa(CURE) with fast adversarial training

Apex-NVIDIA & Automatic Mixed Precision-Pytorch

Run this command for "long-phase" training without "CO (catastrophic overfitting)" with classical CURE :

Run this command for "short-phase" training without "CO (catastrophic overfitting)" with classical CURE :

About

Topics

Resources

Stars

Watchers

Forks

Languages