Provable-Defense-against-Privacy-Leakage-in-Federated-Learning-from-Representation-Perspective

Official implementation of "Provable Defense against Privacy Leakage in Federated Learning from Representation Perspective"

The paper can be found at https://arxiv.org/pdf/2012.06043.pdf

Abstract

Federated learning (FL) is a popular distributed learning framework that can reduce privacy risks by not explicitly sharing private data. %It can reduce privacy risks. However, recent works have demonstrated that sharing model updates makes FL vulnerable to inference attack. In this work, we show our key observation that the data representation leakage from gradients is the essential cause of privacy leakage in FL. We also provide an analysis of this observation to explain how the data presentation is leaked. Based on this observation, we propose a defense against model inversion attack in FL. The key idea of our defense is learning to perturb data representation such that the quality of the reconstructed data is severely degraded, while FL performance is maintained. In addition, we derive certified robustness guarantee to FL and convergence guarantee to FedAvg after applying our defense. To evaluate our defense, we conduct experiments on MNIST and CIFAR10 for defending against the DLG attack and GS attack. Without sacrificing accuracy, the results demonstrate that our proposed defense can increase the mean squared error between the reconstructed data and the raw data by as much as 160$\times$ for both DLG attack and GS attack, compared with baseline defense methods. Therefore, the privacy of the FL system is significantly improved.

Comparing our defense with Gradient Compression defense under GS attack

Code

We provide the implementation of our defense against DLG attack and GS attack. Our code is developed based on DLG original repo and GS original repo.

Setup

pytorch=1.2.0
torchvision=0.4.0

Quick start

DLG attack

For DLG attack, you can change the pruning rate of our defense by changing the percentile parameter in

thresh = np.percentile(deviation_f1_x_norm_sum.flatten().cpu().numpy(), 1)

We also provide the implementation of model compression defense. You can uncomment the corresponding code to try it.

GS attack

For GS attack, you can reproduce the results of the car image in the paper by running

python reconstruct_image.py --target_id=-1 --defense=ours --pruning_rate=60 --save_image

You can try model compression defense by running

python reconstruct_image.py --target_id=-1 --defense=prune --pruning_rate=60 --save_image

Remark

Considering computing efficiency, we use ||r||/||d(f(r))/dX|| to approximate ||r(d(f(r))/dX)^-1|| in the code. You can edit the code to compute ||r(d(f(r))/dX)^-1|| directly, then you can achieve better defense results with higher computation cost.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
DLG_attack		DLG_attack
GS_attack		GS_attack
GS_defense.png		GS_defense.png
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DLG_attack

DLG_attack

GS_attack

GS_attack

GS_defense.png

GS_defense.png

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Provable-Defense-against-Privacy-Leakage-in-Federated-Learning-from-Representation-Perspective

Abstract

Code

Setup

Quick start

DLG attack

GS attack

Remark

About

Releases

Packages

Languages

License

shuklayash10/Soteria

Folders and files

Latest commit

History

Repository files navigation

Provable-Defense-against-Privacy-Leakage-in-Federated-Learning-from-Representation-Perspective

Abstract

Code

Setup

Quick start

DLG attack

GS attack

Remark

About

Resources

License

Stars

Watchers

Forks

Languages