RelaxLoss

This is a minimal implementation of the ICLR 22 paper titled RelaxLoss: Defending Membership Inference Attacks without Losing Utility in PyTorch 1.11. Paper introduces a defense against membership inference attacks (MIA) by developing an algorithm that makes loss distributions of member (train) and non-member (test) samples similar. I was able to replicate this effect (depicted in Figure 1 of the paper).

From left-to-right: Default training, RelaxLoss with α=0.5, RelaxLoss with α=1.0.

Simply run src/runner.sh to replicate those results. Code is also available in notebook format in notebooks/RelaxLoss.ipynb.

I've also tested the algorithm against a simple loss thresholding attack. Briefly, you know the avg. training loss of the model, and you predict a sample 'member' if sample loss < avg. training loss. Concretely, a single run on CIFAR10 yielded the following results.

Setting	Train Acc	Test Acc	Train Avg. Loss / Var	Test Avg. Loss / Var	MIA Balanced Acc.
Regular	100%	84.1%	6.5e-4 / 3e-6	0.75 / 4.54	62.7%
RelaxLoss (α=0.5)	94.3%	83.3%	0.46 / 0.35	0.72 / 0.77	55.4%
RelaxLoss (α=1.0)	83.77%	79.6%	0.97 / 0.61	1.07 / 0.7	52.2%

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
notebooks		notebooks
plots		plots
src		src
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notebooks

notebooks

plots

plots

src

src

.gitignore

.gitignore

LICENSE.md

LICENSE.md

README.md

README.md

Repository files navigation

RelaxLoss

About

Languages

License

TinfoilHat0/RelaxLoss

Folders and files

Latest commit

History

Repository files navigation

RelaxLoss

About

Topics

Resources

License

Stars

Watchers

Forks

Languages