dpgan-revisit

Techniques for improving the performance of differentially private (DP) GANs, as described in our paper:

Alex Bie, Gautam Kamath*, Guojun Zhang*. Private GANs, revisited. In TMLR, 2023.

Figure: Tuning n_D (number of D steps per G step) improves FID on MNIST.

Disclaimer: this is research code, not a production-grade DP implementation suitable for releasing real sensitive data. In particular, it does not handle issues like secure RNG and floating point vulnerabilities.

Requirements

See requirements.txt.
Tested on: Python 3.11.4, PyTorch 2.0.1 (CUDA 11.7, cuDNN 8.5), Opacus 1.1.3.
IMPORTANT: tests fail with latest Opacus 1.4.0, seems like some semantics/breaking changes >1.1.3.

Quick start

pip install -e .              ## install
python -m pytest test         ## run tests (optional)
python train_dpgan.py         ## run ε=10 MNIST config
                              ## requires ~15GB VRAM, runs in ~8 hours on 1x V100

See intermediate eval results and other diagnostics with TensorBoard. TensorBoard logs are saved in logs/<dataset>/<run>/. To view:

tensorboard --logdir logs     ## then visit localhost:6006 in web browser

Checkpoints are saved in results/<dataset>/<run>/.

After training is done, to run FID and accuracy eval on a checkpoint:

python scripts/eval_checkpoint.py --path results/<dataset>/<run>/<g-checkpoint>.pt

By default, this: (1) creates folders of .png files for real and generated data; (2) runs pytorch-fid and classifier training from the folders. You can add the --in_memory flag to skip the image saving and loading, which leads to similar but not identical numbers.

Running different configs

Write your own training configurations in config.yaml. To use it, run

python train_dpgan.py --config config.yaml

See exp_configs/example.yaml for an example config file that you can modify.

Some important configs you might want to experiment with:

bsz (expected batch size)
num_d_steps (total number of discriminator steps)
d_steps_per_g_step (frequency of taking generator steps, relative to discriminator steps)
dp (toggles between DP training and not)
sigma (noise multiplier for DP)
max_physical_bsz (used for simulate large batch sizes, experiment with this on your setup to maximize throughput without OOM)
ds (enables adaptive discriminator step frequency)

Some settings used in the paper can be found in exp_configs/.

Benchmarks

Selected benchmark numbers obtained by running configs in this repo.

ε	Dataset	Adaptive n_D?	FID	Acc.	Mem	Config
∞	MNIST	✗	3.4	97.1	6GB	`mnist-nonpriv.yaml`

10	MNIST	✗	19.4	93.0	15GB	`mnist-eps10-50dsteps.yaml`
10	MNIST	✓	13.3	94.4	25GB	`mnist-eps10-adaptive.yaml`

Figure: Generated MNIST images @ ε=10, adaptive n_D.

Acknowledgments

Repo structure from:

Patrick J. Mineault & The Good Research Code Handbook Community. The Good Research Code Handbook. Zenodo. doi:10.5281/zenodo.5796873. 2021.

Original non-private GAN implementation is adapted from Hyeonwoo Kang's code:

https://github.com/znxlwm/pytorch-MNIST-CelebA-GAN-DCGAN.

which is an implementation of DCGAN:

Alec Radford, Luke Metz, Soumith Chintala. Unsupervised representation learning with deep convolutional generative adversarial networks. In ICLR 2016.

This implementation makes heavy use of Opacus:

Ashkan Yousefpour, Igor Shilov, Alexandre Sablayrolles, Davide Testuggine, Karthik Prasad, Mani Malek, John Nguyen, Sayan Ghosh, Akash Bharadwaj, Jessica Zhao, Graham Cormode, Ilya Mironov. Opacus: User-friendly differential privacy library in PyTorch. 2021.

Citing

If you found this code useful, please consider citing us:

@article{dpgan-revisit,
  title   = {Private {GAN}s, revisited},
  author  = {Alex Bie and
             Gautam Kamath and
             Guojun Zhang},
  journal = {Trans. Mach. Learn. Res.},
  volume  = {2023},
  year    = {2023},
  url     = {https://openreview.net/forum?id=9sVCIngrhP}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dpgan

dpgan

exp_configs

exp_configs

figs

figs

scripts

scripts

test

test

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

setup.py

setup.py

train_dpgan.py

train_dpgan.py

Repository files navigation

dpgan-revisit

Requirements

Quick start

Running different configs

Benchmarks

Acknowledgments

Citing

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
dpgan		dpgan
exp_configs		exp_configs
figs		figs
scripts		scripts
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py
train_dpgan.py		train_dpgan.py

License

alexbie98/dpgan-revisit

Folders and files

Latest commit

History

Repository files navigation

dpgan-revisit

Requirements

Quick start

Running different configs

Benchmarks

Acknowledgments

Citing

About

Resources

License

Stars

Watchers

Forks

Languages