GitHub - petermchale/diffusion: Implementation of a Denoising Diffusion Probabilistic Model with some mathematical background.

Acknowledgement

This is a lightly edited version of the following implementation of a Denoising Diffusion Probabilistic Model (DDPM):

https://www.youtube.com/watch?v=S_il77Ttrmg&list=WL&index=4

Theory

See math.ipynb.

Diffusion model

A UNet is used to model the mapping from noisy image to denoised image (see model.py).

Sanity-check diffusion model

I overfitted the diffusion model to a single training image (racoon.jpg), and checked that the model generates exactly this image, when presented with random noise (racoon.ipynb).

Generating CIFAR10-like images

I then trained the model on CIFAR10 using a single-GPU g4dn.xlarge EC2 instance for 160 epochs over a period of about 6 hours. Some statistics from the training run can be seen at: https://api.wandb.ai/links/peter-thomas-mchale/76bqm8a8 The model is about 240Mb in size. Finally, I used the trained model to generate new images, some of which look realistic (generate-from-cifar10.ipynb).

Further improvements

Most of the images generated from the CIFAR10 model are not realistic. There are two opposite explanations:

The labels were not one-hot encoded (and masked out), as they were in the conditional diffusion model described here. It's possible that, without masking the label, the model can over-rely upon the labels and not learn how to generate images at all.
The conditional diffusion model we trained above is actually ignoring the labels, which could be corrected using classifier or classifier-free guidance, as described in Luo 2022.

Other resources

See also https://github.com/petermchale/minDiffusion

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
AR(1) p1.jpeg		AR(1) p1.jpeg
AR(1) p2.jpeg		AR(1) p2.jpeg
README.md		README.md
generate-from-cifar10.ipynb		generate-from-cifar10.ipynb
math.ipynb		math.ipynb
model.py		model.py
racoon.ipynb		racoon.ipynb
racoon.jpg		racoon.jpg
stderr.log		stderr.log
stdout.log		stdout.log
train-on-cifar10.py		train-on-cifar10.py
utils.py		utils.py

petermchale/diffusion

Folders and files

Latest commit

History

Repository files navigation

Acknowledgement

Theory

Diffusion model

Sanity-check diffusion model

Generating CIFAR10-like images

Further improvements

Other resources

About

Topics

Resources

Stars

Watchers

Forks

Languages