GAN Testing Playground (WIP)

Project about testing techniques about training GANs and their stability

General info

This project contains code for some of the most know types of GAN (Generative Adverserial Network). I am using this repo to play with these types of networks to get better understanding how they work and how to properly train them.

Previous version of this reposritory was moved to branch: old
Disclaimer: This repository is more like proof of concept than download and run!
Some scripts might not work because I'm refactoring so fast and forgot to test it.

Content

DCGAN - GAN for generating new images from latent vector
WGAN(GC) - GAN for generating new images from latent vector
Conditional GAN - GAN for generating new images from latent vector and labels
Pix2Pix using GAN - Model for transforming a image
CycleGAN - GAN for transforming a image
ProGAN - GAN for generating new images from latent vector, using progressive growing models and GP loss

Project Folder Structure

- gans (scripts for each GAN except training scripts)
- media (folder with media files of repo)
- training scripts

Setup

pip install -r requirements.txt

Dependencies

- Python3.10
- PyTorch 1.11.0

Utility

preprocess_dataset.py - Script for mass rescaling images to target size and optionaly splitting them to training and testing parts
clean_small_images.py - Clean low resolution images from dataset

Results

DCGAN

Mnist dataset (64x64 grayscale) - 100k iters, batch size 128
Celeb dataset (64x64 color, 200000 images) - batch size 128
Unstable training and colapsed after few more epochs
No need for more training, because its by design prone to fails
1. Generated - 3M iters
2. Colapsed network (3.3M iters)

WGAN

More stable training in comparison to DCGAN but slower to train and capacity of model is smaller because of hard clamping weights

Celeb dataset (64x64 color, 200000 images) - batch size 64
1. Generated - 3M iters
2. Generated - 6M iters
Celeb dataset (64x64 color, 200000 images) - batch size 64
Model with replaced batch norm lazers with instance norm layers
Stability of model is improved
1. Generated - 600k iters
2. Generated - 6M iters

WGAN-GP

Celeb dataset (64x64 color, 200000 images) - batch size 64
Generated - 500k iters
SOCOFing dataset (64x64 gray, 6000 images) - batch size 32 \
1. Generated 100k iters
2. Generated 100k iters - pixel suffle

Conditional GAN - Based on WGAN-GP

Mnist dataset (64x64 grayscale) - batch size 64
1. Generated - 200k iters
2. Real

Pix2Pix using GAN

Maps segmentation (256x256 color, 2000 images) - batch size 16, 200 epochs
In order: Input, Real, Generated
Anime coloring (256x256 color, 16000 images) - batch size 16, 300k iters
In order: Input, Real, Generated
Fingerprint correction (256x256 color, 6000 images) - batch size 8, 400k iters
In order: Input, Real, Generated

ProGAN

Generated 16x16
Generated 32x32
Generated 64x64
Generated 128x128
Generated 256x256

CycleGAN

Photo to Monet (256x256 color, +-1500 images) - batch size 1, 1.5M iters \
1. Input
2. Converted

TODO

Testing setup

Hardware:
    Processor: I7-9700KF 4.8GHz
    RAM: HyperX Fury RGB 32GB (2x16GB) DDR4 3200MHz
    GPU: GIGABYTE GeForce RTX 2080 SUPER 8G
    SSD: Intel 660p M.2 2TB SSD NVMe

Editor: PyCharm (always latest version)

Resources

Name		Name	Last commit message	Last commit date
Latest commit History 590 Commits
gans		gans
media		media
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
clean_small_images.py		clean_small_images.py
go_through_dataset.py		go_through_dataset.py
preprocess_dataset.py		preprocess_dataset.py
requirements.txt		requirements.txt
train_condgan.py		train_condgan.py
train_cyclegan.py		train_cyclegan.py
train_dcgan.py		train_dcgan.py
train_pix2pix.py		train_pix2pix.py
train_progan.py		train_progan.py
train_srgan.py		train_srgan.py
train_wgan.py		train_wgan.py
train_wgan_gp.py		train_wgan_gp.py

License

Matesxs/GAN-Playground

Folders and files

Latest commit

History

Repository files navigation

GAN Testing Playground (WIP)

Project about testing techniques about training GANs and their stability

Table of contents

General info

Content

Project Folder Structure

Setup

Dependencies

Utility

Results

DCGAN

WGAN

WGAN-GP

Conditional GAN - Based on WGAN-GP

Pix2Pix using GAN

ProGAN

CycleGAN

TODO

Testing setup

Resources

Basic DCGAN

WGAN (Wasserstein GAN)

WGAN-GP

Pix2Pix using GAN

CycleGAN

ProGAN

SRGAN (Super Resolution GAN)

VQGAN (Vector Quantized GAN)

VQ-VAE (Vector QuantisedVariational AutoEncoder)

SR Resnet

ESDR (Enhanced Deep Residual Networks for Single Image Super-Resolution)

Perceptual Loss

GAN Stability and diagnostics

Inception score

Some resources might be missing, I started researching this topic long before this repository was created!

About

Topics

Resources

License

Stars

Watchers

Forks

Languages