Image Generator Implemented by VAE with Transformer Encoder

Problem Formulation

Build a generative model colored_pi_generator whose output is a 5-dimensional point which comes from the same distribution as the points in the dataset.

Input

pi_xs.npy - A numpy array of length 5000 - the x-coordinates of each of the points in the dataset
pi_ys.npy - A numpy array of length 5000 - the y-coordinates of each of the points in the dataset
An image

Output

An image which comes from the same distribution as the points in the dataset

Method - VAE (Variational Autoencoder) with Transformer Encoder Layer

Model Architecture

Model A: Input Dimension: 1x1x25000 (branch/vae_with_transformer_encoder_layer_1distribution)

Model B: Input Dimension: 5000x5x1 (main)

VAE provides a probabilistic manner for describing an observation in latent space. Thus, we are going to formulate our encoder to describe a probability distribution for each latent attribute.
Encoder
- Linear Layer -> ReLU Activation Function -> Transformer Encoder Layer
Generator
- Linear Layer -> ReLU Activation Function -> Transformer Encoder Layer -> Linear Layer -> Sigmoid Activation Function

Loss Function

Reconstruction loss ($Loss_{recons}$): how well VAE can reconstruct the input from the latent space
Kullback–Leibler Divergence Loss ($Loss_{kld}$): how similar the latent distribution and standard normal distribution are
Beta ($\beta$): A weight to alleviate the KL-vanishing issue is to apply annealing schedules for the KLD loss
Formula $$Loss = (1-\beta) * Loss_{recons} + \beta * Loss_{kld}$$ where $\beta$ from 0 linear growth to 1 as the #epochs increase
Reference

Results of Model A

Training Curves

Vanilla VAE	VAE with Transformer Encoder Layer

Generative Results

Epochs	1	25	50	75	100
Vanilla VAE
VAE with Transformer Encoder Layer

Vanilla VAE	VAE with Transformer Encoder Layer

Usage

Build environment
```
conda env create -f environment.yaml
```
Put data in ./gen_ml_quiz_content/
Run script
```
    python main.py
```

Dataset

Please put all the data under './gen_ml_quiz_content' folder. For more details, please reference to Input.

Hyperparameter Setting

All the experiments are done by the hyperparameters below. Feel free to set your own hyperparameter in config.yaml

epochs batch_size latent_dim device num_workers result_path gen_every_epochs seed retrain num_head

100 1 256 'cpu' 4 './result/' True 10 True 4

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
assets		assets
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
dataset.py		dataset.py
environment.yaml		environment.yaml
main.py		main.py
model.py		model.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

.gitignore

.gitignore

README.md

README.md

config.yaml

config.yaml

dataset.py

dataset.py

environment.yaml

environment.yaml

main.py

main.py

model.py

model.py

utils.py

utils.py

Repository files navigation

Image Generator Implemented by VAE with Transformer Encoder

Problem Formulation

Input

Output

Method - VAE (Variational Autoencoder) with Transformer Encoder Layer

Model Architecture

Loss Function

Results of Model A

Usage

Dataset

Hyperparameter Setting

Reference

About

Releases

Packages

Languages

wwweiwei/colored_pi_generator

Folders and files

Latest commit

History

Repository files navigation

Image Generator Implemented by VAE with Transformer Encoder

Problem Formulation

Input

Output

Method - VAE (Variational Autoencoder) with Transformer Encoder Layer

Model Architecture

Loss Function

Results of Model A

Usage

Dataset

Hyperparameter Setting

Reference

About

Resources

Stars

Watchers

Forks

Languages