Exploring-Autoencoders

Learning about auto encoders and latent spaces

Requirements:

Pytorch (torchvision)
matplotlib

How to use:

The models are all in the Experiments folder, to run the training and/or visualization for any one of them just do:

python3 main.py <experiment name>

You can include -v to save a visualization of that model to the disk and -r to start from the previously saved model weights.

If you want to edit one of the experiments, just change the config.json of that given model.

Visualizations

Trivial solution for MNIST reconstruction:

I trained a neural network with no bias and activation with the following architecture on the image reconstruction task:

784 (input) -> 784 (latent) -> (784) output

First, I trained the to achieve a near-perfect photometric loss on the validation set on MNIST, I trained for 100 epochs with LR scheduling. Once I achieved the loss I wanted, I then computed (W2)(W1) and plotted the heatmap below.

(W2)(W1):

The second set of weights appears to have higher off-diagonal values in the visualization, I believe this is because it has very small negative values of the diagonal, so the visualization messes up a little bit. Therefore, I clipped the values on the right plot so we can clearly see the identity matrix.

As evidenced, it appears the network has learned an identity transformation, with W1 and W2 being inverses.

CAE, L1 constrainted AE:

I've implemented and trained contractive autoencoders and constrained AE. I won't write about them in detail.

Denoising autoencoder

I've implemented a simple demoising autoencoder on MNIST. The visualization above shows how the network has learned to reconstruct the corrupted parts of the image. For the noise, I just dropped out pixels in the image with a probability hyperparameter, basically dropout.

VAE

So far, I've trained a VAE with a gaussian posterior on MNIST:

Here is a visualization. For each cell with an image, I am interpolating between two randomly sample images from the dataset:

CNN VAE

I've also trained a VAE using a CNN encoder and decoder, where the decoder uses transposed convolutions. Here's a visualization:

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
Experiments		Experiments
data		data
images		images
models		models
sh_scripts		sh_scripts
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
TODO.md		TODO.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiments

Experiments

data

data

images

images

models

models

sh_scripts

sh_scripts

utils

utils

.gitignore

.gitignore

.gitmodules

.gitmodules

README.md