Experiments with Autoencoders

Alice, December 2019

This is the repository for the development of artificial neural networks with autoencoder-like architectures.

The deep learning framework of choice is Keras 2.2.4 using TensorFlow 1.12.0 backend. The scripts are executed with Python 3.6.8. The networks are trained on multiple GPUs with CUDA 10.1.

Neural architectures

Standard autoencoders (AE and VAE)

The first set of neural models we developed are simple autoencoders having a single color image as input, and are trained to reconstruct the same image as output, in a totally unsupervised way. We initially developed a standard autoencoder, and then we improved it into a variational autoencoder.

Topological autoencoders (MAE and MVAE)

A second set of neural models shares the encoder structure of the previous models, but tries to explicitly learn in the latent space a compressed representation of the location of cars and lane markings in the scene. We developed these architectures initially with a standard latent space, and then using variational Bayes.

There are 3 different decoders reconstructing, respectively, the initial color image, a binary mask indicating the locations of other cars, and a binary mask for the location of lane markings. The 2 "topological" branches of the network require a supervised training, with the use of segmented ground truth.

Temporal autoencoders (RAE, RVAE and RMVAE)

The third set of neural models expands the previous architecture with the inclusion of temporal information. A sequence of frames is fed to a topological autoencoder, which produce a corresponding sequence of latent spaces. This sequence is passed to a reccurent network predicting a new sequence future latent spaces, which are then expanded to images through the same decoder of the topological model.

This architecture is not directed at performing prediction in the long future, but it is trained to obtain a more refined latent representation that takes in consideration also the temporal coherence between frames.

Predictive models (TIME, RTIME, RMTIME)

The last set of neural models focuses on long-term prediction of future sequences, using as input/output only the latent representations computed by the temporal autoencoders.

VAE	MVAE	RMVAE
![vae](doc/nets_0.png	width=100)

Test results

Hallucination

These are the results of testing the RMTIME model using the hallucination technique: the predicted output of the recurrent network is fed as input of the next recursive iteration. In this way the network is really "imagining" the future road scenarios. The two animations show on the right the ground truth sequence, on the left the imaginary sequence produced by the network, executed for 40 iterations:

README.md this file.
cnfg/ configuration files.
dataset/ structure of folders containing all data used to train the models.
imgs/ some visual predictions resulted from different experiments.
log/ collection of significant saved trained neural models, and summary tables.
math/ various Mathematica scripts for recreation of neural models and visualization.
models/ state-of-the-art pretrained models (like VGG16).
res/ folder where executions are automatically saved.
src/ Python scripts:
- arch.py defines the architectures of neural models,
- cnfg.py handles command line arguments,
- exec_dset.py creates the structures of symlinks for building a dataset,
- exec_eval.py loads and evaluates a saved model,
- exec_feat.py generates dataset of latent space encodings,
- exec_lata.py is a collection of functions to analyze the latent space,
- exec_main.py is the main file to execute training,
- gener.py handles the Generator structures for parsing a dataset,
- h5lib.py is a collection of utilities for loading weights from an HDF5 file,
- mesg.py contains utilities for printing error messages,
- pred.py defines a class for non-neural time prediction,
- sample_sel.py contains a dictionary of manually-selected samples of different type of events,
- tester.py collects functions for testing a trained model,
- trainer.py contains the training routine.
video/ animations showing the best results.

Usage

To run the program, execute the main script src/exec_main.py . The script supports the following command line arguments:

exec_main.py [-h] -c <file> -g <num> [-f <frac>] [-l <model>] [-T] [-t] [-r] [-s] [-a] [-e]

-a, --accur execute accuracy evaluation on selected samples (-a) or on all test set (-aa) (it may take a while!).
-c <file>, --config <file> pass a configuration file describing the model architecture and training parameters.
-e, --eval execute evaluation routines.
-f <frac>, --fgpu <frac> set the fraction of GPU memory to allocate [default: 0.90].
-g <num>, --gpu <num> set the number of GPUs to use (0 if CPU) or list of GPU indices.
-h, --help show the help message with description of the arguments.
-i, --intrp execute interpolation tests.
-l <model>, --load <model> pass a folder or a HDF5 file to load as weights or entire model.
-p, --pred compute model predictions over a selected set of images.
-r, --redir redirect stderr and stdout to log files.
-s, --save archive configuration file (-s) and python scripts (-ss) used.
-t, --test execute testing routines.
-T, --train execute training of the model.

As example, run the following command from the upmost autoencoder/ folder. This command will train a new model on the first GPU on the machine. Then it will test the results, save all the files required to reproduce the experiment, and redired all console messages to log files:

$ python src/exec_main.py -c config/cnfg_file -g 0, -Ttssr

Another example, this command will load an already trained model and will execute all the test routines on CPU:

$ python src/exec_main.py -l log/nicemodel/nn_best.h5 -c log/nicemodel/config/cnfg_file -g 0 -taaeip

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
doc		doc
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

doc

doc

src

src

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Experiments with Autoencoders

Alice, December 2019

Neural architectures

Standard autoencoders (AE and VAE)

Topological autoencoders (MAE and MVAE)

Temporal autoencoders (RAE, RVAE and RMVAE)

Predictive models (TIME, RTIME, RMTIME)

Test results

Hallucination

Contents

Usage

About

Releases

Packages

Languages

3lis/keras_ann

Folders and files

Latest commit

History

Repository files navigation

Experiments with Autoencoders

Alice, December 2019

Neural architectures

Standard autoencoders (AE and VAE)

Topological autoencoders (MAE and MVAE)

Temporal autoencoders (RAE, RVAE and RMVAE)

Predictive models (TIME, RTIME, RMTIME)

Test results

Hallucination

Contents

Usage

About

Topics

Resources

Stars

Watchers

Forks

Languages