Learning Disentangled Representations with Latent Variation Predictability

This repository contains the code for [Learning Disentangled Representations with Latent Variation Predictability].

Abstract

Latent traversal is a popular approach to visualize the disentangled latent representations. Given a bunch of variations in a single unit of the latent representation, it is expected that there is a change in a single factor of variation of the data while others are fixed. However, this impressive experimental observation is rarely explicitly encoded in the objective function of learning disentangled representations. This paper defines the variation predictability of latent disentangled representations. Given image pairs generated by latent codes varying in a single dimension, this varied dimension could be closely correlated with these image pairs if the representation is well disentangled. Within an adversarial generation process, we encourage variation predictability by maximizing the mutual information between latent variations and corresponding image pairs. We further develop an evaluation metric that does not rely on the ground-truth generative factors to measure the disentanglement of latent representations. The proposed variation predictability is a general constraint that is applicable to the VAE and GAN frameworks for boosting disentanglement of latent representations. Experiments show that the proposed variation predictability correlates well with existing ground-truth-required metrics and the proposed algorithm is effective for disentanglement learning.

Requirements

64-bit Python 3.6 installation. We recommend Anaconda3 with numpy 1.14.3 or newer.
TensorFlow 1.14 or 1.15 with GPU support. The code does not support TensorFlow 2.0.

This project is based on StyleGAN2, which relies on custom TensorFlow ops that are compiled on the fly using NVCC. To test that your NVCC installation is working correctly, run:

nvcc test_nvcc.cu -o test_nvcc -run
| CPU says hello.
| GPU says hello.

For more detailed instruction about StyleGAN2 environment setup, see StyleGAN2.

CelebA Dataset

To prepare the tfrecord version of CelebA dataset, first download the original aligned-and-cropped version from http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html, then use the following code to create tfrecord dataset:

python dataset_tool.py create_celeba /path/to/new_tfr_dir /path/to/downloaded_celeba_dir

Training

To train a model on CelebA dataset, run this command:

sh train_run.sh

You can modify this script to train different model variants. Note that for flag --data-dir you need to enter the parent directory path of the actual dataset, and use --dataset for the actual dataset directory name.

Evaluation

To evaluate trained models by variation predictability metric, run:

sh run_pair_imgs.sh

to generate a dataset of image pairs. You need to modify this script to fit your result-dir and the trained network pkl.

Then use this repository https://github.com/zhuxinqimac/VP-metric-pytorch to get the VP score using the generated dataset. You should run multiple times (e.g. 3) of this evaluation procedure to obtain an averaged score for your model.

Citation

@inproceedings{VPdis_eccv20,
author={Xinqi Zhu and Chang Xu and Dacheng Tao},
title={Learning Disentangled Representations with Latent Variation Predictability},
booktitle={ECCV},
year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
dnnlib		dnnlib
docs		docs
imgs		imgs
metrics		metrics
training		training
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
README_stylegan2.md		README_stylegan2.md
dataset_tool.py		dataset_tool.py
grab_traversals.py		grab_traversals.py
pretrained_networks.py		pretrained_networks.py
projector.py		projector.py
projector_vc.py		projector_vc.py
run_generate_grid.sh		run_generate_grid.sh
run_generator.py		run_generator.py
run_generator_vc.py		run_generator_vc.py
run_metrics.py		run_metrics.py
run_pair_generator_vc.py		run_pair_generator_vc.py
run_pair_imgs.sh		run_pair_imgs.sh
run_projector.py		run_projector.py
run_training.py		run_training.py
run_training_vc.py		run_training_vc.py
run_unsupervised_acc.py		run_unsupervised_acc.py
test_nvcc		test_nvcc
test_nvcc.cu		test_nvcc.cu
train_run.sh		train_run.sh

License

zhuxinqimac/stylegan2vp

Folders and files

Latest commit

History

Repository files navigation

Learning Disentangled Representations with Latent Variation Predictability

Abstract

Requirements

CelebA Dataset

Training

Evaluation

Citation

About

Resources

License

Stars

Watchers

Forks

Languages