Introduction

This codebase for disentangled representation learning accompanies the paper Disentanglement via Latent Quantization by Kyle Hsu, Will Dorrell, James C. R. Whittington, Jiajun Wu, and Chelsea Finn.

It uses:

JAX and Equinox for automatic differentiation
Hydra for configuration management
Weights & Biases for experiment logging
TensorFlow Datasets for dataset management and data loading

The code separates encoder architecture, decoder architecture, latent space design, and model objectives into modular components. These are combined via Hydra's partial object instantiation functionality via the *_partial options in configuration files. See below for an example.

We also provide a standalone file for InfoMEC estimation for easy integration into other projects.

Installation

conda create -n latent_quantization python=3.10 -y && conda activate latent_quantization
git clone --recurse-submodules https://github.com/kylehkhsu/latent_quantization.git
pip install --upgrade "jax[cuda11_pip]" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html
pip install -r requirements.txt
pip install -e .

Add environment variables to `conda activate`

mkdir -p $CONDA_PREFIX/etc/conda/activate.d
echo 'CUDNN_PATH=$(dirname $(python -c "import nvidia.cudnn;print(nvidia.cudnn.__file__)"))' >> $CONDA_PREFIX/etc/conda/activate.d/env_vars.sh
echo 'export LD_LIBRARY_PATH=$CONDA_PREFIX/lib/:$CUDNN_PATH/lib:$LD_LIBRARY_PATH' >> $CONDA_PREFIX/etc/conda/activate.d/env_vars.sh

Datasets will be installed via the TensorFlow Datasets API on first use.

To use Weights & Biases logging, you may have to create a free account at wandb.ai. Then, run wandb login and enter the API key from your account.

Usage

Main entry points are in scripts. Each configurable script has a corresponding config file and launcher file.

train_ae.py trains autoencoder variants, including the quantized-latent autoencoder (QLAE), vanilla AE, VAE, TCVAE, BioAE, VQ-VAE, and others.

train_gan.py trains InfoWGAN-GP variants, including the quantized-latent InfoWGAN-GP and vanilla InfoWGAN-GP.

Both of these automatically log model and optimizer checkpoints. plot_mi.py and perturbations.py show how to retrieve a previous run's checkpoint for further analysis.

Example

To train an autoencoder variant, do python launchers/train_ae.py. This will use the configuration defaults in configs/train_ae.yaml. To override these defaults, do python launchers/train_ae.py key=value. For example, python launchers/train_ae.py model_partial=ae dataset=isaac3d will train a vanilla autoencoder on the Isaac3D dataset.

To run a sweep, add the --multirun flag. The sweep will run over all combinations of configurations specified in hydra.sweeper.params in the config file.

By default, using --multirun will invoke the SubmitIt launcher, which submits jobs to a Slurm cluster. Configure this here. To instead run locally, add hydra/launcher=submitit_local to the command.

InfoMEC estimation

A methodological contribution of our paper is a cohesively information-theoretic framework for disentanglement evaluation based on three complementary metrics: InfoM (modularity), InfoE (explicitness), and InfoC (compactness). See here for a standalone implementation that can be copied by itself into other projects.

Modularity and Compactness

This file contains code for InfoM and InfoC estimation.

Computing InfoM and InfoC involves estimating the normalized pairwise mutual information between individual latents and sources. We recommend using the continuous-discrete estimator for continuous latents and discrete-discrete estimator for discrete latents. We do log discrete-discrete estimation with various binning settings to demonstrate the sensitivity of continuous latent evaluation done in this manner. We recommend using the sources normalization.

Next, the resulting matrix (the transpose of NMI in the paper) is heuristically pruned to remove inactive latents. Finally, the sparsity of each row (for InfoM) or column (for InfoC) of the matrix is computed via a ratio or gap. We recommend and report the ratio.

Explicitness

This file contains code for InfoE estimation.

InfoE involves calculating the predictive linear information from the latents to a source (and averages over sources). We implement and log both classification (logistic regression) and regression (linear regression) formulations of this procedure. As the datasets we use in the paper all have discrete sources, we only report InfoE-classification.

Citation

If you find this code useful for your work, please cite:

@inproceedings{hsu2023disentanglement,
  title={Disentanglement via Latent Quantization},
  author={Hsu, Kyle and Dorrell, Will and Whittington, James C. R. and Wu, Jiajun and Finn, Chelsea},
  booktitle={Neural Information Processing Systems},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
configs		configs
disentangle		disentangle
launchers		launchers
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configs

configs

disentangle

disentangle

launchers

launchers

scripts

scripts

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

Introduction

Installation

Add environment variables to `conda activate`

Usage

Example

InfoMEC estimation

Modularity and Compactness

Explicitness

Citation

About

Releases

Packages

Languages

License

kylehkhsu/latent_quantization

Folders and files

Latest commit

History

Repository files navigation

Introduction

Installation

Add environment variables to conda activate

Usage

Example

InfoMEC estimation

Modularity and Compactness

Explicitness

Citation

About

Resources

License

Stars

Watchers

Forks

Languages

Add environment variables to `conda activate`