DARLA

PyTorch implementation of the DARLA reinforcement learning pipeline, using PPO to learn a policy from the ß-VAE's latent state

Learn disentangled features of the environment using a random agent in an unsupervised domain
Learn a policy for the source domain (in this case with PPO) using the learned state representation from step 1
Test the policy from step 2 on the target domain

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
beta_vae		beta_vae
dae		dae
data		data
img		img
.gitignore		.gitignore
README.md		README.md
history.py		history.py
train.py		train.py

Provide feedback