Unsupervised Representation Learning in Atari

Reproduced by: Shekar Ramaswamy, Lawrence Huang, Kendrick Tan, and Tyler Jiang

Original Code can be found here: https://github.com/mila-iqia/atari-representation-learning

Project Structure

.
├── data_representation
    └── get_data.py
        (collects episode data from Atari games)
├── data_viz (produce charts of experiment results)
├── encoders (holds encoder architectures and saved encoders trained with ST-DIM)
    └── rand_cnn.py
        (base CNN architecture used in Random-CNN and ST-DIM)
├── handlers
    ├── probe_handler.py
        (trains the probe (supervised and unsupervised flags specified by pipeline.py))
    └── stdim_handler.py
        (trains the encoder and bilinear layers using ST-DIM (InfoNCE ST-DIM, no ablations))
    └── cpc_handler.py

├── probe
    └── probe.py
        (regular probe and fully supervised probe (linear layer and linear layer + encoder respectively))
├── pipeline.py
    (runs the pipeline end-to-end (collects data, trains, validates, and tests the encoder))
└── test_stdim_handler.py
    (train and save the encoder using ST-DIM or run the probe with a trained encoder)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data_representation

data_representation

data_viz

data_viz

encoders

encoders

handlers

handlers

probe

probe

runs

runs

.gitattributes

.gitattributes

.gitignore

.gitignore

README.md

README.md

pipeline.py

pipeline.py

test_stdim_handler.py

test_stdim_handler.py

Repository files navigation

Unsupervised Representation Learning in Atari

Project Structure

About

Releases

Packages

Contributors 4

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 107 Commits
data_representation		data_representation
data_viz		data_viz
encoders		encoders
handlers		handlers
probe		probe
runs		runs
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
pipeline.py		pipeline.py
test_stdim_handler.py		test_stdim_handler.py

shekarramaswamy4/state-representation-replication

Folders and files

Latest commit

History

Repository files navigation

Unsupervised Representation Learning in Atari

Project Structure

About

Resources

Stars

Watchers

Forks

Languages