Self-Imitation-Learning with A2C

This is the pytorch version of the A2C + SIL - which is basiclly the same as the openai baselines. The paper could be found Here.

TODO List

Add PPO with SIL
Add more results

Requirements

python-3.5.2
openai-baselines
pytorch-0.4.0

Installation

Install OpenAI Baselines (Need to use the previous version of openai-baselines, will solve in the future.)

# clone the openai baselines
git clone https://github.com/openai/baselines.git
cd baselines
git checkout 366f486
pip install -e .

How to use the code

Train the network:

python train.py --env-name 'PongNoFrameskip-v4' --cuda (if you have the GPU)

Test the network:

python demo.py --env-name 'PongNoFrameskip-v4'

You could also try the A2C algorithm without SIL by adding flag --no-sil:

python train.py --env-name 'PongNoFrameskip-v4' --cuda --no-sil

Training Performance

Because of time, I just run Pong with 2 million steps. The results of MontezumaRevenge will be uploaded later!
Another results for the Freeway which is correspond with the original paper.

Demo: FreewayNoFrameskip-v4

Acknowledgement

@junhyukoh for original code

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
figures		figures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
a2c_agent.py		a2c_agent.py
arguments.py		arguments.py
demo.py		demo.py
models.py		models.py
sil_module.py		sil_module.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

figures

figures

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

a2c_agent.py

a2c_agent.py

arguments.py

arguments.py

demo.py

demo.py

models.py

models.py

sil_module.py

sil_module.py

train.py

train.py

utils.py

utils.py

Repository files navigation

Self-Imitation-Learning with A2C

TODO List

Requirements

Installation

How to use the code

Training Performance

Demo: FreewayNoFrameskip-v4

Acknowledgement

About

Releases

Packages

Languages

License

TianhongDai/self-imitation-learning-pytorch

Folders and files

Latest commit

History

Repository files navigation

Self-Imitation-Learning with A2C

TODO List

Requirements

Installation

How to use the code

Training Performance

Demo: FreewayNoFrameskip-v4

Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Languages