Pytorch Reinforcement Learning

This repository contains the code for policy gradient algorithm incorporating with credit assignment mechanism.

Install Dependencies

Install Pytorch

pip install torch torchvision

install Tensorflow 2

pip install tensorflow=2.2

or

pip install tensorflow-gpu=2.2

Install OpenAI baseline (Tensorflow 2 version)

git clone https://github.com/openai/baselines.git -b tf2 && \
cd baselines && \
pip install -e .

Note: I haven't tested the code on Tensorflow 1 yet but it should work as well.

Install gym

pip install 'gym[atari]'

Install Park Platform. I modified the platform slightly to make it compatible with OpenAI's baseline.

git clone https://github.com/lehduong/park -b openai_baseline &&\
cd park && \
pip install -e .

Run experiments

python main.py --algo a2c --env-name PongNoFrameskip-v4

Acknowledgement

The started code is based on ikostrikov's repository

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
assets		assets
core		core
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
conda_env.yml		conda_env.yml
enjoy.py		enjoy.py
evaluation.py		evaluation.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

core

core

scripts

scripts

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

conda_env.yml

conda_env.yml

enjoy.py

enjoy.py

evaluation.py

evaluation.py

main.py

main.py

Repository files navigation

Pytorch Reinforcement Learning

Install Dependencies

Run experiments

Acknowledgement

About

Releases

Packages

Languages

License

lehduong/Contrastive-Predictive-Coding-in-RL

Folders and files

Latest commit

History

Repository files navigation

Pytorch Reinforcement Learning

Install Dependencies

Run experiments

Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Languages