Tensorflow-Reinforce

A collection of Tensorflow implementations of reinforcement learning models. Models are evaluated in OpenAI Gym environments. Any contribution/feedback is more than welcome. Disclaimer: These implementations are used for educational purposes only (i.e., to learn deep RL myself). There is no guarantee that the exact models will work on any of your particular RL problems without changes.

Environments

This codebase works in both Python 2.7 and 3.5. The models are implemented in Tensorflow 1.0.

Models

Model	Code	References
Cross-Entropy Method	run_cem_cartpole	Cross-entropy method
Tabular Q Learning	rl/tabular_q_learner	Sutton and Barto, Chapter 8
Deep Q Network	rl/neural_q_learner	Mnih et al.
Double Deep Q Network	rl/neural_q_learner	van Hasselt et al.
REINFORCE Policy Gradient	rl/pg_reinforce	Sutton et al.
Actor-critic Policy Gradient	rl/pg_actor_critic	Minh et al.
Deep Deterministic Policy Gradient	rl/pg_ddpg	Lillicrap et al.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
rl		rl
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
run_actor_critic_acrobot.py		run_actor_critic_acrobot.py
run_cem_cartpole.py		run_cem_cartpole.py
run_ddpg_mujoco.py		run_ddpg_mujoco.py
run_dqn_cartpole.py		run_dqn_cartpole.py
run_ql_cartpole.py		run_ql_cartpole.py
run_reinforce_cartpole.py		run_reinforce_cartpole.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rl

rl

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

run_actor_critic_acrobot.py

run_actor_critic_acrobot.py

run_cem_cartpole.py

run_cem_cartpole.py

run_ddpg_mujoco.py

run_ddpg_mujoco.py

run_dqn_cartpole.py

run_dqn_cartpole.py

run_ql_cartpole.py

run_ql_cartpole.py

run_reinforce_cartpole.py

run_reinforce_cartpole.py

Repository files navigation

Tensorflow-Reinforce

Environments

Models

License

About

Releases

Packages

Languages

License

ayush-agrawal/tensorflow-reinforce

Folders and files

Latest commit

History

Repository files navigation

Tensorflow-Reinforce

Environments

Models

License

About

Resources

License

Stars

Watchers

Forks

Languages