Breakout

Libraries:

Stable Baselines:

https://stable-baselines3.readthedocs.io/en/master/index.html

Stable Baselines contrib

https://sb3-contrib.readthedocs.io/en/master/index.html

Algorithms:

Description:

Another famous Atari game. The dynamics are similar to pong: You move a paddle and hit the ball in a brick wall at the top of the screen. Your goal is to destroy the brick wall. You can try to break through the wall and let the ball wreak havoc on the other side, all on its own! You have five lives. Detailed documentation can be found on the AtariAge page.

Training steps:

Inital exploration across algorithms - 200K
Final training for PPO and RecurrentPPO - 5M

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
A2C_200000		A2C_200000
A2C_5000000		A2C_5000000
DQN_200000		DQN_200000
DQN_5000000		DQN_5000000
PPO_200000		PPO_200000
PPO_5000000		PPO_5000000
QRDQN_200000		QRDQN_200000
RecurrentPPO_200000		RecurrentPPO_200000
RecurrentPPO_5000000		RecurrentPPO_5000000
TRPO_200000		TRPO_200000
.gitattributes		.gitattributes
1a_Breakout-v0_train.ipynb		1a_Breakout-v0_train.ipynb
1b_Breakout-v0-Evaluate.ipynb		1b_Breakout-v0-Evaluate.ipynb
1c_Breakout-v0-Test.ipynb		1c_Breakout-v0-Test.ipynb
2a_Breakout-v0_recurrent_train.ipynb		2a_Breakout-v0_recurrent_train.ipynb
2b_Breakout-v0-recurrent_Evaluate.ipynb		2b_Breakout-v0-recurrent_Evaluate.ipynb
2c_Breakout-v0-recurrent_Test.ipynb		2c_Breakout-v0-recurrent_Test.ipynb
3a_Breakout-v0_final_train.ipynb		3a_Breakout-v0_final_train.ipynb
3b_Breakout-v0-final_Evaluate.ipynb		3b_Breakout-v0-final_Evaluate.ipynb
3c_Breakout-v0-final_Test.ipynb		3c_Breakout-v0-final_Test.ipynb
Evaluation_across_models.png		Evaluation_across_models.png
LICENSE.md		LICENSE.md
Readme.md		Readme.md
get_algos.py		get_algos.py
modelled.gif		modelled.gif
random.gif		random.gif

License

SwamiKannan/Breakout-v0-using-Stable-Baselines

Folders and files

Latest commit

History

Repository files navigation

Breakout

Libraries:

Stable Baselines:

Stable Baselines contrib

Algorithms:

Description:

Training steps:

Results:

Randomly acting agent:

Modelled agent

About

Topics

Resources

License

Stars

Watchers

Forks

Languages