Playing Atari Game with Reinforcemen Learning

Table comparing the average and total scores per episode by training on game LunarLanderv.2.0. using different learning algorithms

Lunar Lander game

Game Discription: Reward for moving from the top of the screen to landing pad and zero speed is about 100..140 points. If lander moves away from landing pad it loses reward back. Episode finishes if the lander crashes or comes to rest, receiving additional -100 or +100 points. Each leg ground contact is +10. Firing main engine is -0.3 points each frame. Solved is 200 points. Landing outside landing pad is possible. Fuel is infinite, so an agent can learn to fly and then land on its first attempt. Four discrete actions available: do nothing, fire left orientation engine, fire main engine, fire right orientation engine.

Random

random agent: average r = -2.5, duration = 80 steps/episode:

Learned

Deep Q agent:

watch the game play video at https://www.youtube.com/watch?v=D5ymUS7umPQ&t=2s

Actor-Critic agent (with experience reply):

Dependencies

openAI gym
python
pytorch
numpy
matplotlib

Name		Name	Last commit message	Last commit date
Latest commit History 206 Commits
Agents		Agents
MemoryBuffer		MemoryBuffer
Network		Network
config		config
notebook		notebook
res		res
train		train
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agents

Agents

MemoryBuffer

MemoryBuffer

Network

Network

config

config

notebook

notebook

res

res

train

train

.gitignore

.gitignore

README.md

README.md

init.py

init.py

Repository files navigation

Playing Atari Game with Reinforcemen Learning

Random

Learned

Deep Q agent:

Actor-Critic agent (with experience reply):

Dependencies

About

Releases

Packages

Languages

celisun/Playing_atari_with_Reinforcement_Learning

Folders and files

Latest commit

History

Repository files navigation

Playing Atari Game with Reinforcemen Learning

Random

Learned

Deep Q agent:

Actor-Critic agent (with experience reply):

Dependencies

About

Topics

Resources

Stars

Watchers

Forks

Languages