Snake-AI

This project aims to use deep reinforcement learning (DRL) to play Snake game automatically. The core DRL method used here is PPO for discrete, which has brilliant performance in the field of discrete action space like in continuous action space. You just need half an hour to train the snake agent and then it can take effect.

Requirements

conda create -n ppo --yes --file conda.txt
conda activate ppo
pip install -r requirements.txt

Usage

Train

python train.py # after training, the training curve of current round will autometically show
python snake.py # evaluate latest saved model

Evaluate assigned model

python evaluate.py --weight ./model/act-weight_round3_472_82.5.pkl

Plot assigned reward log

python plotter.py --history ./logs/reward_round3_82.5.csv

Experiments

Round	1	2	3
Traing curve
Evaluation
Reward_eat	+2.0	+2.0	+2.0
Reward_hit	-0.5	-1.0	-1.5
Reward_bit	-0.8	-1.5	-2.0
Avg record	≈19	≈23	≈28

Conclusions

Increasing the penalty for death leads to higher average records
The training result of the low death penalty strategy has a low reward curve, but it performs well in the demo
A particularly high reward for eating food can lead to quick success regardless of long-term safety

Future work

Training time is too short to reflect the advantages of DRL compared to none-DRL method (Snaqe)
The zigzag of snake body looks ugly, try to add punishment into reward for too many zigzags

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
logs		logs
model		model
.gitignore		.gitignore
Agent.py		Agent.py
LICENSE		LICENSE
README.md		README.md
conda.txt		conda.txt
evaluate.py		evaluate.py
painter.py		painter.py
plotter.py		plotter.py
ppo.py		ppo.py
requirements.txt		requirements.txt
snake.py		snake.py
train.py		train.py

License

MuGeminorum/Snake-AI

Folders and files

Latest commit

History

Repository files navigation

Snake-AI

Requirements

Usage

Train

Evaluate assigned model

Plot assigned reward log

Experiments

Conclusions

Future work

About

Topics

Resources

License

Stars

Watchers

Forks

Languages