Skip to content

Training a vision-based agent with the Actor Critic model in an online environment, implementation in Tensorflow.

License

Notifications You must be signed in to change notification settings

andi611/AC-Actor-Critic-SlitherIO-Tensorflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement Learning: Vision Based Agent trained with Actor Critic playing Slither.IO

  • Training a vision-based agent with the Actor Critic model in an online environment, implementation in Tensorflow.
  • Click on the image below to see the DEMO VIDEO: Demo Video

Requirements:

  • Tensorflow
  • Universe
  • Python

Introduction

  • In this report, we present the result of training a vision-based agent for Slither.io, an online massively multiplayer browser game that is partially supported by Universe (OpenAi), using Reinforcement Learning (RL) algorithms.
  • The framework we used is based on the Actor-Critic models, combining with convolutional neural networks (CNN).
  • During training, we apply several techniques to encourage exploration and keeping our agent at a high entropy state, successfully avoiding the dilemma of having a highly-peaked policy function (𝜋(𝑎|𝑠)) towards a few actions, a known problem with on-policy models.
  • The agent we trained requires only raw frames from the screen and game states from the AI side, without using opponents’ information. Therefore, the technique applied is general and suitable for training computer agents in other environments which uses raw frames directly.
  • Our agent is capable of playing against other human players online and survive in this massively multiplayer game, and is proficient at performing tricky moves upon the encounter of enemies, including intensive sharp turns, high speed twist, and circulations.

Environment

Model

Architecture

Screen Preprocess

Training Pipeline

Exploration Tuning Techniques

Usage

python3 ./src/train_AC.py
python3 ./src/play_AC.py

Detail

  • See the following files for more detail:
- description.pdf
- presentation.pdf

About

Training a vision-based agent with the Actor Critic model in an online environment, implementation in Tensorflow.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages