Alpha-Omok

This is a project of Reinforcement Learning KR group.

AlphaZero is a Reinforcement Learning algorithm which is effectively combine MCTS(Monte-Carlo Tree Search) with Actor-Critic. Alpha-Omok team wanted to apply AlphaZero algorithm to famous board game Omok (Gomoku). Omok is a traditional game, which uses same gameboard with Go. Therefore we thought that it is proper game to apply AlphaZero algorithm. For now, the algorithm is implemented by Pytorch. Tensorflow version will be release soon!!

All the environments are implemented by pygame, so you should install pygame to run the codes in this repository!!

Training Result

Project objective

There are 4 objectives to achieve in this project

MCTS on Tic-Tac-Toe
MCTS on Omok
AlphaZero on Omok
Upload AlphaZero on web

Documents

Description of the Folders

1_tictactoe_MCTS

This folder is for implementing MCTS in Tic-Tac-Toe. If you want to study MCTS only, please check the files in this folder.

The description of the files in the folder is as follows. (files with bold text are codes for implementation)

env: Tic-Tac-Toe environment code (made with pygame)
mcts_guide: MCTS doesn't play the game, it only recommends how to play.
mcts_vs: User can play against MCTS algorithm.
utils: functions for implementing algorithm.

2_AlphaOmok

The folder is for implementing AlphaZero algorithm in omok environment. There are two versions of omok (env_small: 9x9, env_regular: 15x15). The above image is sample image of 9x9 omok game

The description of the files in the folder is as follows. (files with bold text are codes for implementation)

eval_main: code for evaluating the algorithm on both local PC and web
main: main training code of Alpha Zero
model: Network model (PyTorch)
agents: Agent and MCTS algorithm
utils: functions for implementing algorithm
WebAPI: Implementation of web API

Name		Name	Last commit message	Last commit date
Latest commit History 533 Commits
1_tictactoe_MCTS		1_tictactoe_MCTS
2_AlphaOmok		2_AlphaOmok
docs		docs
image		image
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1_tictactoe_MCTS

1_tictactoe_MCTS

2_AlphaOmok

2_AlphaOmok

docs

docs

image

image

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Alpha-Omok

Training Result

Project objective

Documents

Description of the Folders

1_tictactoe_MCTS

2_AlphaOmok

Sample Image of Web Demo

Future Work

Reference

AlphaOmok Team

Kyushik Min

Jungdae Kim

Taeyoung Kim

Woongwon Lee

About

Releases

Packages

Contributors 4

Languages

reinforcement-learning-kr/alpha_omok

Folders and files

Latest commit

History

Repository files navigation

Alpha-Omok

Training Result

Project objective

Documents

Description of the Folders

1_tictactoe_MCTS

2_AlphaOmok

Sample Image of Web Demo

Future Work

Reference

AlphaOmok Team

Kyushik Min

Jungdae Kim

Taeyoung Kim

Woongwon Lee

About

Resources

Stars

Watchers

Forks

Languages