ThinkingTicTacToe

An attempt to create a Tic Tac Toe player which learns by self play inspired by Alpha Zero.

Read more about Alpha Zero and the algorithm: https://arxiv.org/pdf/1712.01815.pdf https://deepmind.com/documents/119/agz_unformatted_nature.pdf

Much inspiration follows thanks to Jeff Bradberry https://jeffbradberry.com/posts/2015/09/intro-to-monte-carlo-tree-search/

The Project development consists of three stages:

Stage 1: Using UCB algorithm in monte carlo search tree to pick the strongest play. Please download all the files in UCB folder. Run ThinkingTicTacToe.py to play against the UCB Player in a console.

Stage 2: Using the best moves from UCB algorithm to train a neural network which predicts best move. Please download all the files in SL folder. Run alphaZeroTrain_SL.py to train the neural network. You will need keras and tensorflow for this. The number of games to be trained for are specified in variable called gamesTrainBatch. Once training is complete, you can run playWithAlphaZero.py to play against the trained network.

Stage 3: Using AlphaZero algorithm to do reinforcement learning Under progress

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
SL		SL
UCB		UCB
res		res
README.md		README.md
agz_unformatted_nature.pdf		agz_unformatted_nature.pdf
alphaZeroMCTS.py		alphaZeroMCTS.py
alphaZeroTrain_RL.py		alphaZeroTrain_RL.py
convNeuralNetwork.py		convNeuralNetwork.py
dNNTF.py		dNNTF.py
deepNeuralNetwork.py		deepNeuralNetwork.py
my_model		my_model
playWithAlphaZero.py		playWithAlphaZero.py
testGui.py		testGui.py
tttBoard.py		tttBoard.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SL

SL

UCB

UCB

res

res

README.md

README.md

agz_unformatted_nature.pdf

agz_unformatted_nature.pdf

alphaZeroMCTS.py

alphaZeroMCTS.py

alphaZeroTrain_RL.py

alphaZeroTrain_RL.py

convNeuralNetwork.py

convNeuralNetwork.py

dNNTF.py

dNNTF.py

deepNeuralNetwork.py

deepNeuralNetwork.py

my_model

my_model

playWithAlphaZero.py

playWithAlphaZero.py

testGui.py

testGui.py

tttBoard.py

tttBoard.py

Repository files navigation

ThinkingTicTacToe

About

Releases

Packages

Contributors 3

Languages

Neo-The1/ThinkingTicTacToe_archived

Folders and files

Latest commit

History

Repository files navigation

ThinkingTicTacToe

About

Resources

Stars

Watchers

Forks

Languages