You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th publication: [Mastering the Game of Go without Human Knowledge]. The supervised learning approach is more practical for individuals. (This repository has single purpose of education only)
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
Implementations of a Multilayer Perceptron, a Convolutional Neural Network, and Monte Carlo Tree Search applied to Tic-Tac-Toe and Connect Four. A learning project for me to construct the elements in Google's AlphaZero from the ground up.
An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI makes decisions. 一个通过可视化AlphaZero中的蒙特卡洛树搜索来解释AI决策方式的程序。