AlphaZero Implementation

This repository contains an implementation of AlphaZero, a Deep Reinforcement Learning (DRL) algorithm for playing complex strategy games. AlphaZero, developed by DeepMind, is known for its impressive performance in games like Go, Chess, and Shogi.

Introduction

AlphaZero combines deep neural networks with Monte Carlo Tree Search (MCTS) to achieve superhuman performance in various board games. This implementation aims to provide a clear and adaptable codebase for experimenting with AlphaZero in different games.

Features

Modular Design: The implementation is designed to be modular, allowing easy integration with different board games.
Efficient Training: Utilizes parallelism and GPU acceleration for efficient training of neural networks.
Self-Play: Implements self-play mechanism to generate training data.
Model Evaluation: Evaluates the trained models against baseline agents or human players.
Visualization: Provides tools for visualizing gameplay and training progress. (In-progress)

Supported Games

Requirements

Python 3.x
PyTorch
NumPy

Getting Started

Clone this repository:

git clone https://github.com/yourusername/AlphaZero-Implementation.git'

Create and activate a new environment:

conda create -n alphazero
conda activate alphazero

Install conda an pip dependencies:

conda env update -f conda_environment.yml
pip install -r requirements.txt

Run AlphaZero.ipynb notebook

python AlphaZero.ipynb

TicTacToe GUI (Player vs Agent)

Navigate to cd TicTacToe_UI
Navigate to cd backend
Run app.py file to start the backend server (Make sure environment is activated)

python app.py

In a separate terminal, navigate to cd frontend
Start frontend server by:

npm install
npm start

ConnectFour GUI (Player vs Agent)

Navigate to cd ConnectFour_UI
Navigate to cd backend
Run app.py file to start the backend server (Make sure environment is activated)

python app.py

In a separate terminal, navigate to cd frontend
Start frontend server by:

npm install
npm start

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
ConnectFour_UI		ConnectFour_UI
TicTacToe_UI		TicTacToe_UI
AlphaZero.ipynb		AlphaZero.ipynb
README.md		README.md
conda_requirements.yml		conda_requirements.yml
environment.yml		environment.yml
model_2.pt		model_2.pt
model_4_TicTacToe.pt		model_4_TicTacToe.pt
model_7_ConnectFour.pt		model_7_ConnectFour.pt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ConnectFour_UI

ConnectFour_UI

TicTacToe_UI

TicTacToe_UI

AlphaZero.ipynb

AlphaZero.ipynb

README.md

README.md

conda_requirements.yml

conda_requirements.yml

environment.yml

environment.yml

model_2.pt

model_2.pt

model_4_TicTacToe.pt

model_4_TicTacToe.pt

model_7_ConnectFour.pt

model_7_ConnectFour.pt

requirements.txt

requirements.txt

Repository files navigation

AlphaZero Implementation

Introduction

Features

Supported Games

Requirements

Getting Started

TicTacToe GUI (Player vs Agent)

ConnectFour GUI (Player vs Agent)

About

Releases

Packages

Contributors 3

Languages

iamutk4/Reinforcement-Learning-AlphaZero

Folders and files

Latest commit

History

Repository files navigation

AlphaZero Implementation

Introduction

Features

Supported Games

Requirements

Getting Started

TicTacToe GUI (Player vs Agent)

ConnectFour GUI (Player vs Agent)

About

Topics

Resources

Stars

Watchers

Forks

Languages