MasterAI-2.0-1vs1-NoLimit

Introduction

MasterAI v2.0 is an iterative algorithm derived from MasterAI v1.0 It utilizes profound Reinforcement Learning + Search in imperfect-information games and achieves superhuman performance in heads-up no-limit Texas Hold’em. Furthermore, it is a major step toward developing technologies for multiagent interactions in real world.

Technology

1.MaterAI v2.0 algorithm generalizes the paradigm of self-play reinforcement learning and deep learning and search through gargantuan imperfect-information. It makes decisions by factoring in the probability distribution of different beliefs each player might have about the current state of the game and uses counterfactual Regret minimization (CFR) algorithm to search efficiently.

2.Our experiments confirmed that MasterAI does indeed converge to an approximate Nash equilibrium in two-player zero-zum game

Technical bottlenecks

Some technical bottlenecks are encountered when training the algorithm model with CFR framework. For instance, the large state space is leading to too much computation:

1.Algorithm training has a large amount of calculation (2560000 * 1750 in the paper)

2.Deployment speculation and search time is too much: 3 ~ 5 seconds

3.The number of nodes in Abstract CFR (400BB) Betting Tree is too large, more than 400 million

Contact us

The Master team is constantly exploring the innovation of AI algorithm, and hoping that like-minded technical experts from all over the world can communicate and exchange here, or join us to make MasterAI bigger and stronger together. Please feel free to contact us at masterai918@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
bak		bak
cfv		cfv
conf		conf
csrc		csrc
game		game
heyhi		heyhi
ipc		ipc
proto		proto
rela		rela
robot		robot
supervised_strategy		supervised_strategy
tests		tests
utils		utils
.gitignore		.gitignore
README.md		README.md
chaos_brain_client.py		chaos_brain_client.py
chaos_brain_message.py		chaos_brain_message.py
chaos_brain_server.py		chaos_brain_server.py
deploy.sh		deploy.sh
kill_all.sh		kill_all.sh
main.py		main.py
publish.sh		publish.sh
pytest.init		pytest.init
run.py		run.py
serve.sh		serve.sh
train.sh		train.sh
train_debug.sh		train_debug.sh

deeptexas-ai/MasterAI-2.0-1vs1-NoLimit

Folders and files

Latest commit

History

Repository files navigation

MasterAI-2.0-1vs1-NoLimit

Introduction

Technology

Technical bottlenecks

Contact us

About

Topics

Resources

Stars

Watchers

Forks

Languages