DeepCFR.jl

Deep Counterfactual Regret Minimization (Brown et al.)

using CounterfactualRegret
using CounterfactualRegret.Games
import CounterfactualRegret as CFR
using StaticArrays
using DeepCFR

# Get Rock-Paper-Scissors as default CounterfactualRegret.jl matrix game
RPS = MatrixGame()

#=
Information state type of matrix game is `Int`, 
so extend `vectorized` method to convert to vector 
s.t. it's able to be passed through a Flux.jl network
=#
DeepCFR.vectorized(::MatrixGame, I) = SA[Float32(I)]


sol = DeepCFRSolver(
        RPS; 
        buffer_size = 100*10^3, 
        batch_size = 128, 
        traversals = 10, 
        on_gpu = false
)

# train CFR solver for 1000 iterations
train!(sol, 1_000, show_progress=true)

I0 = DeepCFR.vectorized(0) # information state corresponding to first player's turn
I1 = DeepCFR.vectorized(1) # information state corresponding to second player's turn

sol(I0) # return strategy for player 1 
sol(I1) # return strategy for player 2

Define custom Flux.jl networks

using Flux

in_size = 1 # information state vector is of length 1 (ref `DeepCFR.vectorized`)
out_size = 3 # 3 actions: rock, paper, scissors

#= 
strategy is a probability distribution -> network output must add to 1.
Simple solution is to softmax output
=#
strategy_network = Chain(Dense(in_size, 40), Dense(40, out_size), softmax)

# regret/value does not need to be normalized
value_network = Chain(Dense(in_size, 20), Dense(20, out_size))

sol = DeepCFRSolver(
        RPS; 
        strategy = strategy_network,
        values = (value_network, deepcopy(value_network)) 
) # DeepCFR requires as many value networks as there are players (2 here)

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.github/workflows		.github/workflows
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
Project.toml		Project.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

src

src

test

test

.gitignore

.gitignore

LICENSE

LICENSE

Project.toml

Project.toml

README.md

README.md

Repository files navigation

DeepCFR.jl

About

Releases

Packages

Languages

License

WhiffleFish/DeepCFR.jl

Folders and files

Latest commit

History

Repository files navigation

DeepCFR.jl

About

Resources

License

Stars

Watchers

Forks

Languages