Reversi-WANN

Overview

Reinforcement Learning in Reversi game with Weight Agnostic Neural Network.

Based on article weightagnostic.github.io and my other repository Reversi-RL.

Running

Training

python wann_train.py -p p/reversi_5_4.json -n 8

Testing

python wann_test.py -p p/reversi_5_4.json -r 1000 -i champions/reversi_5_4.out -v True

Results

Fitness may be interpreted as accuracy

Reversi 5x4

[***]   Fitness:          [0.48 0.45 0.46 0.8  0.77 0.75]
[***]   Weight Values:    [-2.  -1.  -0.5  0.5  1.   2. ]

Reversi 5x5

[***]   Fitness:         [0.58 0.56 0.55 0.4  0.43 0.43] 
[***]   Weight Values:   [-2.  -1.  -0.5  0.5  1.   2. ]

Reversi 8x8

[***]   Fitness:          [0.51 0.49 0.49 0.55 0.54 0.54] 
[***]   Weight Values:    [-2.  -1.  -0.5  0.5  1.   2. ]

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
champions		champions
domain		domain
log		log
p		p
wann_src		wann_src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
wann_test.py		wann_test.py
wann_train.py		wann_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

champions

champions

domain

domain

log

log

p

p

wann_src

wann_src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

wann_test.py

wann_test.py

wann_train.py

wann_train.py

Repository files navigation

Reversi-WANN

Overview

Running

Training

Testing

Results

Reversi 5x4

Reversi 5x5

Reversi 8x8

About

Languages

License

klima7/Reversi-WANN

Folders and files

Latest commit

History

Repository files navigation

Reversi-WANN

Overview

Running

Training

Testing

Results

Reversi 5x4

Reversi 5x5

Reversi 8x8

About

Topics

Resources

License

Stars

Watchers

Forks

Languages