Skip to content

Latest commit

 

History

History
37 lines (28 loc) · 1.4 KB

README.md

File metadata and controls

37 lines (28 loc) · 1.4 KB

Learning to Play Imperfect-Information Games by Imitating an Oracle Planner

This repository contains source code for Pommerman experiments in the paper titled "Learning to Play Imperfect-Information Games by Imitating an Oracle Planner" by Rinu Boney, Alexander Ilin, Juho Kannala and Jarno Seppänen.

Dependencies:

Oracle Planner with Full Observability

The planning results reported in the paper can be reproduced by running:

python plan.py

Optional arguments:

Parameter Default Description
--planner fdts 'fdts' or 'mcts' or 'mcs'
--mab ts 'ts' or 'ucb'
--n_simulations 100 number of planning rollouts at each time-step
--horizon 20 depth of planning rollouts in FDTS and MCS
--n_threads 5 number of games to play in parallel
--n_episodes 100 total number of games to play

Training Follower Policy with Partial Observability

The follower results reported in the paper can be reproduced by running:

python follow.py

License

This project is licensed under the terms of the GPL-3.0 License. See LICENSE file for details.