Hanabi ToM

Implementation of nested theory of mind belief estimation & implicit communication intrinsic rewards proposed in Theory of Mind for Deep Reinforcement Learning in Hanabi

Citation

@misc{fuchs2019theory,
      title={Theory of Mind for Deep Reinforcement Learning in Hanabi}, 
      author={Andrew Fuchs and Michael Walton and Theresa Chadwick and Doug Lange},
      year={2019},
      eprint={2101.09328},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
belief_model.py		belief_model.py
container_start.sh		container_start.sh
new_experiment.sh		new_experiment.sh
prob_based_agent.py		prob_based_agent.py
run_experiment.py		run_experiment.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

belief_model.py

belief_model.py

container_start.sh

container_start.sh

new_experiment.sh

new_experiment.sh

prob_based_agent.py

prob_based_agent.py

run_experiment.py

run_experiment.py

train.py

train.py

Repository files navigation

Hanabi ToM

Citation

About

Releases

Packages

Languages

License

mwalton/ToM-hanabi-neurips19

Folders and files

Latest commit

History

Repository files navigation

Hanabi ToM

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages