Structured Multi-Agent World Models

Winner of CS4246 Project Competition 2021. Read the paper: PDF

In multi-agent reinforcement learning, the difficulty of generalising to diverse interactions remains a challenge. Inspired by model-based reinforcement learning, we present Structured Multi-Agent World Models (SMAWM), a world model that encompasses other agents in a compositional structure, to provide a strong inductive bias for generalising to novel interactions among multiple agents in the environment. We show that reinforcement learning with the agent-factored state representation outperforms that with a purely connectionist world model despite using much fewer parameters. We further show that SMAWM learns an effective representation that is capable of much higher accuracy in forward prediction for planning, and propose future extensions that can likely scale SMAWM to environments of higher complexity.

Project Breakdown

Generate datasets - generate_dataset.py
Train world model - python train_wm.py
Train SMAWM - python train_swm.py
Train RL agents - python train_rl.py
Run experiments - python run_experiments.py
Run analysis - python run_analysis.py

Results

Performance of SMAWM exceeds World Models across all settings of interest.
Performance of SMAWM does not change significantly as parameter count increases.
SMAWM has higher prediction accuracy than World Models for very short time steps.

Future Extensions

Opponent modeling to explicitly model joint action or joint policies of agents.
Graph-VRNN to overcome environment stochasticity and partial observability.
SMAWM as a model-based reinforcement learning method with online planning.

Name		Name	Last commit message	Last commit date
Latest commit History 182 Commits
analysis		analysis
baselines		baselines
datasets		datasets
experiments		experiments
figures		figures
models		models
.gitignore		.gitignore
20_NewJunJie_WuYujin.pdf		20_NewJunJie_WuYujin.pdf
LICENSE		LICENSE
README.md		README.md
generate_dataset.py		generate_dataset.py
policies.py		policies.py
requirements.txt		requirements.txt
run_analysis.py		run_analysis.py
run_experiments.py		run_experiments.py
tools.py		tools.py
train_rl.py		train_rl.py
train_swm.py		train_swm.py
train_wm.py		train_wm.py

License

jetnew/smawm

Folders and files

Latest commit

History

Repository files navigation

Structured Multi-Agent World Models

Project Breakdown

Results

Future Extensions

About

Resources

License

Stars

Watchers

Forks

Languages