ModeRL: Mode-constrained Model-based Reinforcement Learning via Gaussian Processes

This repo contains the code and source docs for our paper:

Mode-constrained Model-based Reinforcement Learning via Gaussian Processes
Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS) 2023
Aidan Scannell, Carl Henrik Ek, Arthur Richards

Model-based reinforcement learning (RL) algorithms do not typically consider environments with multiple dynamic modes, where it is beneficial to avoid inoperable or undesirable modes. We present a model-based RL algorithm that constrains training to a single dynamic mode with high probability. This is a difficult problem because the mode constraint is a hidden variable associated with the environment’s dynamics. As such, it is 1) unknown a priori and 2) we do not observe its output from the environment, so cannot learn it with supervised learning. We present a nonparametric dynamic model which learns the mode constraint alongside the dynamic modes. Importantly, it learns latent structure that our planning scheme leverages to 1) enforce the mode constraint with high probability, and 2) escape local optima induced by the mode constraint. We validate our method by showing that it can solve a simulated quadcopter navigation task whilst providing a level of constraint satisfaction both during and after training.

Install

Create a virtual environment:

cd /path/to/moderl
python -m venv moderl-venv
source moderl-venv/bin/activate

Install ModeRL in editable mode with dependencies needed for experiments:

pip install -e ".[experiments]"

Running and plotting

See experiments/ for detailed instructions on running all of the experiments in the paper. As an example, the ModeRL experiment with a schedule that tightens the constraint level during training can be run with:

cd ./experiments
python train.py +experiment=constraint_schedule

See the example notebook to see how to use ModeRL in practice.

Citation

@proceedings{scannell2023moderl,
    title={Mode-constrained Model-based Reinforcement Learning via Gaussian Processes},
    author={Scannell, Aidan and Ek, Carl Henrik and Richards, Arthur},
    booktitle = {International {{Conference}} on {{Artificial Intelligence}} and {{Statistics}}},
    year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1,150 Commits
examples		examples
experiments		experiments
moderl		moderl
paper		paper
poster		poster
subtrees		subtrees
.flake8		.flake8
.gitattributes		.gitattributes
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

experiments

experiments

moderl

moderl

paper

paper

poster

poster

subtrees

subtrees

.flake8

.flake8

.gitattributes

.gitattributes

.gitignore

.gitignore

.isort.cfg

.isort.cfg

.pre-commit-config.yaml

.pre-commit-config.yaml

README.md

README.md

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

ModeRL: Mode-constrained Model-based Reinforcement Learning via Gaussian Processes

Install

Running and plotting

Citation

About

Releases

Packages

Languages

aidanscannell/ModeRL

Folders and files

Latest commit

History

Repository files navigation

ModeRL: Mode-constrained Model-based Reinforcement Learning via Gaussian Processes

Install

Running and plotting

Citation

About

Topics

Resources

Stars

Watchers

Forks

Languages