Model predictive control with deep learning system model and Bayesian last layer trust regions

Accompanying repository for our work "Model predictive control with deep learning system model and Bayesian last layer trust regions".

Abstract

Deep neural networks have proven to be an efficient and expressive black-box model to learn complex nonlinear relationships from large amounts of data. They are also increasingly popular in system identification for model predictive control. One of the main caveats of neural networks in this setting is their lack of uncertainty quantification. Especially for economic MPC, unreasonable predictions in regions without training data might lead to poor and potentially dangerous control behavior. Bayesian neural networks (BNN) try to alleviate this problem but add significant complexity to both training and inference, rendering their application for MPC infeasible. Bayesian last layer (BLL) is a simplified BNN, representing a compromise between tractability and expressiveness. Most importantly, training and point estimates are unchanged in comparison to regular NN. The BLL covariance computation is strongly related to Gaussian Processes (GP) but in contrast to them, BLL does not suffer from the same scaling issues. While BLL cannot be used for probabilistic guarantees in most cases, we propose to define a trust region based on the computed covariance. We demonstrate in an empirical investigation that our economic MPC formulation with BLL trust region constraint leads to well behaved closed-loop trajectories, where the equivalent formulation without trust region leads to poor closed-loop performance.

About this repository

Dear visitor,

thank you for checking our the supplementary materials to our work. If you have questions, remarks, technical issues etc. feel free to use the discussions page of this repository. We are looking forward to your feedback and the discussion.

The purpose of this repository is to provide all the materials needed to recreate our results and to allow to experiment yourself with the code. Most importantly, we are providing here the investigated system model (towing kite). All of our results are created using Python code (using Jupyter Notebooks) and the toolboxes:

numpy
matplotlib
scipy
CasADi

As well as our own toolbox do-mpc. We heavily rely on the newly developed do-mpc sampling tools, for example for data generation and for statistical validation of our findings. For a comprehensive guide to this feature, please watch this video.

Organization of this repository

BLL_Fundamentals
- BLL_Fundamentals: Used to generate the results shown in Figure 1 in our paper.
Kite_Investigation
- Kite Introduction: Statement of the investigated system and all required parameters. Showcasing an examplary MPC closed trajectory and how to obtain it.
- system
  - Python files for model, simulator, (MPC) controller used to generate closed-loop trajectories
  - main.py file (creating similar results as in the Kite Introduction Jupyter Notebook)
- sampling Sample system to create training data for GP/NN system model
  - create_sampling_plan.py: Define 100 cases (varying initial states, etc.)
  - sample_sytem.py: Create closed-loop trajectories for the defined cases
- surrogate Train GP and NN system model
  - train_test_val_data_prep: Load and analyze sampled sequences. Create training, testing and validation data for GP and NN.
  - nn_kite_modell: Jupyter notebook from which the NN system model was trained.
  - gp_kite_modell: Jupyter notebook from which the GP system model was trained.
  - nn_vs_gp_sys_id: Jupyter notebook to compare GP and NN system model in open-loop predictions (used for Figure 2 in our paper)
- mpc: MPC with surrogate NN/GP model(s)
  - Python files for (GP / NN)model, simulator, (MPC) controller
  - validation_sampling: Validate MPC controller based on surrogate model
    - create_sampling_plan.py: Define 20 cases
    - sample_sytem.py: Create closed-loop trajectories for the defined cases
    - validation_evalution: Jupyter notebook to compare the obtained results (used for Figure 3 in our paper)

Please notice that we do not include data to reproduce the results due to space restrictions of this repository. We do, however, provide all the tools to re-sample the exact same data that we used (random seeds are fixed).

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
BLL_Fundamentals		BLL_Fundamentals
Kite_Investigation		Kite_Investigation
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BLL_Fundamentals

BLL_Fundamentals

Kite_Investigation

Kite_Investigation

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Model predictive control with deep learning system model and Bayesian last layer trust regions

Abstract

About this repository

Organization of this repository

About

Releases

Packages

Languages

License

4flixt/2022_BLL_MPC_Materials

Folders and files

Latest commit

History

Repository files navigation

Model predictive control with deep learning system model and Bayesian last layer trust regions

Abstract

About this repository

Organization of this repository

About

Resources

License

Stars

Watchers

Forks

Languages