Implementing Tube-Based Robust MPC Controller #24

Federico-PizarroBejarano · 2022-04-19T00:52:59Z

Implementing the Robust Tube Model Predictive Control based on:
D. Mayne, M. Seron and S. Raković "Robust model predictive control of constrained linear systems with bounded disturbance," in Automatica 41(2): 219–224. 2005. doi: https://doi.org/10.1016/ j.automatica.2004.08.019

… the MPC controllers, modified tracking example accordingly. Re-organized some code in MPC so that it flowed better. Created a new controller called tube_mpc by copy-pasting linear_mpc code.

… the disturbance bounds from running linear MPC.

…I) set.

…oal is actually [0, 1].

… mRPI, making x0 an optimization variable, and then using that with the optimal feedback controller.

…ptimization

… to compute the mRPI set. It seems to work well for cartpole. tube_mpc will throw an exception if you ask it to compute mRPI for anything other than cartpole and 1d quadrotor. Users now have the option to manually specify an mRPI.

…cart-pole to be feasible. fixed a bug with x_init.

…an run the experiments

…log violations on original and tightened constraints

Federico-PizarroBejarano · 2022-05-03T20:27:41Z

Non TubeMPC Changes:

Changed iterations to max_steps in the PID controller and its experiments to match the variable name in the MPC controllers
Put YAML files in pid experiment in their own folder to match structure of other experiments
Updated pytope to version 0.0.5 in the toml file
Did a large restructuring of the LQR code to update it to current controller standards. For example, it accepted all the arguments to create its own environment rather than getting an env_func as is standard for other controllers. Also, it did not keep track of the data at each iteration (obs, info, done, reward, and action) so this was added. I would recommend completely stripping down LQR to the basic functionality since it incorporates a bunch of extra code that does not seem to be the current standard controller architecture and isn't useful (like why does it run multiple evaluations? this is part of the experiment design, not the controller).
Switched use_prev_start to warmstart everywhere since this is the current standard used in most of the MPC implementations
Added two components to linearMPC:

terminal ingredients in the form of a terminal constraint and a terminal cost which allowed the linearMPC to actually function on the cartpole environment. These can be turned on or off using the terminal ingredients flag
A "backup" controller where, if the linearMPC is infeasible, it switches to solving the MPC problem without state or input constraints to ideally find a feasible solution. This can be turned on and off using the use_backup flag. This allowed us to test the linearMPC in challenging scenarios without it becoming infeasible and quitting early.

Extended MPC's reset to reset more if a different environment is passed to the run function. Also updated the run function to quit once "done" is sent by the environment as this seems to be the intended behaviour of "done".

Federico-PizarroBejarano · 2022-05-03T22:16:59Z

Execution Instructions

Set the number of tests to run using the current configuration, num_tests in mpc_experiment.py
Turn external disturbances on and off in the respective environment configuration files cartpole_stabilization.yaml and cartpole_tracking.yaml.
Determine whether to train the TubeMPC controller using the disturbed or undisturbed environment by selecting the correct train_env in mpc_experiment.py
Set the correct task (stabilization or tracking) and the correct controller (linear_mpc, tube_mpc, lqr) in the executable file mpc_experiment.sh.
Run the experiment by calling the executable in the command line ./mpc_experiment.sh.

Expected Output

The submitted results were for 14 scenarios, each with 100 trials and took approximately 8hrs to run. The results after each trial are printed to the terminal and the aggregated results are exported as a pickle. Here are the expected results for undisturbed scenarios with 1 trial:

LQR Stabilization:

NUM SUCCESSES: 1
NUM VIOLATIONS: 3
AVG ITERATIONS: 38.0
RMSE: 0.6202432706102726

Linear MPC Stabilization

NUM SUCCESSES: 1
NUM VIOLATIONS: 0
AVG ITERATIONS: 43.0
RMSE: 0.5957017873213217

Tube MPC Stabilization

NUM SUCCESSES: 1
NUM VIOLATIONS: 5
AVG ITERATIONS: 34.0
RMSE: 0.6639842760497662

LQR Tracking

NUM SUCCESSES: 1
NUM VIOLATIONS: 0
AVG ITERATIONS: 201.0
RMSE: 0.0435026456247442

Linear MPC Tracking

NUM SUCCESSES: 0
NUM VIOLATIONS: 0
AVG ITERATIONS: 201.0
RMSE: 0.1492713725180259

Tube MPC Tracking

NUM SUCCESSES: 0
NUM VIOLATIONS: 0
AVG ITERATIONS: 106.0
RMSE: inf

Federico-PizarroBejarano · 2022-06-06T20:52:53Z

I have an even leaner version of LQR coded up that removes all the extra logging and debugging that the other controllers don't have and leaves only the core LQR. I prefer this. Let me know if you want me to add this simpler LQR to this PR since I am already cutting down LQR in this PR

keenan-burnett and others added 15 commits April 15, 2022 18:15

modified pid to use the same argument for iterations --> max_steps as…

8cb22eb

… the MPC controllers, modified tracking example accordingly. Re-organized some code in MPC so that it flowed better. Created a new controller called tube_mpc by copy-pasting linear_mpc code.

modified MPC code to work with the tracking example.

78144ff

Added a new experiment folder for MPC work.

82ab2c4

tested tube mpc, does something different than linear MPC. I estimate…

69279d5

… the disturbance bounds from running linear MPC.

added in the stabilizing controller.

051c6dc

Reorganizing config files for easier testing and changes

d046bd0

Adding learning to tube MPC

aebe4e1

resolving merge conflict.

29793b8

working on incorporating the minimal robust positively invariant (mRP…

bd6625b

…I) set.

working more on the mRPI set.

1b4b1ba

fixed some issues with the drone not stabilizing correctly. default g…

d346a25

…oal is actually [0, 1].

finished my implementation of tube mpc with correct setup for finding…

ef4d5d4

… mRPI, making x0 an optimization variable, and then using that with the optimal feedback controller.

bug fix

7410c84

fixed bugs in tube mpc.

df23ccf

Fixing some issues with constraint tightening and simplifying setup_o…

79b6533

…ptimization

Federico-PizarroBejarano requested a review from JacopoPan April 19, 2022 00:52

Federico-PizarroBejarano assigned keenan-burnett, Federico-PizarroBejarano and vivek-uka Apr 19, 2022

JacopoPan added the new controller AER1517 course assignments, contributions, etc. label Apr 19, 2022

keenan-burnett and others added 10 commits April 19, 2022 14:07

resolving merge conflict.

fef7dfa

finished merging in federico's changes. had to change value of r for …

c73b3fb

…cart-pole to be feasible. fixed a bug with x_init.

Added violation tracking and backup controllers to MPC such that we c…

cb3473a

…an run the experiments

Extended tubeMPC to take in lower bounds for disturbance and RPI and …

5ecf872

…log violations on original and tightened constraints

Making sure origin is in disturbance set and mRPI set

38dbe60

Setting up experiments and updating LQR

d6061da

Setting up experiment

292ff0a

Adding terminal set and constraints on linearMPC and tubeMPC

e68e56e

Making variable names consistent for GP MPC

c9bfbb7

Federico-PizarroBejarano added 3 commits April 22, 2022 20:53

Extending linearMPC terminal ingredients for trajectory tracking task

44ceaa3

Adding trajectory tracking to experiment

3a3509c

Final experiment setup

c220c20

Federico-PizarroBejarano marked this pull request as ready for review April 26, 2022 02:39

JacopoPan requested review from adamhall and Justin-Yuan April 26, 2022 16:31

Federico-PizarroBejarano added 2 commits May 3, 2022 18:17

Cleaning up final code

49cefac

Cleaning up config files

cb99f5a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing Tube-Based Robust MPC Controller #24

Implementing Tube-Based Robust MPC Controller #24

Federico-PizarroBejarano commented Apr 19, 2022

Federico-PizarroBejarano commented May 3, 2022

Federico-PizarroBejarano commented May 3, 2022

Federico-PizarroBejarano commented Jun 6, 2022

Implementing Tube-Based Robust MPC Controller #24

Are you sure you want to change the base?

Implementing Tube-Based Robust MPC Controller #24

Conversation

Federico-PizarroBejarano commented Apr 19, 2022

Federico-PizarroBejarano commented May 3, 2022

Non TubeMPC Changes:

Federico-PizarroBejarano commented May 3, 2022

Execution Instructions

Expected Output

LQR Stabilization:

Linear MPC Stabilization

Tube MPC Stabilization

LQR Tracking

Linear MPC Tracking

Tube MPC Tracking

Federico-PizarroBejarano commented Jun 6, 2022