0% found this document useful (0 votes)

24 views28 pages

05 - Robust MPC

The document discusses Robust Model Predictive Control (MPC) and its challenges, particularly in dealing with model mismatches and disturbances. It emphasizes the importance of feedback in optimizing control policies and explores the use of prior information about disturbances to improve performance under uncertainty. The document also outlines various approaches to robust control, including the use of dynamic programming and feedback MPC strategies.

Uploaded by

Ahmet Çelik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views28 pages

05 - Robust MPC

Uploaded by

Ahmet Çelik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Computational Control

Robust MPC

Saverio Bolognani

Automatic Control Laboratory (IfA)

ETH Zurich
“All models are wrong, but some are useful”

Neglected exogenous disturbances

xk+1 = f (xk , uk , wk )

Model mismatch
xk+1 = f̃ (xk , uk )
Missing dynamics / non-Markovianity

xk+1 = f (xk , zk , uk )
zk+1 = fz (xk , zk , uk )

Linearization
xk+1 = Ãxk + B̃uk
And more
▶ time discretization
▶ quantization
▶ time-varying parameters
▶ ...

1 / 26
The main tool against model mismatch/disturbances: feedback.
By determining∗ the optimal control policy at the current state x, we incorporate all
the past information in the decision.

Parametric optimization
K
X −1
u0∗ (x) determined by min gk (xk , uk ) + gK (xK )
u,x
k=0

subject to xk+1 = f (xk , uk )

x0 = x
xk ∈ X k
uk ∈ U k

* determining =
evaluating a policy in very special cases: LQR, Explicit MPC, ...
solving a program in real-time in general: tracking MPC, Economic MPC, ...

2 / 26
We can do better than that if we have prior information on the disturbance.

past
nominal
x ensemble
(same input sequence, different disturbance)

now k

xk+1 = f (xk , uk , wk )

finite disturbances wk ∈ {w 0 , w 1 , . . . , w p }
disturbance set wk ∈ co{w 0 , w 1 , . . . , w p }
probability distribution wk ∼ W

Does it matter? How can we use this prior information?

3 / 26
What is performance under uncertainty
Cost
▶ Nominal cost
▶ Worst case cost
▶ Expected cost
▶ ...and more (e.g. depending on risk tolerance)
Constraints
▶ Guaranteed satisfaction of constraints
▶ Constraint satisfaction with high probability
▶ Constraint satisfaction for a number of samples (Monte Carlo, scenario approach)
▶ Bound on expected violation (CVar - conditional value at risk)
▶ ...and more

Not covered: stability under uncertainty.

4 / 26
In this course

Linear system, additive disturbance

xk+1 = Axk + Buk + Dwk

Robust decision: Worst-case

Feedback control law u0∗ (x) determined as the first step of the open-loop robust
optimal control problem
K
X −1
min max gk (xk , uk ) + gK (xK )
u,x w
k=0

subject to xk+1 = Axk + Buk +Dwk

x0 = x
xk ∈ X ∀w ∈ W
uk ∈ U

5 / 26
Example of unfeasible robust trajectory

A very simple system

xk+1 = xk + uk + wk , |wk | ≤ 0.5

cost g(x, u) = x 2 + u2
state constraint |xk | ≤ 1, unconstrained input
horizon K = 5

Simple integration of the plant dynamics yields

4
X 4
X
|x5 | ≤ 1 ⇔ −1 ≤ x0 + uk + wk ≤ 1
k=1 k=1

If this needs to hold for all feasible disturbance w, we have

4
X
−x0 − 1−2 ≤ uk ≤ −x0 + 1−2 for wk = 0.5, 0.5, 0.5, . . .
k=1
4
X
−x0 − 1+2 ≤ uk ≤ −x0 + 1+2 for wk = −0.5, −0.5, −0.5, . . .
k=1

6 / 26
Example of unfeasible robust trajectory

A very simple system

xk+1 = xk + uk + wk , |wk | ≤ 0.5

cost g(x, u) = x 2 + u2
state constraint |xk | ≤ 1, unconstrained input
horizon K = 5

A feasible (although suboptimal) proportional controller clearly exists:

uk = −xk

as it yields the closed loop dynamics xk+1 = wk , which is clearly within bounds.

MPC was supposed to be a clever way to produce static time-invariant feedback

laws, why is it failing to do so?

7 / 26
In solving the open loop finite-time optimal control at the core of the MPC routine,
we are looking for a feasible input sequence

u0 , u1 , . . . , uK

that produces a feasible trajectory regardless of the disturbance wk .

That leads to
extremely conservative control sequences
infeasible problems

Closed loop optimal control

We are allowed to optimize over the set of input policies

uk = πk (xk ) k = 0, . . . , K − 1

or, equivalently,
uk = θk (w0 , . . . , wk−1 ) k = 0, . . . , K − 1

8 / 26
Unfortunately, computing the optimal robust closed-loop control policies is
extremely hard.
If we could do that, we would have solved our optimal control problems via
dynamic programming.
Not surprisingly, linear state update and quadratic cost is one of the very few
cases when this problem is tractable.

Robust LQR
Minmax LQR
H∞ LQR
two-player LQR

Come to Game Theory and Control in Fall!

9 / 26
Special (solvable) case

Soft-constrained LQR game

K
X −1
V (x) = min max xk⊤ Qxk + uk⊤ Ruk −γ 2 wk⊤ wk + xK⊤ SxK , Q, S ≥ 0, R > 0
u w
k=0

The term −γ 2 wk⊤ wk

is irrelevant in the minimization with respect to u
is a “soft bound” on the energy of the disturbance wk
makes the maxw concave and therefore solvable (γ large enough)
can be tuned a posteriori, iterating until wk is acceptable

Interpretation: two players

minimizing player: the controller, trying to reduce the plant cost
maximizing player: nature playing against you, knowing uk (worst case!)

10 / 26
Dynamic programming solution

Let Vk (x) be the min-max problem involving the steps from k to K .

1 Verify that VK is a quadratic form. → It is: VK (x) = x ⊤ Sx.
2 Assume that Vk+1 is a quadratic form: Vk+1 (x) = x ⊤ Pk+1 x.
3 Solve the Isaac equation

Vk (x) = min max x ⊤ Qx + u⊤ Ru − γ 2 w ⊤ w + Vk+1 (Ax + Bu + Dw)

u w

and obtain the optimal input uk∗ (x) and the optimal disturbance wk∗ (x).
4 Prove that Vk is a quadratic form: Vk (x) = x ⊤ Pk x.
5 Iterate backwards until u0∗ .

11 / 26
Solution of the Isaac equation
Maximization over w yields a linear function in x and u

ŵk (x, u) = −(D⊤ Pk+1 D − γ 2 I)−1 D⊤ Pk+1 (Ax + Bu) := Λx + Γu

Proof:
The argument of the min max function in Vk (x) can be rewritten, using the
inductive assumption Vk+1 (x) = x ⊤ Pk+1 x, as

x ⊤ Qx + u⊤ Ru − γ 2 w ⊤ w + (Ax + Bu + Dw)⊤ Pk+1 (Ax + Bu + Dw).

Its gradient with respect to w is

−2γw + 2D⊤ Pk+1 (Ax + Bu + Dw)

which is zero when

(γ 2 I − D⊤ Pk+1 D)w = D⊤ Pk+1 (Ax + Bu)

that is when
w = (γ 2 I − D⊤ Pk+1 D)−1 D⊤ Pk+1 (Ax + Bu).

12 / 26
Solution of the Isaac equation
Minimization over u (assuming worst case ŵk ) yields a linear function in x

uk∗ (x) = Kx ⇒ wk∗ (x) = Λx + Γu = (Λ + ΓK ) x

| {z }
H

Proof:
We can plug the expression for the worst w, i.e., ŵk (x, u) = Λx + Γu into the
expression for Vk (x), and obtain

Vk (x) = min x ⊤ Qx + u⊤ Ru − γ 2 (Λx + Γu)⊤ (Λx + Γu)
u

+ (Ax + Bu + D(Λx + Γu))⊤ Pk+1 (Ax + Bu + D(Λx + Γu))

Notice that this is a standard LQR problem now, for which we know that the
optimal solution is a linear state feedback, i.e. u = Kx.
The expression for K can be computed by zeroing the gradient with respect to u.

13 / 26
Recursive definition of Vk
We can finally prove that the value function is quadratic, by simple substitution of
the linear forms of uk∗ and wk∗ in x

Vk (x) = min max x ⊤ Qx + u⊤ Ru − γ 2 w ⊤ w + Vk+1 (Ax + Bu + Dw)

u w

= x ⊤ Qx + x ⊤ K ⊤ RKx − γ 2 x ⊤ H ⊤ Hx+
(A + BKx + DHx)⊤ Pk+1 (A + BKx + DHx)

= x ⊤ Q + K ⊤ RK − γ 2 H ⊤ H + (A + BK + DH)⊤ Pk+1 (A + BK + DH) x
| {z }
Pk

14 / 26
Things you would have to verify
convexity in u at all steps
concavity in w at all steps (requires γ large enough)
invertibility of the Hessians → unique minimizers/maximizers
positive semidefinitess of Pk

Offline computation
Similarly to the LQR case, this entire computation can be performed offline.
Online part in a receding horizon scheme:

u0∗ (x) = K0 x

(time-invariant static feedback law)

15 / 26
Two ways to derive u0∗ (x)

Closed-loop solution Open-loop solution

Computationally intractable (except Computationally tractable (convex

for very few cases like optimization problem) but often
soft-constrained min-max LQR) unfeasible
Constructs a feedback control
u1 (x1 ), u2 (x2 ), . . . , uK∗ (xK ) Constructs an input sequence
Optimal: all past information about u0 , u1 , . . . , uK
the disturbance is used at each Conservative: no past information
stage k about the disturbance is used except
for the information available at k = 0
ŵk = D† (xk+1 − Axk − Buk )

Dynamic programming
automatically returns the desired The desired control law u0∗ (x) is
control law u0∗ (x) from offline obtained by parametrizing the
computation. online optimization problem in x
Corresponds to infinite-time optimal
control at the limit K → ∞.

16 / 26
nominal w = 0 closed loop open loop

w1 w1

w2 w2

state
!
input
w3 w3
disturbance

17 / 26
A tradeoff

Parametrize the feedback control law via a set of parameters v

uk (x) = πk (xk ; v)

and solve a feedback MPC problem.

Feedback MPC
K
X −1
min max gk (xk , πk (xk , v)) + gK (xK )
v,x w
k=0

subject to xk+1 = f (xk , πk (xk , v), wk )

x0 = x
xk ∈ Xk ∀w ∈ W
πk (xk , v) ∈ Uk ∀w ∈ W

We are allowing uk to use current information xk via a policy π!

18 / 26
Examples of policies

Open-loop policy

v ∈ RK −1 uk (xk ) = πk (xk ; v) = vk

Some smart parametrization

M
X
v ∈ RKM uk (xk ) = πk (xk ; v) = vkm θm (xk )
m=1

Closed-loop policy

v ∈ R∞ uk (xk ) = πk (xk ) any function!

19 / 26
Feedback MPC
A policy parametrized in a vector of parameters v:
M
X
v k ∈ RM uk (xk ) = π(xk ; v) = vm θm (xk )
m=1

Selecting the right basis functions {θm }m=1,...,M is often complicated.

Example for linear time-invariant systems: affine control law

K
X −1
min max xk⊤ Qxk + πk (xk , v)⊤ Rπk (xk , v) + xK⊤ SxK
v,x w
k=0

subject to xk+1 = Axk + Bπk (xk , v) + Dwk

x0 = x
xk ∈ Xk ∀w ∈ W
πk (xk , v) ∈ Uk ∀w ∈ W

with the affine control law πk (xk , v) = vk + Lxk

20 / 26
The optimal trajectory is determined by

xk+1 = Axk + B (vk + Lxk ) +Dwk

| {z }
πk (xk ,v)

that is
xk+1 = (A + BL)xk + Bvk + Dwk

The sequence vk is an open-loop optimal policy

The feedback gain L rejects the disturbance wk
▶ A + BL Hurwitz
▶ design L offline for good disturbance rejection

Note: In case of no disturbance, there is no advantage in this parametrization, as

vk can include the term Lxk computed for the nominal system trajectory.

21 / 26
Remark: joint optimization of L and v

Consider the MPC-feedback affine control law

uk = vk + Lxk

What if we optimize both with respect to

the open-loop (feedforward) sequence vk
the closed-loop (feedback) gain L?

closed-loop policies
any πk (xk )

open-loop
policies πk (xk ) = vk + Lxk
πk (xk ) = vk + Lk xk
πk (xk ) = vk fixed L
L=0

A larger set of possible policies (still not any policy πk )

22 / 26
Optimization over disturbance-feedback policies
Consider control policies of the form
k−1
X
uk = Mki wi + vk .
i=0

The optimization problem

K
X −1
min max xk⊤ Qxk + uk⊤ Ruk + xK⊤ SxK
v,M,x w
k=0

subject to xk+1 = Axk + Buk + Dwk

x0 = x
xk ∈ Xk ∀w ∈ W
uk ∈ Uk ∀w ∈ W
k−1
X
uk = Mki wi + vk
i=0

is convex.

23 / 26
k−1
X
uk = Mki wi + vk .
i=0

Notice first that a feedback form the disturbance w (with a unit delay) is equivalent
to a feedback from the state, as

wk = D† (xk+1 − Axk − Buk )

We then have convexity, as the feasible set

 

 xk+1 = Axk + Buk + Dwk 



 x0 = x 


(M, v) xk ∈ Xk ∀w ∈ W
uk ∈ Uk ∀w ∈ W

 


 

uk = k−1
P
i=0 Mki wi + vk
 

is convex (even for non-convex W).

Computational complexity
Despite being a convex problem, this problem can be computationally very hard,
because of the huge number of constraints (proportional to the cardinality of the
set W).
24 / 26
Robust MPC: summary

Standard MPC already rejects disturbance by the feedback nature of the

resulting control law u0∗ (x)
We may want to do better: min-max and robust performance criteria (cost and
constraints)
Simply enforcing worst-case satisfaction of constraints in the open-loop
trajectory planning yields very conservative solutions (when not infeasible
problems)
Ideally, closed-loop policies should be used in the optimization, but this is only
doable for very special cases (min-max LQR)
A compromise consists in parametrizing control policies in a tractable way
Example: combine a pre-computed linear feedback with an optimized
open-loop sequence
The linear feedback and the openloop sequence can be optimized together
via a convex problem.

25 / 26
The control engineer flowchart

26 / 26
This work is licensed under a
Creative Commons Attribution-ShareAlike 4.0 International License

https://bsaver.io/COCO

03 - Model Predictive Control
No ratings yet
03 - Model Predictive Control
47 pages
MPC Book
100% (1)
MPC Book
464 pages
Model Predictive Control
100% (2)
Model Predictive Control
460 pages
ECE1659H Instructor 2x1
No ratings yet
ECE1659H Instructor 2x1
148 pages
Linear Quadratic Regulator
0% (1)
Linear Quadratic Regulator
52 pages
MPC Slides
100% (1)
MPC Slides
25 pages
Predictive Control: For Linear and Hybrid Systems
No ratings yet
Predictive Control: For Linear and Hybrid Systems
458 pages
LQG MPC Notes PDF
100% (1)
LQG MPC Notes PDF
42 pages
Borelli Predictive Control PDF
No ratings yet
Borelli Predictive Control PDF
424 pages
THESIS-Neural Networks and Sliding Modes Control
No ratings yet
THESIS-Neural Networks and Sliding Modes Control
258 pages
Model Predictive Control Using Matlab
No ratings yet
Model Predictive Control Using Matlab
10 pages
Model Predictive Control Notes
100% (6)
Model Predictive Control Notes
135 pages
Model Predictive Control
100% (2)
Model Predictive Control
40 pages
04 - Economic MPC
No ratings yet
04 - Economic MPC
21 pages
Adaptative For Lineare
No ratings yet
Adaptative For Lineare
23 pages
10 - Reinforcement Learning
No ratings yet
10 - Reinforcement Learning
24 pages
Review
No ratings yet
Review
22 pages
Cava Lca 2013
No ratings yet
Cava Lca 2013
9 pages
Book Model Based Predictive Control (Bismark)
No ratings yet
Book Model Based Predictive Control (Bismark)
31 pages
02 - Dynamic Programming and LQR
No ratings yet
02 - Dynamic Programming and LQR
25 pages
ECE586BH Lecture1
No ratings yet
ECE586BH Lecture1
36 pages
Notations... Crows Foot
No ratings yet
Notations... Crows Foot
14 pages
Computer Architecture and Organisation Notes
100% (1)
Computer Architecture and Organisation Notes
18 pages
RL and ObC Lecture 1
No ratings yet
RL and ObC Lecture 1
34 pages
Lecture5 LQR PDF
No ratings yet
Lecture5 LQR PDF
54 pages
Single Shooting and Multiple Shooting
No ratings yet
Single Shooting and Multiple Shooting
4 pages
Explicit LQR For Constrained Systems
No ratings yet
Explicit LQR For Constrained Systems
21 pages
Model Based Output Difference Feedback Optimal Control
No ratings yet
Model Based Output Difference Feedback Optimal Control
6 pages
1 s2.0 S0005109806000021 Main PDF
No ratings yet
1 s2.0 S0005109806000021 Main PDF
11 pages
Chapter 4
No ratings yet
Chapter 4
33 pages
09 - Monte Carlo Learning
No ratings yet
09 - Monte Carlo Learning
24 pages
Model Identification For Robot Manipulators Using Regressor-Free Adaptive Control
No ratings yet
Model Identification For Robot Manipulators Using Regressor-Free Adaptive Control
32 pages
Model Free RL Notes
No ratings yet
Model Free RL Notes
7 pages
5 PDFsam Book Model Based Predictive Control (Bismark)
No ratings yet
5 PDFsam Book Model Based Predictive Control (Bismark)
12 pages
Feedback Linearization and LQ Based Constrained Predictive Control
No ratings yet
Feedback Linearization and LQ Based Constrained Predictive Control
19 pages
Adaptive Control
No ratings yet
Adaptive Control
33 pages
Learn Devops With A Grade Project
No ratings yet
Learn Devops With A Grade Project
13 pages
L-5 Introduction To Robust Control
No ratings yet
L-5 Introduction To Robust Control
9 pages
Constrained Linear Quadratic Regulation
No ratings yet
Constrained Linear Quadratic Regulation
7 pages
A2 Linear-Quadratic Optimal Control
No ratings yet
A2 Linear-Quadratic Optimal Control
8 pages
Deep Ukf
No ratings yet
Deep Ukf
13 pages
Closed Loop Properties of MPC
No ratings yet
Closed Loop Properties of MPC
18 pages
Athans Workshop 10 07
No ratings yet
Athans Workshop 10 07
40 pages
09.all We Like Sheep PDF
No ratings yet
09.all We Like Sheep PDF
9 pages
Optimal Control
No ratings yet
Optimal Control
35 pages
Me 433 - State Space Control: 1. Optimization Without Constraints
No ratings yet
Me 433 - State Space Control: 1. Optimization Without Constraints
8 pages
Lecture 4 Control
No ratings yet
Lecture 4 Control
23 pages
Makaleler
No ratings yet
Makaleler
108 pages
Optimal Control and Decision Making: Eexam
No ratings yet
Optimal Control and Decision Making: Eexam
18 pages
3796 Neural Lyapunov Model Predicti
No ratings yet
3796 Neural Lyapunov Model Predicti
12 pages
Inno2024 EMT4203 CONTROL II NOTES R6
No ratings yet
Inno2024 EMT4203 CONTROL II NOTES R6
9 pages
L4 Discrete Time Optimal Control Indirect LQ ARE
No ratings yet
L4 Discrete Time Optimal Control Indirect LQ ARE
26 pages
Exact-Linearization Lyapunov-Based Design: Lab 2 Adaptive Control Backstepping
No ratings yet
Exact-Linearization Lyapunov-Based Design: Lab 2 Adaptive Control Backstepping
35 pages
Blackmore GNC06
No ratings yet
Blackmore GNC06
15 pages
OCDM2223 Tutorial7solved
No ratings yet
OCDM2223 Tutorial7solved
5 pages
TK Series Magnet GPS Tracker USER MANUAL
No ratings yet
TK Series Magnet GPS Tracker USER MANUAL
26 pages
Receding Horizon HN Control of
No ratings yet
Receding Horizon HN Control of
10 pages
Robust Model Predictive Control of Constrained Linear Systems With Bounded Disturbances
No ratings yet
Robust Model Predictive Control of Constrained Linear Systems With Bounded Disturbances
6 pages
Applsci 13 08204
No ratings yet
Applsci 13 08204
14 pages
Disaster Recovery Using Alwayson Availability Group - Scenario 1
No ratings yet
Disaster Recovery Using Alwayson Availability Group - Scenario 1
34 pages
Hall Ticket
No ratings yet
Hall Ticket
1 page
ASSEMBLY Chapter 10
No ratings yet
ASSEMBLY Chapter 10
45 pages
Simple Device Discovery Protocol Specification
No ratings yet
Simple Device Discovery Protocol Specification
12 pages
Lec16 - Optimal Control
No ratings yet
Lec16 - Optimal Control
13 pages
Linear Programming and Model Predictive Control: Christopher V. Rao, James B. Rawlings
No ratings yet
Linear Programming and Model Predictive Control: Christopher V. Rao, James B. Rawlings
7 pages
Maximum Demand Controller
0% (1)
Maximum Demand Controller
4 pages
An E!cient O"-Line Formulation of Robust Model Predictive Control Using Linear Matrix Inequalities
No ratings yet
An E!cient O"-Line Formulation of Robust Model Predictive Control Using Linear Matrix Inequalities
10 pages
Lastexception 63869526642
No ratings yet
Lastexception 63869526642
2 pages
Resume Professional Aafridah Software Engineer
No ratings yet
Resume Professional Aafridah Software Engineer
4 pages
Activity 1
No ratings yet
Activity 1
7 pages
Adaptive DP For Discrete Time LQR Optimal Tracking Control Problems With Unknown Dynamics
No ratings yet
Adaptive DP For Discrete Time LQR Optimal Tracking Control Problems With Unknown Dynamics
6 pages
Module 5.4 LOGIC
No ratings yet
Module 5.4 LOGIC
11 pages
BRM LDC
No ratings yet
BRM LDC
61 pages
Git Document
No ratings yet
Git Document
14 pages
BBMbook Cambridge Newstyle
No ratings yet
BBMbook Cambridge Newstyle
373 pages
Robust Stability Analysis With Integral Quadratic Constraints (IQCs) A Design Example
No ratings yet
Robust Stability Analysis With Integral Quadratic Constraints (IQCs) A Design Example
6 pages
SIFT Detector FPCV-2-3
No ratings yet
SIFT Detector FPCV-2-3
22 pages
Xhamster VR Manual
No ratings yet
Xhamster VR Manual
5 pages
2ND Summative CSS 10
No ratings yet
2ND Summative CSS 10
3 pages
The Internet: Bringing Us Together or Tearing Us Apart
No ratings yet
The Internet: Bringing Us Together or Tearing Us Apart
4 pages
BADS (KMBA 106) - Qus Bank
No ratings yet
BADS (KMBA 106) - Qus Bank
7 pages
កិច្ចសន្យាទិញលក់ផ្ទះ PDF
No ratings yet
កិច្ចសន្យាទិញលក់ផ្ទះ PDF
1 page
AP-14 Ver 1.0 EN
No ratings yet
AP-14 Ver 1.0 EN
3 pages
Aptitude Questions
No ratings yet
Aptitude Questions
10 pages
WWW Reddit Com R Slingshots Comments Weygv0 Diy Slingshot Make A Knuckle Slingshot Out of Wood
No ratings yet
WWW Reddit Com R Slingshots Comments Weygv0 Diy Slingshot Make A Knuckle Slingshot Out of Wood
7 pages
GMT4000product Information
No ratings yet
GMT4000product Information
2 pages
Paper4 Asian Journal of Control Volume 7 Issue Number 3 Page(s) 323-329
No ratings yet
Paper4 Asian Journal of Control Volume 7 Issue Number 3 Page(s) 323-329
7 pages
Rincy: Face Recognition Based Automated Registration System
No ratings yet
Rincy: Face Recognition Based Automated Registration System
2 pages
Remotegate and The New Sic & Eurosic: Business Applications For Banks and For Corporates
No ratings yet
Remotegate and The New Sic & Eurosic: Business Applications For Banks and For Corporates
1 page
Statement 2: Under The Assumptions of This Theorem For Any
No ratings yet
Statement 2: Under The Assumptions of This Theorem For Any
7 pages
Exemple de Contrôle Continu
No ratings yet
Exemple de Contrôle Continu
1 page
Robust Design of Linear Control Laws For Constrained Nonlinear Dynamic Systems
No ratings yet
Robust Design of Linear Control Laws For Constrained Nonlinear Dynamic Systems
6 pages
Netflix Premium Cookie 1
No ratings yet
Netflix Premium Cookie 1
3 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet

05 - Robust MPC

Uploaded by

05 - Robust MPC

Uploaded by

Computational Control

Automatic Control Laboratory (IfA)

Neglected exogenous disturbances

subject to xk+1 = f (xk , uk )

Does it matter? How can we use this prior information?

Not covered: stability under uncertainty.

Linear system, additive disturbance

xk+1 = Axk + Buk + Dwk

Robust decision: Worst-case

subject to xk+1 = Axk + Buk +Dwk

A very simple system

xk+1 = xk + uk + wk , |wk | ≤ 0.5

Simple integration of the plant dynamics yields

If this needs to hold for all feasible disturbance w, we have

A very simple system

xk+1 = xk + uk + wk , |wk | ≤ 0.5

A feasible (although suboptimal) proportional controller clearly exists:

MPC was supposed to be a clever way to produce static time-invariant feedback

that produces a feasible trajectory regardless of the disturbance wk .

Closed loop optimal control

Come to Game Theory and Control in Fall!

Soft-constrained LQR game

The term −γ 2 wk⊤ wk

Interpretation: two players

Let Vk (x) be the min-max problem involving the steps from k to K .

Vk (x) = min max x ⊤ Qx + u⊤ Ru − γ 2 w ⊤ w + Vk+1 (Ax + Bu + Dw)

ŵk (x, u) = −(D⊤ Pk+1 D − γ 2 I)−1 D⊤ Pk+1 (Ax + Bu) := Λx + Γu

x ⊤ Qx + u⊤ Ru − γ 2 w ⊤ w + (Ax + Bu + Dw)⊤ Pk+1 (Ax + Bu + Dw).

Its gradient with respect to w is

−2γw + 2D⊤ Pk+1 (Ax + Bu + Dw)

which is zero when

(γ 2 I − D⊤ Pk+1 D)w = D⊤ Pk+1 (Ax + Bu)

uk∗ (x) = Kx ⇒ wk∗ (x) = Λx + Γu = (Λ + ΓK ) x

Vk (x) = min max x ⊤ Qx + u⊤ Ru − γ 2 w ⊤ w + Vk+1 (Ax + Bu + Dw)

(time-invariant static feedback law)

Closed-loop solution Open-loop solution

Computationally intractable (except Computationally tractable (convex

Parametrize the feedback control law via a set of parameters v

and solve a feedback MPC problem.

subject to xk+1 = f (xk , πk (xk , v), wk )

We are allowing uk to use current information xk via a policy π!

Some smart parametrization

v ∈ R∞ uk (xk ) = πk (xk ) any function!

Selecting the right basis functions {θm }m=1,...,M is often complicated.

Example for linear time-invariant systems: affine control law

subject to xk+1 = Axk + Bπk (xk , v) + Dwk

with the affine control law πk (xk , v) = vk + Lxk

xk+1 = Axk + B (vk + Lxk ) +Dwk

The sequence vk is an open-loop optimal policy

Note: In case of no disturbance, there is no advantage in this parametrization, as

Consider the MPC-feedback affine control law

What if we optimize both with respect to

A larger set of possible policies (still not any policy πk )

The optimization problem

subject to xk+1 = Axk + Buk + Dwk

wk = D† (xk+1 − Axk − Buk )

We then have convexity, as the feasible set

is convex (even for non-convex W).

Standard MPC already rejects disturbance by the feedback nature of the

You might also like