0% found this document useful (0 votes)

130 views14 pages

Pontryagin Principle of Maximum Time-Optimal Control: Constrained Control, Bang-Bang Control

1) The document discusses Pontryagin's maximum principle, which provides a necessary condition for optimal control problems. 2) It states Pontryagin's principle, which says that for an optimal control u*, there exists a costate variable such that together they satisfy Hamilton's canonical equations and the Hamiltonian is maximized over the set of allowable controls. 3) It gives the example of time-optimal control problems, where bounds on controls are needed to compute realistic minimum times, and Pontryagin's principle is directly applicable.

Uploaded by

dhayanethra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

130 views14 pages

Pontryagin Principle of Maximum Time-Optimal Control: Constrained Control, Bang-Bang Control

Uploaded by

dhayanethra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

7

Pontryagin principle of maximum;

time-optimal control
Constrained control, bang-bang control

Zdeněk Hurák
April 27, 2017

he techniques of calculus of variations introduced in the previous lecture signifi-

T cantly extended our set of tools for solving optimal control problems—instead of
optimizing over (finite) sequences of real numbers we are now able to optimize over
functions. Nonetheless, the methods were also subject to severe restrictions. For ex-
ample, the property that the optimal controls maximize the Hamiltonian were checked
by testing the derivative of the Hamiltonian, which only makes sense if 1.) the Hamil-
tonian is differentiable with respect to control and 2.) the optimal control is inside the
set of allowable controls (vanishing derivative of Hamiltonian on the boundary is not
a necessary condition for the Hamiltonian to achieve a maximum value there). It also
appears that the classes of perturbations characterized by 1-norm and even 0-norm
are not rich enough for practical applications. Just consider switched (on-off) con-
trols which differ in the times of switching. For all these reasons, some more advanced
mathematical machinery has been developed. Unfortunately, it calls for a different
and a bit more advanced mathematics than what we used in the calculus of varia-
tions. The mathematics here is indeed rather involved, therefore we will only state the
most important result—the powerful Pontryagin’s principle of maximum. Although
it sort of replaces some of the previous results (and you might start complaining why
on earth we spent time with calculus of variations), having been introduced to the
calculus of variations-style of reasoning we are now certainly well-equipped to digest
at least the very statement of the powerful result by Pontryagin. If you are interested
(and courageous), for a proof see the book [1] or elsewhere.

1 Pontryagin’s principle of maximum

We have already seen in the calculus of variations that the Hamiltonian when evalu-
ated along the extremal, has the property that

Hy0 = 0. (1)

Combined with the second-order necessary condition of minimum

Ly0 y0 ≥ 0, (2)

we concluded that Hamiltonian is not only stationary along the extremal; it is actually
maximized since
Hy0 y0 = −Ly0 y0 ≤ 0. (3)
Pontryagin principle of maximum; time-optimal control

This result can be written as

H(x, y ∗ , (y ∗ )0 , p∗ ) ≥ H(x, y ∗ , y 0 , p∗ ). (4)
This is a major observation and would never be obtained withouth viewing y 0 as a
separate variable (see [2] for an insightful discussion). After a notational transition
to the optimal control setting and considering the augmented Lagrangian
Laug (t, x, u, ẋ, λ) = L(t, x, u) + λT · (ẋ − f (x, u, t)), (5)
Hamiltonian can be written as
 T

H(t, x, u, ẋ, λ) = Laug

ẋ
 · ẋ − Laug ,
|{z}
λ

= λ · ẋ − L(t, x, u) − λT · ẋ + λT · f (x, u, t),

= λT · f (x, u, t) − L(t, x, u).

Thanks to the fact that ẋ = f (x, u, t), the Hamiltonian can be considered as a
function of t, x, u and λ:
H(t, x, u, λ). (6)
An important observation is that u now plays a similar role to that of y 0 in calculus
of variations. See more on this in the book.
If we now label the set of all allowable controls as U, the result on maximization of
Hamiltonian can then be written as
H(t, x∗ , u∗ , λ∗ ) ≥ H(t, x∗ , u, λ∗ ) ∀u ∈ U, (7)

or, equivalently as

u∗ = argmax H(t, x∗ , u, λ∗ ), u ∈ U. (8)

The essence of the celebrated Pontryagin’s principle is that actually the above
condition is the necessary condition of optimality. The fact that
Hu = 0 (9)
is just a consequence in the situation when Hu exists and the set of allowable controls
u is not bounded. Let us emphasize the fact that the control u comes from some
bounded set U by writing the Pontryagin’s principle as
Theorem 1 (Pontryagin’s principle of maximum). For a given system and a given
optimal criterion, let u∗ ∈ U be an optimal control, then there is a variable called
costate which together with the state satisfies the Hamilton canonical equations
ẋ = ∇λ H, (10)
λ̇ = −∇x H, (11)
where
H(t, x, u, λ) = λT (t) · f (x, u, t) − L(t, x, u) (12)
and
H(t, x∗ , u∗ , λ∗ ) ≥ H(t, x∗ , u, λ∗ ), u ∈ U. (13)
Moreover, the corresponding boundary conditions must hold.

Lecture 7 on Optimal and robust control at CTU in Prague 2

Pontryagin principle of maximum; time-optimal control

As a matter of fact, Hamiltonian here is defined as

H(t, x, u, λ) = λT (t) · f (x, u, t) − λ0 L(t, x, u), (14)

which allows for degenerate situations by setting λ0 = 0.

We could certainly rederive our previous results on LQ-optimal control with fixed
and free final states. Nonetheless, this would be an overkill unless we want to explore
the bounded controls case. There is another scenario, where the Pontryagin’s principle
is immediately needed, and that is the minimum-time problem. The task is to bring
the system to a given state in the shortest time possible. Apparently, with no bounds
on controls, the time can be shrunk to zero (the control signal assuming the shape of
a Dirac impulse). Therefore, bounds need to be imposed on the magnitudes of control
signals in order to compute realistic outcomes.
In order to investigate this situation, we must first know how our necessary con-
ditions change if we relax the final time, that is, the final time becomes one of the
optimization variables.

2 Necessary conditions for a free final time

Setting the final time free means that we want to use the final time as yet another
parameter for optimization.

2.1 Free final time and free final state

Example 2.1. Maybe by late submision of your solution to a homework assignment
in a general course, you can have a net gain in grading since the penalty for late
submission may be more than compensated by a much higher grading for the increased
quality of the solution. Time may be a parameter. Well, not so in our course :-)

2.1.1 Calculus of variations setting

Let us return back to the calculus of variations setting with their notation. The
problem we are going to solve is visualized in Fig. 1.

y(x)

y(x)
δy(x)
y ∗ (x)

a b b + db x

Figure 1: Optimizing over curves with one of their end point on a curve.

Lecture 7 on Optimal and robust control at CTU in Prague 3

Pontryagin principle of maximum; time-optimal control

This trick is, that stretching or shrinking of the interval of the independent variable
is done by perturbing the stationary value of the right end b of the interval with the
same α as we use to perturb the functions y and y 0 . That is, b is perturbed by
∆b = α∆x and the perturbed cost functional is then
Z b+α∆x
J(y ∗ + αη) = L(x, y ∗ + αη, (y ∗ )0 + αη 0 )dx. (15)
a

Note that we have a minor technical problem here since y ∗ is only defined on the
interval [a, b]. But there is an easy fix: we will define a continuation of the function
even to the right of b in the form of a linear approximation given by the derivative of
y ∗ at b. We will exploit it in a while.
Now, in order to find a variation δJ, we can either proceed by fitting the Taylor’s
expansion of the above perturbed cost function to the general Taylor’s expansion and
identifying the first-order term in α. Alternatively (well, in fact, equivalently), we
can use the already stated fact that

d ∗

δJ = J(y + αη) α. (16)
dα α=0

In order to find this derivative, we have to observe that the variable with respect to
which we are differentiating is included in the upper bound of the integral. Therefore
we cannot just change the order of differentiation and integration. This situation
is handled by the well-known Leibniz rule for differentiation under the integral sign.
Look it up yourself in the full generality. In our case it leads to
Z b
d ∗
d
J(y + αη) = Ly − Ly η(x)dx + Ly0 |b η(b) + L|b ∆x,
0 (17)
dα α=0 a dx
which after multiplication by α gives the variation of the functional
Z b
d
δJ = Ly − Ly0 δy(x)dx + Ly0 |b δy(x) + L|b ∆xα
| {z }, (18)
a dx
∆b

where the first two terms on the right are already known to us. The only new is
the third term. The reasoning now is pretty much the same as it was in the fixed-
interval free-end case. We argue that among the variations δy there are also those that
vanish at b, hence the conditions must be satisfied even if the last two terms are zero.
But then the integral must be zero, which gives rise to the familiar Euler-Lagrange
equation. The last two terms must together be zero and it does not hurt to rewrite
them in a complete notation to dispell any ambiquity

∂L(x, y(x), y 0 (x))

δy(b) + L(x, y(x), y 0 (x))|x=b ∆b = 0. (19)
∂y 0
x=b

Now, in order to get some more insight into the above condition, the relation
between the participating objects can be further explored. We will do it using the
Fig. 1 but we will augment it a bit with a few labels, see Fig. 2 below.
Note that we have included a brand new label here, namely, dyf for the perturbation
of the value of the function y() at the end of the interval (taking into consideration
that the length of the interval can change as well). We can now write
y ∗ (b + ∆b) + δy(b + ∆b) = y ∗ (b) + dyf , (20)

Lecture 7 on Optimal and robust control at CTU in Prague 4

Pontryagin principle of maximum; time-optimal control

y(x)

≈ δy(b)
dyf
y(x) δy(b) y ∗0 (b)db
y ∗ (b)
y ∗ (x)

a b b + db x

Figure 2: Optimizing over curves with one end of the interval of the independent
variable x set free and relaxing also the value of the function there.

which after approximating each term with the first two terms of its Taylor expansion
gives
y ∗ + y ∗ 0 (b)∆b + δy(b) + δ 0 (b)∆b
(b) = y ∗
(b) + dyf . (21)

Note that the third product on the left can be neglected since it contains two
terms that are both of order one in the perturbation variable α. In other words, we
approximate δy(b + ∆b) by δy(b). In addition, the term y ∗ (b) can be subtracted from
both sides. From what remains after these cancelations, we can conclude that
y ∗ 0 (b)∆b + δy(b) = dyf , (22)
or, equivalently,
δy(b) = dyf − y ∗ 0 (b)∆b. (23)
We will now substitute this into the general form of the boundary equation in (19)
Ly0 (b, y(b), y 0 (b)) · (dyf − y ∗ 0 (b)∆b) + L(b, y(b), y 0 (b))∆b = 0. (24)
Collecting now the terms with the two independent perturbation variables dyf and
∆b, we reformat the above expression into

Ly0 (b, y(b), y 0 (b))dyf + (L(b, y(b), y 0 (b)) − Ly0 (b, y(b), y 0 (b))y ∗ 0 (b))∆b = 0. (25)

Now, since dyf and ∆b are assumed independent, the corresponding terms must be
simultaneously and independently equal zero, that is,
Ly0 (b, y(b), y 0 (b)) = 0, (26)
0 0 ∗0
L(b, y(b), y (b)) − Ly0 (b, y(b), y (b))y (b) = 0. (27)
Note that the first condition actually constitutes n scalar conditions whereas the
second one is just a scalar condition itself, hence, n + 1 boundary conditions.

2.1.2 Optimal control setting

Let’s now switch back to the optimal control setting with t as the independent variable.
Recall that the optimal control problem is
Z tf
min φ(x(tf )) + L(x, u, t)dt . (28)
x(),u(),tf ti

Lecture 7 on Optimal and robust control at CTU in Prague 5

Pontryagin principle of maximum; time-optimal control

subject to
ẋ(t) = f (x, u, t), x(ti ) = ri . (29)
We have already seen that the integrand of the augmented cost function now con-
tains not only the term that corresponds to the Lagrange multiplier but also the term
that penalizes the state at the final time, that is,
∂φ dx
Laug (x, u, λ, t) = L(x, u, t) + + (∇x φ)T +λ(ẋ − f (x, u, t)) (30)
|∂t {z dt}
dφ(x(t),t)
dt |t=tf

We then rewrite the boundary conditions (25) as

T ∂φ
− λT f (x, u, t)

(∇x φ + λ)|t=tf dxf + L+ dtf . (31)
∂t t=tf

Since here we assume that the final time and the state at the final time are inde-
pendent, this single conditions breaks down into two boundary conditions1

∇x φ(x(tf ), tf ) + λ(tf ) = 0 (32)

∂φ(x(tf ), tf )
L(x(tf ), u(tf ), tf ) + − λT (tf )f (x(tf ), u(tf ), tf ) = 0. (33)
∂t
The first one is actually representing n scalar conditions, the second one is just a
single scalar condition. Hence, altogether we have n + 1 boundary conditions.
Let’s try to get some more insight into this. Let’s assume now that the term
penalizing the state at the final time does not explicitly depend on time, that is,
∂φ
∂t = 0. Then the boundary condition modifies to

T
(∇x φ + λ)|t=tf dxf + L − λT f (x, u, t) dtf , (34)

t=tf

which can be rewritten as

T
(∇x φ + λ)|t=tf dxf − H(x, u, λ, t)|t=tf dtf , (35)

which, in turn, enforces the scalar boundary condition (on top of those other n con-
ditions)
H(x(t), u(t), λ(t), t)|t=tf dtf = 0 . (36)

This is an observation that is worth memorizing—for a free final time optimal

control problem, Hamiltonian vanishes at the end of the time interval.
Let’s now add one more observation. We could have mentioned it even in the
previous lecture since it is a general property of a Hamiltonian evaluated along the

1 Note that here we commit the common abuse of notation in writing the functions to be differ-
∂φ(x(tf ),tf )
entiated as explicitnly dependent on tf such as in ∂t
. Instead, we should perhaps keep

∂φ(x(t),t)
writing it as ∂t but it is tiring and the formulas look cluttered.
t=tf

Lecture 7 on Optimal and robust control at CTU in Prague 6

Pontryagin principle of maximum; time-optimal control

optimal solution—the total derivative of a Hamiltonian (evaluated along the solution)

with respect to time is equal to its partial derivative with respect to time:
dH ∂H dx ∂H dλ ∂H du ∂H ∂H
= + + + = . (37)
dt ∂x
|{z} dt ∂λ
|{z} dt ∂u
|{z} dt ∂t ∂t
−λ̇ ẋ 0

Now, if neither the system equations nor the optimal control cost function depend
explicitly on time, that is, if ∂H
∂t = 0, the Hamiltonian remains constant along the
optimal solution (trajectory), that is,

H(x(t), u(t), λ(t)) = const. ∀t (38)

Combined with the previous result (boundary value of H at the end of the free time
interval is zero), we obtain the powerful conclusion that the Hamiltonian evaluated
alon the optimal trajectory is always zero in the free final time scenario:

H(x(t), u(t), λ(t)) = 0 ∀t (39)

This is a pretty insightful piece of information. Since some (numerical) techniques

for optimal control are based on iterative minimization of a Hamiltonian, here we
already know the minimum value.

Remark on notation In the previous lecture (notes) we already discussed the unfor-
tunate discrepancy in the definitions of Hamiltonian in the literature. Perhaps there
is no need to come back to this topic because you are now aware of the problem, but
I will do it anyway. My only motivation is to have the formulas at hand.
Recall that the ambigiuty starts with the definition of the augmented Lagrangian.
I could have easily written instead of (30) the following
∂φ dx T
L̂aug (x, u, λ̂, t) = L(x, u, t) + + (∇x φ)T +λ̂ (f (x, u, t) − ẋ). (40)
|∂t {z dt
}
dφ(x(t),t)
dt |t=tf

The boundary condition would then modify to

T
∂φ T
(∇x φ − λ̂) dxf + L + + λ̂ f (x, u, t) dtf , (41)

t=tf ∂t t=tf

∂φ(x(tf ),tf )
which can be rewritten in the case of ∂t = 0 and using the alternative defini-
T
tion of the Hamiltonian Ĥ = L + λ̂ f as
T
(∇x φ − λ̂) dxf + Ĥ(x, u, λ̂, t) dtf , (42)

t=tf t=tf

2.2 Free final time but the final state on a prescribed curve
2.2.1 Calculus of variations setting
We will now investigate a special case when the final value of the solution y(x) is to
be on the curve described by ψ(x), that is
y ∗ (b + ∆b) + δy(b + ∆b) = ψ(b + ∆b). (43)

Lecture 7 on Optimal and robust control at CTU in Prague 7

Pontryagin principle of maximum; time-optimal control

y(x)
ψ(x)

y ∗ (x)

a x

Figure 3: Free final time on curve

This corresponds to the situation depicted in Fig. 3.

We already discussed the terms on the left. What is new here is the term on the
right. It can also be approximated by the first two terms in Taylor’s expansion

ψ(b + ∆b) = ψ(b) + ψ 0 (b)∆b. (44)

Therefore, we can expand (43) into

y ∗

+ (y ∗ )0 (b)∆b + δy(b) = y ∗
(b)
+ψ 0 (b)∆b,
(b) (45)
| {z }
ψ(b)

from which we can express δy(b) as

δy(b) = ψ 0 (b)∆b − (y ∗ )0 (b)∆b (46)

and substitute to the boundary condition (19), which after cancelling the common
∆b term yields

Ly0 (b, y(b), y 0 (b)) · (ψ 0 (b) − y 0 (b)) + L(b, y(b), y 0 (b)) = 0. (47)

This is just one scalar boundary conditions. But the original n conditions that the
state that y(x) = ψ(x) at the right end of the interval must be added. Altogether,
we have n + 1 boundary conditions.
Anyway, the above single equation is called transversality condition for the reason
to be illuminated by the next example.
Example 2.2. To get an insight, consider again the minimum distance problem. This
time we want to find the shortest distance from a point to a curve given by φ(x). The
answer is intuitive, but let us see what our rigorous tools offer here. The EL equation
stays intact, therefore we know that the shortest path is a line. It starts at (a, 0)
but in order to determine its end, we need to invoke the other boundary condition.
Remember that the Lagrangian is
p
L = 1 + (y 0 )2 dx (48)

and
y0
Ly0 = p dx. (49)
1 + (y 0 )2

Lecture 7 on Optimal and robust control at CTU in Prague 8

Pontryagin principle of maximum; time-optimal control

The transversality condition boils down to

1 + y 0 (b)ψ 0 (b) = 0, (50)

which can also be visualized using vectors in the plane

1
1 y 0 (b) · 0

= 0. (51)
ψ (b)

The interpretation of this result is that our desired curve y hits the target curve φ
in a perpendicular (transverse) direction.
Understanding the boundary conditions is crucial. Let us have yet another look at
the result just derived. It can be written as

Ly0 ψ 0 |b − H|b = 0. (52)

It follows that for a free length of the interval and fixed value of the variable at the
end of the interval, in which
ψ(x) = c, c ∈ R, (53)
the transversality condition simplifies to

H(b) = 0. (54)

2.2.2 Optimal control setting

Once again, let’s recall that the optimal control problem is
Z tf
min φ(x(tf )) + L(x, u, t)dt . (55)
x(),u(),tf ti

subject to

ẋ(t) = f (x, u, t), (56)

x(ti ) = ri (57)
x(tf ) = ψ(tf ). (58)

Translating the above derived transversality condition from the domain (and nota-
tion) of calculus of variations into the optimal control setting gives

T ∂φ T

(∇x φ + λ) ψ̇(t) + L + − λ f (x, u, t) = 0. (59)
∂t t=tf

Of course, as usual, on top of this single condition, the 2n boundary conditions

shown above must be added.

Lecture 7 on Optimal and robust control at CTU in Prague 9

Pontryagin principle of maximum; time-optimal control

3 Time-optimal control for a linear system—bang-bang

control
The task of bringing the system from a given state to some given final state (either a
single state or a set of states) can be formulated by setting
L = 1, (60)
which turns the cost functional to
Z tf
J= 1dt = (tf − ti ). (61)
ti

Let us solve the task for an LTI system

ẋ = Ax + Bu, ti = 0, x(ti ) = r0 (62)
for which we set the final desired state as
x(tf ) = 0. (63)
As already discussed, this only makes sense if we impose some bounds on the
control, therefore
|ui (t)| ≤ 1 ∀i. (64)
The necessary conditions can be build immediately by forming Hamiltonian as
H = λT · (Ax + Bu) − 1 (65)
and substituting into the Hamilton canonical equations
ẋ = ∇λ H = Ax + Bu, (66)
T
λ̇ = −∇x H = −A λ. (67)
plus the Pontryagin’s statement about maximization of H with respect to u
H(t, x∗ , u∗ , λ∗ ) ≥ H(t, x∗ , u, λ∗ ), ui (t) ∈ [−1, 1] ∀t. (68)
Application of Pontryagin’s principle gives
(λ∗ )T · (Ax∗ + Bu∗ ) − 1 ≥ (λ∗ )T · (Ax∗ + Bu) − 1, ui ∈ [−1, 1]. (69)
Cancelling the identical terms on both sides we are left with
(λ∗ )T · Bu∗ ≥ (λ∗ )T · Bu, ui ∈ [−1, 1]. (70)
It turns out that if this inequality is to hold then with the u arbitrary on the left
(within the bounds), the only way to guarantee the validity is to have
u∗ = sgn (λ∗ )T · B ,

(71)
where the signum function is applied elementwise. Clearly the optimal control is
switching—it only assumes values of 1 or -1. This is visualized in Fig. 4 for a scalar
case (the B matrix has only a single column).
Well, in fact to support this claim, it must be rigorously excluded that the argument
of the signum function, the so-called switching function can assume zero value for
longer then just a time instant (although repeatedly). Check this by yourself in [1]
(or its online version). Search for normality conditions.

Lecture 7 on Optimal and robust control at CTU in Prague 10

Pontryagin principle of maximum; time-optimal control

u∗(t)
1

bT λ(t)

-1

Figure 4: Switching function and an optimal control derived from it.

3.1 Time-optimal control for a double integrator system

Let us analyze the situation for a double integrator. This corresponds to a system
described by the second Newton’s law. For a normalized mass the state space model
is
ẏ 0 1 y 0
= + u. (72)
v̇ 0 0 v 1
The switching function is obviously λ2 (t) and an optimal control is given by

u(t) = sgnλ2 (t). (73)

We do not know λ2 (t). In order to get it, we may need to solve the costate equations.
Indeed, we can solve them independently of the state equations since it is decoupled
from them
λ̇1 0 0 λ1
=− , (74)
λ̇2 1 0 λ2
from which it follows that
λ1 (t) = c1 (75)
and
λ2 (t) = c1 t + c2 . (76)
for some constants c1 and c2 . To determine the constants, we will have to bring the
boundary conditions finally into the game. The condition that H(tf ) = 0 gives

λ2 (tf )u(tf ) = 1. (77)

We can now sketch possible profiles of the switching function. A few characteristic
versions are in Fig. 5
What we have learnt is that the costate λ2 would go through zero at most once
during the whole control interval. Therefore we will have at most one switching of
the control signal. This is a valuable observation.
We are approaching the final stage of the derivations. So far we have learnt that
we can only consider u(t) = 1 and u(t) = −1. The state equations can be easily
integrated to get
1
v(t) = v(0) + ut, y(t) = y(0) + y(0)t + ut2
2

Lecture 7 on Optimal and robust control at CTU in Prague 11

Pontryagin principle of maximum; time-optimal control

λ2(t)

t0 tf t

Figure 5: Possible evolutions of the costate in time-optimal control.

To visualize this in y − v domain, express t from the first and subsitute into the
second equation
1
u(y − y(0)) = v(0)(v − v(0)) + (v − v(0))2 ,
2
which is a family of parabolas parameterized by (y(0), v(0)). These are visualized in
Fig. 6.
There is a distinguished curve in the figure, which is composed of two branches. It
is special in that for all the states starting on this curve, the system is brought to
the origin for a corresponding setting of the control (and no further switching). This
curve, called switching curve can be expressed as
1 2
y= 2v v<0
− 12 v 2 v > 0
or
1
y = − v|v|
2
The final step can be done using this figure. Point your finger anywhere in the
plane. Follow the state trajectory that emanates from the particular point for which
you can get to the origin with at maximum 1 switching. Clearly the strategy is to set
u such that it brings us to the switching curve (the red one in the figure) and then
follow it (after switching). That is it. This control strategy can be written as

−1 if y(t) > − 21 v(t)|v(t)| or if y(t) = − 12 v(t)|v(t)| and y < 0

u(t) = ,
1 if y(t) < − 21 v(t)|v(t)| or if y(t) = − 12 v(t)|v(t)| and y > 0

which can be written in a compact form as

1
u(t) = sign − v(t)|v(t)| − y(t) . (78)
2

A simulation scheme in Simulink is in Fig. 7. and the expected simulated optimal

response is in Fig. 8.
In the plots you can find a confirmation of the fact that we derived rigorously—the
fact that there will be at most one switch in the control signal. . . Ooops. . . This is

Lecture 7 on Optimal and robust control at CTU in Prague 12

Pontryagin principle of maximum; time-optimal control

1.5

0.5

0
v

−0.5

−1

−1.5

−2
−3 −2 −1 0 1 2 3
y

Figure 6: Typical trajectories for both u(t) = 1 and u(t) = −1. Red is the switching
curve.

Figure 7: The structure of the time-optimal controller.

actually not quite what we see in the plot above, is it? We can see two switches in the
control signals. The first one happend at about 2.2 s, but the second happend close to
3.5 s. In fact, what you would experience if you run the code in Simulink is an error
statement that “At time 3.449489782192745, simulation hits (1000) consecutive zero
crossings.” and the simulation will be finished. Obviosly, what is going on is that the
simulator is tempted to include not just two but in fact a huge number of switches
in the control signal as it approaches the origin. This is quite characteristic of bang-
bang control—a phenomenon called chattering. In this particular example you may
decide to ignore it since both state variables are already close enough to the origin
and you may want to declare the control task as finished2 . Generally, this chattering
phenomenon needs to be handled somehow. Any suggestion how to reduced it?

4 Further reading
This lecture was prepared using [1], in particular chapters 3 (application of calculus of
variations to general problem of optimal control) and chapter 4 (Pontryagin’s princi-

2 Remember that we still consider a control over finite time interval, even though its length is a
tunable parameter. Hence, after reaching the end of the interval, the task is over.

Lecture 7 on Optimal and robust control at CTU in Prague 13

Pontryagin principle of maximum; time-optimal control

0
u
-1
0 0.5 1 1.5 2 2.5 3 3.5
2

0
v

-2
0 0.5 1 1.5 2 2.5 3 3.5
2

1
y

0
0 0.5 1 1.5 2 2.5 3 3.5
Time [s]

Figure 8: Time optimal response obtained from numerical simulation in Simulink.

ple). We did not talk about the proof of Pontryagin’s principle at the lecture and we
do not even command the students to go through the proof in the book. Understand-
ing the result, its roots in calculus of variations and how it removes the deficiencies
of the calculus of variations based results will suffice for our purposes.
The transition from the calculus of variations to the optimal control, especially
when it comes to the definition of Hamiltonian, is somewhat tricky. Unfortunately,
it is not discussed satisfactorily in the literature. Even Liberzon leaves it as an
(unsolved) exercise (3.5 and 3.6) to the student. Other major textbooks avoid the
topic altogether. The only treatment can be found in the famouse journal paper [2],
in particular the section “The first fork in the road: Hamilton” on page 39. The issue
is so delicate that they even propose to distinguish the two types of Hamiltonian by
referring to one as control Hamiltonian.
The time-optimal control for linear systems, in particular bang-bang control for a
double integrator is described in section 4.4.1 and 4.4.2. The material is quite standard
and can be found in many books and lecture notes. What is not covered, however, is
the fact that without any adjustment, the bang bang control is very troublesome from
an implementation viewpoint. A dedicated research thread has evolved, especially
driven by the needs of hard disk drive industry, which is called (a)proximate time-
optimal control (PTOS). Many dozens of papers can be found with this keyword in
the title.

References
[1] Daniel Liberzon. Calculus of Variations and Optimal Control Theory: A Concise
Introduction. Princeton University Press, December 2011.
[2] H.J. Sussmann and J.C. Willems. 300 years of optimal control: from the brachys-
tochrone to the maximum principle. IEEE Control Systems, 17(3):32–44, 1997.

Lecture 7 on Optimal and robust control at CTU in Prague 14

Solutions Manual For Optimal Control Theory: An Introduction
70% (20)
Solutions Manual For Optimal Control Theory: An Introduction
185 pages
Control Engineering: Advancing Autonomous Systems Through Precision and Adaptation
From Everand
Control Engineering: Advancing Autonomous Systems Through Precision and Adaptation
Fouad Sabry
No ratings yet
Stewart Platform: Advancing Precision and Mobility in Robotic Systems
From Everand
Stewart Platform: Advancing Precision and Mobility in Robotic Systems
Fouad Sabry
No ratings yet
Essential Math For AI Hala Nelson Instant Download
No ratings yet
Essential Math For AI Hala Nelson Instant Download
54 pages
Wind Power Generation A Clear and Concise Reference
From Everand
Wind Power Generation A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Introduction to Optical Components
From Everand
Introduction to Optical Components
Simone Malacrida
No ratings yet
Or - Lecture 3 - LP Graphical Solution
No ratings yet
Or - Lecture 3 - LP Graphical Solution
24 pages
MA3608 QB Unit I
No ratings yet
MA3608 QB Unit I
4 pages
Chemical and Bio-Process Control: James B. Riggs M. Nazmul Karim
14% (7)
Chemical and Bio-Process Control: James B. Riggs M. Nazmul Karim
44 pages
Performance Analysis of A Liquid Desiccant Air Conditioning System Based On A Data-Driven Model
No ratings yet
Performance Analysis of A Liquid Desiccant Air Conditioning System Based On A Data-Driven Model
5 pages
Week 4
No ratings yet
Week 4
33 pages
MAXIMA-MINIMA Eco Appl
No ratings yet
MAXIMA-MINIMA Eco Appl
24 pages
Energy
No ratings yet
Energy
13 pages
BIBc
No ratings yet
BIBc
6 pages
Discrete Element Simulation On The Key Parameters of Bucket Wheel Stacker-Reclaimer Based On Virtual Prototyping Technology
No ratings yet
Discrete Element Simulation On The Key Parameters of Bucket Wheel Stacker-Reclaimer Based On Virtual Prototyping Technology
8 pages
Javed Et Al. - 2019 - Numerical Simulation and Experimental Study On Lightweight Mechanical Member
No ratings yet
Javed Et Al. - 2019 - Numerical Simulation and Experimental Study On Lightweight Mechanical Member
9 pages
(Facchinei, Pang) Finite - Dimens I
No ratings yet
(Facchinei, Pang) Finite - Dimens I
728 pages
M Tech - Manufacturing-Technology Course STR and Syllabus
No ratings yet
M Tech - Manufacturing-Technology Course STR and Syllabus
23 pages
Water Resource Management 002
No ratings yet
Water Resource Management 002
112 pages
2018 Operator Theory Operator Algebras and Matrix Theory - Book
100% (1)
2018 Operator Theory Operator Algebras and Matrix Theory - Book
381 pages
Reliability of Electric Generation With Transmission Constraints
No ratings yet
Reliability of Electric Generation With Transmission Constraints
215 pages
B.A. (Hons.) ECONOMICS SYLLABUS, IPU
No ratings yet
B.A. (Hons.) ECONOMICS SYLLABUS, IPU
31 pages
Computer Methods in Power Systems Analysis with MATLAB
From Everand
Computer Methods in Power Systems Analysis with MATLAB
Sekhar Chandra P.
No ratings yet
TO Operations Research: DR Raghavendra M J Assistant Professor Sit, Mangaluru
No ratings yet
TO Operations Research: DR Raghavendra M J Assistant Professor Sit, Mangaluru
39 pages
Application of Operations Research in Election Voting System
No ratings yet
Application of Operations Research in Election Voting System
5 pages
Optimized Maintenance Scheduler For Circuit Breakers and Power Transformers PDF
100% (3)
Optimized Maintenance Scheduler For Circuit Breakers and Power Transformers PDF
238 pages
ACT04 ManScie Online Syllabus 1
No ratings yet
ACT04 ManScie Online Syllabus 1
9 pages
Ninad Khandagale: (M. Tech - Structural Engineering) Mobile: +91-8369494607/9890793594
No ratings yet
Ninad Khandagale: (M. Tech - Structural Engineering) Mobile: +91-8369494607/9890793594
5 pages
Shovel Truck System
100% (1)
Shovel Truck System
7 pages
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
4.5/5 (2)
MPC PC11 Lecture1
100% (1)
MPC PC11 Lecture1
51 pages
Haldiram Casse Study PDF
No ratings yet
Haldiram Casse Study PDF
6 pages
Matrix Functions Via Jordan Canonical
No ratings yet
Matrix Functions Via Jordan Canonical
10 pages
Functional Performance Specifications (FPS) : Based On EU Standard EN 12973:2000
No ratings yet
Functional Performance Specifications (FPS) : Based On EU Standard EN 12973:2000
18 pages
Choice of Interest Rate Term Structure Models For Pricing and Hedging Bermudan Swaptions - An ALM Perspective
No ratings yet
Choice of Interest Rate Term Structure Models For Pricing and Hedging Bermudan Swaptions - An ALM Perspective
41 pages
SALVO Paper IET Conference Nov 2011
No ratings yet
SALVO Paper IET Conference Nov 2011
10 pages
Business Mathematics 402d
No ratings yet
Business Mathematics 402d
24 pages
Kosmann Nonlin PDF
100% (2)
Kosmann Nonlin PDF
342 pages
(Robin Preece (Auth.) ) Improving The Stability of
No ratings yet
(Robin Preece (Auth.) ) Improving The Stability of
599 pages
3 Body
No ratings yet
3 Body
18 pages
Computational Modeling for Fluid Flow and Interfacial Transport
From Everand
Computational Modeling for Fluid Flow and Interfacial Transport
Wei Shyy
No ratings yet
Algebras of Holomorphic Functions and Control Theory
From Everand
Algebras of Holomorphic Functions and Control Theory
Amol Sasane
No ratings yet
Lectures on the Coupling Method
From Everand
Lectures on the Coupling Method
Torgny Lindvall
No ratings yet
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
Data-Driven Aerospace Engineering With ML
No ratings yet
Data-Driven Aerospace Engineering With ML
28 pages
VTU - MTECH - VLSI Design& Embedded Systems Syllabus - Revised
No ratings yet
VTU - MTECH - VLSI Design& Embedded Systems Syllabus - Revised
37 pages
L4 Discrete Time Optimal Control Indirect LQ ARE
No ratings yet
L4 Discrete Time Optimal Control Indirect LQ ARE
26 pages
Switched Systems
100% (2)
Switched Systems
185 pages
RF MEMS Switches and Switch Circuits: Shimul Chandra Saha
No ratings yet
RF MEMS Switches and Switch Circuits: Shimul Chandra Saha
174 pages
PhdThesis-Model Predictive Control Strategies
No ratings yet
PhdThesis-Model Predictive Control Strategies
172 pages
Graphs and Tables of the Mathieu Functions and Their First Derivatives
From Everand
Graphs and Tables of the Mathieu Functions and Their First Derivatives
James C. Wiltse
No ratings yet
System Identification: Theory For The User, 2nd Edition (Ljung, L. 1999) (On The Shelf)
No ratings yet
System Identification: Theory For The User, 2nd Edition (Ljung, L. 1999) (On The Shelf)
3 pages
Study On GRG
100% (1)
Study On GRG
14 pages
Yury V. Orlov - Discontinuous Systems - Lyapunov Analysis and Robust Synthesis Under Uncertainty Conditions-Springer-Verlag London (2009) PDF
No ratings yet
Yury V. Orlov - Discontinuous Systems - Lyapunov Analysis and Robust Synthesis Under Uncertainty Conditions-Springer-Verlag London (2009) PDF
333 pages
2014 Book GreenSFunctionsInTheTheoryOfOr
No ratings yet
2014 Book GreenSFunctionsInTheTheoryOfOr
180 pages
Quantum Technology and Optimization Problems: Sebastian Feld Claudia Linnhoff-Popien
No ratings yet
Quantum Technology and Optimization Problems: Sebastian Feld Claudia Linnhoff-Popien
234 pages
State-of-the-Art and Energy Management System of Lithium-Ion Batteries in Electric Vehicle Applications Issues and Recommendations
No ratings yet
State-of-the-Art and Energy Management System of Lithium-Ion Batteries in Electric Vehicle Applications Issues and Recommendations
17 pages
Technical Program SASG 187 PDF
No ratings yet
Technical Program SASG 187 PDF
34 pages
John OBrien Frequency-Domain Control Design For High Performance Systems PDF
No ratings yet
John OBrien Frequency-Domain Control Design For High Performance Systems PDF
196 pages
Lecture Notes in Physics: Monographs
100% (1)
Lecture Notes in Physics: Monographs
200 pages
Physics of Semiconductors Sapoval Hermann PDF
No ratings yet
Physics of Semiconductors Sapoval Hermann PDF
326 pages
A Tacholess Order Tracking Methodology Based On A Probabilistic
No ratings yet
A Tacholess Order Tracking Methodology Based On A Probabilistic
17 pages
Elements of Optimal Control Theory Pontryagin's Maximum Principle
No ratings yet
Elements of Optimal Control Theory Pontryagin's Maximum Principle
11 pages
Thoma J., Mocellin G. Simulation With Entropy in Engineering Thermodynamics.. Understanding Matter and Systems With Bondgraphs (Springer, 2006)
No ratings yet
Thoma J., Mocellin G. Simulation With Entropy in Engineering Thermodynamics.. Understanding Matter and Systems With Bondgraphs (Springer, 2006)
141 pages
Optimal Control: An Introduction to the Theory and Its Applications
From Everand
Optimal Control: An Introduction to the Theory and Its Applications
Michael Athans
4.5/5 (2)
Patran 2008 r1 Thermal User's Guide Volume 1: Thermal/Hydraulic Analysis
No ratings yet
Patran 2008 r1 Thermal User's Guide Volume 1: Thermal/Hydraulic Analysis
802 pages
Quadrotor Dynamics and Control
No ratings yet
Quadrotor Dynamics and Control
48 pages
Kelly & O'Neill - The Network Simplex Method
100% (1)
Kelly & O'Neill - The Network Simplex Method
91 pages
Volterra-Series-Based Nonlinear System Modeling and Its Engineering Applications - Cheng-Peng-Zhang-Meng
No ratings yet
Volterra-Series-Based Nonlinear System Modeling and Its Engineering Applications - Cheng-Peng-Zhang-Meng
26 pages
DynamicProfileofSwitched ModeConverter ModelingAnalysisandControl (2009)
No ratings yet
DynamicProfileofSwitched ModeConverter ModelingAnalysisandControl (2009)
362 pages
Regulations and Control Engineering (Meng 4162) : Chapter - 1 Introduction To Control Systems
No ratings yet
Regulations and Control Engineering (Meng 4162) : Chapter - 1 Introduction To Control Systems
30 pages
Laeser Pulse Heating of Surfaces and Thermal Stress Analysis - BYilbas
No ratings yet
Laeser Pulse Heating of Surfaces and Thermal Stress Analysis - BYilbas
185 pages
Constructed Layered Systems: Measurements and Analysis
From Everand
Constructed Layered Systems: Measurements and Analysis
W. H. Cogill
No ratings yet
An Introduction To Nonlinear Model Predictive Control
No ratings yet
An Introduction To Nonlinear Model Predictive Control
23 pages
A Complex Variable Approach To The Analysis of Linear Multivariable Feedback Sy PDF
No ratings yet
A Complex Variable Approach To The Analysis of Linear Multivariable Feedback Sy PDF
178 pages
Smart Grid
100% (2)
Smart Grid
44 pages
Numerical Methods For Large Eigenvalue Problems
100% (1)
Numerical Methods For Large Eigenvalue Problems
285 pages
Robot Manipulators Trends and Development
100% (1)
Robot Manipulators Trends and Development
676 pages
Non Linear Control
No ratings yet
Non Linear Control
15 pages
Gibbs Phase Rule Article 1876 - Equilibrium of Heterogeneous Substances
No ratings yet
Gibbs Phase Rule Article 1876 - Equilibrium of Heterogeneous Substances
329 pages
2009 P. Norton
No ratings yet
2009 P. Norton
163 pages
Analog Computation of Chaotic Oscillator
No ratings yet
Analog Computation of Chaotic Oscillator
115 pages
Quantum Isometry Groups: Jyotishman Bhowmick
No ratings yet
Quantum Isometry Groups: Jyotishman Bhowmick
201 pages
A 10-Year Mechatronics Curriculum Development - Part-II - IEEE
No ratings yet
A 10-Year Mechatronics Curriculum Development - Part-II - IEEE
7 pages
Neural Network Sliding-Mode Position Controller For Induction Servo Drive
No ratings yet
Neural Network Sliding-Mode Position Controller For Induction Servo Drive
12 pages
Fronczak, Reniers - Model-Based Systems Engineering - 4TC00 Dictaat 2014-2015 PDF
No ratings yet
Fronczak, Reniers - Model-Based Systems Engineering - 4TC00 Dictaat 2014-2015 PDF
105 pages
DeCarlo, Zak, Matthews, VSC of Nonlinear Multi Variable Systems A Tutorial, 1988
No ratings yet
DeCarlo, Zak, Matthews, VSC of Nonlinear Multi Variable Systems A Tutorial, 1988
21 pages
Engineering Measurements - Methods and Intrinsic Errors - WILLEY PDF
No ratings yet
Engineering Measurements - Methods and Intrinsic Errors - WILLEY PDF
195 pages
MECATRONICA
No ratings yet
MECATRONICA
5 pages
r05410205 Advanced Control Systems
No ratings yet
r05410205 Advanced Control Systems
8 pages

Pontryagin Principle of Maximum Time-Optimal Control: Constrained Control, Bang-Bang Control

Uploaded by

Pontryagin Principle of Maximum Time-Optimal Control: Constrained Control, Bang-Bang Control

Uploaded by

7

Pontryagin principle of maximum;

he techniques of calculus of variations introduced in the previous lecture signifi-

1 Pontryagin’s principle of maximum

Combined with the second-order necessary condition of minimum

This result can be written as

H(t, x, u, ẋ, λ) = Laug

= λ · ẋ − L(t, x, u) − λT · ẋ + λT · f (x, u, t),

= λT · f (x, u, t) − L(t, x, u).

u∗ = argmax H(t, x∗ , u, λ∗ ), u ∈ U. (8)

Lecture 7 on Optimal and robust control at CTU in Prague 2

As a matter of fact, Hamiltonian here is defined as

H(t, x, u, λ) = λT (t) · f (x, u, t) − λ0 L(t, x, u), (14)

which allows for degenerate situations by setting λ0 = 0.

2 Necessary conditions for a free final time

2.1 Free final time and free final state

2.1.1 Calculus of variations setting

Lecture 7 on Optimal and robust control at CTU in Prague 3

∂L(x, y(x), y 0 (x))

Lecture 7 on Optimal and robust control at CTU in Prague 4

2.1.2 Optimal control setting

Lecture 7 on Optimal and robust control at CTU in Prague 5

We then rewrite the boundary conditions (25) as

∇x φ(x(tf ), tf ) + λ(tf ) = 0 (32)

which can be rewritten as

This is an observation that is worth memorizing—for a free final time optimal

Lecture 7 on Optimal and robust control at CTU in Prague 6

optimal solution—the total derivative of a Hamiltonian (evaluated along the solution)

H(x(t), u(t), λ(t)) = const. ∀t (38)

H(x(t), u(t), λ(t)) = 0 ∀t (39)

This is a pretty insightful piece of information. Since some (numerical) techniques

The boundary condition would then modify to

Lecture 7 on Optimal and robust control at CTU in Prague 7

Figure 3: Free final time on curve

This corresponds to the situation depicted in Fig. 3.

ψ(b + ∆b) = ψ(b) + ψ 0 (b)∆b. (44)

Therefore, we can expand (43) into

from which we can express δy(b) as

δy(b) = ψ 0 (b)∆b − (y ∗ )0 (b)∆b (46)

Lecture 7 on Optimal and robust control at CTU in Prague 8

The transversality condition boils down to

1 + y 0 (b)ψ 0 (b) = 0, (50)

which can also be visualized using vectors in the plane

Ly0 ψ 0 |b − H|b = 0. (52)

2.2.2 Optimal control setting

ẋ(t) = f (x, u, t), (56)

Of course, as usual, on top of this single condition, the 2n boundary conditions

Lecture 7 on Optimal and robust control at CTU in Prague 9

3 Time-optimal control for a linear system—bang-bang

Let us solve the task for an LTI system

Lecture 7 on Optimal and robust control at CTU in Prague 10

Figure 4: Switching function and an optimal control derived from it.

3.1 Time-optimal control for a double integrator system

u(t) = sgnλ2 (t). (73)

λ2 (tf )u(tf ) = 1. (77)

Lecture 7 on Optimal and robust control at CTU in Prague 11

Figure 5: Possible evolutions of the costate in time-optimal control.

−1 if y(t) > − 21 v(t)|v(t)| or if y(t) = − 12 v(t)|v(t)| and y < 0

which can be written in a compact form as

A simulation scheme in Simulink is in Fig. 7. and the expected simulated optimal

Lecture 7 on Optimal and robust control at CTU in Prague 12

Figure 7: The structure of the time-optimal controller.

Lecture 7 on Optimal and robust control at CTU in Prague 13

Figure 8: Time optimal response obtained from numerical simulation in Simulink.

Lecture 7 on Optimal and robust control at CTU in Prague 14

You might also like