Introduction To Nonlinear Control Lecture # 5 State Feedback Stabilization
Introduction To Nonlinear Control Lecture # 5 State Feedback Stabilization
Introduction To Nonlinear Control Lecture # 5 State Feedback Stabilization
Lecture # 5
– p. 1/?
We want to stabilize the system
ẋ = f (x, u)
0 = f (xss , uss )
xδ = x − xss , uδ = u − uss
def
ẋδ = f (xss + xδ , uss + uδ ) = fδ (xδ , uδ )
fδ (0, 0) = 0
uδ = γ(xδ ) ⇒ u = uss + γ(x − xss )
– p. 2/?
State Feedback Stabilization: Given
ẋ = f (x, u) [f (0, 0) = 0]
find
u = γ(x) [γ(0) = 0]
s.t. the origin is an asymptotically stable equilibrium point of
ẋ = f (x, γ(x))
– p. 3/?
Linear Systems
ẋ = Ax + Bu
(A, B) is stabilizable (controllable or every uncontrollable
eigenvalue has negative real part)
u = −Kx
Typical methods:
Eigenvalue Placement
Eigenvalue-Eigenvector Placement
LQR
– p. 4/?
Linearization
ẋ = f (x, u)
ẋ = Ax + Bu
∂f ∂f
A= (x, u) ; B= (x, u)
∂x x=0,u=0 ∂u x=0,u=0
u = −Kx
– p. 5/?
Closed-loop system:
ẋ = f (x, −Kx)
∂f ∂f
ẋ = (x, −Kx) + (x, −Kx) (−K) x
∂x ∂u x=0
= (A − BK)x
– p. 6/?
Example (Pendulum Equation):
θ̈ = −a sin θ − bθ̇ + cT
0 = −a sin δ + cTss
x1 = θ − δ, x2 = θ̇, u = T − Tss
ẋ1 = x2
ẋ2 = −a[sin(x1 + δ) − sin δ] − bx2 + cu
" # " #
0 1 0 1
A= =
−a cos(x1 + δ) −b −a cos δ −b
x1 =0
– p. 7/?
" # " #
0 1 0
A= ; B=
−a cos δ −b c
h i
K= k1 k2
" #
0 1
A − BK =
−(a cos δ + ck1 ) −(b + ck2 )
a cos δ b
k1 > − , k2 > −
c c
a sin δ a sin δ
T = − Kx = − k1 (θ − δ) − k2 θ̇
c c
– p. 8/?
Notions of Stabilization
– p. 9/?
Semiglobal Stabilization: The origin of ẋ = f (x, γ(x)) is
asymptotically stable and γ(x) can be designed such that
any given compact set (no matter how large) can be
included in the region of attraction (Typically u = γp (x) is
dependent on a parameter p such that for any compact set
G, p can be chosen to ensure that G is a subset of the
region of attraction )
– p. 10/?
Example
ẋ = x2 + u
Linearization:
ẋ = u, u = −kx, k > 0
Closed-loop system:
ẋ = −kx + x2
– p. 11/?
The control u = −kx does not achieve global stabilization
The control
u = −x2 − kx
achieves global stabilization because it yields the linear
closed-loop system ẋ = −kx whose origin is globally
exponentially stable
– p. 12/?
Practical Stabilization
ẋ = f (x, u) + g(x, u, t)
f (0, 0) = 0, g(0, 0, t) 6= 0
kg(x, u, t)k ≤ δ, ∀ x ∈ Dx , u ∈ Du , t ≥ 0
There is no control u = γ(x), with γ(0) = 0, that can make
the origin of
– p. 13/?
Definition: The system
ẋ = f (x, u) + g(x, u, t)
kx(t)k ≤ ε, ∀t≥T
– p. 14/?
With practical stabilization, we may have
local practical stabilization
regional practical stabilization
global practical stabilization, or
semiglobal practical stabilization
depending on the region of initial states
– p. 15/?
Example
ẋ = x2 + u + d(t), |d(t)| ≤ δ, ∀ t ≥ 0
V = 12 x2 ⇒ V̇ = x3 − kx2 + xd(t)
V̇ ≤ − k3 x2 − x2 k3 − |x| − |x| k3 |x| − δ
V̇ ≤ − k3 x2 , for 3δ
k
≤ |x| ≤ k
3
3δ 3δ
Take k <ε ⇔ k≥ ε
By choosing k large enough we can achieve semiglobal
practical stabilization
– p. 16/?
ẋ = x2 + u + d(t)
V = 12 x2 ⇒ V̇ = −kx2 + xd(t)
V̇ ≤ − k2 x2 − |x| k2 |x| − δ
2δ
V̇ ≤ − k2 x2 , ∀ |x| ≥
k
By choosing k large enough we can achieve global practical
stabilization
– p. 17/?
Normal Form
– p. 18/?
∂(Lf h)
Lg Lf h(x) = g(x)
∂x
∂(Lf h)
L2f h(x) = Lf Lf h(x) = f (x)
∂x
∂(Lk−1
f h)
Lkf h(x) = Lf Lk−1
f h(x) = f (x)
∂x
L0f h(x) = h(x)
ẏ = Lf h(x) + Lg h(x) u
Lg h(x) = 0 ⇒ ẏ = Lf h(x)
(2) ∂(Lf h)
y = [f (x) + g(x)u] = L2f h(x) + Lg Lf h(x) u
∂x
– p. 19/?
Lg Lf h(x) = 0 ⇒ y (2) = L2f h(x)
– p. 20/?
Example
ẏ = ẋ1 = x2
ÿ = ẋ2 = −x1 + ε(1 − x21 )x2 + u
Relative degree = 2 over R2
Example
– p. 21/?
Definition: a map z = T (x) is a diffeomorphism on a
domain D if there is an inverse map T −1 (·) such that
x = T −1 (z) for all z ∈ T (D), and both T (·) and T −1 (·)
are continuously differentiable
– p. 22/?
Change of variables:
φ1 (x)
..
.
φn−ρ (x) φ(x) η
def def
z = T (x) = − − − = −−− = −−−
h(x) ψ(x) ξ
..
.
ρ−1
Lf h(x)
– p. 23/?
∂φ
η̇ = [f (x) + g(x)u] = f0 (η, ξ) + g0 (η, ξ)u
∂x
ξ̇i = ξi+1 , 1≤i≤ρ−1
ξ̇ρ = Lρf h(x) + Lg Lρ−1
f h(x) u
y = ξ1
– p. 24/?
η̇ = f0 (η, ξ)
ξ̇i = ξi+1 , 1≤i≤ρ−1
ρ ρ−1
ξ̇ρ = Lf h(x) + Lg Lf h(x) u
y = ξ1
0 1 0 ... 0 0
0 0 1 ... 0
0
.. ... .. .
..
Ac = . . , Bc =
.
.. 0 1
0
0 ... ... 0 0 1
h i −Lρf h
Cc = 1 0 ... 0 0 , α= ρ−1 , γ = Lg Lρ−1
f h
Lg Lf h
– p. 25/?
Normal Form:
η̇ = f0 (η, ξ)
ξ̇ = Ac ξ + Bc γ(x)[u − α(x)], γ(x) 6= 0
y = Cc ξ
Zero Dynamics
– p. 26/?
Example
2 + x23
ẋ1 = −x1 + u, ẋ2 = x3 , ẋ3 = x1 x3 + u, y = x2
1+ x23
ẏ = ẋ2 = x3
ÿ = ẋ3 = x1 x3 + u ⇒ ρ = 2
L2f h(x)
γ = Lg Lf h(x) = 1, α = − = −x1 x3
Lg Lf h(x)
– p. 27/?
Find φ(x) such that
2+x23
∂φ h i 1+x23
∂φ ∂φ ∂φ
φ(0) = 0, g(x) = , , 0 =0
∂x ∂x1 ∂x2 ∂x3
1
and h iT
T (x) = φ(x) x2 x3
is a diffeomorphism
∂φ 2 + x23 ∂φ
· + =0
∂x1 1+ x23 ∂x3
– p. 28/?
h iT
T (x) = −x1 + x3 + tan−1 x3 , x2 , x3
is a global diffeomorphism
η = −x1 + x3 + tan−1 x3 , ξ1 = x2 , ξ2 = x3
!
2 + ξ22
−η + ξ2 + tan−1 ξ2
η̇ = 1+ 2
ξ2
1+ ξ2
ξ̇1 = ξ2
−1
ξ̇2 = −η + ξ2 + tan ξ2 ξ2 + u
y = ξ1
– p. 29/?
Normal Form (ρ < n):
η̇ = f0 (η, ξ)
ξ̇ = Ac ξ + Bc γ(x)[u − α(x)]
y = Cc ξ
ξ̇ = Ac ξ + Bc γ(x)[u − α(x)]
y = Cc ξ
– p. 30/?
Given the system
ẋ = f (x) + g(x)u
ẋ = f (x) + g(x)u
y = h(x)
– p. 31/?
Feedback Linearization
ẋ = f (x) + G(x)u
f (0) = 0, x ∈ Rn , u ∈ Rm
Suppose there is a change of variables z = T (x), defined
for all x ∈ D ⊂ Rn , that transforms the system into form
ż = Az + Bγ(x)[u − α(x)]
u = α(x) + γ −1 (x)v ⇒ ż = Az + Bv
– p. 32/?
v = −Kz
ż = (A − BK)z
– p. 33/?
What can we say about the stability of x = 0 as an
equilibrium point of
−1
ẋ = f (x) + G(x) α(x) − γ (x)KT (x)
– p. 34/?
What information do we need to implement the control
Closed-loop system:
ż = (A − BK)z + Bδ(z)
δ = γ[α̂ − α + γ −1 KT − γ̂ −1 K T̂ ]
– p. 35/?
ż = (A − BK)z + Bδ(z) (∗)
– p. 36/?
Example (Pendulum Equation):
θ̈ = −a sin θ − bθ̇ + cT
a
x1 = θ − δ, x2 = θ̇, u = T − Tss = T − sin δ
c
ẋ1 = x2
ẋ2 = −a[sin(x1 + δ) − sin δ] − bx2 + cu
1
u= {a[sin(x1 + δ) − sin δ] − k1 x1 − k2 x2 }
c
" #
0 1
A − BK = is Hurwitz
−k1 −(k2 + b)
– p. 37/?
a 1
T =u+ sin δ = [a sin(x1 + δ) − k1 x1 − k2 x2 ]
c c
Let â and ĉ be nominal models of a and c
1
T = [â sin(x1 + δ) − k1 x1 − k2 x2 ]
ĉ
ẋ = (A − BK)x + Bδ(x)
âc − aĉ c − ĉ
δ(x) = sin(x1 + δ1 ) − (k1 x1 + k2 x2 )
ĉ ĉ
– p. 38/?
âc − aĉ c − ĉ
δ(x) = sin(x1 + δ1 ) − (k1 x1 + k2 x2 )
ĉ ĉ
|δ(x)| ≤ kkxk + ε
âc − aĉ c − ĉ âc − aĉ
q
k = + 2 2
k +k , ε= | sin δ1 |
1 2
ĉ ĉ ĉ
" # " #
p11 p12 p12
P = , PB =
p12 p22 p22
1
k< q
2 p212 + p222
sin δ1 = 0 ⇒ ε = 0
– p. 39/?
Is feedback linearization a good idea?
Example
ẋ = ax − bx3 + u, a, b > 0
u = −(k + a)x + bx3 , k > 0, ⇒ ẋ = −kx
−bx3 is a damping term. Why cancel it?
u = −(k + a)x, k > 0, ⇒ ẋ = −kx − bx3
Which design is better?
– p. 40/?
Example
ẋ1 = x2
ẋ2 = −h(x1 ) + u
h(0) = 0 and x1 h(x1 ) > 0, ∀ x1 6= 0
Feedback Linearization:
u = h(x1 ) − (k1 x1 + k2 x2 )
– p. 41/?
The control
V̇ = −x2 σ(x2 )
– p. 42/?
Partial Feedback linearization
η̇ = f0 (η, ξ)
ξ̇ = Aξ + Bγ(x)[u − α(x)]
– p. 43/?
u = α(x) + γ −1 (x)v
η̇ = f0 (η, ξ), ξ̇ = Aξ + Bv
Suppose the origin of η̇ = f0 (η, 0) is asymptotically stable
– p. 44/?
If the origin of η̇ = f0 (η, 0) is globally asymptotically stable,
will the origin of
– p. 45/?
Example
η̇ = − 12 (1 + ξ2 )η 3 , ξ̇1 = ξ2 , ξ̇2 = v
e(A−BK)t =
−k2 te−kt (1 − kt)e−kt
– p. 46/?
Peaking Phenomenon:
2 −kt k
max{k te }= → ∞ as k → ∞
t e
η02
η 2 (t) =
1 + η02 [t + (1 + kt)e−kt − 1]
If η02 > 1, the system will have a finite escape time if k is
chosen large enough
– p. 47/?
Lemma 13.2: The origin of
Proof: Use
Lemma 4.7: If ẋ1 = f1 (x1 , x2 ) is ISS and the origin of
ẋ2 = f2 (x2 ) is globally asymptotically stable, then the
origin of
ẋ1 = f1 (x1 , x2 ), ẋ2 = f2 (x2 )
is globally asymptotically stable
– p. 48/?
u = α(x) − γ −1 (x)KT2 (x)
– p. 49/?
Lemma 13.4
If kδ(z)k ≤ ε for all z and η̇ = f0 (η, ξ) is input-to-state
stable, then the state z is globally ultimately bounded by
a class K function of ε
– p. 50/?
Proof–First Part:
kξ(t)k≤ cε, ∀ t ≥ t0
∂V1
f0 (η, 0) ≤ −c3 kηk2
∂η
∂V1
≤ c4 kηk
∂η
– p. 51/?
V (z) = bV1 (η) + ξT P ξ
" #T " #
kηk kηk
V̇ ≤ − Q
kξk kξk
" #
bc3 −(kkP Bk + bc4 L/2)
Q=
−(kkP Bk + bc4 L/2) 1 − 2kkP Bk
b=k
Q is positive definite for sufficiently small k
– p. 52/?