Stochastic Disturbance Accommodating Control Using A Kalman Estimator
Stochastic Disturbance Accommodating Control Using A Kalman Estimator
Stochastic Disturbance Accommodating Control Using A Kalman Estimator
Nomenclature
(Ω, F , P) Complete probability space
E[·] Expectation operator
(A, B) True system state matrix and control distribution matrix, A ∈ Rn×n and B ∈ Rn×r
X(t, ω) ∈ Rn Stochastic state vector, t ∈ [t0 tf ] and ω ∈ Ω, for fixed t, X(t) is a random variable
Y(t, ω) ∈ Rm Stochastic output vector
u(t) ∈ Rr Input vector
W(t, ω) ∈ Rn External stochastic disturbance vector
V(t, ω) ∈ Rm
Measurement noise, assumed to be Gaussian white noise, fV (v) ∼ N 0, Rδ(τ )
(Am , Bm ) Assumed state matrix and assumed control distribution matrix, Am ∈ Rn×n and Bm ∈ Rn×r
Xm (t), Ym (t) State vector and output vector corresponding to the assumed system
C Known output matrix, C ∈ Rm×n
D(t, ω) ∈ Rn True lumped disturbance term
D m (t, ω) ∈ Rn Assumed lumped disturbance term
L1 (·), L2 (·) Linear mappings: Rn → Rn
V(t, ω) ∈ Rn
Zero mean Gaussian white noise process, fV (v) ∼ N 0, Qδ(τ )
x̄(t), ū(t) Nominal system states and control input, x̄(t) ∈ Rn , ū(t) ∈ Rr
Z(t) ∈ R2n , [XT (t) D T (t)]T , True augmented state vector
Zm (t) ∈ R2n , [XTm (t) D Tm (t)]T , Assumed augmented state vector
(F, D) True state matrix and control distribution matrix for the augmented system,
F ∈ R2n×2n and D ∈ R2n×r
ADm Assumed state matrix for disturbance term, ADm ∈ Rn×n
(Fm , Dm ) Assumed state matrix and control distribution matrix for the augmented system,
Fm ∈ R2n×2n and Dm ∈ R2n×r
W(t, ω) ∈ Rn
Zero mean Gaussian white noise process, fW (w) ∼ N 0, Qδ(τ )
(G, H) Known disturbance input matrix and output matrix for the augmented system,
G ∈ R2n×n and H ∈ Rm×2n
K(t) ∈ R2n×m Optimal observer gain or Kalman gain for the assumed system
∗ Graduate Student, Department of Mechanical & Aerospace Engineering, [email protected], Student Member AIAA.
† Assistant Professor, Department of Mechanical & Aerospace Engineering, [email protected], Member AIAA.
‡ Professor, Department of Mechanical & Aerospace Engineering, [email protected], Associate Fellow AIAA.
1 of 22
I. Introduction
System uncertainties and noisy measurements can obscure the development of a viable control law. The
main objective of a feedback controller design is to develop a compensator that will maintain given design
specifications in the presence of realistic ranges of uncertainty. A useful compensator that handles uncertainty
is Disturbance-Accommodating Control (DAC), which was first proposed by Johnson in 1971.1 Though the
traditional DAC only considers disturbance functions which exhibit waveform patterns over short intervals of
time,2 a more general formulation of DAC can accommodate the simultaneous presence of both “noise” type
disturbances and “waveform structured” disturbances.3
The disturbance-accommodating observer approach has shown to be extremely effective for disturbance
attenuation;4–6 however, the performance of the observer can significantly vary for different types of exoge-
nous disturbances, which is due to observer gain sensitivity. This paper presents a robust control approach
based on a significant extension of the disturbance accommodating control concept, which compensates for
both model parameter uncertainties and external disturbances by estimating a model-error vector (through-
out this paper we will use the phrase “disturbance term” to refer to this quantity) in real time that is used as
a signal synthesis adaptive correction to the nominal control input to achieve maximum performance. This
control approach utilizes a Kalman filter in the feedback loop for simultaneously estimating the system states
and the disturbance term from measurements.7 The estimated states are then used to develop a nominal
control law while the estimated disturbance term is used to make necessary corrections to the nominal control
input to minimize the effect of system uncertainties and the external disturbance. Similar developments of
disturbance accommodating controllers using Kalman filter can be found in Refs. 8 and 9. There are several
advantages of implementing the Kalman filter in the DAC approach: 1) tuning of the estimator parame-
ters, such as the process-noise covariance matrix, can be done easily unlike the standard DAC techniques
2 of 22
3 of 22
We do not have precise knowledge about the dynamics of the disturbance term. For simplicity, the distur-
bance term dynamics is modeled as
where ADm is Hurwitz. Equation (11) is used solely in the estimator design
" to estimate
# the true disturbance
Xm (t)
term. Now construct the assumed augmented state vector, Zm (t) = , the assumed model of the
D m (t)
system Eq. (9) can be written as
" # " #" # " # " #
Ẋm Am I(n×n) Xm Bm 0(n×1)
= + u+ (12)
Ḋ m 0(n×n) ADm Dm 0(n×r) W
The zero elements in the disturbance term dynamics are assumed for the sake of simplicity, the control
formulation given here is also valid if non-zero elements are assumed. Equation (12) can be written in terms
of the appended state vector, Zm , as
Let H = [C 0m×n ], then Y = HZ + V and Ym = HZm + V. Though the disturbance term is unknown,
assuming W(t) and V(t) possess certain stochastic properties, an optimal estimator such as a Kalman filter
can be implemented in the feedback loop to estimate the unmeasured system states and the disturbance
term from the noisy measurements. The estimator dynamics can be written as
˙
Ẑ(t) = Fm Ẑ(t) + Dm u(t) + K(t)[Y(t) − Ŷ(t)], Ẑ(t0 ) = Zm (t0 ) (16)
where K(t) is the Kalman gain and Ŷ = H Ẑ. The estimator dynamics can be rewritten as
˙
Ẑ(t) = Fm Ẑ(t) + Dm u(t) + K(t)H[Z(t) − Ẑ(t)] + K(t)V(t) (17)
Notice that the estimator uses the assumed system model given in Eq. (13) for the propagation stage and
the actual measurements from Eq. (15) for the update stage, i.e., Ẑ(t) = E[Zm (t)|{Yt . . . Y0 }]. The Kalman
4 of 22
The total control law, u(t), consists of a nominal control and necessary corrections to the nominal control
to compensate for the disturbance term as shown in Eq. (7b). The nominal control, ū, is selected so that
it guarantees the desired performance of the assumed system. For the system given in Eq. (5), the nominal
controller is given as
ū(t) = −Km X̂(t) (19)
where Km ∈ Rr×n is the feedback gain. While the nominal controller guarantees the desired performance of
the assumed model, the second term, −D(t), in Eq. (7b) ensures the complete cancelation of the disturbance
term which is compensating for the external disturbance and the model uncertainties. Now the control law
can be written in terms of the estimated system states and the estimated disturbance term as
" #
h i X̂(t)
T −1 T
u(t) = (Bm Bm ) Bm − Bm Km − I(n×n) = S Ẑ(t) (20)
D̂(t)
h i
T
where S = (Bm Bm )−1 BmT
−Bm Km −I . Notice that (Bm T
Bm ) is a nonsingular matrix since Bm is assumed
to have linearly independent columns. A summary of the proposed control scheme is given Table. 1.
It is important to note that if Q = 0, then D m (t) = D m (t0 ) = 0 and the total control law becomes just
the nominal control. If the nominal control, ū(t), on the true plant would result in an unstable system,
then selecting a small Q would also result in an unstable system. On the other hand, selecting a large Q
value would compel the estimator to completely rely upon the measurement signal and therefore the noise
associated with the measurement signal is directly transmitted into the estimates. This could result in noisy
control signal which could lead to problems, such as chattering and controller saturation. Also note that as
R, the measurement noise covariance, increases, the observer gain decreases and thus the observer fails to
update the propagated disturbance term based on measurements. For a highly uncertain system, selecting
a small Q or a large R will result in an unstable closed-loop system as shown in Ref. 12. A schematic
representation of the proposed controller is given in Fig. 1. In the next section a detailed stability analysis
is given, which investigates the dependency of closed-loop system stability on Q and R.
5 of 22
D̂(t) T
−(Bm Bm )−1 Bm
T
Estimator
error covariance is then presented. Finally it is shown that the system stability depends on a lower bound
requirement on Q and R−1 .
From hereon the explicit notation for time varying quantities is omitted when there is no risk of confusion.
Let Z̃ = Z − Ẑ be the estimation error, then the error dynamics can be written as
˙
Z̃˙ = Ż − Ẑ = F Z + DS Ẑ + GV − Fm Ẑ − Dm S Ẑ − KH[Z − Ẑ] − KV
= [Fm − KH + ∆F ]Z̃ + [∆F + ∆DS]Ẑ + GV − KV
Combining the error dynamics and the estimator dynamics we could write,
Z̃˙
" # " #" # " #" #
(Fm − KH + ∆F ) (∆F + ∆DS) Z̃ G −K V
˙ = KH (Fm + Dm S) Ẑ
+
0 K V
Ẑ
6 of 22
Zt
Z(t) = Φ(t, t0 )Z(t0 ) + Φ(t, τ )Γ(τ )G(τ )dτ (24)
t0
h Zt
P(t) = E Φ(t, t0 )Z(t0 )Z (t0 )Φ (t, t0 ) + Φ(t, t0 )Z(t0 )G T (τ )ΓT (τ )ΦT (t, τ )dτ +
T T
t0
Zt Zt Zt i
T T
Φ(t, τ )Γ(τ )G(τ )Z (t0 )Φ (t, t0 )dτ + Φ(t, τ1 )Γ(τ1 )G(τ1 )G T (τ2 )ΓT (τ2 )ΦT (t, τ2 )dτ1 dτ2
t0 t0 t0
Assuming G(t) and Z(t0 ) are uncorrelated we have E[G(t)Z T (t0 )] = E[Z(t0 )G T (t)] = 0. The initial P is
P(t0 ) = E[Z(t0 )Z T (t0 )]. Since V(t) and V(t) are uncorrelated, the expectation of G(τ1 )G T (τ2 ) is
" #
T Q 0
E[G(τ1 )G (τ2 )] = δ(τ1 − τ2 )
0 R
" #
Q 0
Let Λ = , now P(t) can be rewritten as
0 R
Zt
T
P(t) = Φ(t, t0 )P(t0 )Φ (t, t0 ) + Φ(t, τ )Γ(τ )ΛΓT (τ )ΦT (t, τ )dτ (25)
t0
Utilizing the fundamental properties of the evolution operator, the above equation can be rewritten as
Ṗ(t) =Υ(t)Φ(t, t0 )P(t0 )ΦT (t, t0 ) + Φ(t, t0 )P(t0 )ΦT (t, t0 )ΥT (t) + Γ(t)ΛΓT (t)+
Zt Zt
Υ(t) Φ(t, τ )Γ(τ )ΛΓ (τ )Φ (t, τ )dτ + Φ(t, τ )Γ(τ )ΛΓT (τ )ΦT (t, τ ))dτ ΥT (t)
T T
t0 t0
Therefore
Let
T T
" # " #
E[Z̃Z̃ ] E[Z̃Ẑ ]
PZ̃ PZ̃Ẑ
P(t) = T T =
E[ẐZ̃ ] E[ẐẐ ] PZ̃Ẑ PẐ
7 of 22
From the above equation, ṖZ̃ , ṖZ̃Ẑ , and ṖẐ can be written as
Since the model errors are unknown, the above equation cannot be utilized in the filter implementation.
B. Closed-Loop Stability and Transient Response Bound for Systems with No Uncertainties
A detailed analysis of the closed-loop system’s asymptotic stability in the mean when there are no uncer-
tainties is now given. As shown here, a transient bound on the system response mean can be obtained in
terms of the time varying correlation matrix. Most of the definitions and formulations given in this section
are similar to the ones given in Ref. 13 for deterministic systems.
Consider a case where there is no model error, i.e., F = Fm , D = Dm , and V(t) = W(t). If there is no
¯ = µ = 0. Now we could write
model error, then the estimator is unbiased, i.e., E[Z̃] ¯
Z̃
"
¯˙ ¯
#" # " #" #
Z̃
= (Fm − KH) 0 Z̃ G −K W
˙¯ KH (F + D S) ¯ + 0 K V
Ẑ m m Ẑ
Before discussing the stability analysis, a few definitions regarding the stability of stochastic processes are
presented.
˙
Definition 1. Given M ≥ 1 and β ∈ R, the system Z̄(t) = Ῡ(t)Z̄(t) + Γ(t)Ḡ(t) is said to be (M, β)-stable
in the mean if
where Φ̄(t, t0 ) is the evolution operator generated by Ῡ(t) and µZ̄ (t) = E[Z̄(t)].
Since most applications involve the case where β ≤ 0, (M, β)-stability guarantees both a specific decay
rate of the mean (given by β) and a specific bound on the transient behavior of the mean (given by M ).
˙
Definition 2. If a stochastic system, Z̄(t) = Ῡ(t)Z̄(t) + Γ(t)Ḡ(t), is (M, β)-stable in the mean, then the
transient bound of the system mean response for the exponential rate β is defined to be
n o
Mβ = inf M ∈ R; ∀t ≥ t0 :k Φ̄(t, t0 ) k≤ M eβ(t−t0 ) (33)
8 of 22
Therefore it is of interest to know the smallest β ∈ R such that k Φ̄(t, t0 ) k≤ eβ(t−t0 ) , t ≥ t0 . Given the
˙
system, Z̄(t) = Ῡ(t)Z̄(t) + Γ(t)Ḡ(t), which is (M, β)-stable in the mean, the transient bound Mβ of the
system mean can be readily obtained based on the premises of the following theorem.
Theorem 1. Suppose the system Z̄(t)˙ = Ῡ(t)Z̄(t) + Γ(t)Ḡ(t) is (M, β)-stable in the mean, then there exists
T
a continuously differentiable positive definite matrix function P̄(t) (P̄ = E[Z̄ Z̄ ]) satisfying the matrix
Lyapunov differential equation
˙
P̄(t) = Ῡ(t)P̄(t) + P̄(t)ῩT (t) + Γ(t)Λ̄ΓT (t) (34)
such that
Notice P̄(t) ≥ Φ̄(t, t0 )P̄(t0 )Φ̄T (t, t0 ) ≥ σmin (P̄(t0 ))Φ̄(t, t0 )Φ̄T (t, t0 ), i.e.,
Z̃˙
" # " #" # " #" #
(Fm − KH + ∆F ) (∆F + ∆DS) Z̃ G −K V
˙ = KH (Fm + Dm S) Ẑ
+
0 K V
Ẑ
where
" #
(∆F ) (∆F + ∆DS)
∆Υ(t) =
0 0
˙
In the previous section we analyzed the stability of the unperturbed system Z̄(t) = Ῡ(t)Z̄(t) + Γ(t)Ḡ(t).
Here we will analyze the stability of the perturbed system, Ż(t) = Ῡ(t)Z(t) + ∆Υ(t)Z(t) + Γ(t)G(t).
The correlation matrix P(t) = E[Z(t)Z T (t)] satisfies the following matrix Lyapunov differential equation
9 of 22
Theorem 2. The uncertain system, Ż(t) = Ῡ(t)Z(t) + ∆Υ(t)Z(t) + Γ(t)G(t), is (M, β)-stable in the mean
if
Proof. In order to show the asymptotic stability of the mean we consider the following equation:
The matrix P̄(t) is required to be a positive definite matrix, therefore P̄ −1 (t) exists and V [µZ (t)] > 0 for
all µZ (t) 6= 0. Since P̄(t)P̄ −1 (t) = I, the time derivative of P̄(t)P̄ −1 (t) is 0:
dh i
˙ P̄ −1 (t) + P̄(t)P̄˙ −1 (t) = 0
P̄(t)P̄ −1 (t) = P̄(t)
dt
Solving the above equation for P̄˙ −1 (t) and substituting Eq. (39) gives
Note
−∆ΥP̄ − P̄∆ΥT +LQLT + N R−1 N T = LQLT − P̄∆ΥT (N R−1 N T )−1 ∆ΥP̄+
h i h iT
P̄∆ΥT (N R−1 N T )−1 − I (N R−1 N T ) P̄∆ΥT (N R−1 N T )−1 − I
10 of 22
LQLT ≥ σmin (Q)LLT ≥ P̄∆ΥT (σmin (R−1 )N N T )−1 ∆ΥP̄ ≥ P̄∆ΥT (N R−1 N T )−1 ∆ΥP̄
Hence we have
Therefore (M, β)-stability in the mean is guaranteed if the inequality Eq. (38) is satisfied. Let Q∗ and
R is chosen so that the above inequality is satisfied. Now substituting Q∗ and R∗ into Eq. (37) we have
∗
P˙∗ (t) = (Ῡ(t) + ∆Υ(t))P ∗ (t) + P ∗ (t)(Ῡ(t) + ∆Υ(t))T + LQ∗ LT + N (t)R∗ −1 N T (t) (43)
Corollary 1. If the system given in Eq. (36) is (M, β)-stable in the mean, then there exists a continuously
differentiable positive definite symmetric matrix function P ∗ (t) given by Eq. (44) such that
where Mβ represents the transient bound of the perturbed system’s mean response.
Proof. If P ∗ (t) satisfies Eq. (44), then
P ∗ (t) ≥ [Φ̄(t, t0 )+Φ∆ (t, t0 )]P ∗ (t0 )[Φ̄(t, t0 )+Φ∆ (t, t0 )]T ≥ σmin (P ∗ (t0 ))[Φ̄(t, t0 )+Φ∆ (t, t0 )][Φ̄(t, t0 )+Φ∆ (t, t0 )]T
i.e.,
σmax (P ∗ (t)) ≥k [Φ̄(t, t0 ) + Φ∆ (t, t0 )]P ∗ (t0 )[Φ̄(t, t0 ) + Φ∆ (t, t0 )]T k≥ σmin (P ∗ (t0 )) k [Φ̄(t, t0 ) + Φ∆ (t, t0 )] k2
Now we have
Therefore the transient bound, Mβ2 , of the perturbed system can be obtained from
11 of 22
The exponentially stable in the mean implies the system matrix, Υ(t) = Ῡ(t) + ∆Υ(t), generates a stable
evolution operator, therefore P(t) has a bounded solution.16
every initial valuse Z 0 , the solution of Eq. (47) has the property that
Z(t) → 0 almost surely as t → ∞ (51)
The operator L{·} acting on V (z, t) is given by
1
LV (z, t) = lim E dV (Z(t), t)|Z(t) = z (52)
dt→0 dt
where dV (Z(t), t) can be calculated using the Itô Formula.
12 of 22
κ1 (k z k) ≤ V (z, t) ≤ κ2 (k z k) (53a)
LV (z, t) ≤ η(t) and (53b)
LV (z, t) ≤ η(t)+ k VzT (z, t)Γ(t) k2 −κ3 (k z k) (53c)
∂V
where Vz = ∂z . Then the conclusion of Theorem 3 still holds.
More detailed derivation and proof of this theorem can be found in Ref. 20. Notice that if the inequality
Eq. (38) is satisfied, then there exists a P(t) which satisfies the following equation:
Now we have
2 1 1 n o
dV (Z(t), P −1 (t)) ≈ Z T P −1 dZ + Z T dP −1 Z + Tr P −1 dZdZ T
Msup Msup Msup
2 2 1 n o
≈ Z T P −1 ΥZdt + Z T P −1 ΓΛ1/2 dB − Z T P −1 Υ + ΥT P −1 + P −1 ΓΛΓT P −1 Zdt+
Msup Msup Msup
1 n o
Tr P −1 ΥZZ ΥT dt2 + ΓΛ1/2 dBZ T ΥT dt + ΥZdB T Λ1/2 ΓT dt + ΓΛ1/2 dBdB T Λ1/2 ΓT
T
Msup
Now taking the conditional expectation, we obtain
2 1 n o
E dV (Z(t), P −1 (t))|Z(t) = z = z T P −1 Υzdt + Tr P −1 Υzz T ΥT dt2 + ΓΛΓT dt −
Msup Msup
1 n o
z T P −1 Υ + ΥT P −1 + P −1 ΓΛΓT P −1 zdt
Msup
Now we can calculate
2 1 n o 1 n o
LV (z, P −1 ) = z T P −1 Υz − z T P −1 Υ + ΥT P −1 + P −1 ΓΛΓT P −1 z + Tr P −1 ΓΛΓT
Msup Msup Msup
1 1 n o
=− z T P −1 ΓΛΓT P −1 z + Tr P −1 ΓΛΓT
Msup Msup
13 of 22
σmin {P −1 ΓΛΓT P −1 }
LV (z, P −1 ) ≤ 1 − k z k2
Msup
Notice that
1
lim k z(t) k2 ≤ =⇒ 1 − k k z k2 η(t)
t→∞ k(t)
Thus we do not have almost sure asymptotic stability for the stochastic system given in Eq. (47). In fact,
given a Υ(t) that generates an asymptotically stable evolution for the linear system in Eq. (47), the necessary
and sufficent condition for the almost sure asymptotic stability is
Detailed proof of this argument can be found in Ref. 21. Equation (54) constitutes the sufficent condition
for the almost sure asymptotic stability of a linear stochastic system gievn (M, β)-stability in the mean.
IV. Results
A detailed investigation of the above Lyapunov stability analysis through numerical simulations is given
in this section. For simulation purposes, we consider a two degree of freedom helicopter that pivots about
the pitch axis by angle θ and about the yaw axis by angle ψ. As shown in Fig. 2, the pitch is defined positive
when the nose of the helicopter goes up and the yaw is defined positive for a counterclockwise rotation. Also
in Fig. 2, there is a thrust force Fp acting on the pitch axis that is normal to the plane of the front propeller
and a thrust force Fy acting on the yaw axis that is normal to the rear propeller. Therefore a pitch torque
is being applied at a distance rp from the pitch axis and a yaw torque is applied at a distance ry from the
yaw axis. The gravitational force, Fg , generates a torque at the helicopter center of mass that pulls down
on the helicopter nose. As shown in Fig. 2, the center of mass is a distance of lcm from the pitch axis along
the helicopter body length.
14 of 22
The control input to the system are the input voltages of the pitch and yaw motors, Vm,p and Vm,y ,
respectively. Let u = [u1 u2 ]T = [Vm,p Vm,y ]T . Now the linearized equations can be rewritten as
θ̈ = a1 θ̇ + b1 u1 + b2 u2 − mheli glcm (57a)
ψ̈ = a2 ψ̇ + b3 u1 + b4 u2 (57b)
where
−Bp −By
a1 = 2 )
a2 = 2 )
(Jeq,p + mheli lcm (Jeq,y + mheli lcm
Kpp Kpy
b1 = 2 )
b2 = 2 )
(Jeq,p + mheli lcm (Jeq,p + mheli lcm
Kyp Kyy
b3 = 2 )
b4 = 2 )
(Jeq,y + mheli lcm (Jeq,y + mheli lcm
15 of 22
Y = Cx + V (60)
Ym = Cxm + V (61)
" #
1 0 0 0
where C = . Notice that the disturbance term, d = [0 0 dθ̇ dψ̇ ]T , can be written as
0 1 0 0
dθ̇ = (a1 − a1m )θ̇ + (b1 − b1m )u1 + (b2 − b2m )u2 − mheli glcm = △a1 θ̇ + △b1 u1 + △b2 u2 − mheli glcm (62a)
dψ̇ = (a2 − a2m )ψ̇ + (b3 − b3m )u1 + (b4 − b4m )u2 = △a2 ψ̇ + △b3 u1 + △b4 u2 (62b)
The first two zero elements in the disturbance term indicate the perfect knowledge of the system kinematics.
The disturbance term in vector notation can be written as
or in vector notation:
ẋ = Am x + Bm u + d (65)
Since the model uncertainty is only associated with the dynamics, only the nonzero elements of the dis-
turbance term need to be appended to the system states. Let the extended assumed state vector, Zm =
[xTm dθ̇m dψ̇m ]T . Now the assumed extended state-space equation can be written as
Żm = Fm Zm + Dm u + GW (67)
0 0 1 0 0 0
0 0 0 1 0 0
" # " # " #
0 0 a1m 0 1 0 B m 0 W (t)
where Fm = , Dm = , G = 4×2 , and W = 1
. The assumed
0
0 0 a2m 0 1 02×2 I2×2 W2 (t)
0 0 0 0 −1 0
0 0 0 0 0 −3
output equation can be written in terms of the appended state vector, zm , as
Ym = HZm + V (68)
16 of 22
x̄˙ = Am x̄ + Bm ū (70)
where the nominal controller is a linear quadratic regulator which minimizes the cost function
Z∞
1
(x(t) − xd )T Qx (x(t) − xd ) + uT (t)Ru u(t) dt
J= (71)
2
0
where xTd = [θd ψd 0 0], θd and ψd are some desired final values of θ and ψ, respectively, and Qx and Ru
are two symmetric positive definite matrices. The nominal control that minimizes the above cost function is
where Km is the feedback gain that minimizes the cost Eq. (71). Now the total control law can be written
in terms of the estimated states and the estimated disturbance term as
h i x̂ − xd
u = (Bm T
Bm )−1 Bm
T
− Bm Km − I2×2 dˆθ̇ = S Ẑ + Km xd (73)
ˆ
dψ̇
Since Eq. (65) does not contain any noise-like external disturbances, after substituting the above control law
into Eq. (63), the true disturbance-term dynamics can be written as
Equation (74) indicates that selecting a large Q or small R would amplify the measurement noise effect on
the disturbance term dynamics. This is clearly shown in the simulation results given next.
LQR Weighting
" Matrices
# " Covariance
# " Matrices #
10 0 q1 0 1 × 10−5 0
Ru = Q= R=
0 10 0 q2 0 1 × 10−5
1 × 10−3 0 0 0 0 0
2000 0 0 0 0 1 × 10−3 0 0 0 0
0 2000 0 0 0 0 0.1 0 0 0
Qx = P (t0 ) =
0 0 100 0
0 0 0 0.1 0 0
0 0 0 100 0 0 0 0 1 0
0 0 0 0 0 1
Table 3 shows the nominal controller and estimator matrices. Since the measurement noise covariance,
R, can be obtained from sensor calibration, the process noise matrix, Q, is treated as a tuning parameter.
Based on the weighting matrices given in Table 3, the feedback gain is calculated to be
" #
14.0529 1.5865 2.1762 0.3790
Km =
−1.5865 14.0529 −0.1712 3.6387
For simulation purposes the initial states are selected to be [θ0 ψ0 θ̇0 ψ̇0 ]T = [−45o 0 0 0]T and the
desired states θd and ψd are selected to be 45o and 30o , respectively.
17 of 22
−5
ψ(deg)
θ(deg)
1
−10
0
−15
−1
0 2 4 6 8 10 0 2 4 6 8 10
Time(sec) Time(sec)
(a) (b)
7 9
x 10 x 10
3 0.5
Desired Desired
2 Truth 0 Truth
Estimate Estimate
ψ̇(deg/sec)
1 −0.5
θ̇(deg/sec)
0 −1
−1 −1.5
−2 −2
−3 −2.5
0 2 4 6 8 10 0 2 4 6 8 10
Time(sec) Time(sec)
(c) (d)
7 6
x 10 x 10
2 12
u1 dst1
u2 10 dst2
dθ̇ and dψ̇ (deg/sec)
0 8
u1 and u2
6
−2 4
2
−4 0
−2
−6 −4
0 2 4 6 8 10 0 2 4 6 8 10
Time(sec) Time(sec)
(a) (b)
0
Stability Indicator
−50
−100
−150
0 2 4 6 8 10
Time(sec)
(c)
Figure 4. Control Input, Estimated Disturbance Term, and Stability Indicator: q1 = q2 = 0.10
18 of 22
50 35
30
Desired
Truth 25 Desired
Estimate Truth
ψ(deg)
θ(deg)
20 Estimate
0
15
10
−50 0
0 2 4 6 8 10 0 2 4 6 8 10
Time(sec) Time(sec)
(a) (b)
70
Desired Desired
150 Truth 60 Truth
Estimate Estimate
50
ψ̇(deg/sec)
θ̇(deg/sec)
100 40
30
50 20
10
0 0
−10
0 2 4 6 8 10 0 2 4 6 8 10
Time(sec) Time(sec)
(c) (d)
19 of 22
u1 and u2
10
5
0
0
−5
−10
−15 −5
0 2 4 6 8 10 0 2 4 6 8 10
Time(sec) Time(sec)
(a) (b)
7
x 10
15
Stability Indicator
10
0
0 2 4 6 8 10
Time(sec)
(c)
Figure 6. Control Input, Estimated Disturbance Term, and Stability Indicator: q1 = q2 = 1 × 104
50 40
Desired
Truth 30
Estimate
ψ(deg)
Desired
θ(deg)
0 20 Truth
Estimate
10
−50 0
0 2 4 6 8 10 0 2 4 6 8 10
Time(sec) Time(sec)
(a) (b)
120
Desired Desired
150 Truth 100 Truth
Estimate Estimate
80
ψ̇(deg/sec)
θ̇(deg/sec)
100 60
40
50 20
0
0 −20
−40
0 2 4 6 8 10 0 2 4 6 8 10
Time(sec) Time(sec)
(c) (d)
20 of 22
u1 and u2
4
0 2
0
−10
−2
−20 −4
0 2 4 6 8 10 0 2 4 6 8 10
Time(sec) Time(sec)
(a) (b)
V. Conclusions
This paper presents the formulation of a stochastic disturbance accommodating control with observer ap-
proach for linear time-invariant multi-input-multi-output systems which automatically detects and minimizes
the adverse effects of both model uncertainties and external disturbances on a controlled system. Assuming
all system uncertainties and external disturbances can be lumped in a disturbance term, this control ap-
proach utilizes a Kalman filter in the feedback loop for simultaneously estimating the system states and the
disturbance term from measurements. The estimated states are then used to develop a nominal control law
while the estimated disturbance-term is used to make necessary corrections to the nominal control input to
minimize the effect of system uncertainties and the external disturbance.
The stochastic stability analysis conducted on the controlled system reveals a lower-bound requirement
on the estimator matrices, Q and R−1 , to ensure stability in the mean or the mean-square stability of the
closed-loop system. If the nominal control on the true plant would result in an unstable system, then selecting
a small Q would also result in an unstable system. On the other hand, selecting a large Q value would compel
the estimator to completely rely upon the measurement signal and therefore the noise associated with the
measurement signal is directly transmitted to the estimates. This could result in noisy control signal which
could lead to problems, such as chattering and controller saturation. Also note that as R, the measurement
noise covariance, increases, the observer gain decreases and thus the observer fails to update the propagated
disturbance term based on the measurements. Thus for a highly uncertain systems, selecting a small Q or
a large R will result in an unstable closed-loop system. The stochastic Lyapunov style analysis indicates
that the controlled stochastic system is almost surely asymptotically stable if the noise distribution matrix,
Γ(t), satisfies a specific decay rate. Since the measurement noise covariance can be obtained from sensor
calibration, the process noise matrix Q is treated as a tuning parameter. The simulation results reveal that
if the selected Q is too low, then the system is unstable and if the selected Q is too large, then the resulted
control input is highly noisy. Simulation results also indicate that there is an optimal parameter that would
guarantee stability with minimal control input noise. Future research plans include developing an adaptive
law for Q that would guarantee asymptotic stability in the mean based on the stochastic Lyapunov analysis,
and also extending the current approach to nonlinear systems where the disturbance term also accommodate
for system nonlinearities.
Acknowledgments
This material is based upon work supported in part by U.S. Air Force Research Laboratory, Space Vehicles
Directorate. The authors would like to thank Khanh D. Pham for his contributions.
References
1 Johnson, C., “Accommodation of External Disturbances in Linear Regulator and Servomechanism Problems,” IEEE
Transactions on Automatic Control, Vol. AC-16, No. 6, December 1971, pp. 635–644.
2 Johnson, C. and Kelly, W., “Theory of Disturbance-Utilizing Control: Some Recent Developments,” Proceedings of IEEE
21 of 22
NAECON, Proceedings of the IEEE National Aerospace and Electronics Conference (NAECON , Dayton, OH, 2000, pp. 208–
211.
5 Joseph A. Profeta III, W. G. and Mickle, M. H., “Disturbance Estimation and Compesation in Linear Systems,” IEEE
Transactions on Aerospace and Electronic Systems, Vol. 26, No. 2, March 1990, pp. 225–231.
6 Kim, J.-H. and Oh, J.-H., “Disturbance Estimation using Sliding Mode for Discrete Kalman Filter,” Proceedings of the
37th IEEE Conference on Dicision and Control , Tampa, FL, 1998, pp. 1918–1919.
7 Sorrells, J. E., “Design of Disturbance Accommodating Controllers for Stochastic Environments,” Proceedings of the 1982
Proceedings of the 35th Southeastern Symposium on, March 2003, pp. 222–226.
10 Sorrells, J., “Adaptive Control Law Design for Model Uncertainty Compensation,” Tech. rep., Dynetics, Inc, Huntsville,
AL, 1989.
11 Crassidis, J. L. and Junkins, J. L., Optimal Estimation of Dynamic System, Chapman & Hall/CRC, Boca Raton, FL,
2004.
12 George, J. and Crassidis, J. L., “Sensitivity Analysis of Disturbance Accommodating Control with Kalman Filter Esti-
mation,” Proceedings of the AIAA Guidance, Navigation and Control Conference and Exhibit, Hilton Head, SC, 2007.
13 Hinrichsen, D. and Pritchard, A. J., Mathematical Systems Theory I; Modeling, State Space Analysis, Stability, and
Systems & Control: Foundations & Applications, Birkhäuser, Basel, Switzerland, 1st ed., 2003.
17 Soong, T. T. and Grigoriu, M., Random Vibration of Mechanical and Structural Systems, Prentice Hall, Englewood
Journal of Mathematical Analysis and Applications, Vol. 260, No. 2, August 2001, pp. 325–340.
21 Appleby, J. A. D., “Almost Sure Stability of Linear Itô-Volterra Equations with Damped Stochastic Perturbations,”
22 of 22