1.2894130
1.2894130
1.2894130
Kamal Youcef-Toumi
Associate Professor.
Systems With Unknown Dynamics
This paper focuses on the control of systems with unknown dynamics and deals with
Osamu Ito the class of systems described by x=f(x,t) +h(x,t) + B(x,t)u + d(t) where h(x,t)
Graduate Student. and d(t) are unknown dynamics and unexpected disturbances, respectively. A new
control method, Time Delay Control (TDC), is proposed for such systems. Under
Department of Mechanical Engineering,
the assumption of accessibility to all the state variables and estimates of their
Laboratory for Manufacturing and delayed derivatives, the TDC is characterized by a simple estimation technique that
Productivity, evaluates a function representing the effect of uncertainties. This is accomplished
Massachusetts Institute of Technology, using time delay. The control system's structure, stability and design issues are
Cambridge, MA 02139 discussed for linear time-invariant and single-input-single-output systems. Finally,
the control performance was evaluated through both simulations and experiments.
The theoretical and experimental results indicate that this control method shows ex-
cellent robustness properties to unknown dynamics and disturbances.
1 Introduction
Most of the well-developed control theory, either in the fre- ance is obtained. Therefore, this approach is restricted to
quency domain or in the time domain, deals with systems repetitive tasks only.
whose mathematical representations are completely known. Other control systems have also been proposed for
However, in many practical situations, the parameters of the dynamical systems with norm bounded uncertainty in system
system are either poorly known or operate in environments parameters and input disturbances. Selected references in this
where unpredictable large system parameter variations and area are [7] and [8]. Reference [12] extends the fundamentals
unexpected disturbances are possible. Underwater vehicles, of robust nonlinear control strategies with applications to
robot manipulators, and autonomous systems are a few ex- robot manipulators.
amples. In such situations, the usual fixed-gain controller will This paper proposes another control method, Time Delay
be inadequate to achieve satisfactory performance in the entire Control (TDC) [17, 18], which depends on neither estimation
range over which the characteristics of the system may vary of specific parameters, discontinuous control, nor repetitive
[10]. actions. Rather, it depends on the direct estimation of a func-
Several advanced control techniques have been developed tion representing the effect of uncertainties. This is ac-
for such systems. One of the primary methods is Adaptive complished using time delay. The gathered information is used
Control [3, 6, 11, 14], In Adaptive Control, the structure of to cancel the unknown dynamics and the unexpected distur-
the controller is selected a priori, usually PD or PID type con- bances simultaneously. Then, the controller inserts the desired
troller. The controller gains are then updated using a recur- dynamics into the plant. In other words, the TDC uses past
sively estimated parameters of the plant so that the plant out- observation of the system's response and the control input to
put closely follows the desired response. As stated in [5, 9], directly modify the control actions rather than adjusting the
this method considers slowly varying parameters, linear controller gains or identifying system parameters thereby
dynamics equations and/or bounded uncertainty. leading to a model independent controller. This algorithm can
Sliding Mode Control [13, 16, 22] is another powerful deal with large unpredictable system parameter variations and
method which can deal with nonlinear systems. Based on disturbances. Yet, the system's performance is very
Lyapunov's method, the control scheme is characterized by satisfactory.
discontinuous function with high frequency chattering. The In Section 2, the control problem is defined and the Time
plant parameter variations and disturbances are assumed Delay Control algorithm is presented. Section 3 discusses the
bounded. system's structure, stability analysis and design issues for
Learning Control [1, 15] is an approach which is based on linear time-invariant and single-input-single-output systems.
trial and error. Each time the system performs the same task, Section 4 is an evaluation part of the paper. Effectiveness of
data is collected and used to update the control action. By the TDC system is evaluated through both simulations and ex-
repeating this process several times, betterment in perform- periments. Finally, in Section 5, the results are summarized
and the directions of the future research are suggested.
Contributed by the Dynamic Systems and Control Division of THE AMERICAN
SOCIETY OF MECHANICAL ENGINEERS and presented at the American Control
Conference, 1987, Minneapolis, Minn. Manuscript received by the Dynamic 2 Derivation of the Control Law
Systems and Control Division July 1987; revised manuscript received July 1988.
Associate Editor: G. E. Young 2.1 Error Dynamics and Structural Constraint. The
Journal of Dynamic Systems, Measurement, and Control MARCH 1990, Vol. 112/133
9
0
x= ; f(x,/) = h(x,0 =
Xr fr(x,0 h r (x,0
OIL 0 0
B,„ = K=
(2.17)
3 Analysis and Design for LTI-SISO Systems Substituting equations (3.2) through (3.4) into equation
(2.18), the TDC control law can be obtained as follows.
If the plant is a linear time-invariant (LTI) and single-input-
single-output (SISO) system, discussion of a Time Delay Con- u(t) = u(t-L) + (l/b){-xn(t-L)-nmnx„(t)-
trol system can be considerably simpler and more intuitive us-
ing classic frequency domain technique. In order to describe - a m l x , ( 0 + b,„r(0 + k „ ( x m „ - x „ ) + . . . + k , ( x , „ , - x , ) )
the main feature of this controller, this section will focus on
(3.5)
the analysis and design for such systems.
From the above equations (3.1), (3.3) and (3.5), the following
3.1 Control System Structure. Consider the following Laplace transformations are obtained for the plant, the model
LTI-SISO plant with unknown system matrix A.
and the control action as G(s), G,„{s), and U{s), respectively.
x = Ax + bw G{s)=Y{s)/U(s) = b/P(s) (3.6)
i
y=xi (3.1) v/hereP(s) = s"+a„s"- + . . .+a,
Journal of Dynamic Systems, Measurement, and Control MARCH 1990, Vol. 112/135
f
fr
b Pis)
P(s)
m ^\ Ls
model
e: P(s)
m : y^
P(s)
?: L 3 Pis) Fig. 4 Root locus of TDC for a LTI-SISO plant
model
Fig. 3 Reduced block diagram #2 of TDC for a LTI-SISO plant r b 1
y y
m m
P(s) 6 s* 1
Gm(s)=Y,„(s)/R(s) = bm/Pm(s) (3.7) m
where Pm(s) = s" +a,ms"~'+ . . .+aml Fig. 5 Reduced block diagram #3 of TDC for a LTI-SISO plant
Ls
U(s)= {Pk(s)Ym(s) + b„Ms)-Pmk(s)Y(.s)}/{b(l-e~ )} order system with a large bandwidth. The resultant block
(3.8) diagram is shown in Fig. 5. Since the second block in Fig. 5
can be regarded as almost unity, the whole system behaves as
where Pk(s) = k„s"~1 +. . . +k1s + kl the reference model. Performance properties of such control
systems for SISO systems which include stability, disturbance
Pmk(s) = e-Lss" + (a,„„+ kn)s"-> + . . . +(amX+k{)
rejection properties, and uncertainties in the control distribu-
Now we can describe the main feature of the TDC system in tion matrix could be found in reference [21].
frequency domain by analyzing the LTI-SISO systems defined From the above analysis, a Time Delay Control system can
above. be interpreted as one where the command r is prefiltered by
Suppose that the desired error dynamics is governed by the reference model into ym and then the error ym — y is forced
to zero by the high gain integrator l/Ls with pole/zero
e = A„,e (3.9) cancellation.
which simplifies the algebra by making all the error feedback 3.2 Stability Analysis. It is also possible to discuss the
gains to be zero. stability issues of this controller using the above formulation.
*,=. =0 (3.10) By combining equations (3.6) through (3.8), one obtains
In addition, for a small time delay L, the following approx- Y(s) _ b„ Pm(.s) + Pk(s)
L
imation holds. R(s) Pm(.s)\l-e- s)P(.s) + Pmk(.s)
x„(/-L)sx„(0 (3.11) Therefore, the characteristic equation of the closed loop
system is
Now combining equations (3.8), (3.10) and (3.11) leads to
Pm(s) {(1 - e-L°)P(,s) + Pmk(s) 1 = 0 (3.17)
U(s)= ibmR(s)-Pm(s)Y(s)}/{b(l-e-L°)} (3.12)
Usually, the model's characteristic equation Pm(s) is chosen to
or using equation (3.7), be stable, thus one is to examine:
U(s)= {Pm(s)Yn!(s)-Pm(s)Y(s))/{b(l-e-^)} (3.13) (l-e~L°)P(s) + Pmk(s) = 0 (3.18)
This equation can be reduced to or equivalently,
Ls
U(s) = e- U(s)+ [Pm(?)/b) I Ym(s)- Y(s)} (3.14) l + {-e-Ls + (Pmk(s)/P(s))}=0 (3.19)
A block diagram for the whole system can then be obtained as The stability condition of equation (3.18) or (3.19) can be
shown in Fig. 1. With the following approximation of solved by using existing techniques such as the Nyquist stabili-
ty criterion, or Routh-Hurwitz criterion using the Pade ap-
A-Ls (3.15)
proximation, and it gives the upper limit of the time delay L to
when L is sufficiently small, the block diagram of Fig. 1 can maintain the stability. Strictly speaking, the above stability
first be reduced to the one of Fig. 2 and then to the one of Fig. test. should be done for the entire possible range of the
3. unknown polynominal P(s). However, a good result can
A typical root locus of the closed loop part of the system usually be obtained using the "worst" P(s), which can be
(inside the dotted line of Fig. 3) is shown in Fig. 4. If the delay defined as the polynominal that corresponds to the "fastest"
time L is small enough, in other words, if the effective con- plant. Stability can also be interpreted intuitively using equa-
troller gain \/L is sufficiently large, one closed loop pole goes tion (3.18). If the time delay is infinitely small, L —0, then the
to negative infinity whereas all the other poles approach the overall stability will depend only on Pmk(s) and P(s).
fixed closed loop zeros. Therefore, pole/zero cancellation oc- As seen in equation (3.16), the stability condition depends
curs, which makes the whole closed loop part look like a first on the time delay, the error feedback gains, and the
Journal of Dynamic Systems, Measurement, and Control MARCH 1990, Vol. 112/137
Motor 2 .
Motor 1 *
limelKC)
Fig. 7(a) Response of the plant and the model
°l'°ml i. 8, IPDI
(rad)
9
* : (TDC), e m l
D
2'0ra2 l
B
(rad) 2 , T D U ' um2 v ^ - ^ ~ ~
timelKCl ^ ^ ** 82(PD)
Fig. 7(b) Response of the error
2'. 4'. 6'. •3
time (sec)
Fig. 9(a) Response of the manipulator and the model
e, ITDC)
(rad)
* e, IPD)
-«.»4
9'
e
2
(rad) e 2 (PD)
timelsei
Fig. 7(c) Response of the control input
time (set
In order to determine the effectiveness of the controller, the Fig. 9(b) Response of the error
Coriolis and centrifugal force vector h is treated to be com-
pletely unknown. The reference model was selected as a set of T
' ,
second order systems described by
^8,(TDC).8 m l
^^ ^^Sw^ ^^^***^M(v
time (sec)
8
2- e .n2 i
(rad/secN
(rad)
• i * ^ ^ " ^ ^ W .^jy****^^ *• e2 (PD)
, e , (TDC)
From, equations (4.5) through (4.8) and (2.18), the Time (rad)
Delay Control law becomes: /
/« e( IPD)
-9.14
T
i = H,*{H II
a. 12
e
2
T2 t-L t-L (rad)
« .as / \ *• e 2 <PD)
' e2(TDC) \
S,
+ A„ + B„
'i i
The following PD controller was derived and used to compare
the results. Note that this PD controller does not compensate
for the Coriolis centrifugal force. *»MV4,+i"A'
= H/*{A„ + B„ (4.10) T
2 «,
7"2 'A
Journal of Dynamic Systems, Measurement, and Control MARCH 1990, Vol. 112/139
r - n/8 rad
e 8 r • ,,/8 rad
t
ro com pensated
spring torque
Top lrace: e <0.22 rad/div. ) Top trace: 8 ( 022 rad/div. )
Bouom lrace;t m< 0.068 Nm/div. )
Bottom lrace' ta/, 0.068 Nm/div )
HoriZ.lrace: lime ( O.S sec.ldiv. )
HorLt. trace: time ( O.S sec./div. )
Flg.13 Experiment #1 (PO controller)
Fig. 15 Experiment #3 (TOC with a soft spring)
2' sellling ti me
- 0.8 sec 2' sell ling lime
- 0.8 sec
compensated
spring torque
Fig.14 Experiment #2 (Time Delay Controller)
Flg.16 Experiment #4 (TOC with a hard spring)
d
dt
o
- (k s / J)*f) - (b/J)*O - dssgn(O)
J In the first experiment, the following PD controller was
used in order to see how large the unidentified viscous damp-
+ [0 J
1/J
7
m
ing coefficient bs and dry friction d s are,
7 m(t)=J{ -w,,2f}(t)-2sw ll 0(t) + w 1l 2r(t)} (4.16)
(4.11)
If the external spring is not attached and both b s and d s are
where f} is motor rotation angle, 7 m is motor torque, and k s is small enough, the above controller will make the plant behave
an unknown spring constant, bs is an unknown viscous fric- the same as the model. However, the actual response showed a
tion coefficient, d s is unknown dry friction and J( = 4.0* 10 large steady state error as seen in Fig. 13, which implies that
-4kg_m 2 = 0.53 oz-in-sec 2 ) is the total inertia. The reference the unknown viscous damping coefficient and/or dry friction
model was chosen to be a second order system defined by cannot be ignored and some compensation technique is
necessary. In order to compensate for these undesired and
d
unknown dynamics, the Time Delay Control law of equation
dt [ :: J [ (4.15) was used. This time, as shown in Fig. 14, the plant
showed no steady state error, 0 percent overshoot and 0.8 s for
where the natural frequency W n , damping ratio Sand reference the 2 percent settling time, which means that the plant closely
command rare 5 rad/s, 1 and 7l'/8 rad = 22.5 deg, respectively. followed the model. An external spring was then attached to
Let the error dynamics to be e=Ame thus making error feed- simulate additional unknown dynamics. Figure 15 (with a soft
back gains to be zero. Then, combining equations (4.11), spring of k s = 0.17 Nm/rad) and Fig. 16 (with a hard spring of
(4.12) and (2.18), one obtains the TDC control law: k s = 0.23 Nm/rad) show that the response remained almost the
same despite the unexpected external spring torque. Note that
7 m(t) =7 m(t- L) + J{ - O(t- L) - w2nf}(t) - 2sw lI O(t) + w2lI r(t) J
the controller does not depend on the unknown system
(4.13) parameters of k s ' d s and b s ' Thus, even if they change over
time, the same performance will be guaranteed. Additional ex-
Since an angular accleration signal was not available in the ex- periments with a reference model having a damping ratio of
perimental hardware, the following approximation was used, 0.5 can be found in [20]. All of these responses verify the good
O(t - L) =. (O(t) - O(t - L) J/ L (4.14) mocj.el following property of this controller. Since the
distinguishing feature of the Time Delay Controller is the
The final control law can be obtained by substituting equation angular acceleration term, the last experiment was done to see
(4.14) into equation (4.13). what would happen when the acceleration term is taken out of
the control law
7 m(t)=7 m(t-L)+J{ - [O(t)-O(t-L)]lL
(4.17)
- w2nf}(t) - 2sw n0(t) + w211 r(t) J
The above PID controller was applied to the system without
where the delay time Lis 5 ms. an external spring. As seen in Fig. 17, the system went into an
Journal of Dynamic Systems, Measurement, and Control MARCH 1990, Vol. 1121141