Liu 2010
Liu 2010
DOI 10.1007/s11768-010-8038-x
Abstract: In this paper, an adaptive neural network control scheme for robot manipulators with actuator nonlinearities
is presented. The control scheme consists of an adaptive neural network controller and an actuator nonlinearities compen-
sator. Since the actuator nonlinearities are usually included in the robot driving motor, a compensator using radial basis
function (RBF) network is proposed to estimate the actuator nonlinearities and eliminate their effects. Subsequently, an
adaptive neural network controller that neither requires the evaluation of inverse dynamical model nor the time-consuming
training process is given. In addition, GL matrix and its product operator are introduced to help prove the stability of the
closed control system. Considering the adaptive neural network controller and the RBF network compensator as the whole
control scheme, the closed-loop system is proved to be uniformly ultimately bounded (UUB). The whole scheme provides a
general procedure to control the robot manipulators with actuator nonlinearities. Simulation results verify the effectiveness
of the designed scheme and the theoretical discussion.
Keywords: Adaptive control; RBF neural network; Actuator nonlinearity; Robot manipulator; Deadzone
where q(t) ∈ Rn is a vector of the link positions, M (q) difficult. Here we assume that the functions h(u) and g(u)
is the symmetric positive definite inertia matrix of robots, are monotonically increasing, smooth and invertible. There-
Vm (q, q̇) represents coriolis and centrifugal forces, G(q) is fore, there exists a deadzone pre-inverse D−1 (w),
a vector that represents the gravitational forces, and τ is a ⎧ −1
⎨g (w) < 0, u < 0,
vector of the torques (or forces) acted on the joints. −1
D (w) = 0, u = 0, (3)
2.2 Deadzone nonlinearity ⎩ −1
h (w) > 0, u > 0
The deadzone usually appears as one form of the actua-
tor nonlinearities. A mathematical model for the deadzone such that
characteristics is given by: D(D−1 (w)) = w. (4)
⎧ To offset the deleterious effects of the unknown deadzone
⎨g(u) < 0, u d− ,
D(u) = 0, nonlinearity, a neural network compensator may be placed
d− < d + , (2)
⎩ as illustrated in Fig.1, where w and u are the output of the
h(u) > 0, u d+ . NN controller and NN compensator, respectively. We hope
Function h(u) and g(u) are smooth, nonlinear functions, that the compensator can cause the composite throughput
so this describes a very general class of deadzone. All of from w to τ to be unity. This is the reason why the deadzone
h(u), g(u), d+ and d− are unknown, so compensation is compensator using RBF networks is proposed in Section 4.
q̈r = q̈d + Λė. (11) and Wi are the ideal target weights. σ and σi are both the
−(x−c)2
Based on the Gaussian RBF networks approximation radial basis function as σ(x) = e p2 .
property [8], one can emulate M (q), Vm (q, q̇) and G(q),
respectively: By defining Ŵ and Ŵi as estimates of the ideal NN
weights, two RBF networks are used to emulate D(u) and
M (q) = [{Wm }T ◦ {σM (q)}] + εM (q), (12) wN N :
Vm (q, q̇) = [{Wm }T ◦ {σV (z)}] + εV (q), (13) D̂(u) = {Ŵ }T ◦ {σ(u)}, (26)
G(q) = [{WG }T ◦ {σG (q)}] + εG (q), (14) ŵN N = {Ŵi }T ◦ {σi (w)}. (27)
where {◦} is GL matrix and “◦” is the GL product oper-
We define W̃ = W − Ŵ and W̃i = Wi − Ŵi as the
ator defined in Section 3.1. εv , εv and εG are the NN re-
weight estimation error.
construction errors. WM , WV and WG are the ideal target
weights. σM , σV and σG are all the radial basis function as Theorem 1 Given the RBF network compensator (23)
−(x−c)2 and (27), the throughput of the compensator plus the dead-
σ(x) = e p2 . Then we can get zone is given by
M (q)q̈ + Vm (q, q̇)q̇ + G(q) τ = w − [{Ŵ }T ◦ {σ (u)}] ◦ [{Ŵi }T ◦ {σi (w)}]
= [{WM }T ◦ {σM (q)}]q̈ + [{WV }T ◦ {σV (z)}]q̇
+[{W̃ }T ◦ {σ (u)}] ◦ [{W̃i }T ◦ {σi (w)}]
+[{WG }T ◦ {σG (q)}] + εM (q)q̈ + εV (z)q̇ + εG (q) +d(t), (28)
= [{WM }T ◦ {σM (q)}]q̈r + [{WV }T ◦ {σV (z)}]q̇r where the modeling mismatch term is
+[{WG }T ◦ {σG (q)}] − M ṙ − Vm (q, q̇)r + E, (15)
d(t) = −[{W̃ }T ◦ {σ (u)}] ◦ [{Wi }T ◦ {σi (w)}]
where E = εM (q)q̈r + εV (z)q̇r + εG (q). −b(t) + ε(u), (29)
By defining ŴM , ŴV and ŴG as estimates of the ideal b(t) = {W }T ◦ {[σ (w + {Ŵi }T ) ◦ {σi w}]
NN weights, three RBF networks with parameter adapta-
tion law can be used to emulate M (q), Vm (q, q̇) and G(q), ◦[εi (w)]} + {W }T ◦ {R1 (W̃i , w)}
respectively: +ε(w + wN N ). (30)
M̂ (q) = {ŴM } ◦ {σM (q)}, (16) Proof From (23) and (24), one has
τk = WkT σ(uk ) + ε(uk )
V̂m (q, q̇) = {ŴV } ◦ {σV (z)}, (17)
= WkT σ(wk + ŵN N k ) + ε(wk + ŵN N k ), (31)
Ĝ(q) = {ŴG } ◦ {σG (q)}. (18)
where τk is the kth throughput of the compensator plus the
We choose the controller given by Ge et al. in [8] and [9]: deadzone.
w = M̂ (q)q̈r + V̂m (q, q̇)q̇r + Ĝ(q) + KV r + Kr sgn(r) From (4) and (25), it follows that
= {ŴM }T ◦ {σM (q)}q̈r + {ŴV }T wk = WkT σ(wk + wN N k ) + ε(wk + wN N k )
◦{σV (z)}q̇r + {ŴG }T ◦ {σG (q)} = WkT σ(wk + Wik T
σi (wk ) + εi (wk ))
+KV r + Kr sgn(r). (19) +ε(wk + wN N k )
The adaptation laws are = WkT σ(wk + Wik T
σi (wk ) + W̃ikT
σi (wk ) + εi (wk ))
⎧
⎨ ŴM k = ΓM k ◦ σM k (q)q̈r rk ,
⎪ +σ(wk + wN N k ). (32)
˙ Using the Taylor series expansion, one has
ŴV k = ΓV k ◦ σV k (z)q̇r rk , (20)
⎪
⎩ ˙
ŴGk = ΓGk ◦ σGk (q)rk . wk = WkT σ(wk + Ŵik T
σi (wk ))
+WkT σ (wk + Ŵik
T T
σi (wk ))(W̃ik σi (wk )
T
4 NN deadzone compensator design +εi (wk ))+Wk R1 (W̃ik , wk )+ε(wk +wN N k ), (33)
Based on the work of Lewis et al. [6], the deadzone in- where R1 (W̃ik , wk ) is the remainder of the first Taylor
verse can be expressed in equivalent form as polynomial. Regrouping the terms, one has
D−1 (w) = w + wN N , (21) wk = WkT σ(wk + Ŵik T
σi (wk )) + WkT σ
tion. The NN tuning algorithm is chosen as Using (40) and (41), one has
⎧
⎪ ˙
T L̇ = rT [−KV r + [{Ŵ }T ◦ {σ (u)}]
⎨ Ŵk = −Sσ (uk )Ŵik σi (wk )rk − K1 SrŴk ,
˙
Ŵik = T σi (wk )rk ŴkT σ (uk ) − K1 T rŴik (37) ◦ [{W̃i }T ◦ {σi (w)}] − [{Ŵ }T ◦ {σ (u)}]
⎪
⎩
−K2 T rŴi F Ŵik ◦ [{W̃i }T ◦ {σi (w)}] − d(t)
with any positive constant matrices S = S T , T = T T and +{W̃V }T ◦ {σV (q)}q̈r + {W̃M }T ◦ {σM (q)}q̇r
K1 , K2 > 0. Ŵk and Ŵik are the weights of the kth com- +{W˜G }T ◦ {σG (q)} + E − Kr sgn(r)]
pensator and 1 k n. n
˙ + n
˙
+ W̃ T S −1 W̃
k kW̃ T T −1 W̃ ik ik
k=1 k=1
5 Adaptive NN controller with deadzone n
−1 ˙
n
˙
compensator + T
W̃M k ΓM k W̃M k + W̃VTk ΓV−1
k W̃V k
k=1 k=1
Based on the dynamics of robot manipulator equation (1) n
T −1 ˙
and the definitions in equation (5)∼(8) in Section 3, there + W̃Gk ΓGk W̃Gk . (44)
exists k=1
7 Conclusions
In this paper, an adaptive neural network control scheme
based on RBF neural network for a robot manipulator with
deadzone is proposed. The control scheme consists of a new
Fig. 7 Position tracking errors when using the adaptive controller and the adaptive RBF neural network controller and an improved
compensator. RBF compensator. Unlike the NN control proposed in [6],
three RBF networks with parameter adaptive law are used
to emulate the inertia matrix, centripetal matrix and grav-
itational vector of robot directly, nolinear robot function
f (x) estimate is not necessary, and chattering caused by
switch gain can be decreased effectively. Compared to the
multilayer BP-network in [6], RBF-network used in the im-
proved control system has fewer parameters to be tuned and
is much easier to realize in real time control engineering.
For example, for BP neural network in [6], 20 hidden-layer
nodes with sigmoid function and 4 jump function are cho-
sen; in this paper, we only use 5 hidden layer nodes. GL
matrix and operator are introduced to help prove the sta-
bility of the n-link robot manipulator system. The closed-
Fig. 8 Control signals τ . loop system is proved to be uniformly ultimately bounded
(UUB). The whole scheme can not only be used for dead-
zone compensation, but also for general actuator nonlineari-
ties compensation, like backlash, saturation, etc. Simulation
results verify the high accuracy-tracking performance of the
designed scheme and the theoretical discussion.
In addition, the similar analysis and Matlab programs de-
sign were given in the book [16].
References
[1] B. Friedl. Advanced Control System Design[M]. Englewood Cliffs:
Prentice-Hall, 1996.
[2] G. Tao, P. V. Kokotovic. Adaptive control of systems with unknown
output backlash[J]. IEEE Transactions on Automatic Control, 1995,
Fig. 9 Compensator signals ŴN N . 40(2): 326 – 330.
256 J. LIU et al. / J Control Theory Appl 2010 8 (2) 249–256
[3] G. Tao, P. V. Kokotovic. Continuous-time adaptive control of systems [13] X. Wu, X. Zhu, G. Cao, et al. Predictive control of SOFC based on
with unknown backlash[J]. IEEE Transactions on Automatic Control, a GA-RBF neural network model[J].Journal of Power Sources, 2008,
1995, 40(6): 1083 – 1087. 179(1): 232 – 239.
[4] F. L. Lewis, W. K. Tim, L. Wang, et al. Deadzone compensation in [14] J. E. Slotine, W. Li. Applied Nonlinear Control[M]. Englewood Cliffs:
motion control systems using adaptive fuzzy logic control[J]. IEEE Prentice-Hall, 1991.
Transactions on Control Systems and Technology, 1999, 7(6): 731 –
742. [15] Y. Lu, J. Liu, F. Sun. Actuator nonlinearities compensation using rbf
[5] G. Tao, P. V. Kokotovic. Adaptive Control of Systems with Actuator neural networks in robot control system[C]//IMACS Multiconference
and Sensor Nonlinearities[M]. New York: John Wiley & Sons, 1996. on Computational Engineering in Systems Applications. Beijing:
Tsinghua University Press, 2006: 231 – 238.
[6] R. R. Selmic, F. L. Lewis. Deadzone compensation in motion control
systems using neural networks[J]. IEEE Transactions on Automatic [16] J. Liu. Robot Control System Design and Matlab Simulation[M].
Control, 2000, 45(4): 602 – 612. Beijing: Tsinghua University Press, 2008.
[7] M. Bianchini, P. Frasconi, M. Gori. Learning without local minima
Jinkun LIU was born in Heilongjiang Province,
in radial basis function networks[J]. IEEE Trans. Neural Networks,
1965. He received his Ph.D. degree in Department
1995, 6(3): 749 – 755.
of Automatic Control from Northeastern Univer-
[8] S. S. Ge, T. H. Lee, C. J. Harris. Adaptive Neural Network Control sity, China, in 1997. After a two-year post-doctor in
of Robotic Manipulators[M]. Singapore: World Scientific Publishing Zhejiang University from 1997 to 1999, he joined
Co., 1998. School of Automation Science and Electrical Engi-
[9] S. S. Ge, C. C. Hang, L. C. Woon. Adaptive neural network control of neering, Beijing University of Aeronautics and As-
robot manipulators in task space[J]. IEEE Trans. Ind. Electron, 1997, tronautics, and he is currently a professor. His re-
44(6): 746 – 752. search concerns motion control, intelligent control,
[10] S. S. Ge, C. C. Hang. Direct adaptive neural network control of and robust control. E-mail: [email protected].
robots[J]. International Journal of System Science, 1996, 27(6): 533
Yu LU received the B.E. and M.E. degrees in Elec-
– 542.
trical Engineering from the Beijing University of
[11] Q. Zhu, S. Fei, T. Zhang, et al. Adaptive RBF neural-networks control Aeronautics and Astronautics, China. Since 2008, he
for a class of time-delay nonlinear systems [J]. Neurocomputing, has been pursuing the Ph.D. degree at the National
2008, 71(16/18): 3617 – 3624. University of Singapore. His current research inter-
[12] H. Peng, J. Wu, G. Inoussa, et al. Nonlinear system modeling ests are in the fields of computer networks, human-
and predictive control using the RBF nets-based quasi-linear ARX computer interaction and applications of control the-
model[J]. Control Engineering Practice, 2009, 17(1): 59 – 66. ory.