Habtom-Litz1997 Chapter NeurocontrolOfNonlinearDynamic
Habtom-Litz1997 Chapter NeurocontrolOfNonlinearDynamic
1. Introduction
Due to their strong representational and mapping capabilities, recurrent neural
networks have been widely employed for the modeling and control of nonlinear
dynamic systems. Recently, there has been another dimension of interest in exploring
recurrent networks for state estimation. This is due to the feature that appropriately
structured recurrent networks have the potential to estimate not only the output of the
plant but also the internal states of the plant.
The Extended Kalman Filter (EKF), which is an optimal state and parameter
estimation tool, has been applied for the training of neural networks. The use of EKF
as a recurrent network learning algorithm is elaborated in [1] and its application for
the training of a neural controller is discussed in [2]. In our work [3], a method is
proposed, where the EKF and the recurrent neural model of the unknown system are
employed for the estimation of unmeasured inputs. In this paper, we apply this
approach to estimate unmeasured disturbance inputs in a closed-loop. An off-line
trained neurocontroller employs the estimated disturbance inputs and other measured
variables to control a dynamic system subject to unmeasured disturbance inputs.
identification model for a nonlinear dynamic system is depicted in Fig. 1. The RMLP
network in this figure can be represented mathematically as
x(k + 1) = f(WYy(k) + WUu(k) + Whx(k) + b u) (1)
where x(k) and y(k) denote the output vector of the hidden units and the output vector
of the network at time k, respectively; W y and W e represent the weight matrices
connecting the hidden units to the previous output of the network and the input vector
u, respectively; W x is the weight between the hidden units and the output units; W h
and W ° are weight matrices representing the local and lateral connections for the
hidden units and the output units, respectively; and b e, b x are the input and output
biases, respectively. The vectors of activation functions for the hidden units and the
output units are denoted as f(.) and g(.), respectively.
u(k)
where x y represents the output vector of the output units at time k. To use the discrete
EKF algorithm, the linearization of the states should be carried out on every step of
training and we have to associate plant and measurement noises.
where wl(k), w2(k), w3(k) are vectors of plant noise and v(k) is that of measurement
noise, at time k. Now, the discrete EKF can be applied to estimate the unmeasured
input vector z together with the states of the network (x and xY). The estimation is
based on the error between the output of the plant and the identification network with
the weights of the identification being kept unchanged.
For the purpose of training the neurocontroller, the reference vector and r and the
disturbance input vector Ud are generated randomly. The dynamic backpropagation
(DBP) algorithm involves a sensitivity model considering both the internal and
external recurrences. An off-line training can be carried out by using an efficient
training algorithm like the Levenberg-Marquardt method. If an on-line adaptation of
the controller is demanded, then the EKF algorithm can be preferably applied [2].
~1 +
- < _ ~ -'--~e(k+l)
94 I
95
93 1
90
9t
90
85
89
RM LP (y) ~ plant's output (yp)
80 20; 40;
~0; SO; looo s s ,oo ,ooo l,oo
[timin.] [ t / m in.]
Fig. 4 Measured and predicted values of y. Fig. 5 Performance of the neurocontroller.
Finally, the neurocontroller is implemented in the overall control structure shown
in Fig. 3. Now, the unmeasured disturbance input (u2) of the drum is estimated using
the EKF algorithm, while the other disturbance input (Ul) is applied to both the
identification and controller networks. In Fig. 6, the input u2, which is applied to the
mathematical model, and the estimated unmeasured input z are shown. Fig. 7 shows
the tracking ability of the neurocontroller, when the reference input changes.
860
48 95
% %
94
46
44
92 i
42
91
4,
90
38 ~ ~ reference(r)
-- measured(u 2) 89
plant's output (yp)
estimated (z)
36 I i i
20'° ,0; ,.o 800 1000 ~8 20o 400 600 800 lOO~l
[timin.] [t/rain.]
Fig. 6 Measured and estimated values of u2. Fig. 7 The controller tracking the reference.
6. C o n c l u s i o n s
The simulation results have demonstrated the effectiveness of the proposed
approach not only in estimating the unmeasured disturbance input of the drying drum
but also in driving the unknown system to the desired output. The fact that the inherent
characteristics of the mathematical model is complex and that we employed two
strongly varying disturbance inputs of which one is estimated, makes the control
aspect rather difficult. In view of this, we believe that the performance of the neural
controller is quite satisfactory. Even with the assumption that we could measure all the
disturbance inputs, the control aspect is not easy. As already pointed out in Section 4,
the neurocontroller can be theoretically adapted on-line with the aim of minimizing
the error between the output of the plant and that of the reference model by applying
the identification model as a channel for backpropagation. On this regard, a further
study will be carried out on the perspective of combining our approach to an adaptive
control law to improve the overall performance.
References
1. R.J. Williams. Training recurrent networks using the extended Kalman filter, in Proc. of
the International Joint Conference on Neural Networks, Baltimore 1992, vol. IV, pp. 241-
246.
2. G. V. Puskorius and L. A. Feldkamp. Neurocontrol of nonlinear dynamical systems with
Kalman filter trained recurrent networks, IEEE Trans. on Neural Networks, vol. 5, pp. 279-
297, 1994.
3. R. Habtom and L. Litz. Estimation of unmeasured inputs using recurrent neural networks
and the extended Kalman filter, in Proc. of International Conference on Neural Networks,
Houston 1997, vol. IV, pp. 2067-2071.
4. B. Fernandez, A. G. Parlos, and W. K. Tsai, Nonlinear dynamic system identification using
artificial neural networks (ANNs), in Proc. of International Joint Conference on Neural
Networks, San Diego 1990, vol. II, pp. 131-141.
5. K.S. Narendra and K. Parthasarathy. Identification and control of dynamical systems using
neural networks, IEEE Trans. on Neural Networks, vol. 1, pp. 4-27, 1990.
6. R.J. Williams and D. Zipser. A learning algorithm for continually running l)ally recurrent
neural networks, Neural Computation, vol. 1, pp. 270-280, 1989.
7. R. Habtom and L. Litz. Neural network-based control of nonlinear dynamic systems with
unmeasured inputs, in Proc. of Twelfth International Conference on Systems Engineering,
Coventry, UK, 1997.