Kernel Learning Adaptive One-Step-Ahead Predictive Control For Nonlinear Processes
Kernel Learning Adaptive One-Step-Ahead Predictive Control For Nonlinear Processes
ABSTRACT: A Kernel learning adaptive one-step-ahead Predictive Control (KPC) algorithm is proposed for the general
unknown nonlinear processes. The main structure of the KPC law is twofold. A one-step-ahead predictive model is
first obtained by using the kernel learning (KL) identification framework. An analytical control law is then derived
from Taylor linearization method, resulting in an efficient computation for on-line implementation. The convergence
analysis of the KPC control strategy is presented, meanwhile a new concept of adaptive modification index is proposed
to improve the tracking ability of KPC and reject the unknown disturbance. This simple KPC scheme has few parameters
to be chosen and small computation scale, which make it very suitable for real-time control. Numerical simulations
compared with a well-tuned proportional-integral-derivative (PID) controller on a nonlinear chemical process show
the new KPC algorithm exhibits much better performance and more satisfactory robustness to both additive noise and
unknown process disturbance. 2008 Curtin University of Technology and John Wiley & Sons, Ltd.
Control Strategy; meanwhile, a new concept of adap- point u(k − 1), meanwhile neglecting the higher-order
tive modification index (AMI) is obtained to achieve a terms.[2] Then we have
good tracking performance. This simple KPC scheme
has few parameters to be chosen beforehand including y(k + 1) = f [x̃ (k )]
a small computation scale, which make it very suitable
for nonlinear real-time control. Application of the pro- ∂f
+ u(k ) + O(u(k ))
posed KPC algorithm to a nonlinear chemical process ∂u(k ) u(k )=u(k −1)
is illustrated in the section on Simulation Results and
∂f
the conclusions are drawn in the final section. ≈ f [x̃ (k )] + u(k ) (5)
∂u(k ) u(k )=u(k −1)
(SVR) or a least squares SVR is adopted, the KL iden- where h is the error correction coefficient, and in
tification model can be expressed as a uniform formula most cases h = 1. Consequently, the KPC law can be
ultimately formulated as
ym (k + 1) = KL[x (k ), α(k )]
∂KL
NSV µ(k )
= αi K x (i ), x (k ) + b (8) ∂u(k ) u(k )=u(k −1)
u(k ) = u(k − 1) + 2
i =1
∂KL
λ+
where αi denote general Lagrange multipliers, which ∂u(k ) u(k )=u(k −1)
are the linear combinations of Lagrange multipliers;
NSV is the number of support vectors; ·, · denotes the × E (k + 1) (12)
dot product and K x (i ), x (k ) is a kernel function that
handles the inner product in the feature space and hence where E (k + 1) = yr (k + 1) − e(k ) − KL[x̃ (k )] is the
the explicit form of nonlinear mapping does not need total error of KL predictive model at time k .
to be known; b is the bias term.[10,11] Then substituting
the KL model into Eqn (6) yields the original KPC law
CONVERGENCE ANALYSIS AND
∂KL CORRESPONDING ADAPTIVE CONTROL
∂u(k ) u(k )=u(k −1) STRATEGY
u(k ) = u(k − 1) + 2
∂KL It is well known that it is extremely important to guaran-
λ+
∂u(k ) u(k )=u(k −1) tee a control law convergent. Thus, we investigate the
proposed KPC law detailed and obtain the following
× {yr (k + 1) − KL[x̃ (k )]} (9) theorem.
Theorem 1. There exists suitable µ(k ) such that the
To compensate both the Taylor approximation and control algorithm given in Eqn (12) will be convergent.
the identification error, an adaptive modification item Proof: Let
(AMI) µ(k ) is introduced to the control law in Eqn (9).
Then the KPC law can be rewritten as (k + 1) = yr (k + 1) − y(k + 1) (13)
∂KL
µ(k ) and
∂u(k ) u(k )=u(k −1)
u(k ) = u(k − 1) + 2 ∂KL
∂KL µ(k )
∂u(k ) u(k )=u(k −1)
λ+
∂u(k ) u(k )=u(k −1) δ(k ) = 2 E (k + 1). (14)
∂KL
× {yr (k + 1) − KL[x̃ (k )]} λ+
∂u(k ) u(k )=u(k −1)
(10)
Thus, there exists suitable AMI when 1. It is easy to obtain an accurate nonlinear predictive
model with good generalization by KL identification
2 methodology.
∂KL
λ+ 2. The control law can be modified adaptively by AMI
∂u(k ) u(k )=u(k −1) µ(k ) to keep high tracking performance, especially
µ(k ) = (17)
∂KL ∂KL when the process is time varying or suffers from
∂u(k ) u(k )=u(k −1) ∂u(k ) u(k )=u(k −1)
unknown disturbances.
3. A simple analytical control law makes real-time
computation effective.
one can always make ε(k + 1) = 0. That is to say the
KPC law given in Eqn (12) is convergent. Two common kernel functions, Gaussian and poly-
To obtain a reliable estimate of µ(k ), we design nomial, are usually utilized in KL methods.[10] When
a simple recursive efficient method. According to a Gaussian kernel function is adopted: K (xi , xj ) =
Eqns (9) and (10), µ(0) is set to 1 and then µ(k ) is exp(−||xi − xj ||2 /σ 2 ), where σ is the Gaussian kernel
replaced by µ(k − 1) in Eqn (14) at time k to obtain width, the KL identification model can be formulated
the estimation of δ(k ). Let u(k − 1) = [u(k − 1) + as
u(k )]/2 for more precision; at last the actual AMI
at time k is calculated in Eqn (17). By substituting
Eqn (17) into Eqn (12), the convergent and adaptive ym (k + 1) = KL[x (k )]
control law at time k can be deduced
NSV
= αi exp(−||x (i )
u(k ) = u(k − 1) + E (k + 1) i =1
− x (k )||2 /σ 2 ) + b (19)
∂KL
(18)
∂u(k ) u(k )=u(k −1) From Eqn (19) we can obtain
∂KL 2
Figure 1 shows the whole KPC strategy. TDL denotes NSV
= αi exp(−||x (i ) − x (k )||2
∂u(k ) u(k )=u(k −1)
the common time delay and GTDL is defined as a gen- 2
eral time delay, through which x̃ (k ) = [Y (k ), u(k − σ i =1
1), U (k − 1)] is obtained. KPC is composed of two pri- 2
σ )[xny +1 (i ) − u(k − 1)] (20)
mary modules: a KL predictive model and an adaptive
controller. The flowchart of this simple strategy is as
follows: at time k , a corrected KL predictive model where x (k ) = [Y (k ), u(k − 1), U (k − 1)], xny +1 (i ) is
is obtained by adding the latest error e(k ) to the quasi- the ny + 1 item of x (i ) vector. Thus, a Gaussian kernel
one-step-ahead predictive output KL[x̃ (k )], and then the function based KPC law is expressed below
total error E (k + 1) and the AMI µ(k ) are both intro-
u(k ) = u(k − 1)+
duced into the controller to compute the process control
law u(k ). E (k + 1)
. (21)
There are three main advantages in our proposed
NSV
αi exp(−||x (i ) − x (k )|| 2
σ )[xny +1 (i ) − u(k − 1)]
2
KPC law: i =1
NSV
KL[x (k )] = αi (x (i ), x (k ) + τ )p + b (22)
i =1 qc ,Tc 0
∂KL
∂u(k ) u(k )=u(k −1) Figure 2. Schematic of the CSTR. This
figure is available in colour online at
NSV
www.apjChemEng.com.
=p αi (x (i ), x (k ) + τ )p−1 xny +1 (i ) (23)
i =1
u(k ) = u(k − 1) 103.41 l min−1 . The other meanings and nominal val-
E (k + 1) ues of the variables in the above equations can be
+ (24) referred to Nahas et al .[5] Under the input constraint,
NSV
90 l min−1 ≤ qc ≤ 110 l min−1 , the control objective
p αi (x (i ), x (k ) + τ )p−1 xny +1 (i )
is to regulate Ca by manipulating qc .
i =1
The sampling period of all process measurements
is 6 s. For the identification procedure a sequence of
only 500 samples, which is much less than NN based
SIMULATION RESULTS identification method, [7]
is generated to form the iden-
tification set S = {x (i ), y(i + 1)}Ni=1 , x ∈ Rn , y ∈ R ,
As an example to illustrate the validity of the proposed where n = nu + ny is the order of input vector, which
KPC algorithm, we consider a highly nonlinear continu- is chosen as x (k ) = [y(k ), y(k − 1), y(k − 2), u(k ),
ous stirred tank reactor (CSTR) process, which is known u(k − 1), u(k − 2)] according to Lightbody and Irwin[7]
for its significant nonlinear behavior and exhibits multi- and Iplikci.[16] The simulation environment is Matlab
ple steady states and poses a difficult control problem.[5] V7.1 with CPU main frequency 2.4 GHz and 256 M
This complex CSTR process has been studied with memory. Without lost generality, an offline LSSVR
other control strategies, e.g. NN based nonlinear internal identification model is used with polynomial kernel
model control[7] and SVM based generalized predic- function, and the regularization parameter γ = 103 and
tive control.[16] Figure 2 is the schematic of this CSTR polynomial degree p = 3 are chosen by using cross-
process, in which an exothermic irreversible first-order validation approach. It only takes us several minutes to
reaction takes place. obtain this satisfactory KL identification model. Com-
The concentration Ca inside the reactor is controlled pared with the other methods, the KL identification is
by manipulating the coolant flow qc through the jacket. much easier to implement.
Under the mass and energy balance, the dynamics of
this CSTR process can be described as follows:
KPC is 0.039 which is obviously much smaller than KPC is more robust against additive noise and unknown
the PID controller (IAE = 0.164). When λ = 1−4 or disturbance. For details about the IAE comparison is
λ = 1, KPC can adaptively obtain the same control tabulated in Table 1. All the simulation results are the
performance due to the effect of AMI (In both cases average of 20 running times.
IAE = 0.098). The KPC controller with different λ can
track the set-point quickly and with little overshoot.
Moreover, the running time of KPC in one sample
Case 3: a ‘stair’ reference tracking
is about 0.05 s which is much less than the sampling
period of this CSTR process (6 s) and almost equals to
Furthermore, when the proposed controller is working
PID (0.03 s). So we can conclude that the KPC strat-
in a large operating range, is it still valid? Fig. 5 shows
egy presents much better performance than the PID
the system response of a ‘stair’ reference starting on
controller.
0.085 mol/l and ending at 0.12 mol/l; each step has a
duration of 10 min and amplitude of 0.005 mol/l. The
value of λ is unchanged and kept the same as in Case
Case 2: noise and disturbance rejection 2 to validate its robustness. As can be appreciated, the
In order to mimic a realistic situation the system is
subjected to additive noise and unmeasured disturbance. 0.14
Different magnitudes of the super-imposed Gaussian
noise and disturbance are discussed. The satisfactory
0.13
performance is achieved when λ is varying in a wide
range just as analyzed aforementioned. And we choose
λ = 1−5 in this case. It is shown in Fig. 4 that the Ca, mol⋅l-1
0.12
0.11
0.12
0.1
Ca, mol⋅l-1
0.11
yr
0.09
PID (0.164) Reference Trajectory
l=1-8 (0.039) KPC
0.1 PID
-4
l=1 (0.098) 0.08
l=1 (0.039) 0 20 40 60 80 100 120 140 160 180 200
110
Sample Number
qc, l⋅min-1
100
Figure 4. System response with both noise and dis-
90 turbance. This figure is available in colour online at
www.apjChemEng.com.
105
AMI
100
0 20 40 60 80 100 0.125
Reference Trajectory
Sample Number 0.12 KPC
PID
Figure 3. System response to the set-point tracking with 0.115
different weighting factors. This figure is available in colour 0.11
online at www.apjChemEng.com.
Ca, mol⋅l-1
0.105
0.1
Table 1. IAE comparison of KPC and PID with different
noises and disturbances. 0.095
0.09
Simulation Control Low Normal High
conditions strategy noise noise noise 0.085
Acknowledgements
2008 Curtin University of Technology and John Wiley & Sons, Ltd. Asia-Pac. J. Chem. Eng. 2008; 3: 673–679
DOI: 10.1002/apj