Clustering Safe Process 09
Clustering Safe Process 09
* Control and Intelligence Processing Center of Excellence, School of ECE, University of Tehran, Iran
(phone: +98 912 246 23 27; e-mail: [email protected])..
** Petroleum University of Technology, Iran (e-mail: [email protected]).
*** Control and Intelligence Processing Center of Excellence, School of ECE,
University of Tehran, Iran (e-mail: [email protected]).
Abstract: This paper presents a set of new fault detection and isolation (FDI) approaches based on a
Modified Gath-Geva (MGG) Clustering approach. The proposed approaches are formulated in the forms
of a combined principal component analysis (PCA)-MGG or nonlinear PCA (NPCA)-MGG schemes.
The PCA and NPCA perform as a data preprocessing step on the measured industrial multivariate time
series data to enhance their informative richness, capsulated in a more compact form. Then, the MGG
clustering approach is used to detect and isolate the faults by organizing the PCA/NPCA-transformed
data in different clusters. A distributed-MGG scheme is also proposed in this paper as a new FDI
approach which is based on a distributed monitoring configuration. The main idea is to divide the overall
monitoring task into a series of local distributed monitoring sub-tasks so as to track and capture the
process faults locally. The diagnostic performances of the proposed FDI approaches are evaluated by a
set of comparative test studies on the Tennessee Eastman process plant as a large-scale benchmark
problem.
858
This paper is organized as follows. In the next section, the where µ i,k represents the degree of membership of the
proposed FDI approaches are introduced. Section III observation, Z k is the ith cluster (i=1,…,c) and m ∈ [1, ∞ ) is
demonstrates the experimental simulation test study to
provide a comparative performance evaluation of the a weighting exponent that determines the fuzziness of the
proposed FDI approaches on the TE process plant as a large- resulting clusters, usually chosen as m=2. The data are
scale benchmark problem. Finally, some concluding remarks assumed to be normally distributed random variables with
are summarized in the last section. expected value Vi and covariance matrix Fi . The GG
clustering algorithm is equivalent to the identification of a
2. DEVELOPMENT OF THE PROPOSED FDI mixture of Gaussians that represents the P (Z k η i )
APPROACHES probability density function expanded in a sum over the c
clusters,
In this section, first a powerful method for time-series
segmentation, introduced by Abonyi et al., 2005, is briefly
c
reviewed and then different modifications of the algorithm
are proposed for an enhanced FDI performance. P (Z k η ) = ∑ P(Z k η i )P(η i ), (2)
i =1
∑µ
i =1
i,k =1 (4)
c N
J= ∑∑ (µ
i =1 k =1
i,k )m D 2 (Z k , ηi ), (1)
to get a fuzzy partitioning space. Detailed description of the
optimization approach is out of the scope of this paper and
interested readers are referred to (Abonyi et al., 2005) for the
859
recursive formulation of solving minimization problem as 2.3 Variable selections in a distributed FDI approach
well as the detail description of the bottom-up algorithm for
merging similar clusters. When dealing with high-dimensional datasets, the cluster
structure of the observation is often confined to a small
2.2 PCA and nonlinear PCA application in FDI algorithm subset of variables. As pointed out by several authors (Ding
PCA is used wherever large and complicated multivariate et al., 2002 and Raftery et al., 2006), the inclusion of
data sets have to be reduced to a simpler form. NPCA, like unrelated variables could complicate or even mask the
PCA, is used to identify and remove correlations among recovery of the clusters. Common approaches for mitigating
problem variables as an aid to dimensionality reduction, the effect of noisy variables or identifying those that define
visualization, and exploratory data analysis. However, NPCA true cluster structure involve differentially weighting the
uncovers both linear and nonlinear correlations. It works as a covariates or selecting the discriminating ones.
feature extraction algorithm, producing features that retain We propose the concept of distributed FDI for proper
the maximum possible amount of information from the clustering of various faults. Since different variables are
original data set, for a given degree of data compression. In responsible for different faults, using all process variables for
this paper, we investigate using PCA and NPCA as data identification of all faults is time-consuming and non-
reduction techniques and a preprocessing step for the efficient. In the proposed method, process is divided into
application of FDI in large scale processes. The main goal of subsystems and only the related variables of each subsystem
this paper in this section is to incorporate statistical PCA and are considered for fault identification purposes corresponding
NPCA data preprocessing to improving FDI performances. to that subsystem. This idea can be very helpful for proper
For the sake of brevity, implementations steps of these FDI of industrial large scale processes.
algorithms are briefly summarized. Interested readers are
referred to the relevant references. Fig. 1 shows the different approaches employed in this paper
for solving FDI problem in large scale processes. These
PCA or NPCA seeks for a linear or nonlinear transformation approaches are implemented and compared in the next
k ×m
G ∈ℜ such that: section for FDI of the TE process plant as an industrial
benchmark problem.
T = G (Y ) (5)
G and H are selected to minimize loss of information 3. EXPERIMENTAL SIMULATIONS OF THE PROPOSED
E where FDI APPROACHES ON THE TE PROCESS PLANT
∑∑ (Y )
n m
1 i The process simulator for the Tennessee Eastman (TE)
E= j − Y j′i (7)
mn i =1 j =1 industrial Challenge Problem was created by the Eastman
Chemical Company to provide a realistic industrial process
for evaluating process control and monitoring methods
PCA can be implemented by singular value decomposition of (Downs et al., 2002). The TE process simulator has been
the covariance matrix of the data or by a three layer neural widely used by the process monitoring community as a
network (Baldi et al., 1989). A five layer neural network, as source of data for comparing various approaches (Chiang, et
showed by Kramer (1991), can be employed for al., 2001 and Lin et al., 2000). The control structure is shown
implementing NPCA. Here, we employ technique introduced schematically in Fig. 2. There are five major unit operations
by Scholz et al (2002) to enforce hierarchical order of the in the process: a reactor, a condenser, a recycle compressor, a
principal components which yields essentially uncorrelated separator, and a stripper. The four reactants A, C, D, and E
features. and the inert B are fed to the reactor where the products G
and H are formed and a by-product F is also produced. The
process has 22 continuous process measurements, 12
860
Fig. 2. Different Process layout of the Tennessee Eastman process.
manipulated variables, and 19 composition measurements The proposed FDI approaches, including PCA-based, NPCA-
sampled less frequently. Details on the process description base and distributed-based techniques, are evaluated in this
are well explained by (Chiange et al., 2001). A total of 33 section to demonstrate their diagnostic capabilities. The
variables, listed in Table 1, are used for monitoring in the obtained results will be compared with the original MGG
simulation studies. We excluded all composition clustering approach. Two sets of simulation experiments
measurements because they are unreliable. A sampling were conducted to investigate the comparative performances.
interval of 3 min was used to collect the simulated data for 1) Simulation Experiment 1
the training and testing sets. Some of the preprogrammed TE In this experiment, the faults corresponding to the TE Reactor
faults which are used in this section for FDI purpose have cooling water process, i.e. faults 4, 9 and 11, were
been listed in Table 2. A set of 100 data samples considered. To implement the experiment, a time series data
corresponding to each fault training data sets has been used in with a length of 300 data samples was constructed by
test study due to the unsupervised nature of the classification cascading a set of 100 data samples for each of the mentioned
technique employed in the proposed FDI approaches.. The TE Faults, comprising of all the 33 variables listed in Table
data have been generated and can be downloaded from 1. Fig. 3(a) illustrates the diagnostic performance of the
http://brahms.scs.uiuc.edu. original MGG clustering approach to detect and isolate the
3.2. Simulation Results and discussion
861
Table 2. Process Faults For The Tennessee
Eastman Process
NO. Description Type
4 Reactor cooling water inlet temperature Step
5 Condenser cooling water inlet temperature Step
9 D feed temperature (stream 2) Random variation
11 Reactor cooling water inlet temperature Random variation
12 Condenser cooling water inlet temperature Random variation
15 Condenser cooling water valve Sticking
a)
a)
b)
b)
c)
862
distributed-MGG, have been illustrated in Figs .5(a, b, c, d),
respectively. The experimental observations are quite similar
to those recorded in section 1. For the sake of brevity, the
supporting scatter plots have not been presented.
4. CONCLUSION
A number of new monitoring approaches were presented in
this paper based on the MGG clustering approach. It was
shown that the original MGG is unable to perform FDI task
a)
in the TE process plant as a large-scale benchmark problem.
It was shown that the proposed PCA-MGG and NPCA-MGG
approaches could improve the diagnostic monitoring
performances with respect to the original MGG approach.
However, they were unable to exactly perform the FDI task.
The proposed distributed-MGG approach, based on the
monitoring of local subsystems, proved to be a promising
diagnostic approach with superb FDI performances to
discriminate the TE faults 4, 9 ,11 5, 12 and 15. It is noted
b) that the approach could manage to diagnose the TE faults 9
and 15 which are known as the TE most difficult faults. To
the best of our knowledge, these faults have not been
diagnosed yet.
REFERENCES
Abonyi, J., B. Feil, S. Nemeth and P. Arva (2005), Modified
Gath–Geva clustering for fuzzy segmentation of
multivariate time-series. Fuzzy sets and systems J., vol.
c) 140, pp. 39–56.
Baldi, P., and K. Hornic (1989), Neural network and
principal component analysis: learning from example
without Local Minima. Neural network, vol. 2, pp. 53.
Chiang, L. H., E. L. Russell and R. D. Braatz (2000), Fault
diagnosis in chemical processes using Fisher
discriminant analysis, partial least squares, and principal
component analysis. Chemometrics and Intelligent
Laboratory Systems J., vol. 50, pp. 243–252.
Chiang, L.H., E.L. Russell and R.D. Braatz (2001), Fault
d) Detetction and Diagnosis in industrial systems. Springer
Verlag, London.
Fig. 5. FDI results for faults 5, 12 and 15. Ding, C., X. He, H. Zha and H. D. Simon (2002), Adaptive
dimension reduction for clustering high-dimensional
As a consequence, the proposed distributed-MGG monitoring data. Proceeding of the IEEE International conference
approach, based on the locally selected process variables, was on Data Mining, pp. 147–154.
tested. The obtained result shown in Fig. 3(d) demonstrates Downs, J. J., E.F. Vogel (1993), A plant-wide industrial-
diagnostic performances of the proposed approach. In this process control problem. Computers & Chemical
distributed monitoring scheme, the local process variables Engineering J., vol. 17, pp. 245–255.
corresponding to the Reactor, i.e. variables 9, 21 and 32 in Kramer, M. (1991), Nonlinear principal component analysis
Table I, have been employed as the input fault data set. The using auto-associative neural networks. AIChE Journal,
scatter plot illustrated in Fig. 4(c) verifies the validity of this vol. 37, pp. 233–243.
approach. As shown, the three TE Reactor faults have Lin, W., Y. Qian and X. Li (2000), Nonlinear dynamic
properly been separated, making their clustering an easy task principal component analysis for on-line process
to do. monitoring and diagnosis. Computers & Chemical
Engineering J., vol. 24, pp. 423–429.
2) Simulation Experiment 2 Raftery, A. E. and N. Dean (2006), Variable selection for
The same experimental scenario was repeated for the faults model-based clustering. Journal of the American
corresponding to the Condenser cooling water process, i.e. Statistical Association, vol. 101, No. 473, pp. 168–178.
faults 5, 12 and 15. The obtained results for the different Scholz, M., and R.Vigario, (2002), Nonlinear PCA: A new
monitoring approaches, i.e. the original MGG clustering, the hierarchical approach. Proc.European Symposium on
combined PCA-MGG, the combined NPCA-MGG and the Artificial Neural Networks, pp. 439–444.
863