Paper Template Saucedo - Vfinal2
Paper Template Saucedo - Vfinal2
Paper Template Saucedo - Vfinal2
Abstract – Condition monitoring schemes are essential for electrical stresses associated to problems in the power supply
increasing the reliability and ensuring the equipment efficiency cause stator faults like short circuits in the stator winding [3].
in industrial processes. The feature extraction and dimensionality Thus, the related condition monitoring plays a key role in the
reduction are useful preprocessing steps to obtain high
reliability and safety strategies of several industry applications
performance in condition monitoring schemes. To address this
issue, this work presents a novel diagnosis methodology based on [4]-[6]. Although different physical magnitudes have been
high-dimensional feature reduction applied to detect multiple investigated for IM condition monitoring [3], [7]-[9]; vibration
faults in an induction motor linked to a kinematic chain. The analysis remains as the most industrially accepted approach.
proposed methodology involves a hybrid feature reduction that The vibration analysis is a useful and reliable tool to assess the
ensures a good processing of the acquired vibration signals. The IM condition since the characteristic vibration modes of any
method is performed sequentially; first, signal decomposition is
rotating machine changes in presence of faults [10]-[13]. Yet,
carried out by means of Empirical Mode Decomposition. Second,
statistical-time based features are estimated from the resulting although several methodologies applied to diagnose faults in
decompositions. Third, a feature optimization is performed to electric motors have been presented during the last decades,
preserve the data variance by a Genetic Algorithm in conjunction most of these methodologies are focused on the analysis of a
with the Principal Component Analysis. Fourth, a feature specific fault mode [7], [10], [14]-[15]. Indeed, the application
selection is done by means of Fisher score analysis. Fifth, a feature of such health monitoring schemes to industrial scenarios
extraction is performed through Linear Discriminant Analysis.
presents new challenges that must be addressed, where
And, finally, sixth, the different considered faults are diagnosed
by a Neural Network-based classifier. The performance and the different faults may appear hiding or overlapping the expected
effectiveness of the proposed diagnosis methodology is validated characteristic fault patterns.
experimentally and compared with classical feature reduction Typically, it is estimated the root mean square (RMS) from
strategies, making the proposed methodology suitable for the vibration signal as a numerical indicator to assess the
industry applications. general condition of the machine [16]-[18]. In order to
consider improved characterization of the vibration signal, the
Index Terms— Induction Motor, Condition Monitoring;
numerical set of features is extended to additional statistical
Multiple Faults; Feature Reduction; Vibrations.
time-domain, frequency domain, and also time-frequency
domain [19]-[21], [22]. Yet, although fast Fourier transform
I. INTRODUCTION and Cohen's class time-frequency distributions have been
Induction motors (IM) represent the most common rotating successfully applied [22]-[23], the simplicity and low
electrical machines used in industry due to its robustness and computational cost of the statistical time-domain features
competitive cost [1]-[2]. However, unexpected faults may exhibit a high characterization potential dealing with regular
occur during the useful life of the IM, causing unscheduled stationary speed cycles in the industry [24].
downtimes of the whole components associated to the Condition monitoring strategies that use a high-dimensional
kinematic chain. Typical faults in IM may be due to set of features to characterize the properties of faults inevitably
mechanical and electrical stresses. Mechanical stresses caused contain redundant and non-significant information. Recently,
by overloads can produce bearing defects, rotor bar breakage, approaches of signal decomposition are widely used in
rotor unbalance and misalignment in couplings, whereas condition monitoring schemes. Different decomposition
techniques can be used; and the signal decomposition approach
This research was partially supported by CONACYT, Mexico, under by means of Empirical Mode Decomposition (EMD) has being
doctoral scholarship number 278033, and MINECO, Spain, under the
Project CICYT TRA2013- 46757-R. applied due to its self-adaptive capabilities to extract a set of
J. J. Saucedo-Dorantes and R. A. Osornio-Rios are with the HSPdigital CA- Intrinsic Mode Functions (IMF) from the raw signal. The
Mecatronica, Engineering Faculty, Autonomous University of Queretaro, estimation of numerical features from each IMF represents a
San Juan del Rio 76806, Mexico (e-mail: [email protected],
[email protected]). good opportunity to obtain a potential high-dimensional set of
M. Delgado-Prieto is with the MCIA Research Center, Department of features for diagnosis purposes [20]. Yet, dimensionality
Electronic Engineering, Technical University of Catalonia (UPC), Spain (e- reduction procedures must be applied to avoid low fault
mail: [email protected]).
R. J. Romero-Troncoso is with the HSPdigital CA-Telematica, DICIS,
diagnosis performances and overfitting responses of the
University of Guanajuato, Salamanca 36885, Mexico (e-mail: classification algorithm [21], [25]. In this regard, classical
[email protected]). techniques of dimensionality reduction have been integrated in
condition monitoring schemes; for instance, Principal include the validation that the application of hybrid feature
Component Analysis (PCA) [20], [26], and Linear reduction strategies (selection and extraction), represents a
Discriminant Analysis (LDA) [27], are the main techniques high-performance information analysis procedure, which
used for reducing high-dimensional sets of features. However, improves the classification capabilities compared with the use
each dimensionality reduction approach is based on a specific of classical approaches, such as PCA and LDA, as a unique
objective function; that is, PCA aims to identify orthogonal technique to high-dimensional feature reduction. Notice that
components aligned with the maximum data dispersion this proposed hybrid feature reduction methodology has not
direction, whereas LDA aims to maximize the distance among been study in multi-fault diagnosis so far and the results are
different data sets [28]. Such difference of criteria leads to promising.
multiple works in which the selection of the dimensionality This paper is structured as follows. Section II describes the
reduction approach is carried out by a performing ratio when theoretical aspects of the proposed method and section III
combined with the classification algorithm [29]-[30]. describes the diagnosis methodology. The experimental test
Moreover, dealing with multiple faults, such classical bench used to assess and validate the method is presented and
dimensionality reduction approaches are usually combined discussed in sections IV and V, respectively. Conclusions and
with complex hierarchical classification structures in order to future work are summarized in Section VI.
compensate the loss of performance. In this sense, in [23], a
set of features, estimated by means of wavelet decomposition II. FEATURE REDUCTION
from vibration signals, is used in a hierarchical deep belief The feature set is a critical aspect that compromises the
based network to classify different bearing defects. Although performance of classification algorithms; thereby, a reduced
this methodology exhibits good results, the proposed multi- number of features will not contain enough information to
stage network implies multiple trainings, one for each specific describe and to characterize the machine working conditions.
condition to be solved. In [22], a bi-spectrum set of features Therefore, the addition of new features is an option to increase
estimated from vibration measurements is reduced through the capability of discrimination, and it is commonly believed
PCA technique and then, used by a hierarchical classifier based that the classification performance will improve. Yet, an
on Support Vector Machine (SVM). Although this scheme increase of the number of features may not offer additional
assesses different bearing condition, the proposed approach information to the machine condition, and the performance of
involves the use of the same number of SVMs as the same classification will be degraded instead of improved. Thus,
number of considered faults. misclassifications can be obtained because of the redundant
Thereby, the contribution of this work lies on a novel multi- and useless information contained in large sets of features.
fault diagnosis methodology, and the verification of the Working with a high-dimensional set of features complicates
proposed hybrid high-dimensional feature reduction method to the fault identification task of the multi-class classification
increase the diagnosis performance dealing with multiple methods. Besides, it is required a high computational cost and
faults in an induction motor linked to a kinematic chain. the use of redundant and useless information could
Originality of the work includes the empirical mode compromise the proper convergence of the algorithms [21].
decomposition of the available vibration signals, the For that reason, procedures of feature reduction are
estimation of statistical-time-features, and the validation of the implemented in condition monitoring schemes [14]. Mainly, it
proposed hybrid high-dimensional feature reduction method. is possible to remove redundant or non-discriminative features
Indeed, the resulting high-dimensional set of features is by means of two reduction strategies: feature selection and
analyzed by means of a novel multi-stage dimensionality feature extraction.
reduction approach, in which, an optimization is performed by Regarding feature selection, it is a filtering strategy in
a Genetic Algorithm (GA) in conjunction with the PCA to seek which all the features are independently evaluated by
an optimal set of IMFs that best preserve the data variance, considering only their individual descriptive capabilities; thus,
afterwards, a selection of the best discriminative statistical the features are ranked in terms of their relevance, and even
features is carried out by means of the Fisher score, and then, though a specific feature cannot be useful by itself, it can be
the select features are compressed and transformed into a 2- very useful when it is combined with others. Filtering
dimensional space through LDA based feature extraction. strategies do not require a particular learning algorithm,
Such multi-stage dimensionality reduction, allows using a making, them effective and easy to compute. Most of these
simple Neural Network (NN) -based classification algorithm algorithms are based on general characteristics of the data such
as diagnosis estimator, including class identification and as distance, dependence, and consistency among others [17].
membership probability. Consequently, the implementation of feature selection
The proposed diagnosis methodology is validated under a strategies in condition monitoring schemes is used to preserve
complete set of experimental vibrations acquired from an the most discriminative features; in this sense, the filtered
electromechanical system, where five different mechanical features are those that best described the machine working
faults are considered. In this context, novelties of this work condition [20].
On the other hand, feature extraction differs in the question bearing defect (BD), half-broken rotor bar (1/2 BRB), one
of whether a technique is supervised or unsupervised. The broken rotor bar (1 BRB), unbalance (UNB) and misalignment
main difference between both techniques is the availability of (MAL). For each considered condition, ninety axial vibration
labels to distinguish the different classes. measurements have been acquired. Each measurement
PCA is a well-known and the most common used technique corresponds to one second of the machine operation.
for unsupervised dimensionality reduction and feature
A. Signal decomposition and features calculation
extraction [15]. This technique projects a high-dimensional
data set into a new uncorrelated set of features; therefore, no The decomposition of the acquired vibration signal is
redundant information is present. These projections, named performed by means of EMD; such decomposition is applied
principal components, are linear combinations in which the to each considered condition and allows obtaining a set of
variability of the data is better captured. PCA is based on IMFs which are automatically adapted to the corresponding
statistical analysis and even though it does not concern in the vibrational pattern.
separation of different classes, it has advantage in feature Afterwards, each resulting set of IMFs is characterized by
extraction due to preserving the variability of the data. estimating 15 statistical time-based features: mean, maximum
Therefore, the consideration of PCA analysis is helpful in value, RMS, square root mean, standard deviation, variance,
condition monitoring schemes to discard redundant RMS shape factor, square root mean shape factor, crest factor,
information that is not required to detect faults in a system. latitude factor, impulse factor, skewness, kurtosis, and
LDA is one of the most well-known supervised techniques normalized fifth and sixth moments. Therefore, a resulting
used in multi-class problems for linear dimensionality number of 150 numerical features are estimated for each
reduction and feature extraction [20]. LDA aims to find a considered condition. The proposed set of statistical features is
projection into a low-dimensional representation in which it is shown in Table I. These statistical-time features have been
contained the most discriminant information attempting to successfully used for fault detection in electrical motor due to
maximize the linear separation between data points belonging their high-performance source of information and their
to different classes. LDA is a suitable feature extraction capabilities to analyze general trends of the signal [19].
technique to be considered in condition monitoring schemes
because it pays attention to differences of known classes; thus, TABLE I
1
STATISTICAL TIME-BASED FEATURES
̅ = ∙ | |
through the proper application of this technique it is possible
Mean (1)
to obtain the parameters that correctly indicate the machine
working condition. Maximum value = (2)
1
= ∙
Feature selection and feature extraction approaches provide
Root mean square (3)
complementary feature reduction effects; therefore, there is not
1
= ∙ | |
a clear criterion for choosing a specific technique: the
Square root mean (4)
reduction stage is typically implemented in order to fulfill with
= ∙ − ̅
a required data processing.
Standard deviation (5)
1
= ∙ − ̅
III. DIAGNOSIS METHODOLOGY
Variance (6)
The proposed multi-fault diagnosis methodology is
=
1
∙ ∑ | |
composed by six steps as depicted in Fig. 1. First, the signal
RMS Shape factor (7)
decomposition with the estimation of the IMFs from the
=
1
∙ ∑ | |
vibration signal is done by the EMD. Second, the calculation
SRM Shape factor (8)
of a set of statistical-time based features from each IMFs is
" =
done. The proposed hybrid high-dimensional feature reduction Crest factor (9)
# =
method follows. Thus, third, a feature optimization approach
Latitude factor (10)
of the available set of features is done by selecting the most
$ =
1
0.2
0
0
-0.1
-0.2
-5
-0.3
-0.4 -10
-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 -20 0 20 40 60 80
Feature 1 1st Principal component
Fig. 7. Projection of the extracted set of features resulting from the Fig. 8. Projection of the extracted set of features resulting from the classical
application of the proposed hybrid feature reduction strategy. approach PCA.
TABLE IV
HLT BD 1/2BRB 1BRB UNB MAL
DETAIL OF THE TRANSFORMATION MATRIX COMPUTED BY THE LDA TO
OBTAIN A 2-DIMENSIONAL PROJECTION OF THE EXTRACTED FEATURES 0.15
Statistical-time feature Optimized IMF Column 1 Column 2
Root mean square IMF1 0.570 0.505 0.1
Root mean square IMF1 0.021 0.026
Standard deviation IMF1 0.551 0.481 0.05
RMS Shape factor IMF1 0.095 0.055
Feature 2
TABLE V
CONFUSION MATRIX RESULTING FROM THE EVALUATION OF ALL CONSIDERED
CONDITIONS USING THE CLASSICAL PCA
Assigned True Class
Class HLT BD ½ BRB 1 BRB UNB MAL
HLT 8 0 2 0 2 0
BD 0 18 0 0 0 0
½ BRB 1 0 14 1 0 0
1 BRB 3 0 2 17 4 0
UNB 6 0 0 0 12 0
MAL 0 0 0 0 0 18
TABLE VI
CONFUSION MATRIX RESULTING FROM THE EVALUATION OF ALL CONSIDERED
CONDITIONS USING THE CLASSICAL LDA
Assigned True Class
Class HLT BD ½ BRB 1 BRB UNB MAL
HLT 13 0 1 3 1 0
BD 0 18 0 0 0 0
½ BRB 1 0 13 13 1 0
1 BRB 2 0 4 2 2 0
Fig. 11. Projection of the decision regions for the multiple fault classification UNB 2 0 0 0 14 0
corresponding to the test of the first cross validation computed by the MAL 0 0 0 0 0 18
proposed NN-based classifier.