Two-Dimensional Optimization Framework of Online Interpretable Time-Frequency Feature Learning For Practical Machine Health Monitoring
Two-Dimensional Optimization Framework of Online Interpretable Time-Frequency Feature Learning For Practical Machine Health Monitoring
Abstract—Data-driven feature extraction for machine health nearing failure. Consequently, it is essential to keep assessing the
monitoring has garnered significant attention, yet two key limi- operational performance of machines on a regular basis. Ongo-
tations remain unaddressed: lack of interpretability and the need
ing monitoring of machine health helps in preventing unexpected
for extensive historical fault data. To overcome these problems,
an online two-dimensional optimization framework is proposed breakdowns and devising cost-effective maintenance strategies.
that enables interpretable time-frequency feature extraction and The availability of extensive monitoring data has spurred
health index (HI) construction without requiring faulty samples advancements in data-driven continuous health monitoring tech-
for model training. Our approach introduces a convex hull-based nologies. For instance, Xu et al. [4] combined a stacked
closest point optimization model for estimating time-frequency
auto-encoder with an exponential function to produce highly
instances and learning interpretable time-frequency features. By
leveraging a small set of baseline vibration samples and recent on- monotonic health indices (HIs). Ma et al. [5] applied wavelet
line data, rapid fault diagnosis can be achieved based on optimized packages, statistical features, and Grassmann manifold tech-
interpretable time-frequency features. This method also facilitates niques for machine performance evaluation through locally lin-
long-term degradation tracking by constructing and updating an ear embedding. Wu et al. [6] incorporated an inverse hyperbolic
HI from collected time-frequency spectrograms. Once machine
tangent function in a multiscale convolutional neural network
faults appear, updated time-frequency features can show apparent
and interpretable fault signatures for prompt fault alarming. More- to construct HIs for trend prediction, using frequency signals
over, the proposed framework allows continuous HI updates for as input. Duong et al. [7] used frequency amplitudes around
incipient fault detection and degradation tracking. The proposed fault characteristic frequencies in a deep convolutional neural
framework is validated by using two run-to-failure datasets and network for bearing fault detection. Qin et al. [8] implemented
ablation experiments are conducted to demonstrate its superiority.
a knowledge-based loss function in a variational autoencoder to
Index Terms—Closest point optimization model, convex hull, create HIs for continuous monitoring. To overcome the challenge
health index (HI), interpretable time-frequency feature, machine of manual feature extraction, several end-to-end training frame-
health monitoring. works have been introduced for continuous health monitoring.
For example, Wen et al. [9] used multilevel fusion and genetic
programming to create an end-to-end system for HI development
I. INTRODUCTION
and trend prediction. Shi and Chehade [10] implemented a two-
ERFORMANCE deterioration can happen at any time dur-
P ing the operational phase [1], [2]. Given that the decline
in machine performance typically unfolds over an extended
stage continuous monitoring approach using long short-term
memory networks.
While purely data-driven approaches offer significant poten-
period, it is crucial to implement ongoing health monitoring of tial to improve model performance and reduce reliance on expert
machinery [3]. This process involves gathering condition data knowledge, they often lack transparency and reliability in prac-
from machines ranging from those in optimal condition to those tical applications. Consequently, integrating data-driven meth-
ods with signal-processing techniques is frequently necessary.
Signal processing features are more interpretable and practical.
Received 22 November 2023; revised 27 May 2024 and 27 August 2024; However, traditional time-domain and frequency-domain data
accepted 27 October 2024. This work was supported by the National Natural processing methods fall short in handling nonstationary and
Science Foundation of China under Grant 52475112. Associate Editor: E. Pohl.
(Corresponding author: Dong Wang.) nonlinear signals for weak fault signature enhancement [11].
Tongtong Yan and Min Xia are with the Department of Mechanical and Time-frequency methods, such as short-time Fourier transform
Materials Engineering, The University of Western Ontario, London, ON N6A (STFT) and wavelet transform (WT), are more effective for these
3K7, Canada (e-mail: [email protected]; [email protected]).
Dong Wang, Tangbin Xia, and Lifeng Xi are with the Department of Industrial types of signals. Numerous studies have combined these tech-
Engineering and Management, School of Mechanical Engineering, and the niques with data-driven models for machine health monitoring.
State Key Laboratory of Mechanical System and Vibration, Shanghai Jiao For instance, Cao et al. [12] used time-frequency representations
Tong University, Shanghai 200240, China (e-mail: [email protected];
[email protected]; [email protected]). of vibration signals with marginal spectra and applied them
Digital Object Identifier 10.1109/TR.2024.3489589 to a temporal broad learning system for health assessment.
1558-1721 © 2024 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Du et al. [13] converted data into two-dimensional images using framework in detecting incipient machine faults and tracking
continuous WT and employed these images in a convolutional degradation by identifying informative frequency bands and
neural network with global attention for trend evaluation. Hu providing progressively updated HIs.
et al. [14] fed STFT-derived grey images into a deep convolu- This article is structured as follows. Section II explores the
tional network for assessing performance degradation. Time- signatures and characteristics of faulty samples using simulation
frequency techniques have shown significant advantages in signals and introduces convex hulls to analyze time-frequency
machine health monitoring across various applications. How- maps. Section III presents a convex hull-based closest point
ever, practical implementation faces several challenges. Data- optimization model and proposes an online two-dimensional
driven methods often lack interpretability, leaving engineers optimization framework for practical health monitoring. Sec-
unable to fully understand how these models make decisions. tion IV demonstrates the application of this framework using two
In addition, these models typically require faulty samples for run-to-failure datasets, illustrating how it continuously extracts
training, which are rarely available, particularly for new ma- physics-informed features from the time-frequency domain and
chines still in their early stages. Although time-frequency maps constructs HIs for real-time monitoring. In addition, perfor-
are widely used, their integration into opaque models limits the mance comparisons and ablation studies are provided to evaluate
understanding of critical frequency bands and fault characteris- the proposed optimization model and HIs.
tics. Consequently, these approaches remain largely inscrutable.
The convex hull is a fundamental geometric concept, defined II. FAULT CHARACTERISTICS IN THE TIME-FREQUENCY
as the smallest convex set enclosing a dataset. Recently, opti- DOMAIN AND ITS GEOMETRIC REPRESENTATION BASED
mization models based on convex hulls have found applications ON CONVEX HULLS
in machine health monitoring, primarily for fault diagnosis, This section introduces the mathematical basis of STFT, a
where they serve as classification tools. These models estimate famous time-frequency analysis technique. It then discusses the
the distribution of different fault classes based on convex hulls time-frequency domain characteristics of fault signals based on
and use distance metrics to classify new samples. For instance, STFT. Following this, convex hulls are utilized to represent
Zeng et al. [15] proposed a flexible convex hull based classifi- time-frequency maps in matrix form, allowing for a compact
cation model and applied this model to machine fault diagnosis expression and characterization of geometric features across
[16]. Li et al. [17] enhanced this approach by integrating the various health states.
Riemannian manifold into the model, while He et al. [18] intro-
duced a kernel-based, flexible convex hull for fault classification. A. Fault Signatures in Time-Frequency Representation
Another study [19] proposed a convex hull-based degradation
model for analyzing spectral lines, though it was limited to Vibration signals from machines often exhibit nonlinear
frequency amplitudes and could not handle time-varying signals. and nonstationary characteristics, especially in harsh operat-
In addition, this approach requires faulty samples for training, ing conditions. Traditional spectral analysis methods such as
which are often not available. the Fourier transform are inadequate for such signals, whereas
In summary, existing data-driven models for continuous time-frequency techniques, including STFT and WT, are more
health monitoring face two key problems. First, despite the suitable. This study emphasizes fault features within the time-
integration of signal processing algorithms, these models still frequency domain, using STFT as an example to convert
suffer from interpretability problems. Second, they rely on fault time-domain vibration signals into time-frequency spectro-
data for model training. To overcome these limitations, this grams. STFT operates by applying a windowed Fourier trans-
study proposes a two-dimensional optimization framework for form to capture local spectral information. For a continuous
online, interpretable time-frequency feature extraction and HI signal s(t), the STFT of the s(t) can be mathematically written
construction, removing the reliance on faulty data for model as follows [20]:
∞
training. The key contributions of this article are summarized
as follows. First, it thoroughly examines the time-frequency TFs (t, f ) = [s (τ ) g ∗ (τ − t)] e−j2πf τ dτ (1)
−∞
characteristics of fault vibration signals using simulation data. A
convex hull-based closest point optimization model is proposed where TFs means the time-frequency map of s(t); g(τ − t) is a
to extract interpretable time-frequency features by approximat- window function with a short time length; ∗ means the operation
ing two-dimensional maps with convex hulls. Second, a novel of complex conjugate. It is observed from (1) that the STFT
online two-dimensional optimization framework is introduced of the s(t) at time t can be regarded as the Fourier transform
for practical health monitoring. This framework can identify performed by multiplying the signal s with the moving window
informative time-frequency bands without requiring faulty sam- g; for a discrete signal s(k), the STFT of the s(k) can be obtained
ples, enabling rapid fault diagnosis. Long-term monitoring data by discretizing the STFT in (1) as follows:
∞
of time-frequency spectra and their optimized features are used
to update an HI for incipient fault detection and degradation TFs (μ, υ) = s (k) g ∗ (kT − μT )e−j2π(υF )k (2)
tracking. Finally, the study demonstrates the proposed method- k=−∞
ology with two run-to-failure datasets, showcasing continuous where T > 0 and F > 0 are respectively sampling periods for
health monitoring without faulty samples for model training. time and frequency; μ and υ are integers and they means
Validation results confirm the effectiveness of the proposed that TFs (t, f ) is sampled at equidistant time-frequency grid
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
YAN et al.: TWO-DIMENSIONAL OPTIMIZATION FRAMEWORK OF ONLINE INTERPRETABLE TIME-FREQUENCY FEATURE LEARNING 3
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
covC fault can be formulated as follows: further investigate the impact of these two objectives on the
m l
physics-informed features. Combining these objectives helps in
1
effectively identifying fault characteristics in the time-frequency
min ai TFH
i − bj TFF
j
ai ,bj 2 i=1 j=1
domain. Specifically, the key differences between ideal healthy
2
and faulty time-frequency maps often involve recurring fault
m
transients within a relevant frequency range, exhibiting spar-
s.t. ai = 1, 0 ≤ ai ≤ 1 sity. By optimizing contrastively, the proposed features should
i=1
enhance the detection of important frequency components and
l
reduce noise in the time-frequency domain.
bj = 1, 0 ≤ bj ≤ 1 In essence, If the proposed physics-informed features φ
j=1 have the same coordinate with TFF H
j and TFi in the time-
i = 1, 2, . . . , m; j = 1, 2, . . . , l. (7) frequency domain, then φ can be understood as an optimized
time-frequency feature and it can be capable of filtering out
Next, considering the group sparsity of fault signatures in the noise automatically. To mitigate reliance on faulty samples, an
time-frequency domain, a norm regulation can be added to (7) online two-dimensional optimization framework is presented.
to enhance the sparsity of extracted time-frequency features as This framework enables continuous and interpretable learning
follows: of time-frequency features and HI construction. The updated
m
l
time-frequency features are derived from baseline and recent
min ai TFH
i − bj TFF
j samples, facilitating prompt fault diagnosis through a convex
ai ,bj
i=1 j=1
2
optimization model in (8). All collected data and their cor-
responding optimized features are recorded, allowing for the
m
l
continuous updating of HI for monitoring emerging faults and
+λ ai TFH
i − bj TFF
j tracking degradation trends.
i=1 j=1
1
m
l
s.t. ai = 1, 0 ≤ ai ≤ 1, bj = 1, 0 ≤ bj ≤ 1 B. Online Two-Dimensional Optimization Framework Without
i=1 j=1 Needing Faulty Samples
i = 1, 2, . . . , m; j = 1, 2, . . . , l (8) In practical scenarios, especially with newly introduced ma-
chines, there is often a shortage of faulty samples and extensive
where λ is a hyperparameter to balance two terms in the objective
historical data. Addressing machine health monitoring under
functions. It can be observed from (8) that the optimization
these conditions remains a significant challenge. This section
model is convex, as each term in the objective function and con-
proposes a practical two-dimensional optimization framework
straints is convex. If a feasible solution exists, global optimal so-
for ongoing health monitoring, leveraging the optimization
lutions a∗ and b∗ are guaranteed. Herein, a∗ = [a∗1 , a∗2 , . . . , a∗m ]
model in (8). When a new machine Ω is first monitored, healthy
and b∗ = [b∗1 , b∗2 , . . . , b∗l ] . Based on this, the proposed inter-
vibration data is collected. To conserve storage space, high-
pretable time-frequency features φ are defined as follows:
frequency data is not recorded continuously, as performance
l
m
degradation typically occurs gradually. Instead, data is gathered
φ= b∗j TFF
j − a∗i TFH
i (9) in fixed-length segments at regular intervals. These intervals
j=1 i=1 can be adjusted according to specific conditions and needs,
where φ ∈ Rq×w is the proposed physics-informed fea- providing insight into changes in machine conditions. Assume
tures in the time-frequency that the time interval for data collection is set to T so that online
m domain. Based on (9), the first
l
objective function 12 i=1 ai Hi − j=1 bj Nj 2 in (8) is
collection vibration data can be denoted as vT , v2T , . . . ,vsT .
equivalent to 12 φ2 while the second objective function Once they are acquired, they are immediately transformed into
m l 1 time-frequency maps TFT , TF2T , . . . ,TFsT based on STFT.
i=1 ai Hi − j=1 bj Nj 1 is equivalent to 2 φ1 . Geomet- Since the machine Ω can generally be regarded as being in a
rically, the first objective function is to maximize the sep-
completely normal state when it is first used, the first collected
aration between time-frequency maps of healthy and faulty
small number of time-frequency maps TFmT i=T (m s) are used
states estimated by convex hulls and 2 norm is used to
as healthy time-frequency maps to establish a baseline convex
penalize large elements in the proposed physics-informed
hull. Herein, TFmTi=T (m s) means the time-frequency maps
features, aiming to reduce overfitting. The second objective
of collected vibration signals from T to mT . For subsequent
function employs a 1 norm to promote sparsity in these 2mT
obtained time-frequency maps T Fi=(m+1)T (m s), they are
features, which improvesm their robustness
l to outliers. There-
subjectively classified as “faulty time-frequency maps,” allow-
fore, the first term i=1 ai Hi − j=1 bj Nj 2 in the ob- ing the optimization model in (8) to be applied. The optimization
jective function aims to highlight the significant differences
model in (8) can be simplified as follows:
between healthy and
faulty time-frequency maps. The second
m l
term i=1 ai Hi − j=1 bj Nj 1 seeks to enforce sparsity
in these differences. Ablation experiments in case studies will (a∗ , b∗ , φ) = Φ C normal , C fault (10)
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
YAN et al.: TWO-DIMENSIONAL OPTIMIZATION FRAMEWORK OF ONLINE INTERPRETABLE TIME-FREQUENCY FEATURE LEARNING 5
where C normal and C fault are the inputs of the optimization model. Algorithm 1: Iterative Optimization Framework for Online
a∗ and b∗ are the outputs of the optimization model and they are Machine Health Monitoring.
the coefficients of convex hulls corresponding to C normal and
Input: baseline normal dataset {T F mT i=T } with STFT,
C fault . φ is the interpretable time-frequency features. Therefore,
online collected dataset {T F sT i=(m+1)T } with STFT,
the interpretable time-frequency features can be updated for
sample of baseline normal data and online data for the
quick fault monitoring as follows:
optimization model m, λ
⎧ mT fault 2mT
⎪
⎪ Φ C normal
= TF , C = TF Output: a, b, φ
⎪
⎪ i=T i=(m+1)T
1. For k = 1, 2, . . . , (s − 2m + 1) do
⎪
⎪
⎪t ∈ (0, 2mT ) 2. baseline normal dataset T F H mT
⎪
⎪
⎪ fault (2m+1)T i = {T F i=T }
⎪
⎨Φ C normal = TFmT i=T , C = TFi=(m+2)T F (k+2m−1)T
3. online update dataset T F j = {T F j=(k+m)T }
φ = t ∈ ((m + 2) T, (2m + 1)T ) . 4. #optimization formulation for CVX solver
⎪
⎪
⎪
⎪ . 5. cvx begin
⎪
⎪..
⎪
⎪ fault sT 6. variablesa(m)b(m)
⎪
⎪Φ C normal = TFmT = TFi=sT −mT +1
⎪ i=T , C
⎩ 7.
t ∈ (sT − mT + 1, sT )
m
l
(11)
It can be observed from (11) that initial interpretable time- objective f unction = ai T F H
i − bj T F F
j
mT i=1 j=1
frequency features are obtained based on C normal = {Pi=T } 2
2mT
and C fault
= {Pi=(m+1)T }. Subsequently, the first updated m
l
interpretable time-frequency features are obtained based on +λ ai T F H
i − bj T F F
j
mT (2m+1)T
C normal = {Pi=T } and C fault = {Pi=(m+2)T }. The assumed i=1 j=1 1
fault time-frequency maps are truncated from (m + 1)T to 8. minimize(objectivef unction)
sT through a window with a length of mT and the window
moves step by step based on collection time T . With this subject to
approach, interpretable time-frequency features can be updated m
l
online with each data collection, enabling swift fault diagnosis. ai = 1, ai ≥ 0, bj = 1, bj ≥ 0
In addition, the collected time-frequency maps and updated i=1 j=1
features facilitate long-term monitoring of degradation. Online 9. cvx end
updated interpretable time-frequency features are denoted as 10. calculate the time-frequency feature φ based on (9)
φ2mT , φ(2m+1)T , φ(2m+2)T , . . . ,φsT . The update frequency of 11. end for
interpretable time-frequency features can match the data collec-
tion interval. These features allow for real-time fault diagnosis
and identification of informative frequency bands. While these
features indicate the current health status, an HI is necessary maps with transparent construction processes, and it can identify
for tracking long-term degradation. Combining interpretable incipient degradation and reflect ongoing deterioration. Besides,
time-frequency features with an HI provides a comprehen- the interpretable features enhance the HI’s sensitivity to early
sive approach to monitoring machine health over its lifecycle. faults and improve its effectiveness in monitoring degradation
The HI is updated based on the latest interpretable features to trends.
offer a clear method for detecting incipient faults and tracking To calibrate HI scales across different collection times, each
degradation and it is defined as follows: HI is normalized, allowing for the detection of incipient faults
⎧ based on trending changes. This study introduces an adaptive
⎪
⎪ sum (TF i ◦ φ2mT ) , i ∈ (0, 2mT ) threshold method, using a three-sigma rule [23], to identify fault
⎪
⎨sum TFi ◦ φ(2m+1)T , i ∈ (0, (2m + 1) T ) initiation. The threshold is updated online with each new HI.
HIi = .. (12) To address noise and outliers, a fault alert is triggered only if
⎪
⎪ .
⎪
⎩ multiple consecutive HI values exceed the adaptive threshold.
sum (TFi ◦ φsT ) , i ∈ (0, sT )
This approach ensures precise detection of incipient faults and
where ◦ is the operation of Hadamard product; Since matrices enables the assessment of degradation trends and future predic-
TFi and φ have the same dimension, the result of TFi ◦ φ is tions.
a matrix with the same dimension with TFi and φ. Its element The practical two-dimensional optimization framework for
is equal to the multiplication between the elements of TFi and online machine health monitoring is summarized in Fig. 2
φ in the corresponding place; sum(·) is the operation of sum and detailed in Algorithm 1. Initially, a small set of baselined
and its result is to add all elements in a matrix. As shown in and online time-frequency matrices are collected. The convex
(12), the HI at each collection time integrates time-frequency optimization model in (8) can be addressed by using some
maps of vibration signals with updated interpretable features in established algorithms such as the interior point method and
a data-level fusion way. This HI can be continuously updated for tools such as CVX. This study uses CVX based on a MATLAB
tracking degradation. The validity of the HI is supported by two environment [24] as a demonstration to solve the optimization
factors. It is built from physically interpretable time-frequency model in (8). Solutions provide updated interpretable features for
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
IV. REAL EXPERIMENTAL CASE STUDY collection point, 20 480 vibration samples were recorded to
assess the bearings’ health at that particular time. In total, the
This section illustrates the application of the proposed
NASA bearings underwent 984 data collection intervals before
methodology for online continuous health monitoring through
one of them completely failed. Fig. 4 illustrates raw vibration
two endurance tests of bearings, demonstrating their perfor-
data from various collection points, showing that the ampli-
mance from initial new conditions to eventual failure.
tude of vibration samples increases with time. However, the
vibration energy recorded at collection time 900 is lower than
A. Developed Practical Methodology to Realize Online
that at collection time 703, suggesting that despite worsening
Incipient Fault Detection and Degradation Tracking of NASA bearing performance, the vibration responses might diminish.
Bearings
Furthermore, fault characteristics observed at the early stages of
The experimental setup for assessing bearing life cycles failure are notably weak, as demonstrated in Fig. 4(c) and (d),
is illustrated in Fig. 3. As depicted, the setup includes four which display patterns similar to those in Fig. 4(a) and (b). This
bearings, labeled bearings 1–4, mounted on a single shaft. A issue presents significant challenges for detecting early faults
motor positioned on the shaft’s left side maintains a constant and accurately assessing the degradation process.
rotational speed of 2000 r/min. To monitor vibrations during the The proposed online approach employs a convex hull opti-
endurance test of the bearings, accelerometers were affixed to mization model combined with time-frequency maps to enable
the bearing housings. The test was terminated once the bearings continuous health monitoring of the NASA bearings without
surpassed their designed life span. The specifics of the vibration needing prior fault sample knowledge. In this section, param-
data collection process are outlined as follows: a sampling rate eters λ and m are defined as 1 and 10, respectively. This
of 20 000 Hz was employed, with data being gathered every implies that ten healthy time-frequency maps and ten faulty
10 min. Consequently, T is 10 min for this study. At each time-frequency maps are utilized to extract interpretable features
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
YAN et al.: TWO-DIMENSIONAL OPTIMIZATION FRAMEWORK OF ONLINE INTERPRETABLE TIME-FREQUENCY FEATURE LEARNING 7
from the time-frequency data. As vibration data are collected at online monitoring of machine health. Furthermore, this approach
each time point, they are promptly converted into time-frequency allows for adaptive updates to pinpoint informative frequency
maps using STFT. Consequently, a new time-frequency map is bands accurately.
produced at each collection interval. Thus, initial interpretable Subsequently, the HIs derived from the proposed method are
features are derived from time-frequency maps collected be- used to assess fault signatures within the interpretable time-
tween times 1 and 20. Specifically, the maps from times 1 frequency features, enhancing the precision of early fault de-
through 10 are considered healthy, whereas those from times tection. Once optimized time-frequency features are obtained at
11 through 20 are treated as faulty. These features are then each collection interval, the HI trajectory is updated accordingly.
updated using time-frequency maps from collection periods Fig. 6 displays the HIs at various collection times for online
1–10 and 12–21. In this case, maps from times 1–10 are still early fault detection, based on (12). Fig. 6 demonstrates that the
used as healthy references, and maps from times 12–21 are proposed HIs effectively monitor the degradation process of the
assumed to represent faults. Given that 984 collection points NASA bearing. During the healthy state, the HIs derived from
were recorded for NASA bearings, this method allows for the the interpretable time-frequency features show random fluctu-
update of 965 time-frequency features. Fig. 5 illustrates some ations and follow a Gaussian distribution, with no observable
selectively chosen interpretable time-frequency features from degradation trend, as shown in Fig. 6(a)–(e). However, when an
various collection times, demonstrating how the proposed fea- incipient fault develops, the HIs begin to exhibit a noticeable
tures evolve as the collection times progress. increasing trend, signaling the onset of the fault, as depicted in
During the healthy phase of the bearing, neither the ini- Fig. 6(f).
tial nor the updated time-frequency features before collec- According to the threshold determination method outlined
tion time 519 display any distinct signatures, with their in [23], an early fault in the NASA bearing was identified at
energy distributions appearing relatively uniform, as illustrated collection time 533. Typically, because degradation is usually
in Fig. 5(a)–(d). Following collection time 519, the NASA an irreversible process, one would expect the HI to show a
bearing transitions into a faulty phase, with a gradual decline consistent monotonic trend throughout a bearing’s lifecycle.
in performance. Concurrently, the energy within the proposed Nonetheless, after collection time 533, some fluctuations are ob-
interpretable time-frequency features begins to focus on specific served in the proposed HIs. This behavior has been examined in
informative frequency bands, as depicted in Fig. 5(e)–(i). This various studies, which have applied the concept of self-healing to
indicates that the proposed features are informed by physical account for these fluctuations [25]. Despite this, such variations
principles and effectively highlight the informative frequency do not affect the precise detection of early faults. The HI values
bands crucial for subsequent fault diagnosis. The methodology consistently surpass the threshold during the lifecycle, indicating
applied here employs a data-driven approach to derive inter- its sensitivity to early faults and its capability for online updates
pretable features within the time-frequency domain, facilitating to monitor the degradation process. In addition, the efficacy
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Fig. 7. Degradation trends of RMS and kurtosis for continuous health moni-
toring of the NASA bearing. (a) RMS. (b) Kurtosis.
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
YAN et al.: TWO-DIMENSIONAL OPTIMIZATION FRAMEWORK OF ONLINE INTERPRETABLE TIME-FREQUENCY FEATURE LEARNING 9
Fig. 12. Square envelope spectra at incipient faults times 533, 534, and 535 by
using the indicated informative frequency band based on proposed interpretable
time-frequency features. (a) Collection time 533. (b) Collection time 534.
(c) Collection time 535.
Fig. 10. Indicated informative frequency bands at incipient faults times 533,
534, and 535 by using the fast kurtogram. (a) Collection time 533. (b) Collection
time 534. (b) Collection time 535.
Fig. 13. Bearing endurance test for life cycle vibration data collection [30].
Fig. 11. Square envelope spectra at incipient faults times 533, 534, and 535
by using the fast kurtogram. (a) Collection time 533. (b) Collection time 534.
(c) Collection time 535.
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
YAN et al.: TWO-DIMENSIONAL OPTIMIZATION FRAMEWORK OF ONLINE INTERPRETABLE TIME-FREQUENCY FEATURE LEARNING 11
C. Ablation Experiments
In this section, ablation experiments of the proposed two-
dimensional optimization model in (8) are conducted to show the
functions of different objective functions. The results between
the first objective function based solely on 2 norm, the second
objective function based solely on 1 norm, and their combi-
nation are compared. This section uses the XJTU-SY bearing
dataset as an illustrative example.
By using the same online optimization strategy, the optimized
interpretable time-frequency features and their fused HI at dif- Fig. 21. Square envelope spectra of the XJTU-SY bearing at incipient fault
ferent collection times based solely on 2 norm are respectively times 78, 79, and 80 by using the Fast Kurtogram. (a) Collection time 78.
(b) Collection time 79. (c) Collection time 80.
depicted in Figs. 23 and 24. And the optimized interpretable
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Fig. 24. Updated HIs for incipient fault detection during continuous health
monitoring of the XJTU-SY bearing at different collection times by solely using
2 norm. (a) HI during collection time 11–59. (b) HI during collection time
Fig. 22. Square envelope spectra of the XJTU-SY bearing at incipient fault
11–64. (c) HI during collection time 11–69. (d) HI during collection time 11–74.
times 78, 79, and 80 by using the indicated informative frequency band
(e) HI during collection time 11–79. (f) HI during collection time 11–84. (g) HI
of proposed interpretable time-frequency features. (a) Collection time 78.
during collection time 11–89. (h) HI during collection time 11–94. (i) HI during
(b) Collection time 79. (c) Collection time 80.
collection time 11–99.
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
YAN et al.: TWO-DIMENSIONAL OPTIMIZATION FRAMEWORK OF ONLINE INTERPRETABLE TIME-FREQUENCY FEATURE LEARNING 13
Fig. 26. Updated HIs for incipient fault detection during continuous health Fig. 29. Proposed interpretable time-frequency features at different collection
monitoring of the XJTU-SY bearing at different collection times by solely using times for continuous health monitoring of the XJTU-SY bearing (λ = 0.01). (a)
1 norm. (a) HI during collection time 11–59. (b) HI during collection time Initial time-frequency feature at collection time 20. (b) Updated time-frequency
11–64. (c) HI during collection time 11–69. (d) HI during collection time 11–74. feature at collection time 49. (c) Updated time-frequency feature in collection
(e) HI during collection time 11–79. (f) HI during collection time 11–84. (g) HI time 69. (d) Updated time-frequency feature in collection time 90. (e) Updated
during collection time 11–89. (h) HI during collection time 11–94. (i) HI during time-frequency feature in collection time 101. (f) Updated time-frequency
collection time 11–99. feature in collection time 109. (g) Updated time-frequency feature in collection
time 114. (h) Updated time-frequency feature in collection time 119. (i) Updated
time-frequency feature in collection time 123.
Fig. 27. Comparison between fused HIs for the XJTU-SY bearing by using
different objective functions. (a) 2 norm. (b) 1 norm. (c) Combination of 2
norm and1 norm.
Fig. 30. Updated HIs for incipient fault detection during continuous health
monitoring of the XJTU-SY bearing at different collection times (λ = 0.01).
(a) HI during collection time 11–59. (b) HI during collection time 11–64. (c)
HI during collection time 11–69. (d) HI during collection time 11–74. (e)
HI during collection time 11–79. (f) HI during collection time 11–84. (g) HI
during collection time 11–89. (h) HI during collection time 11–94. (i) HI during
collection time 11–99.
D. Discussion of Parameter λ
This section presents additional experiments to examine the
effect of the model parameter λ on time-frequency features and
their associated HIs. Due to space constraints, the XJTU-SY
bearing is also chosen as a case study to illustrate the influence
of λ. Various representative values for λ, including λ = 0.01,
Fig. 28. Comparison between interpretable time-frequency features at collec-
tion time 78 for continuous health monitoring of the XJTU-SY bearing by using λ = 0.1, and λ = 100, are evaluated. The time-varying features
different objective functions. (a) 2 norm. (b) 1 norm. (c) Combination of 2 and corresponding HIs for different values of λ are depicted in
norm and1 norm. Figs. 29–34. It can be observed that λ does not significantly affect
the performance of the proposed interpretable time-frequency
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Fig. 31. Proposed interpretable time-frequency features at different collection Fig. 33. Proposed interpretable time-frequency features at different collection
times for continuous health monitoring of the XJTU-SY bearing (λ = 0.1). (a) times for continuous health monitoring of the XJTU-SY bearing (λ = 100) (a)
Initial time-frequency feature at collection time 20. (b) Updated time-frequency Initial time-frequency feature at collection time 20. (b) Updated time-frequency
feature at collection time 49. (c) Updated time-frequency feature in collection feature at collection time 49. (c) Updated time-frequency feature in collection
time 69. (d) Updated time-frequency feature in collection time 90. (e) Updated time 69. (d) Updated time-frequency feature in collection time 90. (e) Updated
time-frequency feature in collection time 101. (f) Updated time-frequency time-frequency feature in collection time 101. (f) Updated time-frequency
feature in collection time 109. (g) Updated time-frequency feature in collection feature in collection time 109. (g) Updated time-frequency feature in collection
time 114. (h) Updated time-frequency feature in collection time 119. (i) Updated time 114. (h) Updated time-frequency feature in collection time 119. (i) Updated
time-frequency feature in collection time 123. time-frequency feature in collection time 123..
Fig. 32. Updated HIs for incipient fault detection during continuous health
monitoring of the XJTU-SY bearing at different collection times (λ = 0.1). Fig. 34. Updated HIs for incipient fault detection during continuous health
(a) HI during collection time 11–59. (b) HI during collection time 11–64. monitoring of the XJTU-SY bearing at different collection times (λ = 100).
(c) HI during collection time 11–69. (d) HI during collection time 11–74. (e) (a) HI during collection time 11–59. (b) HI during collection time 11–64.
HI during collection time 11–79. (f) HI during collection time 11–84. (g) HI (c) HI during collection time 11–69. (d) HI during collection time 11–74. (e)
during collection time 11–89. (h) HI during collection time 11–94. (i) HI during HI during collection time 11–79. (f) HI during collection time 11–84. (g) HI
collection time 11–99. during collection time 11–89. (h) HI during collection time 11–94. (i) HI during
collection time 11–99.
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
YAN et al.: TWO-DIMENSIONAL OPTIMIZATION FRAMEWORK OF ONLINE INTERPRETABLE TIME-FREQUENCY FEATURE LEARNING 15
Second, if a feature can reflect fault signatures, a data-level [12] Y. Cao, M. Jia, X. Zhao, X. Yan, and Z. Liu, “Semi-supervised machinery
fusion scheme can be used to develop an intuitive and phys- health assessment framework via temporal broad learning system em-
bedding manifold regularization with unlabeled data,” Expert Syst. Appl.,
ical HI for incipient fault detection. Third, the parameter λ vol. 222, 2023, Art. no. 119824, doi: 10.1016/j.eswa.2023.119824.
does not have a significant impact on the optimization of the [13] X. Du, W. Jia, P. Yu, “RUL prediction based on GAM–CNN for rotating
model. For online deployment, according to our tests, due to machinery,” J. Braz. Soc. Mech. Sci. Eng., vol. 45, no. 3, Mar. 2023,
doi: 10.1007/s40430-023-04062-8.
the use of matrices as model inputs and norm optimization in [14] M. Hu, G. Wang, K. Ma, Z. Cao, and S. Yang, “Bearing
the proposed optimization model, the inference time of inter- performance degradation assessment based on optimized EWT
pretable time-frequency features requires over ten minutes while and CNN,” Measurement, vol. 172, 2021, Art. no. 108868,
doi: 10.1016/j.measurement.2020.108868.
they show slight differences in a degradation stage and they [15] M. Zeng, Y. Yang, J. Zheng, and J. Cheng, “Maximum margin classification
mainly concentrate on some informative frequency bands. The based on flexible convex hulls,” Neurocomputing, vol. 149, pp. 957–965,
computation of a HI took within one second. Therefore, the 2015, doi: 10.1016/j.neucom.2014.07.038.
[16] M. Zeng, Y. Yang, J. Zheng, and J. Cheng, “Maximum margin clas-
interpretable time-frequency features can be updated according sification based on flexible convex hulls for fault diagnosis of roller
to an application demand, such as 15 min while the HI can be bearings,” Mech. Syst. Signal Process., vol. 66, pp. 533–545, Jan. 2016,
updated once new data are collected. Finally, some limitations doi: 10.1016/j.ymssp.2015.06.006.
[17] X. Li, Y. Yang, W. Ping, W. Jian, and J. Cheng, “A bearing fault diagno-
of the proposed methodology are discussed as follows. First, sis scheme with statistical-enhanced covariance matrix and Riemannian
more advanced and efficient algorithms are needed to shorten the maximum margin flexible convex hull classifier,” ISA Trans., vol. 111,
inference time of interpretable time-frequency features, which pp. 323–336, May 2021, doi: 10.1016/j.isatra.2020.11.018.
[18] Z. He, H. Shao, J. Cheng, Y. Yang, and J. Xiang, “Kernel flexible and
will be investigated in the future. Second, the applications of displaceable convex hull based tensor machine for gearbox fault intelli-
the proposed online optimization framework mainly focus on gent diagnosis with multi-source signals,” Measurement, vol. 163, 2020,
bearings in this study, which can be extended to other mechanical Art. no. 107965, doi: 10.1016/j.measurement.2020.107965.
[19] T. Yan, Y. Wang, T. Xia, B. Hou, L. Xi, and D. Wang, “Sparse
components in future studies. Finally, effective strategies for and flexible convex-hull representation for machine degradation mod-
updating the baseline normal dataset and further enhancing eling,” IEEE Trans. Rel., vol. 72, no. 1, pp. 27–36, Mar. 2023,
method performances will be considered. doi: 10.1109/tr.2022.3164976.
[20] J. Allen, “Short term spectral analysis, synthesis, and modification by
discrete Fourier transform,” IEEE Trans. Acoust., Speech Signal Process.,
vol. 25, no. 3, pp. 235–238, Jun. 1977.
REFERENCES [21] M. Zhao, J. Lin, X. Xu, and Y. Lei, “Tacholess envelope order analysis
and its application to fault detection of rolling element bearings with
[1] A. K. S. Jardine, D. Lin, and D. Banjevic, “A review on machin-
varying speeds,” Sensors, vol. 13, no. 8, pp. 10856–10875, Aug. 2013,
ery diagnostics and prognostics implementing condition-based mainte-
doi: 10.3390/s130810856.
nance,” Mech. Syst. Signal Process., vol. 20, no. 7, pp. 1483–1510, 2006,
[22] M. Zeng, Y. Yang, S. Luo, and J. Cheng, “One-class clas-
doi: 10.1016/j.ymssp.2005.09.012.
sification based on the convex hull for bearing fault detec-
[2] H. Zhu, C. Shen, J. Wang, B. Chen, D. Wang, and Z. Zhu, “Few-shot class-
tion,” Mech. Syst. Signal Process., vol. 81, pp. 274–293, 2016,
incremental learning with adjustable pseudo-incremental sessions for bear-
doi: 10.1016/j.ymssp.2016.04.001.
ing fault diagnosis,” IEEE Sensors J., vol. 24, no. 12, pp. 19543–19552,
[23] N. Li, Y. Lei, J. Lin, and S. X. Ding, “An improved exponential
Jun. 2024, doi: 10.1109/jsen.2024.3395515.
model for predicting remaining useful life of rolling element bearings,”
[3] T. Yan, D. Wang, T. Xia, E. Pan, Z. Peng, and L. Xi, “Online piecewise
IEEE Trans. Ind. Electron., vol. 62, no. 12, pp. 7762–7773, Dec. 2015,
convex-optimization interpretable weight learning for machine life cycle
doi: 10.1109/tie.2015.2455055.
performance assessment,” IEEE Trans. Neural Netw. Learn. Syst., vol. 35,
[24] M. Grant and S. Boyd, “CVX: MATLAB software for disciplined convex
no. 5, pp. 6048–6060, May 2024, doi: 10.1109/TNNLS.2022.3183123.
programming,” version 2.0 beta, Sep. 2013. [Online]. Available: http://
[4] F. Xu, Z. Huang, F. Yang, D. Wang, and K. L. Tsui, “Constructing a
cvxr.com/cvx
health indicator for roller bearings by using a stacked auto-encoder with an
[25] N. Yang, Z. Wang, W. Cai, and Y. Li, “Data regeneration based
exponential function to eliminate concussion,” Appl. Soft Comput., vol. 89,
on multiple degradation processes for remaining useful life esti-
2020, Art. no. 106119, doi: 10.1016/j.asoc.2020.106119.
mation,” Rel. Eng. Syst. Saf., vol. 229, 2023, Art. no. 108867,
[5] M. Ma, X. Chen, X. Zhang, B. Ding, and S. Wang, “Locally linear em-
doi: 10.1016/j.ress.2022.108867.
bedding on grassmann manifold for performance degradation assessment
[26] A. Klausen, H. van Khang, and K. G. Robbersmyr, “RMS based health
of bearings,” IEEE Trans. Rel., vol. 66, no. 2, pp. 467–477, Jun. 2017,
indicators for remaining useful lifetime estimation of bearings,” Model.,
doi: 10.1109/tr.2017.2691730.
Identification Control, Norwegian Res. Bull., vol. 43, no. 1, pp. 21–38,
[6] C. Wu, F. Feng, S. Wu, P. Jiang, and J. Wang, “A method for constructing
2022, doi: 10.4173/mic.2022.1.3.
rolling bearing lifetime health indicator based on multi-scale convolutional
[27] J. Meng, C. Yan, G. Chen, Y. Liu, and L. Wu, “Health indicator of
neural networks,” J. Braz. Soc. Mech. Sci. Eng., vol. 41, no. 11, 2019,
bearing constructed by rms-CUMSUM and GRRMD-CUMSUM with
doi: 10.1007/s40430-019-2010-6.
multifeatures of envelope spectrum,” IEEE Trans. Instrum. Meas., vol. 70,
[7] B. Duong et al., “A reliable health indicator for fault prognosis of bearings,”
Jan. 2021, Art. no. 3510216, doi: 10.1109/tim.2021.3054000.
Sensors, vol. 18, no. 11, 2018, Art. no. 3740, doi: 10.3390/s18113740.
[28] L. Wang, J. Pan, Y. Shao, Q. Zeng, and X. Ding, “Two new kurtosis-
[8] Y. Qin, J. Zhou, and D. Chen, “Unsupervised health Indicator construc-
based similarity evaluation indicators for grinding chatter diagnosis un-
tion by a novel degradation-trend-constrained variational autoencoder
der non-stationary working conditions,” Measurement, vol. 176, 2021,
and its applications,” IEEE/ASME Trans. Mechatron., vol. 27, no. 3,
Art. no. 109215, doi: 10.1016/j.measurement.2021.109215.
pp. 1447–1456, Jun. 2022, doi: 10.1109/tmech.2021.3098737.
[29] D. Wang, Z. Peng, and L. Xi, “The sum of weighted normal-
[9] P. Wen, Y. Li, S. Chen, and S. Zhao, “Remaining useful life prediction of
ized square envelope: A unified framework for kurtosis, negative en-
IIoT-enabled complex industrial systems with hybrid fusion of multiple in-
tropy, Gini index and smoothness index for machine health monitor-
formation sources,” IEEE Internet Things J., vol. 8, no. 11, pp. 9045–9058,
ing,” Mech. Syst. Signal Process., vol. 140, 2020, Art. no. 106725,
Jun. 2021, doi: 10.1109/jiot.2021.3055977.
doi: 10.1016/j.ymssp.2020.106725.
[10] Z. Shi and A. Chehade, “A dual-LSTM framework combining change
[30] Y. Jun, Y. A. N. Tao, L. I. Naipeng, W. Biao, H. A. N. Tianyu, and
point detection and remaining useful life prediction,” Rel. Eng. Syst. Saf.,
L. E. I. Yaguo, “XJTU-SY rolling element bearing accelerated life test
vol. 205, 2021, Art. no. 107257, doi: 10.1016/j.ress.2020.107257.
datasets: A tutorial,” J. Mech. Eng., vol. 55, no. 16, pp. 1–6, 2019,
[11] H. Qiu, J. Lee, J. Lin, and G. Yu, “Wavelet filter-based weak signa-
doi: 10.3901/jme.2019.16.001.
ture detection method and its application on rolling element bearing
prognostics,” J. Sound Vib., vol. 289, no. 4/5, pp. 1066–1090, 2006,
doi: 10.1016/j.jsv.2005.03.007.
Authorized licensed use limited to: The University of Hong Kong Libraries. Downloaded on January 12,2025 at 12:57:43 UTC from IEEE Xplore. Restrictions apply.