Zheng 2020
Zheng 2020
a
School of Mechanical Engineering, Southeast University, Nanjing 211189, China
b
School of Mechanical and Aerospace Engineering, Jilin University, Changchun 130022, China
Keywords: The hazard rate of many practical systems depends not only on age but also on a diagnostic covariate process.
Condition-based maintenance Effective maintenance decisions for such systems need to combine both age information and the covariate in-
Dynamic thresholds formation obtained from condition monitoring. This paper proposes a condition-based maintenance (CBM)
Multiple maintenance actions policy with dynamic thresholds and multiple maintenance actions for such a system subject to periodic in-
Proportional hazards model
spection. The hazard rate is described by the proportional hazards model with a continuous-state covariate
Policy-iteration algorithm
process. At each inspection epoch, appropriate action is selected from no maintenance, imperfect maintenance,
and preventive replacement based on two dynamic thresholds. Over an inspection interval, the system may
experience minor failure or catastrophic failure that can be addressed by minimal repair and corrective re-
placement, respectively. The objective is to determine the optimal thresholds that minimize the long-run average
cost rate. A modified policy-iteration algorithm is developed to solve the optimization problem in the semi-
Markov decision process (SMDP) framework. The effectiveness of the proposed approach is illustrated by a
practical numerical example. The comparison with the other CBM policies confirms the outstanding performance
of the proposed policy.
⁎
Corresponding author.
E-mail address: [email protected] (R. Zheng).
https://doi.org/10.1016/j.ress.2020.107123
Received 10 January 2020; Received in revised form 6 July 2020; Accepted 11 July 2020
Available online 15 July 2020
0951-8320/ © 2020 Elsevier Ltd. All rights reserved.
R. Zheng, et al. Reliability Engineering and System Safety 204 (2020) 107123
models where the PHM covariate was characterized as a Wiener pro- guild practical maintenance implementation.
cess. The optimal hazard-rate replacement threshold was obtained in The contributions of this paper are summarized as follows: (1)
two scenarios where the parameters of the covariate distribution were Development of a CBM model considering dynamic thresholds and
known and unknown in advance respectively. Duan et al. [17] devel- multiple maintenance actions; (2) Consideration of both stationary and
oped a CBM policy using the PHM with a gamma process as its cov- non-stationary covariate processes of a PHM model; (3) Employment of
ariate. Besides the replacement threshold, an age threshold was also a discretization method for the calculation of SMDP quantities; (4)
considered for selecting maintenance actions. Aforementioned CBM Development of a modified policy-iteration algorithm for the optimi-
policies involved a constant hazard-rate threshold which was con- zation problem.
venient to be optimized in computation. The remainder of this paper is organized as follows. Section 2 de-
However, making a constant decision throughout the lifecycle of a scribes the hazard function and the CBM policy and then gives the
system is not cost-effective in many situations. One way to extend the optimization problem. Section 3 develops the computational algorithm
traditional replacement policy is to consider dynamic thresholds. For a in the SMDP framework. In Section 4, a numerical example is used to
system subject to dependent soft and hard failures, Tang and Yu [18] demonstrate the effectiveness of the proposed approach. Section 5
proposed a CBM policy with a dynamic replacement threshold, which summarizes the conclusions and discusses possible directions in future
was proven to be monotonically non-increasing with the inspection research.
number. Yahyatabar and Najafi [19] studied a CBM policy with a dy-
namic replacement threshold for a single-unit system. A multi-stage 2. Problem description
stochastic programming approach was developed to optimize the
threshold by minimizing the cost rate in a finite horizon. An extension The following notation is adopted in this paper:
of this study to series-parallel systems was presented in [20]. However,
there still remains a gap in the computation algorithm for finding the
t system age
optimal policy which involves more than one dynamic threshold.
ξ system age at failure
Moreover, the replacement policy only involves preventive and q probability of a failure to be catastrophic
corrective replacements which cannot be performed in engineering X covariate process
practice unless a system is severely deteriorated or under a catastrophic I state space of the covariate process
failure. Therefore, another way to extend the traditional policy is to M absorbing covariate state
h(t, Xt) hazard function at age t
consider imperfect maintenance and/or minimal repair. Mousavi et al. h0(∙) baseline hazard function
[21] proposed a CBM policy where imperfect maintenance or replace- Ψ(∙) link function
ment was selected at each inspection epoch according to a constant v(t) shape parameter of a gamma process
hazard-rate threshold. An algorithm was developed to optimize the u scale parameter of a gamma process
δ a sufficiently small time interval
threshold and the imperfect maintenance policy simultaneously. In
Δ inspection interval
Yahyatabar and Najafi [19,20], the proposed policies excluded cor- K discretization level of Δ, i.e., Δ = Kδ
rective replacement, and all failures were addressed by minimal repair (n, i) system state of age nΔ and covariate state i
irrespective of their failure types. However, further efforts are still re- S system state space
quired to make CBM decisions considering multiple maintenance ac- α improvement factor of imperfect maintenance
CI inspection cost
tions and different failure types.
Cmr cost of minimal repair
Motivated by these requirements, this paper investigates the opti- Cpm cost of imperfect maintenance
mization problem of a CBM policy with dynamic thresholds and mul- Cpr cost of preventive replacement
tiple maintenance actions for a system subject to periodic inspections. Ccr cost of corrective replacement
Tmr time of minimal repair
The hazard rate is modeled by the PHM with a continuous-state cov-
Tpm time of imperfect maintenance
ariate process. Unlike the previous PHM that mostly incorporated sta- Tpr time of preventive replacement
tionary covariate processes, this paper also examines the non-stationary Tcr time of corrective replacement
case which is suitable for many practical deterioration processes (e.g., p(n,i),(n , i ) transition probability from state (n , i ) S to state (n , i ) S
[22,23]). At each inspection epoch where the covariate value is avail- τ(n, i) expected sojourn time for state (n , i ) S
able, appropriate action is selected from no maintenance, imperfect c(n, i) expected cost for state (n , i ) S
v(n, i) relative value for (n , i ) S
maintenance, and preventive replacement based on two dynamic
G maintenance policy
thresholds. During an inspection interval, the system is subject to thresholds for imperfect maintenance at the nth inspection
L1n
random failure that triggers a corrective maintenance action im- L2n thresholds for preventive replacement at the nth inspection
mediately. Two failure types are considered in this paper. If a cata- g long-run average cost rate
strophic failure occurs, corrective replacement is carried out to restore
the system to the brand-new state. If a minor failure occurs, minimal
repair is used to restore the system to the operational state. The ob- 2.1. The hazard function
jective is to determine the optimal thresholds that minimize the long-
run average cost rate. The optimization problem is formulated in the This paper proposes a CBM policy for a system whose hazard rate
semi-Markov decision process (SMDP) framework. depends both on its age t and on a covariate process X = {Xt | t > 0}.
The formulation process requires the calculation of conditional re- According to Cox's PHM, the hazard function of the system is expressed
liability between two consecutive inspection epochs, which is quite as
difficult due to the “random jump” of the continuous-state covariate h (t , Xt ) = h 0 (t ) (Xt ) (1)
process [24,25]. To address this uncertainty properly, this paper de-
velops a discretization method which is distinguished from the previous where h0(t) is the baseline hazard function, and Ψ(Xt) is a positive link
methods [17,26] by using a contract transition matrix to reduce the function describing the effect of the covariate process on failure time.
memory requirement in computation. On that basis, the SMDP quan- Its survivor function is given by
tities, i.e., transition probabilities, expected sojourn time, and expected t
cost, are derived. A modified policy-iteration algorithm which is proved Pr( > t|Xs , 0 < s < t ) = exp h 0 (s ) (Xs ) ds
0
(2)
to be effective is developed to find the optimal maintenance policy. The
graphical representation of the obtained policy is then presented to The process X can be either external environmental factors (e.g.,
2
R. Zheng, et al. Reliability Engineering and System Safety 204 (2020) 107123
temperature, humidity, and work load) [27,28] or internal diagnostic catastrophic and minor are assumed to be q (q ∈ [0, 1]) and (1 – q),
variables (e.g., crack, corrosion, and wears) [29]. In this paper, the respectively. If a catastrophic failure occurs, corrective replacement is
covariate process is assumed to be monotonically increasing and performed immediately to return the system to the state (0, 0). If a
modeled by a gamma process. This assumption is realistic in many minor failure occurs, minimal repair is performed immediately to re-
practical situations such as the crack growth of a mechanical system. store the system state to its previous known system state. This as-
The process starts from zero, i.e., X0 = 0. The increments between time sumption implies that the effect of minimal repair is a little more than
points t and t + Δt are independent and follow a gamma distribution as “as bad as old” since both the system age and the covariate state are
follows non-decreasing over timer. However, this small rejuvenation can be
neglected and, following the assumption, the subsequent computation
u v (t + t ) v (t ) x v (t + t ) v (t ) 1e ux
f (x|t , t ) = Pr(Xt + t Xt = x ) = , x 0 procedure will be significantly simplified.
(v (t + t ) v (t ))
The proposed maintenance policy can then be expressed in the
(3) matrix form as
where v(t) ≥ 0 is the shape parameter; u > 0 is the scale parameter;
L10 L 20
(x ) = 0 s x 1e sds is the gamma function [30]. If v(t) is a linear
function of time t, the process X reduces to a stationary process; L11 L21
G=
otherwise the process is non-stationary.
For the convenience of both computation and the subsequent policy L1N L 2N (6)
description, the covariate process X is discretized into a multi-state
where [L10 , L11, and
...,L1N ]T are the dynamic thresholds
[L 20 , L 21 , ...,L2N ]T
Markov chain with state space I = {0, 1,…, M}, where M is the ab-
for imperfect maintenance and preventive replacement, respectively.
sorbing state. Letting d denote the discretization interval, then Xt is
The objective is to determine the optimal policy that minimizes the
supposed to be in state i {0, ...,M 1} if Xt [id , (i + 1) d) , and in
long-run average cost rate.
state M if Xt [Md , + ) . Similar discretization methods can be found
in [31]. Using the mid-point (i + 0.5)d as the covariate value of state
3. Computational algorithm in the SMDP framework
i ∈ I approximately, the probability that the process X transits from
state i ∈ I to state j ∈ I, where i ≤ j, between time points t and t + Δt is
This section formulates and solves the maintenance optimization
given by
problem in the SMDP framework. First, a method is developed to cal-
P(t , i),(t + t,j) = Pr(Xt + t =j|Xt = i ) culate the conditional reliability between two discrete time points.
ub Second, we derive the SMDP quantities including transition prob-
= f (x|t , t ) dx
lb abilities, expected sojourn time, and expected cost. A modified policy-
(v (t + t ) v (t ), lb ·u ) (v (t + t ) v (t ), ub ·u ) algorithm is later presented to solve the optimization problem.
=
(v (t + t ) v (t ))
(4) 3.1. Conditional reliability
3
R. Zheng, et al. Reliability Engineering and System Safety 204 (2020) 107123
In Fig. 1, the left notation below each transition path denotes its
3.2. SMDP quantities sojourn time. If the system survives to the next inspection epoch, the
sojourn time is Δ. Otherwise, the sojourn time is associated with failure
The SMDP is determined by the following quantities [35]: time and corresponding repair time. The expected sojourn time of the
current state is given by
• i′)p ∈ S at=thethenext
(n, i),(n , i ) probability that the system will be in the state (n′,
decision epoch if the current state is (n, i) ∈ S; (n, i)
• τ = the expected sojourn time until the next decision epoch if the
(n, i) = · p(n, i),(n + 1, j) +
0
[(1 q)(t + Tmr ) + q (t + Tcr )] dF (t|n , i)
current state is (n, i) ∈ S; j I
• = the expected cost incurred until the next epoch if the current
c (n, i)
= · R ( |n , i) + [(1 q)(t + Tmr ) + q (t + Tcr )] dF (t|n , i )
state is (n, i) ∈ S. 0
The notation above each transition path denotes its transition (14)
probability. The first path denotes the situation where the system sur-
where F (t|n , i) = 1 R (t|n , i ) is the probability that the system fails
vives to the next inspection epoch with the covariate state j ∈ I.
before ξ = nΔ + t; Tmr denotes the time of minimal repair; and Tcr
According to Eq. (8), the transition probability is calculated as
represents the time of corrective replacement.
p(n, i ),(n + 1, j) = Pr (X(n+ 1) = j , > (n + 1) Xn = i , >n ) The right notation below a transition path denotes its cost. If the
K 1
system survives to the next inspection epoch, an inspection is required
to reveal the covariate value which incurs inspection cost CI. If cata-
= ei· H (n + k )·eTj
k=0 (11) strophic or minor failure occurs, corresponding repair cost is incurred.
Since the next system state after minimal repair or corrective replace-
The second path denotes the situation where a minor failure occurs ment is known, it is unnecessary to carry out an inspection and thus no
before the next inspection epoch. Minimal repair is then carried out and inspection cost is incurred. The expected cost for the current state can
restores the system state to (n, i). The transition probability is given by be calculated as
4
R. Zheng, et al. Reliability Engineering and System Safety 204 (2020) 107123
c (n, i ) = CI · p(n, i),(n + 1, j) + Cmr ·p(n, i ),(n, i) + Ccr ·p(n, i),(0,0) 0) after performing corrective replacement. The transition probability is
j I given by
= CI · R ( |n , i) + Cmr (1 q)[1 R ( |n , i )] p(n, i),(0,0) = p(n, i),(n, i ) · p(n, i ),(0,0)
+ Ccr q [1 R ( |n , i)]
= 1·[q· Pr ( < (n + 1) > n , Xn = i )]
= [CI (1 q) Cmr qCcr ]· R ( |n , i) + (1 q) Cmr + qCcr
= q·[1 R ( |n , i )]
K 1
K 1
= [CI (1 q) Cmr qCcr ]· ei · H (n + k )· 1
k=0
= q· 1 ei · H (n + k )· 1
+ (1 q) Cmr + qCcr k=0 (18)
(15) The expected sojourn time of the current state consists of two parts:
(1) the sojourn time between the current state and the intermediate
where CI, Cmr and Ccr are costs associated with inspection, minimal
system state Tpm; and (2) the expected sojourn time between the in-
repair, and corrective replacement, respectively.
termediate state and the possible next states. The calculation of the
Scenario 2: The current system state is (n, i), where L1n i < L2n
second part is similar to Eq. (14). In sum, the expected sojourn time of
and n < N.
the current state is calculated as
For the current state, imperfect maintenance is carried out to restore
the system to state to (n, ⌊iα⌋) which requires no maintenance and thus (n, i) = Tpm + ·R ( n , i )
is treated as an intermediate state. Starting with the intermediate state,
there are three possible transition paths which are similar to those in + [(1 q)(t + Tmr ) + q (t + Tcr )] dF (t n , i )
0
Fig. 1. The transition paths of the current state are presented in Fig. 2.
= Tpm + R (t n, i ) dt
The first path denotes that the system successfully transits to next 0
inspection epoch with covariate state j ∈ I. The transition probability is + [1 R ( n , i )][(1 q) Tmr + qTcr ]
given by K m 1
= Tpm + · ei · H (n + k )· 1
p(n, i),(n+ 1, j ) = p(n, i),(n, i ) ·p(n, i ),(n + 1, j) m=1 k=0
= 1·Pr (X(n+ 1) = j , > (n + 1) Xn = i , >n ) K 1
K 1 + 1 ei · H (n + k )·1 [(1 q) Tmr + qTcr ]
= ei · H (n + k )·eTj k=0 (19)
k=0
The expected cost of the current state consists of two parts: the cost
(16)
of imperfect maintenance Cpm, and the expected cost incurred after the
The second path denotes that a minor failure occurs in next in- intermediate state is similar to Eq. (15). Therefore, the expected cost is
spection interval. Minimal repair is incurred to restore the system state
c (n, i)
to (n, ⌊iα⌋). The transition probability is
= Cpm + CI · p(n, i ),(n + 1, j) + Cmr ·p(n, i ),(n, i ) + Ccr ·p(n, i ),(0,0)
p(n, i),(n, i ) = p(n, i ),(n, i ) ·p (n, i ),(n, i ) j I
= 1·[(1 q)·Pr ( < (n + 1) > n , Xn = i )] = Cpm + [CI (1 q) Cmr qCcr ]· R ( n , i ) + (1 q) Cmr + qCcr
= (1 q)·[1 R ( n , i )] K 1
K 1 = Cpm + [CI (1 q) Cmr qCcr ]· e i · H (n + k )· 1
= (1 q)· 1 ei · H (n + k )· 1 k=0
k=0 (17) + (1 q) Cmr + qCcr
The last path represents the situation where a catastrophic failure (20)
occurs in the next inspection interval. The system state will transit to (0, Scenario 3: The current system state is (n, i), where i L 2n or
n = N.
5
R. Zheng, et al. Reliability Engineering and System Safety 204 (2020) 107123
6
R. Zheng, et al. Reliability Engineering and System Safety 204 (2020) 107123
7
R. Zheng, et al. Reliability Engineering and System Safety 204 (2020) 107123
Table 1
Average cost rate for parameters b, q, and α.
b Average cost rate q Average cost rate α Average cost rate
8
R. Zheng, et al. Reliability Engineering and System Safety 204 (2020) 107123
5. Conclusions
This paper has proposed a CBM policy with dynamic thresholds and
multiple maintenance actions for a periodically inspected system sub-
ject to both aging and a continuous-state covariate process. Two failure
types are considered and the failure time is modeled by the PHM. The
proposed policy involves various maintenance actions such as no
maintenance, imperfect maintenance, preventive replacement, minimal
repair, and corrective replacement. Two dynamic thresholds are set to
select actions at inspection epochs. The optimization of the thresholds is
conducted in the SMDP framework. The SMDP quantities are derived
Fig. 11. Optimal thresholds of Policy 1 and Policy 2.
based on a discretization method. A modified policy-iteration algorithm
is developed to find the optimal maintenance policy that minimizes the
long-run average cost rate. A practical numerical example is provided
Table 2 to illustrate the proposed method. Some important conclusions are
Comparison of different maintenance policies summarized as follows: (1) The thresholds for imperfect maintenance
Maintenance policy Control limits Average cost rate and preventive replacement are monotonically non-increasing over
system age; (2) It is necessary to consider multiple maintenance actions
Proposed policy See Figs. 4 or 5 272.3
for effective CBM decision-making; (3) The obtained graphical policy
Policy 1 See Fig. 11 273.4
Policy 2 See Fig. 11 301.5 presentation is supportive in practical maintenance activities; (4) The
Policy 3 L = 1.16 × 10−3 302.9 proposed policy outperforms the other existing CBM policies.
There are still some interesting topics that can be pursued in future
research: (1) The improvement factor of imperfect maintenance can be
random rather than deterministic; (2) The parameters of the covariate
replacement policy with a constant hazard-rate threshold L. As men- process can be unknown in advance and updated by condition mon-
tioned above, such a policy has been widely considered; see for instance itoring [36]; (3) Another interesting extension is to jointly consider the
[9,17]. proposed CBM policy and the lot-sizing for a production system [37].
The proposed algorithm can be used for solving the optimization
problem of Policy 1 and Policy 2. The obtained thresholds are presented CRediT authorship contribution statement
in Fig. 11, and average cost rates are listed in Table 2. Policy 3 can be
optimized by a procedure that includes the following three steps: (1) Rui Zheng: Conceptualization, Methodology, Data curation,
Discretization of a suitable hazard rate region (e.g., [0, 0.02] in this Software, Writing - original draft. Bingkun Chen: Investigation,
paper) to generate candidates of L; (2) Computation of the long-run Writing - review & editing. Liudong Gu: Visualization, Software,
average cost rate according to Eq. (27) for each candidate; and (3) Validation.
Section of the candidate with the minimum of average cost rate. In-
terested readers can refer to [15,17] for more information about the
Declaration of Competing Interest
optimization procedure. The obtained threshold and average cost rate
are shown in Table 2.
It is noted that policy 1 is a special case of the proposed policy by There is no conflict of interest. The manuscript has not been sub-
removing imperfect maintenance. The comparison between the pro- mitted to any other journal or a conference.
posed policy and policy 1 proves that imperfect maintenance is effective
in reducing maintenance cost. Similarly, policy 2 acts as a special case Acknowledgments
of policy 1 by excluding minimal repair. Consequently, the comparison
between policy 1 and policy 2 shows that minimal repair should also be This work was supported by China Scholarship Council (CSC) under
considered while making CBM decisions. Policy 2 and policy 3 are Grant No. 201806090017.
9
R. Zheng, et al. Reliability Engineering and System Safety 204 (2020) 107123
For a sufficiently small time interval δ, it is reasonable to assume that the covariate state does not change between t and t + δ. Thus, according to
Eq. (2), the second component in Eq. (A.1) is given by
Pr( > t + | > t ,Xt = i)
t+
= exp h 0 (s ) (Xt + s ) ds
t
t+
exp h 0 (s ) (Xt ) ds
t
t+
= exp [(i + 0.5) d]· h 0 (s ) ds
t
(A.2)
Substituting Eq. (A.2) into Eq. (A.1), we can obtain that
t+
hij (t )=P(t , i),(t + , j)·exp [(i + 0.5) d]· h 0 (s ) ds
t
(A.3)
which completes the proof.
Proof. Start the proof from considering the transition between t and t + δ. Using Eq. (7), it is easy to obtain the probability of each covariate state at
the time point t + δ through
o1 = ei · H (n ) (B.1)
where the (j+1)th ( j = 0, 1, ...,M ) element in o1 is the transition probability from i ∈ I to j ∈ I.
Using o1 as the initial state of the time interval between t + δ and t + 2δ, the probability of each covariate state at the end of this interval is
o2 = o1·H (t + ) = ei · H (t )· H (t + ) (B.2)
Recursively, we can obtain the probability of each covariate state at the time point of t + rδ (r = 1, 2, ...) as follows
or = or 1·H (t + (r 1) )
= or 2 ·H (t + (r 2) )· H (t + (r 1) )
= ...
r 1
= ei · H (t + k )
k=0 (B.3)
Then we can obtain the probability that the system survivals to state j ∈ I at t + rδ (r = 1,2,…) given the state i ∈ I at t as follows
r 1
Pr(Xt + r = j, > t + r |Xn = i , > t ) = or · eTj = ei · H (t + k )·eTj
k=0 (B.4)
It is noted that if r = 0, the probability equals to 1 if j = i, and equals to 0 otherwise. Eq. (B.4) holds for this situation which completes the proof.
Proof. First, the classical policy-iteration algorithm is considered. Since the actions of state (0, 0) and (N, i), where i ∈ I, are deterministic, we just
need to consider the action-selection for the other states. The action set for state (n, i) ∈ S, where n = 1, 2, ...,N 1 and i ∈ I, can be defined as
a(n, i) A = {NM , IM , PR} , where NM, IM, and PR denote no maintenance, imperfect maintenance, and preventive replacement, respectively. Then,
v(n, i) (G, a) = c(n, i) (a) g (G)· (n, i) (a ) + p(n, i ),(n , i ) (a)· v(n , i ) (G)
(n , i ) S (C.1)
defines the relative value of selecting a ∈ A for state (n, i) and thereafter following the existing policy G. In the policy-improvement step, the action
for state (n, i) is determined by
min {v(n, i) (G, a (n , i ))}
a(n, i) A (C.2)
According the policy improvement theorem, a new policy constructed with the actions obtained in Eq. (C.2) is not worse than the current policy G
[35].
Second, consider the policy proposed in this paper. Assume that in the current policy G L1n = k (k = 0, 1, ...,M ). Consider the situation where L2n is
fixed and L1n is moved from k to k + 1. It indicates that only the action of state (n, k) is changed from IM to NM, and the actions of the other states
keep the same as G. Therefore, according to Eq. (28), we have
10
R. Zheng, et al. Reliability Engineering and System Safety 204 (2020) 107123
11
R. Zheng, et al. Reliability Engineering and System Safety 204 (2020) 107123
system subject to degradation and external shocks. Comput Ind Eng imperfect information and the proportional hazards model. Int J Prod Res
2016;102:21–32. 2007;45(4):989–1012.
[35] Tijms HC. Stochastic models: an algorithmic approach. New York: John Wiley & [37] Jafari L, Makis V. Joint optimal lot sizing and preventive maintenance policy for a
Sons; 1994. production facility subject to condition monitoring. Int J Prod Econ
[36] Ghasemi A, Yacout S, Ouali MS. Optimal condition based maintenance with 2015;169:156–68.
12