Introduction - To - Reliability Analysis
Introduction - To - Reliability Analysis
Factors of dependability
Reliability, Availability, Maintainability, Safety (RAMS)
Notes:
Avizienis, A., Laprie, J. C., Randell, B., & Landwehr, C. (2004). Basic concepts and taxonomy of dependable and secure
computing. IEEE transactions on dependable and secure computing, 1(1), 11-33.
1 – The ability of an item to perform a required function under stated conditions for a
specified period of time. [Oxford Dictionary 2022].
1 – The ability of an item to perform a required function under stated conditions for a
specified period of time. [Oxford Dictionary 2022].
Failure Rate (λ) - A Reliability index that represents the rate at which your product
fails.
Mean Time To Failure (MTTF) – The reliability index for non-repairable units
represents the mean time to failure.
Mean Time Between Failure (MTBF) – The reliability index for repairable units
represents the mean time between failure.
Notes:
• Failure Rate can also be in per operating hour, per km, per cycle, per solicitation.
• For systems equipped with sensors for condition monitoring, there is a parameter
called “Remaining Useful Life” that can be considered as a live version of MTTF.
Koorosh Aslansefat - University of Hull 8
Definitions
Examples of Repairable Safety-Critical Systems
https://en.wikipedia.org/wiki/Space_Shuttle_Challenger_disaster
Koorosh Aslansefat - University of Hull 11
Definitions
Chernobyl Disaster
https://en.wikipedia.org/wiki/Chernobyl_disaster
Koorosh Aslansefat - University of Hull 12
Definitions
Reliability Indices
Failure Rate (λ) - A Reliability index that represents the rate at which your product
fails.
Mean Time To Failure (MTTF) – The reliability index for non-repairable units
represents the mean time to failure.
https://www.collidu.com/presentation-failure-rate-curve
Koorosh Aslansefat - University of Hull 14
Definitions
Failure Rate
Failure Rate (λ) - A Reliability index that represents the rate at which your product
fails.
EXAMPLE 1: 30 laptops are put on test and run at their normal operating
condition for 1,000 hours. If 6 of those laptops fail during the operating time,
what is the failure rate of the product?
https://www.youtube.com/watch?v=BQXnKpP2lrI
Koorosh Aslansefat - University of Hull 15
Definitions
Failure Rate
Failure Rate (λ) - A Reliability index that represents the rate at which your product
fails.
EXAMPLE 1: 30 laptops are put on test and run at their normal operating
condition for 1,000 hours. If 6 of those laptops fail during the operating time,
what is the failure rate of the product?
Failure Rate (λ) - A Reliability index that represents the rate at which your product
fails.
EXAMPLE 2: 20 laptops are put on test and run at their normal operating
condition for 1,000 hours. If 6 of those laptops fail at the following hours (550,
480, 680, 790, 860, 620), what is the failure rate of the product?
Failure Rate (λ) - A Reliability index that represents the rate at which your product
fails.
EXAMPLE 2: 20 laptops are put on test and run at their normal operating
condition for 1,000 hours. If 6 of those laptops fail at the following hours (550,
480, 680, 790, 860, 620), what is the failure rate of the product?
Failure Rate (λ) - A Reliability index that represents the rate at which your product
fails.
EXAMPLE 2: 20 laptops are put on test and run at their normal operating
condition for 1,000 hours. If 6 of those laptops fail at the following times (550h,
20d, 680h, 790h, 860h, 620h), what is the failure rate (per hour) of the product?
𝟔
𝝀= =𝟎 . 𝟎𝟎𝟎𝟑𝟑𝟑𝟕𝟎𝟒𝟏𝟏
𝟓𝟓𝟎+ 𝟒𝟖𝟎+𝟔𝟖𝟎 +𝟕𝟗𝟎 +𝟖𝟔𝟎+ 𝟔𝟐𝟎+(𝟏𝟒 ∗ 𝟏 , 𝟎𝟎𝟎)
Mean Time To Failure (MTTF) – The reliability index for non-repairable units
represents the mean time to failure.
EXAMPLE 2: 20 laptops are put on test and run at their normal operating
condition for 1,000 hours. If 6 of those laptops fail at the following times (550h,
20d, 680h, 790h, 860h, 620h), what is the MTTF of the product?
−( 𝜆 𝑡 )
− ( 𝑀𝑇𝑇𝐹
1
)𝑡
𝑅 ( 𝑡 )=𝑒 =𝑒
Deif, D., & Gadallah, Y. (2017). A comprehensive wireless sensor network reliability metric for critical Internet of Things applications. EURASIP Journal on
Wireless Communications and Networking, 2017, 1-18.
− ( 4000
1
) 5000
𝑅 ( 5000 ) =𝑒 =0 . 2865
The probability that our product will perform successfully past the 5000 hours mark is
approximately 28.65%.
Koorosh Aslansefat - University of Hull 22
Questions
1- An industrial machine compresses natural gas into an interstate gas pipeline. The
compressor is on line 24 hours a day. (If the machine is down, a gas field has to be
shut down until the natural gas can be compressed, so down time is very expensive.)
The vendor knows that the compressor has a constant failure rate of 0.0000001
failures/hr. What is the operational reliability after 2500 hours of continuous service?
The compressor has a constant failure rate and therefore the lifetimes of these reliability is
given by: where Failure rate =0.0000001 failures/hr, operational time t = 2500 hours.
Reliability = =0.9975
The reliability of the product is given to be 0.98. The reliability of an exponential distribution
is given by: i.e., 0.98=
(a) Since the failure rate is constant, we will use the exponential distribution. Also, the MTTF
= 25 hours. We know, for an exponential distribution, MTTF = 1/. Therefore =1/25=0.04
a) λ=0.03∗10−4 failures/hour
MTTF = 1/λ=333,333 hours i.e., the average life of these seals is about 333,333 hours
Assuming the exponential model, the hazard rate is 1/MTTF = 0.001 So R(500) = e-
500*0.001 = e-0.5 = 0.61
As the hazard rate is a constant, we can use the exponential model with a hazard rate
of 0.04. R(25) = e -0.04*25 = e-1 = 0.37
What is the mean time to failure of the microwave ovens? (Note that the mean life of the
microwave is defined in terms of their mean time to failure because no maintenance is
performed on the ovens).
The MTTF is 2312 hours (simply the average).
Koorosh Aslansefat - University of Hull 29
Definitions
Availability
The probability that a system is operational and functioning correctly at any point in
time. It refers to the ability of the system to be in a state to perform its designated
function under stated conditions whenever required [IEC 60050-192].
Notes:
• The state of an item of being able to perform as required is the “up state” (also
called “working”).
• The state of an item of being unable to perform as required is the “down state,” (also
called “faulty” or “in maintenance”).
• An available item is not necessarily operating (e.g. “stand-by”), (being able to ≠
performing).
• In more complex systems we might have “degraded states”.
The probability that a system is operational and functioning correctly at any point in
time. It refers to the ability of the system to be in a state to perform its designated
function under stated conditions whenever required [IEC 60050-192].
The probability that a system is operational and functioning correctly at any point in
time. It refers to the ability of the system to be in a state to perform its designated
function under stated conditions whenever required [IEC 60050-192] .
Performance
Up State Failures Up State Up State
https://www.toucantoco.com/en/blog/understanding-failure-metrics
Repair Rate () - A availability index that represents the rate at which a product can be
repaired.
Mean Time To Repair (MTTR) – The availability index for repairable units represents
the mean time to repair.
Tracking the software updates, the supervisor finds that the IT department applies
modifications and bug repairs a total of 10 times during the same year. If the total
repair time for yearly updates is 41.5 hours, the supervisor determines the MTTR for
this task as:
Notes:
• Preventive maintenance acts on reliability (and, indirectly, on availability), while the
corrective maintenance only acts on availability.
• Preventive maintenance is “scheduled” when it is carried out in accordance with a
specified timetable and “condition-based” when it is performed upon the
assessment of physical conditions.
Koorosh Aslansefat - University of Hull 39
The Costs Associated with (O&M) of Wind Turbines
01 Onshore [1]
02 Offshore [2]
Of Life [3]
The availability takes the restoration to “up state” (i.e. repairs) into account,
unlike reliability.
When no restoration to “up state” is considered (i.e. items never repaired), the
reliability of an item is equal to its availability.
A poorly reliable item can be very available if the restoration to “up state” is
very fast after each failure.
The availability can increase, decrease, and/or stay constant according to time.
Note:
Overview of reliability indices: failure rate (λ), mean time to failure (MTTF), mean time
K. Pei, et al. K., Cao, Y., Yang, J., & Jana, S. (2017). Deepxplore: Automated whitebox testing of deep learning systems. In proceedings of the 26th
Symposium on Operating Systems Principles (pp. 1-18).
Deif, D., & Gadallah, Y. (2017). A comprehensive wireless sensor network reliability metric for critical
Internet of Things applications. EURASIP Journal on Wireless Communications and Networking, 2017,
1-18.