0% found this document useful (0 votes)

74 views24 pages

Presentation - 02 Reliability in Computer Systems

This document discusses reliability in computer systems. It defines reliability and outlines things that can cause systems to fail, such as hardware issues, software bugs, human error, and natural disasters. It also describes critical systems and different types, including safety-critical, mission-critical, business-critical, and security-critical systems. Methods to improve reliability like backups, redundancy, fault tolerance, and defensive programming are explained.

Uploaded by

victorwu.uk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views24 pages

Presentation - 02 Reliability in Computer Systems

Uploaded by

victorwu.uk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

Teach Computer Science

Reliability in computer
systems

teachcomputerscience.com
2

Lesson Objectives

▪ Students will learn about things that can go wrong in a

computer system and how to avoid such situations.
▪ What are critical systems?
▪ How to protect systems from failure
▪ What to do if a computer system fails
▪ How to analyse reliability of a system

teachcomputerscience.com
1.
Content

teachcomputerscience.com
4

What is reliability?
▪ Reliability of any computer-related component is an attribute
that denotes its consistent performance according to the
specifications.

teachcomputerscience.com
5

Things that can go wrong

 Hardware might fail to operate.

 Software might contain bugs or Natural Hardware
errors. disasters failure
 Human errors can also make the
system inefficient. Security is very System
important to systems as there might Failure
even be a deliberate attack. Software
Human
 Natural disasters like power cuts, error error
flooding or an earthquake affect the
operation of systems too.

teachcomputerscience.com
6

What is a critical system?

▪ Critical systems are computer systems that must be highly
reliable, as their failure may have a great impact on human lives.
▪ Developed using a conservative technique rather than a new
technique.
▪ A new technique is only implemented after analysing its long-
term effects, even though it might seem to be more efficient.

teachcomputerscience.com
7

Types of critical systems

Critical system

Safety-critical Mission-critical Business-critical Security-critical

systems systems systems systems

teachcomputerscience.com
8

Types of critical systems

Safety-critical Mission-critical Business-critical Security-critical

systems systems systems systems
Designed to avoid Failure of these Designed to avoid Designed to protect
danger to human systems affects the loss of business, sensitive
lives and the overall economic loss and information that
environment. performance, as loss of reputation. can be misused
Example: they are responsible Example: Banking when in the wrong
Temperature for the goals in the systems hands.
control of nuclear system. Example: Defence
reactors. Example: navigation systems
systems of aircraft.
teachcomputerscience.com
9

What is backup?
▪ Duplicate data and files stored in a separate server or storage
drive to improve the reliability of a system are called backups.
▪ This protects the data from being lost due to failure.
▪ Backup is also useful when data is accidentally overwritten.

teachcomputerscience.com
10

Backup procedure

 The team responsible for the backup

procedure performs the backup
according to a well-defined schedule.
 Backup disks are to be stored in a
secure location. Back-up
Safe Scheduled
 Disks and tapes secured in a
fireproof location are called an off-
site backup.
 The data can also be backed-up over
Fire-safe
the Internet using cloud technology. Cloud-
technology
teachcomputerscience.com
11

Disaster recovery

 Disaster recovery is the process of getting back lost data from the backup after a
system failure.
 Let us consider the example of hardware failure. To recover from this failure, the
hardware is repaired or replaced with new hardware. The data is recovered from
the backup and copied to the hardware.
 Examples of precautionary measures taken by an organisation to avoid disaster
are use of uninterruptible power supply (UPS), surge protectors (to minimise the
power surges in electronic equipment), fire prevention and anti-virus software.

teachcomputerscience.com
12

Redundancy

 Redundancy is the duplication of Hardware

critical parts of a computer system to redundancy
improve reliability.
 If the primary system fails, the
backup or reserve system steps in.
 Redundancy is very important in Redundancy
critical systems like aircraft systems. Data
If any hardware or software fails Software
redundancy
during a flight, the redundant system redundancy
steps in to avoid failure.

teachcomputerscience.com
13

Types of redundancy
Hardware redundancy Software Data redundancy
Computer systems have an extra redundancy Redundant data in
critical hardware device to avoid Redundant software the backup can
failure. is used to replace the replace the original
Example: A system is provided original program in data in case the
with two power supplies in a case it fails. original data is lost or
parallel set up so that they can be overwritten
easily switched if one of them accidentally.
fails.
Redundant array of independent
disks (RAID): multiple physical
disk drives are used to store
redundant data. teachcomputerscience.com
14

What is fault-tolerance?
▪ Fault tolerance is a property that enables a system to operate
properly even if the system undergoes one or more failures.
▪ Essential for life-critical systems.
▪ This design enables a system to continue its operation, might
be at a reduced level, rather than failing completely, even when
some parts of the system fails.
▪ Data is protected from damage, intrusion or disclosure.

teachcomputerscience.com
15

What is fail-soft system?

▪ When a system gracefully fails, that is, operates at a reduced
level after some component failures, is called a fail-soft system.
▪ For example: a building may operate with reduced lighting and
elevators in case the power fails.

teachcomputerscience.com
16

Defensive programming

 Software can be made more reliable by adding extra checks.

 These checkpoints will warn the user in case the program is not working in the
desired manner. This is called defensive programming.
 This enables the user to take action.
 In the absence of these extra checks, the program would crash without any
warning.

teachcomputerscience.com
17

Measuring reliability
Time between failures

Time to repair Time to failure

Reliability of a system is measured using
various statistical parameters that are
used to predict how reliable the system is.

System Resumes normal System

failure operation failure

teachcomputerscience.com
18

Reliability factors

 Percentage of time:
The percentage of time denotes the percentage of time for which the service was
available and operational during a particular month.
 Number of hours:
Number of hours denotes the amount of time the system has operated without
reporting any problems.

teachcomputerscience.com
19

Reliability factors

 Downtime:
The period during which a system breaks down or spends out of action. Zero
downtime refers to a system that is available all the time.
 Mean time between failures (MTBF):
Meantime between failures is calculated by taking the average of the time
between failures of a system.
 Meantime to failure (MTTF):
Mean time to failure is the time duration in which the system is expected to
continue its operation before system failure.

teachcomputerscience.com
20

Let’s review some concepts

Reliability Critical systems Backup

Reliability of any computer- Critical systems are computer Duplicate data and files stored in
related component is an systems that must be highly a separate server or storage
attribute that denotes its reliable as their failure may have drive to improve the reliability of
consistent performance a great impact on human lives. a system are called backup.
according to the specifications.

Redundancy Fault-tolerance Statistical parameters to

measure reliability
Redundancy is the duplication of Fault tolerance is a property that
critical parts of a computer enables a system to operate Percentage of time
system to improve reliability. properly even if the system
Number of hours
undergoes one or more failures.
(Hardware, software and data)
Downtime
Mean time between failures and
Mean time to failure
teachcomputerscience.com
2.
Activity

teachcomputerscience.com
22

Activity-1
Duration: 15 minutes

You are a programmer developing a banking system.

A. What are the important parts of this system? In what ways
could these parts fail?
B. How can you protect the system from possible failures?

teachcomputerscience.com
3.
End of topic questions

teachcomputerscience.com
24

End of topic questions

1. Where is backup stored?
2. What are the different types of redundancy? How are they
useful in improving the reliability of systems?
3. What is a fault-tolerant system?
4. What is a fail-soft system?
5. How can the reliability of a system be measured? Write down
the different parameters with a line of explanation.

teachcomputerscience.com

Lesson 3 Classification of Drugs 9learners
No ratings yet
Lesson 3 Classification of Drugs 9learners
50 pages
3.1 Tuple Relational Calculus
No ratings yet
3.1 Tuple Relational Calculus
11 pages
Signals and Systems
No ratings yet
Signals and Systems
29 pages
Answer Sheet - 01 Introduction To Computers
No ratings yet
Answer Sheet - 01 Introduction To Computers
7 pages
Schema de Principe Electrical Schematic
No ratings yet
Schema de Principe Electrical Schematic
78 pages
Engr213 Chapter 4 Homework Solutions
No ratings yet
Engr213 Chapter 4 Homework Solutions
18 pages
DC-6 Om
100% (4)
DC-6 Om
522 pages
OCR PGOnline Full A-Level Textbook
No ratings yet
OCR PGOnline Full A-Level Textbook
378 pages
Manual HON 370 20 GB
No ratings yet
Manual HON 370 20 GB
51 pages
HPE Smart Choice Gen 11 - Supplemental QuickSpecs-a50009219enw
No ratings yet
HPE Smart Choice Gen 11 - Supplemental QuickSpecs-a50009219enw
51 pages
Computer Science
No ratings yet
Computer Science
3 pages
Genes 2 - (Variation and Adaptation, Inheritance)
No ratings yet
Genes 2 - (Variation and Adaptation, Inheritance)
8 pages
4.1.1.5 GCSE Biology AQA OCR EXDECEL. Microscopy Answers
No ratings yet
4.1.1.5 GCSE Biology AQA OCR EXDECEL. Microscopy Answers
4 pages
Traumatic Care DR - GOLDEN
No ratings yet
Traumatic Care DR - GOLDEN
34 pages
Cie Igcse Computer Science 0478 Practical Znotes
No ratings yet
Cie Igcse Computer Science 0478 Practical Znotes
7 pages
8 Powerful Icon Libraries
No ratings yet
8 Powerful Icon Libraries
10 pages
Worksheet 3 LS6 - MIANO, REYMARK
No ratings yet
Worksheet 3 LS6 - MIANO, REYMARK
1 page
Paper I Telugu 8th Jan 2025 Shift 1
No ratings yet
Paper I Telugu 8th Jan 2025 Shift 1
88 pages
MACA - A New Channel Access Method For Packet Radio: Phil Karn, KA9Q
100% (1)
MACA - A New Channel Access Method For Packet Radio: Phil Karn, KA9Q
5 pages
Hazard Identification: 2. Risk Analysis/Evaluation 3. Risk Control
No ratings yet
Hazard Identification: 2. Risk Analysis/Evaluation 3. Risk Control
2 pages
Year 8 Key Topic 4 Markscheme
No ratings yet
Year 8 Key Topic 4 Markscheme
3 pages
SSV 2018 DPS (MAVERICK TRAIL) Shop 219100905-050
No ratings yet
SSV 2018 DPS (MAVERICK TRAIL) Shop 219100905-050
11 pages
CS Nipple 21K-62-71310
No ratings yet
CS Nipple 21K-62-71310
1 page
Flashcards - 01 Introduction To Computers
No ratings yet
Flashcards - 01 Introduction To Computers
5 pages
Quiz - 02 Reliability in Computer Systems
No ratings yet
Quiz - 02 Reliability in Computer Systems
7 pages
Quiz - 01 Introduction To Computers
No ratings yet
Quiz - 01 Introduction To Computers
9 pages
4as Tle7 LC4
No ratings yet
4as Tle7 LC4
5 pages
My NoteBook
No ratings yet
My NoteBook
17 pages
Availability Bloomfield's 14 Mar 24
100% (1)
Availability Bloomfield's 14 Mar 24
43 pages
H 0010-20-43061 2 10 0 Pds Protocol Programmer S Guide
No ratings yet
H 0010-20-43061 2 10 0 Pds Protocol Programmer S Guide
172 pages
CS Executive Sbec MCQ Questions With Answers
No ratings yet
CS Executive Sbec MCQ Questions With Answers
20 pages
Tropical Soils
No ratings yet
Tropical Soils
5 pages
Microscopy Questions and Revision - MME2
No ratings yet
Microscopy Questions and Revision - MME2
15 pages
Sport
No ratings yet
Sport
1 page
PCB Ibm PDF
No ratings yet
PCB Ibm PDF
408 pages
Revision Acid and Alkali and Simple Reactions
100% (1)
Revision Acid and Alkali and Simple Reactions
10 pages
Recurrence Tree Example PDF
No ratings yet
Recurrence Tree Example PDF
10 pages
Python Programming PDF Myanmar PDF Files Download
No ratings yet
Python Programming PDF Myanmar PDF Files Download
6 pages
Higher Computing Science - HCOMPPEP - Alford, David - 2017 - London - Hodder Education Group - 9781510413788 - Anna's Archive
No ratings yet
Higher Computing Science - HCOMPPEP - Alford, David - 2017 - London - Hodder Education Group - 9781510413788 - Anna's Archive
108 pages
Automobile Road Test
No ratings yet
Automobile Road Test
2 pages
Embedded Systems IGCSE Comp Handout Free PDF
No ratings yet
Embedded Systems IGCSE Comp Handout Free PDF
16 pages
Defects
No ratings yet
Defects
51 pages
Career Hackers: Hack Your Career Today Python in Finance
100% (1)
Career Hackers: Hack Your Career Today Python in Finance
26 pages
Project Charter Template
No ratings yet
Project Charter Template
9 pages
Answer Sheet - 38 Functions and Procedures
No ratings yet
Answer Sheet - 38 Functions and Procedures
7 pages
CompSci A2 Paper 3
No ratings yet
CompSci A2 Paper 3
42 pages
Revision Notes - 01 Introduction To Computers
No ratings yet
Revision Notes - 01 Introduction To Computers
18 pages
Computer Science Made Simple - V Anton
No ratings yet
Computer Science Made Simple - V Anton
1 page
World Religions Week 3
100% (1)
World Religions Week 3
24 pages
GE8151 Problem Solving and Python Programming MCQ
No ratings yet
GE8151 Problem Solving and Python Programming MCQ
135 pages
A-Level Glossary - 01 Computer Architecture
No ratings yet
A-Level Glossary - 01 Computer Architecture
3 pages
Types and Components of Computer Systems: 2016 © Provided by Anthi Aristotelous
No ratings yet
Types and Components of Computer Systems: 2016 © Provided by Anthi Aristotelous
55 pages
AP CSA Java Notes
No ratings yet
AP CSA Java Notes
27 pages
Computational Problem Solving: Chapter 1, Sections 1.5-1.7
100% (1)
Computational Problem Solving: Chapter 1, Sections 1.5-1.7
39 pages
t7 2009 Dec Q
No ratings yet
t7 2009 Dec Q
8 pages
Informed Search Algorithms
No ratings yet
Informed Search Algorithms
12 pages
SCBA Pre-Use Inspection
No ratings yet
SCBA Pre-Use Inspection
2 pages
Digital Electronics 1
No ratings yet
Digital Electronics 1
8 pages
Employee Welfare
No ratings yet
Employee Welfare
44 pages
A-Level 14 Presentation - Compression, Encryption and Hashing
No ratings yet
A-Level 14 Presentation - Compression, Encryption and Hashing
61 pages
Caie Igcse Computer Science 0478 Theory 66544355e686c60de794fa98 643
No ratings yet
Caie Igcse Computer Science 0478 Theory 66544355e686c60de794fa98 643
22 pages
Network Theory Question Paper
No ratings yet
Network Theory Question Paper
4 pages
Best Books For Programmers (The Ultimate List)
No ratings yet
Best Books For Programmers (The Ultimate List)
13 pages
Teachers Handbook
No ratings yet
Teachers Handbook
16 pages
A-Level Answer Sheet - 01 Computer Architecture
No ratings yet
A-Level Answer Sheet - 01 Computer Architecture
9 pages
Representation of Data - Revision
No ratings yet
Representation of Data - Revision
20 pages
Professional Practices: Course Code: ITEC4112
No ratings yet
Professional Practices: Course Code: ITEC4112
16 pages
Computer Architecture Module Hand Book-Ca - June 6th (Sent ToRam Sir On 6th)
No ratings yet
Computer Architecture Module Hand Book-Ca - June 6th (Sent ToRam Sir On 6th)
23 pages
GCSE CS (2210) / IGCSE CS (0478) P1 NOTES Chapter 1.1: Data Representation 1.1.2 Hexadecimal
No ratings yet
GCSE CS (2210) / IGCSE CS (0478) P1 NOTES Chapter 1.1: Data Representation 1.1.2 Hexadecimal
6 pages
Push Down Automata
No ratings yet
Push Down Automata
41 pages
A New Technology Used in Sports.: Hawk-Eye
No ratings yet
A New Technology Used in Sports.: Hawk-Eye
23 pages
Software Engineering and Project Management - Unit 4
No ratings yet
Software Engineering and Project Management - Unit 4
14 pages
Computer Fundamentals Tutorial
No ratings yet
Computer Fundamentals Tutorial
93 pages
About The Presentations: An Introduction To Programming With C++, Eighth Edition 1
No ratings yet
About The Presentations: An Introduction To Programming With C++, Eighth Edition 1
27 pages
Post WW Ii Latin American Boom: 21 Century Literature From The Philippines and The World Week 4 Topic
No ratings yet
Post WW Ii Latin American Boom: 21 Century Literature From The Philippines and The World Week 4 Topic
2 pages
Course Outline ADC
No ratings yet
Course Outline ADC
3 pages
ICT2632 - ++oct+2017 - +information - PDF Digital Logic
No ratings yet
ICT2632 - ++oct+2017 - +information - PDF Digital Logic
2 pages
Puzzles As Programmer Interview Question
No ratings yet
Puzzles As Programmer Interview Question
32 pages
A-Level Presentation - 01 Computer Architecture
No ratings yet
A-Level Presentation - 01 Computer Architecture
40 pages
Data Base PDF
No ratings yet
Data Base PDF
95 pages
9608 Example Candidate Responses Paper 3 (For Examination From 2016)
No ratings yet
9608 Example Candidate Responses Paper 3 (For Examination From 2016)
71 pages
2021 2023 Syllabus
No ratings yet
2021 2023 Syllabus
31 pages
KS3 Answer Sheet - 01 Introduction To Computers
No ratings yet
KS3 Answer Sheet - 01 Introduction To Computers
7 pages
N5 Web Design Development Notes
No ratings yet
N5 Web Design Development Notes
12 pages
IOT Smart Energy Grid
No ratings yet
IOT Smart Energy Grid
10 pages
KS3 Presentation - 01 Introduction To Computers
No ratings yet
KS3 Presentation - 01 Introduction To Computers
28 pages
01 Problem Solving and Algorithm Design
No ratings yet
01 Problem Solving and Algorithm Design
27 pages
Design and Analysis of Algorithms
No ratings yet
Design and Analysis of Algorithms
64 pages
Congestion Control
No ratings yet
Congestion Control
26 pages
Hufnagel Transcript
No ratings yet
Hufnagel Transcript
3 pages
Revision Notes - 23 Data Transmission Technologies
No ratings yet
Revision Notes - 23 Data Transmission Technologies
15 pages
Olevel Computer Science Notes 2210 PDF
No ratings yet
Olevel Computer Science Notes 2210 PDF
18 pages
What Is Computer Programming? Basics To Learn Coding
No ratings yet
What Is Computer Programming? Basics To Learn Coding
5 pages
AQA Computing A2 Exam Style Qs
No ratings yet
AQA Computing A2 Exam Style Qs
40 pages
Matlab Manual
No ratings yet
Matlab Manual
70 pages
Advanced Computational Mathematics Syllabus
No ratings yet
Advanced Computational Mathematics Syllabus
1 page
Chapter One: Introduction To Computer Programs
No ratings yet
Chapter One: Introduction To Computer Programs
12 pages
Write An Algorithm in The Form of Pseudocode and Draw A Flowchart To
No ratings yet
Write An Algorithm in The Form of Pseudocode and Draw A Flowchart To
1 page
“Exploring Computer Systems: From Fundamentals to Advanced Concepts”: GoodMan, #1
From Everand
“Exploring Computer Systems: From Fundamentals to Advanced Concepts”: GoodMan, #1
Patrick Mukosha
No ratings yet
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
From Everand
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Sebastian Thelen
5/5 (1)
Java Reflection Complete Self-Assessment Guide
From Everand
Java Reflection Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet

Presentation - 02 Reliability in Computer Systems

Uploaded by

Presentation - 02 Reliability in Computer Systems

Uploaded by

Teach Computer Science

▪ Students will learn about things that can go wrong in a

Things that can go wrong

 Hardware might fail to operate.

What is a critical system?

Types of critical systems

Safety-critical Mission-critical Business-critical Security-critical

Types of critical systems

Safety-critical Mission-critical Business-critical Security-critical

 The team responsible for the backup

 Redundancy is the duplication of Hardware

What is fail-soft system?

 Software can be made more reliable by adding extra checks.

Time to repair Time to failure

System Resumes normal System

Let’s review some concepts

Reliability Critical systems Backup

Redundancy Fault-tolerance Statistical parameters to

You are a programmer developing a banking system.

End of topic questions

You might also like