Assertive Testing Reliable Code

The document discusses methods for measuring software reliability and improving software testing. It notes that while generally accepted methods exist to improve reliability, no method can accurately predict it. The document explores defining reliability through failure probability and outlines techniques like prioritizing testing of high impact low probability defects and augmenting standard testing with structured and diverse methods.

Uploaded by

Umair Aijaz

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Assertive Testing Reliable Code

Uploaded by

Umair Aijaz

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Editor: Gerard J.

Holzmann
RELIABLE CODE NASA/JPL
[email protected]

Assertive Testing
Gerard J. Holzmann

A COLLEAGUE ASKED me recently, ability. Here we’re on fi rmer ground. common approach is therefore to de-
“Are there any generally accepted Indeed, generally accepted methods fine reliability by measuring its op-
methods for accurately predicting exist that can measurably improve posite: the probability of failure. This
software reliability?” Sadly, the hon- reliability. Software testing is an ob- is similar to trying to define health
est answer is no. Surely there are vious example of such a method, but as the absence of illness. If you’re
generally accepted, and practiced, not the only, and perhaps not even healthy, the probability that you’ll get
methods, but no one would claim the best, such method. Here, I look sick in some interval of time should
be small, although it likely will never
be zero. So it is for software.
To measure a software applica-
tion’s reliability, then, we can try to
If you can’t measure it, express the rate of discovery of de-
fects that might lead to failure as a
you can’t manage it. probability.
For instance, if the long-term
probability of an application exhib-
iting a failure is p, that application’s
reliability (the probability of failure-
that they can make accurate predic- at simple, effective ways to augment free operation) is 1 – p. If p is 10 –9
tions. And if the predictions aren’t standard software testing. per hour of operation, we shouldn’t
accurate, how useful are they really? expect to see more than one failure
If that sounds overly pessimistic, Measuring Reliability per 100,000 years of operation on
it’s because the question was phrased How can we measure software reli- average, which should satisfy even
more or less as an absolute. Instead ability? Does a generally accepted met- the most demanding applications.
of asking whether methods exist that ric exist? A familiar dictum is “If you Reaching that target of 10 –9 fail-
can predict reliability accurately, it’s can’t measure it, you can’t manage it.” ures per hour can be extraordinarily
perhaps more helpful to ask whether Reliability clearly has something difficult. For instance, a recent gov-
methods exist that can improve reli- to do with the absence of failures. A ernment report specified the required

Authorized licensed use limited to: University of London: Online Library. Downloaded on July 07,2023 at 18:37:08 UTC from IEEE Xplore. Restrictions apply.
RELIABLE CODE

period of failure-free operation for The more problematic software de-

conventional takeoffs and landings fects are those that do have a sig- Negligible impact Major impact
of the F35 Joint Strike Fighter not nificant impact. Again, the ones that
as 100,000 years but as six hours.1 will likely strike, in the upper right
This corresponds to an average of Figure 1, can be expected to be Likely
probability of failure about eight or- caught early. That leaves the set of
ders of magnitude larger than 10 –9. lower-probability defects with po-
The report also noted that this target tentially significant impact, in the
hadn’t yet been realized. lower right of Figure 1. Unlikely
An uncomfortably large propor-
Latent Defects tion of the major software failures
Software failures are caused by cod- that we learn about with some reg-
Strength of formal methods
ing or design defects that could have ularity tends to fall into this lower-
Strength of standard testing
been caught if the right type of check right quadrant. Often, such failures
had been performed before an appli- are caused by unexpected combina-
cation was released for general use. tions of low-probability events that FIGURE 1. The probability and impact
For a commercial company it’s of- can push a system beyond its design of software defects. An uncomfortably
ten not cost-effective to chase down limits. For instance, the failure of a large proportion of the major software
every last bug before a product is hardware component can occur dur- failures that we regularly learn about tends
shipped. This means that in a fixed ing the execution of a fault-handling to fall into the lower-right quadrant.
time period and with a fixed testing procedure for some unrelated off-
budget, only the more likely types of nominal event. All of a sudden, the quires more skill, and a more routine
defects are typically caught. The re- system can then enter a failure mode approach to software testing? There
maining bugs are commonly called that was never tested. is, and that’s what I talk about next.
latent defects. It’s generally not a good idea to
It won’t surprise anyone to learn ignore potential failures simply be- Getting Testy
that the number of latent defects in cause their probability of occur- Let’s first consider how to make stan-
any nontrivial application typically rence is deemed low. As C. Michael dard software-testing approaches
outnumbers the number of discov- Holloway, a researcher at NASA more thorough simply by providing a
ered defects by a large margin, no Langley Research Center, said, “To little more structure and diversity. I’ll
matter how long the application has a first approximation, we can say mention just some of the many pos-
been in use. Of course, the more us- that accidents are almost always the sible techniques of this type that can
ers there are and the longer an ap- result of incorrect estimates of the improve a test suite’s effectiveness.
plication is used, the more latent de- likelihood of one or more things.”2 You can structure a software test
fects will be found. We’re good at estimating conse- beyond the familiar phases of unit,
quences, but we’re bad at estimat- system, and acceptance testing. A
Probability and Impact ing probabilities. more structured approach consists of
We can categorize software defects Formal methods target the dis- five additional steps that you can use
by their probability of occurrence or covery of these low-probability but in each of the standard testing phases:
potential impact (see Figure 1). major-impact defects. Compared
Most defects are minor glitches to standard software testing meth- 1. Ideal conditions. Test the code
that don’t significantly affect users, ods, though, they can be harder to under ideal conditions, to ensure
although they can of course nega- use. For critical systems, therefore, that at the very least it can be-
tively affect the users’ perception of the use of formal methods is often have as designed.
code quality. Those defects fall on restricted to a relatively small num- 2. Nominal execution. If the
the left of the vertical line in Fig- ber of critical modules. But is there code passes step 1, test it under
ure 1. The most likely glitches, on then no middle ground between a nominal conditions—the condi-
the upper left, are reliably caught in pure formal-methods approach that tions it should encounter in nor-
a standard software test regimen. leaves no stone unturned, but re- mal day-to-day use.

M AY / J U N E 2 0 1 5 | I E E E S O F T WA R E 13

Authorized licensed use limited to: University of London: Online Library. Downloaded on July 07,2023 at 18:37:08 UTC from IEEE Xplore. Restrictions apply.
RELIABLE CODE

3. Boundary cases. Test the code ated from the high-level model don’t during normal system test phases
for the correct handling of cover all of the code, the model is but also later, when your code has
boundary conditions, where the incomplete and should be extended. reached the end user.
code is exercised at the edge of It’s also possible that the software For instance, you can place an
its operational profi le. contains too many parts that are assertion in the body of every loop
4. Stress testing. Test the code un- unrelated to the software require- in the code, to ensure that a reason-
der stress or overload conditions. ments. This can mean that you able maximum number of iterations
5. Error handling. Test the code for should delete them to slim the code is never exceeded. You’d be surprised
the correct handling of all con- base down to a more manageable how many bugs this one measure
ceivable error conditions, such (and testable) size. can catch early in software develop-
as invalid inputs, and ideally for In running the tests, look for ment. If you’re unsure about what
different combinations of com- cases in which the results differ from upper bound to use, multiply your
ponent failures. the model’s predictions. The prob- most generous guess by a thousand
lem can be with the model, the soft- or more. The real problem you’re de-
Error-handling code is often the ware, or the requirements. Model- fending against is an execution get-
least thoroughly tested part of any based testing can also make it easier ting stuck in an infi nite loop—for
software system and therefore the for formal-methods types like me to instance, when a linked list acciden-
most likely to contain latent defects. apply more rigorous forms of soft- tally becomes circular.
This is precisely the part of the sys- ware verification—for instance, with Another good strategy is to place
tem you want to be the most robust, the help of logic-model checkers. an assertion before every division op-
but it rarely is. An effective technique eration, to ensure you’re not acciden-
in this stage is to use test randomiza- Assert Yourself tally dividing by zero or a number
tion, also called fuzz testing, which Another way to improve the thor- very close to zero. Similarly, place an
has proven remarkably effective in oughness of a software test, and assertion before pointer dereference
fi nding unsuspected breaking points. with it the reliability of the target ap- operations, to check that they can’t
Another way to improve the rigor plication, is relatively simple: use as- cause a crash. You can use asser-
of software testing is to use model- sertions. As a rule of thumb, aim for tions similarly to check that param-
based testing. First, the system en- an average assertion density of one eters passed to a function are in a
gineer or software developer con- to two percent across all your code. safe range or that the result returned
structs a high-level model of how If you follow this rule, you won’t be to a caller passes a sanity check. If
you’re worried that in a time-critical
system, you can’t afford the cost of
evaluating a few extra Boolean ex-
pressions, you’re operating too close
Another way to improve to the margin. You should take this
the rigor of software testing as an indication that it’s time to
is to use model-based testing. refactor the code. No policeman will
be persuaded either if you claim that
you had no time to stop at a red traf-
fic light.

the software should work. This alone: Microsoft follows it in the Of- Statement Coverage
high-level model can then be used to fice software suite, 3 and NASA’s Jet A common goal in testing, inspired
derive, often automatically, a suite Propulsion Laboratory (JPL) uses it by guidelines such as DO-178B/C
of test cases. The model should en- in the development of its mission- (which deals with software safety
capsulate as many software require- critical fl ight code. for airborne systems), is to ensure
ments as possible, which means that Using assertions can ensure that that all your tests combined secure
the tests can check that the require- you catch defects at the earliest pos- full statement and branch coverage.
ments are met. If the tests gener- sible point in an execution, not only This means that each statement in

14 I E E E S O F T WA R E | W W W. C O M P U T E R . O R G / S O F T W A R E | @ I E E E S O F T WA R E

Authorized licensed use limited to: University of London: Online Library. Downloaded on July 07,2023 at 18:37:08 UTC from IEEE Xplore. Restrictions apply.
RELIABLE CODE

your code must be exercised by at And, oh yeah, don’t disable those Accident Reports,” presentation at the
Software and Complex Electronic Hard-
least one test, and every clause in carefully crafted assertions when
ware Standardization Conf., 2005.
every conditional test must indepen- you ship a product to your custom- 3. C.A.R. Hoare, “Assertions: A Personal
dently evaluate to true and to false in ers. Microsoft doesn’t do so in Of- Perspective,” IEEE Annals of the History
of Computing, vol. 25, no. 2, 2003, pp.
at least one test. What’s sometimes fice, and neither does JPL when its 14–25.
forgotten is that it’s not enough to embedded software hitches a ride 4. A. Van Wijngaarden, B.J. Mailloux,
merely execute a statement; a test to Mars. The assertions can help and J.E.L. Peck, Revised Report on the
Algorithmic Language Algol 68, Springer,
must also actually check something. you detect, diagnose, and fix the la- 1976.
This is where assertions can again tent defects in your code before they 5. L.A. Clarke and D.S. Rosenblum, “A
prove their value: they provide some can do harm. In a sense, removing Historical Perspective on Runtime Asser-
tion Checking in Software Development,”
additional independent checks of an or disabling software assertions be- ACM SIGSOFT Software Eng. Notes, vol.
execution’s sanity. fore shipping a system to customers 31, no. 3, 2006, pp. 25–37.
The insight that assertions can would make as much sense as a car
help make systems more reliable isn’t maker removing the seatbelts and
new, of course. The familiar include airbags from a car after all crash GERARD J. HOLZMANN works at the Jet
file <assert.h>, with the definition of a tests have been completed. Propulsion Laboratory on developing stronger
methods for software analysis, code review, and
few macros to support the use of as-
testing. Contact him at [email protected].
sertions in C code, was added to the References
Unix C compilers as early as 1978. 1. F-35 Joint Strike Fighter: Problems
Completing Software Testing May Hinder
Mike Lesk (also responsible for the Delivery of Expected Warfighting Capa-
Unix tools lex and uucp) first added bilities, GAO-14-322, US Government
Selected CS articles and columns
Accountability Office, Mar. 2014, p. 18;
this file as one of several improve- www.gao.gov/assets/670/661842.pdf.
are also available for free at
ments he made to the C preprocessor. 2. C.M. Holloway, “Why You Should Read
http://ComputingNow.computer.org.
An assert keyword appeared ear-
lier in the 1972 definition of Algol
W. The language report on Algol

Call for Articles

68, to which Algol W was in many
ways a response, also contained a
notation for defining inline asser-
tions. They were called “pragmats”
in the Revised Report on the Algo- IEEE Software seeks practical, readable
rithmic Language Algol 68.4 Like articles that will appeal to experts and nonexperts
modern pragmas in C code, though,
alike. The magazine aims to deliver reliable
they were technically outside the
language definition and could freely information to software developers and managers to
be ignored by the compiler. Earlier help them stay on top of rapid technology change.
still, we find references to the impor-
Submissions must be original and no more than 4,700
tance of assertions in the writings
words, including 200 words for each table and figure.
of both Alan Turing and John von
Neumann, as Lori Clarke and David
Author guidelines: www.computer.org/software/author.htm
Rosenblum noted.5
Further details: [email protected]
www.computer.org/software

S o now it’s your turn again.

Does your regression test
suite (you do have one, don’t
you?) have any tests that fail to ex-
ecute assertions? You can strengthen
your tests by ensuring that they all do.

M AY / J U N E 2 0 1 5 | I E E E S O F T WA R E 15

Authorized licensed use limited to: University of London: Online Library. Downloaded on July 07,2023 at 18:37:08 UTC from IEEE Xplore. Restrictions apply.

Growth Hacking Handbook
100% (8)
Growth Hacking Handbook
131 pages
Learn Software Testing in 24 Hours
From Everand
Learn Software Testing in 24 Hours
Alex Nordeen
No ratings yet
Performance Index Edition Ix
No ratings yet
Performance Index Edition Ix
159 pages
Weick - 1987 - Organizational Culture and High Rel
No ratings yet
Weick - 1987 - Organizational Culture and High Rel
16 pages
Seven Principles of Software Testing
No ratings yet
Seven Principles of Software Testing
3 pages
State of Software Security 2018 Veracode Report
No ratings yet
State of Software Security 2018 Veracode Report
60 pages
Software Engineering Principles and Prac
No ratings yet
Software Engineering Principles and Prac
63 pages
Agresti 2000
No ratings yet
Agresti 2000
10 pages
Mysoftware Reliability
No ratings yet
Mysoftware Reliability
3 pages
Analyzing Software Safety
No ratings yet
Analyzing Software Safety
11 pages
Software Testing
No ratings yet
Software Testing
54 pages
THERP
No ratings yet
THERP
6 pages
Defining Software Faults: Why It Matters
No ratings yet
Defining Software Faults: Why It Matters
11 pages
The Top 10 Worst-Performing Alarm Systems in Industry
No ratings yet
The Top 10 Worst-Performing Alarm Systems in Industry
8 pages
Performance Testing
No ratings yet
Performance Testing
21 pages
Mythical Unit Test Coverage: Keywords
No ratings yet
Mythical Unit Test Coverage: Keywords
9 pages
On A Difficulty of Intrusion Detection
No ratings yet
On A Difficulty of Intrusion Detection
10 pages
Principles
No ratings yet
Principles
3 pages
Ultimate Guide To SRE Acronyms: (And A Few Non-Acronyms, Too)
No ratings yet
Ultimate Guide To SRE Acronyms: (And A Few Non-Acronyms, Too)
12 pages
RE-sample MCQ For XQP
No ratings yet
RE-sample MCQ For XQP
11 pages
BugLlifeCycle Theroy
No ratings yet
BugLlifeCycle Theroy
15 pages
Achieving Strength Through Chaos Engineering
No ratings yet
Achieving Strength Through Chaos Engineering
15 pages
356651.356652
No ratings yet
356651.356652
16 pages
ISEB Foundation
No ratings yet
ISEB Foundation
126 pages
Fuzzing and Random Testing
No ratings yet
Fuzzing and Random Testing
2 pages
Chapter 1 STQA Srinivasan Desikan
No ratings yet
Chapter 1 STQA Srinivasan Desikan
23 pages
SPM Software Quality
No ratings yet
SPM Software Quality
25 pages
The Phi Accrual Failure Detector
No ratings yet
The Phi Accrual Failure Detector
16 pages
SoftwareTestingPrinciples and Practices by Gopalaswamy Ramesh, Srinivasan Desikan PDF
100% (1)
SoftwareTestingPrinciples and Practices by Gopalaswamy Ramesh, Srinivasan Desikan PDF
388 pages
Topics Covered: Lesson 16
No ratings yet
Topics Covered: Lesson 16
4 pages
Failure Detection and Prediction: Computer Science and Engineering
No ratings yet
Failure Detection and Prediction: Computer Science and Engineering
4 pages
Learning From Normal Work Professional Safety Journal 1698956321
No ratings yet
Learning From Normal Work Professional Safety Journal 1698956321
8 pages
TRUST, SAFETY AND RELIABILITY
No ratings yet
TRUST, SAFETY AND RELIABILITY
34 pages
Basic Concepts of Reliability
No ratings yet
Basic Concepts of Reliability
9 pages
Introduction To SQA
No ratings yet
Introduction To SQA
26 pages
ndss2018 03A-2 Li Paper
No ratings yet
ndss2018 03A-2 Li Paper
15 pages
10 1 1 888 137 PDF
No ratings yet
10 1 1 888 137 PDF
14 pages
Application Development and Emerging Technologies: Software Testing The Big Picture
No ratings yet
Application Development and Emerging Technologies: Software Testing The Big Picture
13 pages
Are Testing True Reliability?
No ratings yet
Are Testing True Reliability?
7 pages
1.1 Vulnerability Assessment vs. Penetration Testing
No ratings yet
1.1 Vulnerability Assessment vs. Penetration Testing
2 pages
Optimizing Testing Efficiency With Error-Prone Path Identification and Genetic Algorithms
No ratings yet
Optimizing Testing Efficiency With Error-Prone Path Identification and Genetic Algorithms
10 pages
Box/white Testing?: Unit. It
No ratings yet
Box/white Testing?: Unit. It
18 pages
Operations management 030325
No ratings yet
Operations management 030325
2 pages
Risk Assessment Matrix
No ratings yet
Risk Assessment Matrix
19 pages
Failure Rate 1
No ratings yet
Failure Rate 1
18 pages
Ml5
No ratings yet
Ml5
25 pages
STA Notes (Unit 1)
No ratings yet
STA Notes (Unit 1)
21 pages
PLANTILLA_ECPPTV2_EDITABLE
No ratings yet
PLANTILLA_ECPPTV2_EDITABLE
10 pages
Confidence Intervals: Linking Evidence To Practice
No ratings yet
Confidence Intervals: Linking Evidence To Practice
2 pages
Presentation FMEA
No ratings yet
Presentation FMEA
27 pages
unit-1
No ratings yet
unit-1
38 pages
Are You Doing The Right Maintenance
100% (1)
Are You Doing The Right Maintenance
5 pages
Principi Testiranja Softvera
No ratings yet
Principi Testiranja Softvera
11 pages
Lect 6b Software Relaibility
No ratings yet
Lect 6b Software Relaibility
21 pages
cyber unit 1 ch 2
No ratings yet
cyber unit 1 ch 2
6 pages
Software Penetration Test
No ratings yet
Software Penetration Test
4 pages
3.Testing Principles
No ratings yet
3.Testing Principles
21 pages
Non Parametric Test
No ratings yet
Non Parametric Test
8 pages
STM Unit - I
No ratings yet
STM Unit - I
19 pages
documentation-for-safety-critical-software
No ratings yet
documentation-for-safety-critical-software
9 pages
Lambert 1992
No ratings yet
Lambert 1992
15 pages
Human Performance Handbook
From Everand
Human Performance Handbook
Tim Delorey
No ratings yet
CPC Imp Questions
No ratings yet
CPC Imp Questions
9 pages
Cambridge International Advanced Subsidiary and Advanced Level
No ratings yet
Cambridge International Advanced Subsidiary and Advanced Level
8 pages
System Unit Activity 2
No ratings yet
System Unit Activity 2
3 pages
Coding Platforms Guidelines
No ratings yet
Coding Platforms Guidelines
5 pages
Mso Excel (Notes)
No ratings yet
Mso Excel (Notes)
79 pages
Vmware Vsphere Pricing Whitepaper
No ratings yet
Vmware Vsphere Pricing Whitepaper
19 pages
Amazon Computer Accessories
No ratings yet
Amazon Computer Accessories
36 pages
Module 2 - Limit and Continuity
No ratings yet
Module 2 - Limit and Continuity
21 pages
Health Checklist PACO Devices 16-May-2020
No ratings yet
Health Checklist PACO Devices 16-May-2020
121 pages
Mobile Computing Lab Manual For Final Information Technology - VII Semester
86% (7)
Mobile Computing Lab Manual For Final Information Technology - VII Semester
70 pages
Version Control
No ratings yet
Version Control
28 pages
DSE890 MKII 4G Gateway Installation Instructions: Typical Wiring Diagram
No ratings yet
DSE890 MKII 4G Gateway Installation Instructions: Typical Wiring Diagram
2 pages
Cregital Brochure 2020
No ratings yet
Cregital Brochure 2020
18 pages
BSEB 12TH 122 221 328 Computer Science
No ratings yet
BSEB 12TH 122 221 328 Computer Science
28 pages
Data Base Ioh Ran Jabo 29-06-2022
No ratings yet
Data Base Ioh Ran Jabo 29-06-2022
157 pages
Aouichi Resume
No ratings yet
Aouichi Resume
1 page
Manual Horno Vmir +
No ratings yet
Manual Horno Vmir +
35 pages
Technical Specifications For Computer Courses (COMP)
No ratings yet
Technical Specifications For Computer Courses (COMP)
2 pages
It&c 6.13-1052 PDF
No ratings yet
It&c 6.13-1052 PDF
48 pages
Steps To Install Android Studio: Practical 1 Building A Simple Hello World Application
No ratings yet
Steps To Install Android Studio: Practical 1 Building A Simple Hello World Application
16 pages
Java8 Innards FrescoPlay
50% (2)
Java8 Innards FrescoPlay
3 pages
Fish in Flac PDF Free
No ratings yet
Fish in Flac PDF Free
280 pages
TB SPP Nindie
No ratings yet
TB SPP Nindie
14 pages
Construction Cost Estimating Overview
No ratings yet
Construction Cost Estimating Overview
12 pages
Blockchain Based Land Records System Using Hyperledger Fabric Ijariie22696
No ratings yet
Blockchain Based Land Records System Using Hyperledger Fabric Ijariie22696
6 pages
Computer Networks PPT
No ratings yet
Computer Networks PPT
35 pages
Engr. Rex Jason H. Agustin
No ratings yet
Engr. Rex Jason H. Agustin
11 pages
Data Architect Resume
100% (2)
Data Architect Resume
8 pages