Discrete Time NHPP Models For Software Reliability Growth Phenomenon
Discrete Time NHPP Models For Software Reliability Growth Phenomenon
Discrete Time NHPP Models For Software Reliability Growth Phenomenon
The International Arab Journal of Information Technology, Vol. 6, No. 2, April 2009
1. Introduction
Software reliability assessment is important to evaluate
and predict the reliability and performance of software
system. The models applicable to the assessment of
software reliability are called Software Reliability
Growth Models (SRGMs). An SRGM provides a
mathematical relationship between time span of testing
or using the software and the cumulative number of
faults detected. It is used to assess the reliability of the
software during testing and operational phases.
An important class of SRGM that has been widely
studied is NonHomogeneous Poisson Process (NHPP).
It forms one of the main classes of the existing SRGM;
due to it is mathematical tractability and wide
applicability. NHPP models are useful in describing
failure processes, providing trends such as reliability
growth and fault-content. SRGM consider the
debugging process as a counting process characterized
by the mean value function of a NHPP. Software
reliability can be estimated once the mean value
function is determined. Model parameters are usually
determined using either Maximum Likelihood
Estimate (MLE) or least-square estimation methods [6,
13, 18].
NHPP based SRGM are generally classified into
two groups [2, 6, 17]. The first group contains models,
which use the execution time (i.e., CPU time) or
125
m ( n + 1) m ( n ) = b ( n + 1)(a m ( n ) )
(1)
126
The International Arab Journal of Information Technology, Vol. 6, No. 2, April 2009
m(n + 1)
b(n + 1) = bi + (b f bi )
a
(2)
a 1 (1 b f ) n
(3)
m (n) =
b f bi
n
1+
(1 b f )
bi
The structure of the model is flexible. The shape of
the growth curve is determined by the parameters bi
and bf and can be both exponential and S-shaped for
the four cases discussed above. In case of constant
fault detection rate, equation 3 can be written as
m(n) = a 1 (1 b)n
(4)
bi
(a m(n +1))(a m(n))
a
(5)
ab i n
1 + bi n
(6)
(8)
and
b( n + 1) =
bf p
b f bi
1+
(1 b f p ) n +1
bi
m (n) =
b f bi
n
1 +
(1 b f p )
bi
[(1 (1 b
(9)
p) n
(10)
) b bp p + n
3. Parameter Estimation
MLE method is used to estimate the unknown
parameters of the developed framework. Since all data
sets used are given in the form of pairs
(ni,xi)(i=1,2,,f), where xi is the cumulative number of
faults detected by ni test cases (0<n1<n2<<nf) and ni
is the accumulated number of test run executed to
detect xi faults.
The likelihood function L for the unknown
parameters with the superposed mean value function is
given as
f
L( parmaters| (ni , xi )) =
( xi xi 1 )!
i =1
(7)
127
i 1
(11)
ln L = ( xi xi 1 ) ln[m(ni ) m(ni 1 )]
i =1
(12)
128
The International Arab Journal of Information Technology, Vol. 6, No. 2, April 2009
The first DS-I was collected from test of a networkmanagement system at AT&T Bell Laboratories, after
it was tested for 20 weeks in which 100 faults were
detected [13]. The second DS-II was collected during
21 days of testing, 46 faults were detected [12]. The
third DS-III was for a radar system of size 124 KLOC
after it was tested for 35 months in which 1301 faults
were detected [2]. The fourth DS-IV had been
collected during 25 days of testing in which 136 faults
were detected [13].
RPF =
(m ( n
) xi )
(13)
i =1
(16)
xf
where xf is the cumulative number of faults removed
after the execution of the last test run nf and m^(nf), is
the estimated value of the SRGM m(nf), which
determined using the actually observed data up to an
arbitrary test case ne(nf).
If the RPF value is negative/positive the model is
said to underestimate/ overestimate the fault removal
process. A value close to zero indicates more accurate
prediction, thus more confidence in the model. The
value is said to be acceptable if it is within (10%) [6].
(14)
Data
Set
Proposed
Basic
DS-I
DSII
residual SS
corrected SS
(15)
Parameter Estimation
a
bi
bf
111
.0717
.1581
59
.0167
.1550
C u m u la tiv e F a u lts
R2 = 1
m ( n f ) x f
SSE =
84
56
28
0
0
10
15
20
Actual Data
Estimated Values
48
36
0.2
24
0.1
12
RPE
Cumulative Faults
129
0
50%
0
0
14
21
60%
70%
80%
90%
100%
-0.1
Estimated Values
-0.2
Models under
Comparison
Comparison
Criteria
0.1
RPE
Comparison of the proposed model and welldocumented discrete time NHPP based SRGMs in
terms of goodness of fit is given in Tables 2 and 3 for
DS-I and DS-II respectively. Note that during the
estimation process of models under comparison it is
observed that the exponential model [17] fails to give
any plausible result as it over estimates the faultcontent and no estimates were obtained for DS-II. It is
clearly seen from both the Tables 2 and 3 that the
proposed model is the best among the models under
comparison in terms of SSE, AIC, and R2 metrics
values, which is very encouraging.
0
50%
60%
70%
80%
90%
-0.1
-0.2
SSE
AIC
R2
Exponential [17]
130
.0798
232
92
.9857
106
.2165
357
99
.9781
Proposed Basic
See Table 1.
180
90
.9890
Models under
Comparison
a
Exponential [17]
Delayed S-shaped [6]
Proposed Basic
SSE
b
*
84
Comparison
Criteria
.0831
See Table 1.
R2
AIC
*
28
79
.9938
25
77
.9944
100%
Model
Proposed
Extended
Parameter Estimation
Data
Set
bi
bf
DS-III
1352
.0087
.1832
.9922
DS-IV
156
.1525
.0005
.9965
.0036
130
The International Arab Journal of Information Technology, Vol. 6, No. 2, April 2009
1312
984
656
328
0
0
14
21
28
35
Estimated Values
136
102
0.1
RPE
68
34
0
50%
60%
70%
8 0%
90%
100%
-0.1
0
0
10
15
20
25
A ctual Data
-0.2
Estimated Values
Models under
Comparison
a
Exponential [17]
Comparison
Criteria
SSE
b
*
1735
.0814
Proposed Extended
See Table 5.
AIC R2
*
*
107324
518
.9856
7133
342
.9990
Parameter
Estimation
Comparison
Criteria
R2
SSE
AIC
Exponential [17]
136
.1291
766
119 .9664
126
.2763
2426
176 .8936
Proposed Extended
See Table 5.
306
116 .9866
0.1
RPE
Comparison of the proposed model and welldocumented discrete time SRGM based on NHPP in
terms of goodness of fit is given in Tables 5 and 6 for
DS-III and DS-IV respectively. Note that during the
estimation process of models under comparison it is
observed that the exponential model [17] fails to give
any plausible result as it over estimates the faultcontent (a) and no estimates were obtained for DS-III.
It is clearly seen from both the Tables 5 and 6 that the
proposed model is the best among the models under
comparison in terms of SSE, AIC, and R2 metrics
values, which is very encouraging. Hence, the
proposed extended discrete time model fits better than
existing models on both DS-III and DS-IV.
0
50%
60%
70%
80%
90%
100%
-0.1
-0.2
6. Conclusion
In this paper, newly developed discrete time SRGM
based on NHPP to describe a variety of reliability
growth and the increased skill (efficiency) of the
testing team or a modification of the testing strategy
during testing phase, are proposed.
The proposed discrete time models have been
validated and evaluated on actual software reliability
data cited from real software development projects and
compared with existing discrete time NHPP based
models. The results are encouraging in terms of
goodness of fit and predictive validity due to their
applicability and flexibility. Hence, we conclude that
the two proposed discrete time models not only fit the
past well but also predict the future reasonably well.
Acknowledgements
I take this opportunity to thank Prof. P. K. Kapur, Dr.
A. K Bardhan, and Dr. P. C. Jha of Delhi University
for their support every moment I sought. The
suggestion, comments, and criticisms of these people
have greatly improved this manuscript.
References
[1]
131