Burr Type III Software Reliability Growth Model
Burr Type III Software Reliability Growth Model
Burr Type III Software Reliability Growth Model
e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 1, Ver. IV (Jan Feb. 2015), PP 49-54
www.iosrjournals.org
Abstract : Software Reliability Growth Model (SRGM) is used to assess software reliability quantitatively for
tracking and measuring the growth of reliability. The potentiality of SRGM is judged by its capability to fit the
software failure data. In this paper we propose Burr type III software reliability growth model based on Non
Homogeneous Poisson Process (NHPP) with time domain data. The Maximum Likelihood (ML) estimation
method is used for finding unknown parameters in the model on ungrouped data. How good does a
mathematical model fit to the data is also being calculated. To assess the performance of the considered SRGM,
we have carried out the parameter estimation on real software failure data sets. We also present an analysis of
goodness of fit and reliability for given failure data sets.
Keywords: Burr type III, Goodness of fit, NHPP, ML estimation, Software Reliability, Time domain data.
I.
Introduction
II.
NHPP Model
Software reliability models can be classified according to probabilistic assumptions based on the type
of failure process. First one is, when a Markov process represents the failure process, the resultant model is
Markovian Model. Second one is fault counting model that describes the failure phenomenon by stochastic
process like Homogeneous Poisson Process (HPP), Non Homogeneous Poisson Process (NHPP) and Compound
Poisson Process (CPP) etc. Most of the failure count models are based upon NHPP and are described in the
following lines.
A software system is subject to failures at random times which are caused by errors present in the
system. Suppose {N(t), t >0} is a counting process, then the cumulative number of failures are represented by
time t. As there were no failures at t=0 we have
DOI: 10.9790/0661-17144954
www.iosrjournals.org
49 | Page
0, t 0
m(t )
a, t
Here the above mentioned a is meant for expected number of software errors to be eventually
detected. Let N(t) be known to have a Poisson probability mass function with parameters m(t) i.e.
PN t n
mt n e m(t )
n!
, n=0, 1, 2,
Then the stochastic behavior of software failure phenomenon can be described through the N(t) process. In this
paper we consider m(t) as given by
mt a 1 t c
---------------- (2.1)
Where [m(t)/a] is the cumulative distribution function of Burr type III distribution for the present
choice.
PN t n
mt n e m(t )
n!
lim n PN t n
e a .a n
n!
This is also a Poisson model with mean a. Let the number of errors remaining in the system at timet be N(t).
= a - a 1 t -c
Suppose Sk be the time between (k-1)th and kth failure of the software product and Xk be the time up to
the k failure, then to find out the probability time between (k-1)th and kth failures the software reliability
function is given below.
th
RS k X k 1 ( s / x) e m ( x s ) m ( s ) --------- (2.2)
III.
Background Theory
This section presents the theory that underlines Burr Type III based NHPP model. The Burr type XII
uses a wide range of skewness and kurtosis, which may be used to fit any given set of unimodal data [11]. The
reciprocal Burr (Burr Type III) covers a wide region that includes the region covered by Burr Type XII. Burr
has suggested a number of forms of cumulative distribution functions (cdf) that would be useful for fitting data
[12]. The Burr Type XII distribution F(x) is given as
F(x) =1-(1+xc)-b, x>0, c>0, b>0. ------ (3.1)
Here both c and b are shape parameters.
Let X be the random variable with cdf given by equation (3.1) and consider the transformation t=1/X.
F(t)=(1+t-c)-b
--------- (3.2)
Which is one of the many forms of distribution functions (Burr Type III) given by Burr.
DOI: 10.9790/0661-17144954
www.iosrjournals.org
50 | Page
Burr [12] had introduced twelve different forms of cumulative distribution functions for modeling data.
The task of building a mathematical model is incomplete until the unknown parameters i.e. the model
parameters are estimated and validated on actual software failure data sets. In this section we develop
expressions to estimate the parameters of the Burr type III model based on time domain data. Parameter
estimation is given primary importance for software reliability prediction. Parameter estimation can be achieved
by applying a technique of MLE which is the most important and widely used estimation technique. A set of
failure data is usually collected in one of two common ways, time domain data and interval domain data. Here
the failure data is collected through time domain data.
The mean value function of Burr type III model as given in equation (2.1) is
mt a 1 t c
t>0 a, b, c > 0
To assess the software reliability, a, b and c are to be known or else they can be estimated from
software failure data. For estimating a, b and c for the Burr type III model expressions are derived as
mentioned below. Assuming that the data are given for the occurrence times of the failures or the times of
successive failures, i.e., the realization of random variables Tj for j = 1, 2,..n. Given that the data provide n
successive times of observed failures Tj for 0 t1t2...tn, we can convert these data into the time between failures
xi where xi = ti-ti-1 for i = 1, 2,, n. Given the recorded data on the time of failures, the log likelihood function
(LLF) takes on the following form:
n
--------- (4.1)
i 1
abc
a
LogL log c 1
c b 1
c
i 1
ti (1 ti ) 1 tn
LogL
1 t
c b
n
-------- (4.2)
----- (4.3)
i 1
LogL
0
a
LogL
0
b
n
log 1 t n log 1 t
n
i 1
1
i
----------- (4.5)
1
n
g c
DOI: 10.9790/0661-17144954
g ci
where g(c) and
g ' ci
LogL
0
c
n log tn n n
2
log ti 1
c
1 tn
c i 1
1 tic
www.iosrjournals.org
------ (4.6)
51 | Page
g ' c
nlog tn tnc
2
c
n
2 LogL
0
c 2
n
n
2tic log ti
2
2
c
i 1
tic 1
------------ (4.7)
The value of c in the above equations (4.6) (4.7) can be obtained using Newton-Raphson iterative
method. Solving the above equation yields the point estimate of the parameter c.
V.
The set of software errors analyzed here is borrowed from software development project as published
in Pham [13, 14]. Data set is truncated into different proportions and used for estimating the parameters of the
proposed basic discrete time model. Table 1 shows the time between failures for different software products
presenting as per the size of the data set.
NTDS Data
The data set consists of 26 failures in 250 days. During the production phase 26 software errors are
found and during the test phase five additional errors are found. During the user phase one error is observed and
two more errors are noticed in a subsequent test phase indicating that a network of the module has taken place
after the user error is found. In this paper, a numerical conversion of data (Failure Time (hours)*0.01) is done in
order to facilitate the parameter estimation [15] [16] [17].
Table-1: NTDS Data Set
Failure
Time between
Number n
Failures Sk days
Production (Checkout) Phase
1
9
2
12
3
11
4
4
5
7
6
2
7
5
8
8
9
5
10
7
11
1
12
6
13
1
14
9
15
4
16
1
17
3
18
3
19
6
20
1
21
11
22
33
23
7
24
91
25
2
26
1
Test Phase
27
87
28
47
29
12
30
9
31
135
User Phase
32
258
Test Phase
33
16
34
35
DOI: 10.9790/0661-17144954
Cumulative Time
Xn = days
Failure Time(hours)*0.01
9
21
32
36
43
45
50
58
63
70
71
77
78
87
91
92
95
98
104
105
116
149
156
247
249
250
0.09
0.21
0.32
0.36
0.43
0.45
0.5
0.58
0.63
0.7
0.71
0.77
0.78
0.87
0.91
0.92
0.95
0.98
1.04
1.05
1.16
1.49
1.56
2.47
2.49
2.5
337
384
396
405
540
3.37
3.84
3.96
4.05
5.4
798
7.98
814
849
8.14
8.49
www.iosrjournals.org
52 | Page
a 34.465706
b 1.763647
c 1.810222
Table-2: Parameters Estimated through MLE
Data set (no)
Number of samples
NTDS
AT&T
SONATA
XIE
IBM
26
22
30
30
15
A
34.465706
26.839829
79.831359
33.310426
20.624785
Estimated Parameter
b
1.763647
1.658692
6.742810
2.270095
1.711630
C
1.810222
1.000000
0.602440
1.371974
1.447815
Here, these three values can be accepted as MLEs of a, b and c. The estimator of the reliability
function from the equation (2.2) at any time x beyond 250 days is given by
RS k X k 1 ( s / x) e m ( x s ) m ( s )
= [ (50+250) (250)]
=0.999221
VI.
The potentiality of SRGM is judged by its capability to fit the software failure data, where the term
goodness of fit denotes the question of How good does a mathematical model fit to the data? Experiments on
a set of actual software failure data have been performed to validate the model under study and to assess its
performance. The considered model fits more to the data set whose Log Likelihood is most negative. The
application of the considered distribution function and its Log Likelihood on different datasets collected from
real world failure data is given below in table 3.
Table-3: Log likelihood on different data sets
Data set (no)
Log L (MLE)
NTDS
AT&T
SONATA
XIE
IBM
-168.070414
-160.264155
-218.086332
-256.630664
-109.246261
VII.
Reliability tn+50
(MLE)
0.999221
0.995543
0.917342
0.999247
0.998116
Conclusion
In this paper we propose a Burr type III software reliability growth model. This model is useful
primarily for estimating and monitoring software reliability that is viewed as a measure of software quality.
Equations are developed to obtain the maximum likelihood estimates of the parameters based on time domain
data. The proposed discrete time models have been validated and evaluated on actual software failure data cited
from real software development projects and compared with existing discrete time NHPP based model. The
results are encouraging in terms of goodness of fit and predictive validity due to their applicability and
flexibility. To validate the proposed approach, the parameter estimation is carried out on the data sets collected
[18][19]. The data set Xie has the best fit among all datasets as it is having the highest negative value for the log
likelihood. The reliability of all the data sets are given in Table 3.The reliability of the model over Xie data is
the highest among the data sets which are considered.
References
[1].
[2].
[3].
[4].
DOI: 10.9790/0661-17144954
www.iosrjournals.org
New
53 | Page
[16].
[17].
[18].
[19].
Pham. H (2005) A Generalized Logistic Software Reliability Growth Model, Opsearch, Vol.42, No.4, 332-331.MC Graw Hill,
New York.
Goel, A.L., Okumoto, K., 1979. Time- dependent error-detection rate model for software reliability and other performance
measures. IEEE Trans. Reliab. R-28, 206-211.
Satya Prasad, R (2007) Half logistic Software reliability growth model, Ph.D Thesis of ANU, India.
Satya Prasad, R and Geetha Rani, N (2011), Pareto type II software reliability growth model, International Journal of Software
Engineering, Volume 2, Issue(4) 81-86
Quadri, S.M.K and Ahmad, N., (2010). Software Reliability Growth modelling with new modified Weibull testing-effort and
optimal release policy, International Journal of Computer Applications, Vol.6, No.12.
Xie, M., Yang, B. and Gaudoin, O. (2001), Regression goodness-of-fit Test for Software reliability model validation, ISSRE and
Chillarege Corp.
Tadikamalla, Pandu R. A Look at the Burr and Related Distributions. International Statistical Review / Revue Internationale de
Statistique 48(3)(Dec., 1980): 337344.
Burr (1942), Cumulative Frequency Functions, Annals of Mathematical Statistics, 13, pp. 215-232.
Pham. H., 2003. Handbook of Reliability Engineering Springer.
Pham. H., 2006. System software reliability Springer.
N. R. Barraza., Parameter Estimation for the Compound Poisson Software Reliability Model, International Journal of Software
Engineering and Its Applications, http://www.sersc.org/journals/IJSEIA/vol7_no1_2013/11.pdf, vol. 7, no. 1, (2013) January, p p.
137-148.
Inayat, M. Asim Noor and Z. Inayat, Parameter Successful Product-based Agile Software Development without Onsite Customer:
An
Industrial
Case
Study,
International
Journal
of
Software
Engineering
and
Its
Applications,http://www.sersc.org/journals/IJSEIA/vol6_no2_2012/1.pdf, vol. 6, no. 2, (2012) April, pp. 1-14.
Hassan Najadat and Izzat Alsmadi., Enhance Rule Based Detection for Software Fault Prone Modules, International Journal of
Software
Engineering
and
Its
Applications,
Vol.
6,
No.1,
pp.
75-86,
January
(2012)http://www.serc.org/journals/IJSEIA/vol6_no1_2012/6.pdf.
Xie, M., Goh. T.N., Ranjan.P, (2002) Some effective control chart procedures for reliability monitoring -Reliability engineering
and System Safety 77 143 -150 2002.
Ashoka, M., (2010), Sonata Software Limited data set, Bangalore.
DOI: 10.9790/0661-17144954
www.iosrjournals.org
54 | Page