Software Testing Defect Prediction Model - A Practical Approach
Software Testing Defect Prediction Model - A Practical Approach
ISSN: 2319-1163
Abstract
Software defects prediction aims to reduce software testing efforts by guiding the testers through the defect classification of software
systems. Defect predictors are widely used in many organizations to predict software defects in order to save time, improve quality,
testing and for better planning of the resources to meet the timelines. The application of statistical software testing defect prediction
model in a real life setting is extremely difficult because it requires more number of data variables and metrics and also historical
defect data to predict the next releases or new similar type of projects. This paper explains our statistical model, how it will accurately
predict the defects for upcoming software releases or projects. We have used 20 past release data points of software project, 5
parameters and build a model by applying descriptive statistics, correlation and multiple linear regression models with 95%
confidence intervals (CI). In this appropriate multiple linear regression model the R-square value was 0.91 and its Standard Error is
5.90%. The Software testing defect prediction model is now being used to predict defects at various testing projects and operational
releases. We have found 90.76% precision between actual and predicted defects.
__________________________________________________________________________________________
Volume: 02 Issue: 05 | May-2013, Available @ http://www.ijret.org
741
ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 02 Issue: 05 | May-2013, Available @ http://www.ijret.org
742
ISSN: 2319-1163
Multiple Linear
Regression (MLR)
Coefficients
values
Standard
Error
(SE)
Significant
P-value
-5.36813
5.641756
p>0.05
0.019694
0.013713
p>0.05
6.238706
0.91904
p<0.01
-0.02611
0.007791
p<0.01
-0.07895
0.013876
p<0.01
-0.15725
0.324758
p>0.05
CONCLUSION
We analyzed an extensive historical dataset of software
releases to identify factors influencing parameters of defect
prediction model. We found strong correlation between
defects and test team size, total number of test cases executed,
total number of components delivered. In this analysis we use
multiple regression analysis for estimating software defects.
These models is account for the typical problems for
identifying and influence parameters as well as metrics data,
such as inconsistent and heterogeneity of data. Overall these
models indicate good prediction for upcoming software
defects as well as quality improvement. By using the
predictive model identified project managers key parameters
that require controlling and improve both the development and
testing process. The R-square value was 0.91 (91%) which is
highly significant at 0.01 level and low standard error (SE)
was 5.90%.
We calibrated our model based on its prediction accuracy and
discovered that it is possible to detect on 84% of number of
defectives using a defect predictors.
We will continue this work by:
Extending the analysis by analyzing the impact of
different combinations of factors on proportion of defect
types.
Extending the model to predict the Defect Severity Index
(DSI).
Extending the model to predict the defect leakage into the
production or next phases.
Extending the model to take a decision like GO or No-GO
into the production.
Extending the model with the results of newer analysis
using advanced statistical models.
Validating the model against historical data and
assumptions in software engineering.
Intercept
Total number of Test
Cases executed (#)
Test Team size (#)
Allocated Unit
testing effort (%)
Test case execution
Effort (%)
Total number of
components delivered
(#)
Standard
Deviation
(SD)
Correlation
pvalue
Total
number of
Test Cases
executed (#)
264
169
0.5425
p<0.05
Test Team
size (#)
10
0.7665
p<0.01
Allocated
Unit testing
effort (Hrs)
433
226
0.2441
p>0.05
Test case
execution
Effort (Hrs)
252
188
0.0593
p>0.01
Total
number of
components
delivered (#)
11
11
0.6485
p<0.01
Number of
defects (#)
28
16
Parameter
__________________________________________________________________________________________
Volume: 02 Issue: 05 | May-2013, Available @ http://www.ijret.org
743
ACKNOWLEDGEMENT
I want to thank Narendra singh Rajput, Lead Associate
Manager, for his valuable feedback and guidance and Prof
Balasiddamuni, Department of Statistics, S V University,
Tirupati
and Raghav Beeram, Global Head - Test
Management Consulting, Independent Testing Services, CSC,
Hyderabad for guiding me and supporting me in my research
paper.
I would like to express my love affection to my mother Smt.
Shaik Zaibunnisa, My Father Sri. Late S. Mahaboob Basha,
Brothers and Sisters, My spouse Ballary Ameen, Childrens
Shaik Mohammed Wafeeq, Shaik Waneeya, my Niece
Samreen Aaleia for enthusing me throughout the work
REFERENCES
[1] M. Boraso, C. Montangero, and H. Sedehi, Software cost
estimation: An experimental study of model performances,
tech. report, 1996.
[2] T. Menzies, D. Port, Z. Chen, J. Hihn, and S. Stukes,
Validation methods for calibrating software effort models,
in ICSE 05: Proceedings of the 27th international conference
on Software engineering, (New York, NY, USA), pp. 587
595, ACM Press, 2005.
[3] O. Benediktsson, D. Dalcher, K. Reed, and M. Woodman,
COCOMO based effort estimation for iterative and
incremental software development, Software Quality Journal,
vol. 11, pp. 265281, 2003.
[4] Fenton, N. E. and Neil, M. A Critique of Software Defect
Prediction Models, IEEE Transactions on Software
Engineering, 25(5), 675-689, 1999.
[5] Brooks, A. The Mythical Man-Month: Essays on Software
Engineering. Addison-Wesley. Eds. 1995.
[6] Neil, M., Krause, P., Fenton, N. E., Software Quality
Prediction Using Bayesian Networks in Software Engineering
ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 02 Issue: 05 | May-2013, Available @ http://www.ijret.org
744
ISSN: 2319-1163
BIOGRAPHIES
Working as Lead Industry Consultant in
Test Management Consulting group,
AppLabs a Computer Science
Corporation Company, Hyderabad, India.
He has 9 years in IT experience as
Statistician and Statistical modeller. He has
published
14
papers
in
National/International journals. He is expert in Statistical and
Econometric modelling
__________________________________________________________________________________________
Volume: 02 Issue: 05 | May-2013, Available @ http://www.ijret.org
745