0% found this document useful (0 votes)

64 views10 pages

Comprehensive Study On Machine Learning

The document discusses machine learning techniques for software bug prediction. It provides a comprehensive study of ML algorithms that have been used successfully for software bug prediction, including decision trees, naive bayes, random forest and logistic regression. The paper also presents a software bug prediction model based on these supervised learning algorithms and evaluates their performance on four public datasets.

Uploaded by

Tahiru Abdul-Moomin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views10 pages

Comprehensive Study On Machine Learning

Uploaded by

Tahiru Abdul-Moomin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

(IJACSA) International Journal of Advanced Computer Science and Applications,

Vol. 12, No. 8, 2021

Comprehensive Study on Machine Learning

Techniques for Software Bug Prediction
Nasraldeen Alnor Adam Khleel, Károly Nehéz
Department of Information Engineering
Institute of Information Science
University of Miskolc
H-3515 Miskolc
Hungary

Abstract—Software bugs are defects or faults in computer make these tools hard to be used in practice. So, there must be
programs or systems that cause incorrect or unexpected found another methodology or approach for static code
operations. These negatively affect software quality, reliability, analysis such as Machine Learning (ML) algorithms [1], [9],
and maintenance cost; therefore many researchers have already [12]. Software bugs usually appear during software
built and developed several models for software bug prediction. development process. Software bugs are often difficult to
Till now, a few works have been done which used machine detect or identify, and developers spend a large amount of
learning techniques for software bug prediction. The aim of this time locating and fixing them. As well, some bugs cannot be
paper is to present comprehensive study on machine learning detected at an early phase of development. To relieve the issue
techniques that were successfully used to predict software bug.
of bug fixing, the researchers did many extensively studies for
Paper also presents a software bug prediction model based on
bug prediction. Many machine learning (ML) driven
supervised machine learning algorithms are Decision Tree (DT),
Naïve Bayes (NB), Random Forest (RF) and Logistic Regression prediction models have been built and tested on various basis.
(LR) on four datasets. We compared the results of our proposed The process of software bug report is an important part of
models with those of the other studies. The results of this study software maintenance, but the process of bug reports
demonstrated that our proposed models performed better than assignment can be very expensive in large software
other models that used the same data sets. The evaluation process development projects, where a lot of studies suggest
and the results of the study show that machine learning automating bug assignment approaches using machine
algorithms can be used effectively for prediction of bugs. learning in open-source software. Software Bug Prediction
(SBP) plays a vital and important role in the process of
Keywords—Static code analysis; software bug prediction; improving software product quality. SBP is a process of
software metrics; machine learning techniques generating machine learning models (classifiers) to predict
software (code) defects based on historical data. The most
I. INTRODUCTION recent methodologies used to predict software bugs are
Due to the increasing size, complexity of software supervised(classification)machine learning models, and with
products and inadequate software testing no system or recent advances in machine learning techniques, new models
software can claim to be bugs free. There are many activities have emerged that have enhanced performance and
related to software testing such as implementing processes, capabilities in predicting software bug [2]. Classification is a
procedures, and standards that must be carried out in a specific major task of data analysis using machine learning algorithms
sequence to ensure that quality objectives are achieved or that allow the machine to learn associations between instances
testing a product for issues such as software bugs. There are and decision labels, from which an algorithm builds a model
different classifications of bugs in software testing like Major to predict the labels of new instances for a specific sample
defect: a defect, which will cause an observable product data. In machine learning, classification can be categorized
failure or deviation from functional requirements. Minor into three types: binary (yes or no), multi-class, and multi-
defect: a defect that will not cause a failure in execution of the label classification [5], [25]. To build a dataset containing
product. Fatal defect: a defect that will cause useful buggy code element characterization information, we
application/system crash or close abruptly. Bugs can also be chose Promise Repository dataset that stores software metrics
classified into functional defects, performance defects, along with bug information for many projects, these datasets
usability defects, compatibility defects, security defects, etc. were collected from real software projects by NASA [26]. The
The use of analytical methods to check and review source objective of this study is to investigate the previous studies
codes is standard development practice. This process can be that used most effective machine learning techniques for
accomplished manually or automatically using static code software bug prediction. In this paper, four supervised
analysis tools, dynamic code analysis tools, etc. Recently a lot machine learning models are identified and utilized on four
of tools evolved for static code analysis, to provide a truly different datasets to evaluate the Machine learning algorithms
practical, value added solution to many of the problems that capabilities in software bug prediction. The paper compares
software development organizations face. But there are the proposed models based on various performance measures
numerous false positives and false negatives results, which like accuracy, precision, recall, F1-score and ROC curves. The

726 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021

structure of this study is organized as follow. Section 2 quantities of data are needed to develop machine learning
presents a discussion on software bug prediction by analyzing models-based prediction [11], [31], [33]. Machine learning
static code analysis. An overview of the machine learning algorithms build models from training examples, which are
techniques is presented in Section 3. After that, the literature then used to make predictions when faced with new examples.
review is presented in Section 4. Section 5 presents our Supervised learning is a type of machine-learning algorithm
research methodology. Section 6 presents software metrics that builds a prediction model by training the labeled data to
and data sets. An overview of the selected machine learning execute the prediction task. The goal of supervised machine
classifiers and their evaluation is presented in Sections 7 and learning algorithms is to develop an inferring function through
8. Section 9 presents the experimental results and discussion concluding relationships between independent
followed by conclusions and future work in the Section 10. variables(inputs) and dependent variables(outputs) of the
training datasets [5], [27]. Classification is a method uses a
II. SOFTWARE BUG PREDICTION BY ANALYZING STATIC data mining or machine learning approach classify the data,
CODE classification techniques deal with a software component,
Static code analysis is a method of analyzing software named classifier, this classifier invoked with inputs (features).
code without its execution to find potential problems like Features are extracted from the training data examples as text,
defects or bugs issues that might arise at runtime to check the numbers, or nominal values. Bug prediction is one application
quality of source code and addressing weaknesses in the of machine learning that aims to identify critical pieces in
program code through evaluating and correct source code source code potential contain defects. This process can be
based on some factors like structure, content, and used in software projects to earning insights into how and
documentation. There are many commercial and open source where bugs happen to enhance software quality.
tools developed for static code analysis [3], [24]. These tools
remove the unnecessary fuzz from source code and perform IV. LITERATURE REVIEW
some automated checks to improve and ensure a certain level Software bug prediction is one of the most popular
of quality. This can be performed very early in the research areas in software engineering. The major aim of the
development process, during this procedure the code must software bug prediction is to detect bugs in software modules
pass many formal tests to be considered bug free. There exist by considering software metrics as input (parameters). The
several ways of analyzing static code by exploiting the natural research described in this paper presents a comprehensive
language found within a program’s text based on compliance study on machine learning techniques for software bug
with different coding standards. These types of analysis may prediction. The following subsection covers the recent
be manual, which is usually very time consuming like code literature related to bug prediction. Considerable research has
inspections, or automated using one or more tools. Software been performed on software bug prediction using machine
Bug Prediction (SBP) considers a vital activity during learning techniques. For example, Wang et al. in [1] proposed
software development and maintenance. SBP is a a combination approach of contexts and neural network to
methodology related to figure out bugs in the software module detecting bugs. The results show that the tool can have a
by considering software metrics as a parameter [4]. Numerous relative improvement up to 160% on F-score. Also, the tool
studies have confirmed that machine learning techniques are can detect 48 true bugs in the list of top 100 reported bugs.
suitable techniques for predicting software bug to identify Jonsson et al. in [2] evaluated automated bug assignment
defective software code [5], [6], [9]. Bug reports are basic techniques that are based on machine learning classification.
software development tools which describe software bugs, The results of study show that the prediction of accuracies is
especially in open-source software [7], [30]. To warranty the between 50% and 90% when large training sets are used.
quality of software, many projects use bug reports to gather Chappell et al. in [3] presented report on using machine
and record the bugs reported [8]. The bugs classified into two learning techniques for finding bugs in C programs.
classes: intrinsic bugs refer to bugs that were introduced by Hammouri et al. in [5] presented machine learning model for
one or more specific changes to the source code and extrinsic software bug prediction. The experiment was conducted on
bugs refer to bugs that were introduced by changes not the basis of three supervised machine learning algorithms
recorded in the version control system [5], [18]. Several Naïve Bayes, Decision Tree, and Artificial Neural Networks
techniques have been developed over the years to to predict future software bugs based on historical data. The
automatically detect bugs in source code. Often, these results show that the use of machine learning algorithms is
techniques depend on formal methods program analysis. Many effective and leads to a high rate of accuracy. The comparison
studies in literature use code features as input for machine results showed that the Decision Tree (DT) classifier has the
learning algorithms to perform bug prediction. The most best results over the others. Kumar Pandey et al. in [6]
machine learning algorithms that can be used to detect conducted compare various Bayesian network classifier and
software bugs is classification techniques [10]. how they are useful for bugs prediction and random forest.
The experimental results revealed that the Bayesian network is
III. MACHINE LEARNING TECHNIQUES better than random forest. Meenakshi et al. in [7] proposed
Machine learning is an area of research where computer various ML models for software bug prediction. The
programs can learn and get better at performing specific tasks experiment results demonstrated that the machine learning
by training on historical data [2]. Machine learning algorithms techniques are efficient and suitable approaches to predict the
can be applied to analyze data from different perspectives to future software bugs and the comparison of results showed
allow developers to obtain useful information [10], [38]. High that the DT classifier has the best results over the others. Un-

727 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021

Nisa Uqaili et al. in [8] proposed an approach to classify A. Study Selection

different types of bugs according to their severity and priority There are a lot of criteria to identify the relevant studies in
basis. They applied three supervised machine learning models this study and papers collected and reviewed by year of
(Naïve Bayes, Random Forest, and Multilayer Perceptron) for publication as it is shown in Fig. 1. For a paper to be included
prediction of fault prone. The experimental results showed in this study, it must meet various inclusion criteria.
that the Random-Forest (RF) method better than other
techniques of machine learning. Aleem et al. in [10] • Studies that suggest and discuss the use of machine
conducted study to a comparative the performance of some learning techniques to predict software bugs.
machine learning algorithms for software bug prediction. The
results showed most of the applied machine learning • Studies that motivate and discuss the benefits of using
techniques performed well on software bug prediction. Islamet machine learning techniques for software bug
et al. in [11] presented an empirical study using deep learning prediction.
libraries to explore the bugs in software. They conducted 2716 • Studies that provide an empirical basis for the results
comprehensive bug characteristics studies to identify the bug and have been published in a high-quality journal or in
types and root causes of bugs. The study found that the most conference proceedings.
severe bug types in deep learning software are data bug and
logic bug, where appearing more than 50% of the times and B. Research Questions
main causes of these bugs are incorrect model parameter and This study aims to establish a starting point for future
structural inefficiency. Sharma et al. in [13] proposed a new research for software bug prediction and simultaneously
approach of creating a dictionary to classify critical terms and provide practitioners with a summary of most relevant work
determine severity using two machine learning algorithms done in the area of software bug prediction uses machine
(Naïve Bayes Multinomial and K-nearest neighbor learning techniques to heel and allow picking machine
algorithms), and the results were evaluated based on two learning techniques that suits them. The research questions
performance measures (accuracy and accuracy). The results identified in this context are given in Table II.
demonstrated that the K-nearest neighbor classifier performs
better Naïve Bayes Multinomial classifier to classify the
severity of the bug Table I illustrates techniques used in
previous studies on machine learning-based software bugs
prediction. Bold number indicates comparative studies, capital
and bold X shows the classifier giving the best results.

TABLE I. ML TECHNIQUES USED IN PREVIOUS STUDIES FOR SOFTWARE

BUGS PREDICTION

Machine Learning techniques

Reference
DT NB ANNs RF SVM DL K-NN LR
[1] x
[3] x
[4] x
[5] X x x
[6] x Fig. 1. Number of Papers Collected and Reviewed by Year of Publication.

[7] X x TABLE II. RESEARCH QUESTIONS

[8] x x X
RQ# Research Question Motivation
[10] x x X x X
Which ML models have been Identify the machine learning
[11] x RQ1 used for software bug models commonly being used
[12] x x prediction? for software bug prediction.
[13] x How these models have been To find out how these models
RQ2 trained and what languages have were trained and what languages
[16] x x x x
been used? are used.
[18] x x x X
Which performance measures Assess the performance of the
RQ3 are used for software bug machine learning techniques for
V. RESEARCH METHODOLOGY prediction? software bug prediction.
The main objective of this study is to identify and analyze What the conclusions can we
the latest studies that use machine learning techniques for Identify the efficiency of
draw about the efficiency of
machine learning algorithms
software bug prediction. A literature review has been used as a machine learning algorithms
RQ4 used in predicting software bug
research methodology in this study as it is a defined and used in predicting software bug
from results presented in the
methodical way of identifying, evolution, and analyzing from results presented in the
selected studies.
selected studies?
published literature to investigate the research questions.

728 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021

1) RQ1: Which ML models have been used for software • Logistic Regression (LR): LR is a statistical
bug prediction?: To answer this research question, this study classification technique which is based on maximum
identified the machine learning models commonly being used likelihood estimation. It is meant for predicting the
for software bug prediction in previous studies as shown in likelihood of an entity belonging one class or another
Fig. 2, and these models are: class [16], [28], [37], [39].

• Decision Tree is a popular learning method used in data 2) RQ2: How these models have been trained and what
mining and machine learning for the purpose of languages have been used? To answer this research question,
regression and classification. It refers to a hierarchal the essential issue of software bug prediction with machine
model or a tree with decision nodes that have more than learning techniques is how train and test the model [17]. A
one branch and leaf nodes that represent the decision. large and representative data set is the basis for training and
Each node in a decision tree represents a feature in an testing machine learning models. So, in the literature review
instance to be classified, and each branch represents the and in our experimental study, different and large datasets,
value thresholds the contained nodes can assume. and different programming languages such C, C++ and Java
Instances are categorized beginning at the root node and has been used to training machine learning models.
sorted based on their attribute values [5], [29].
3) RQ3: Which performance measures are used for
• Naïve Bayes (NB) is a supervised learning algorithm software bug prediction? To answer this research question,
and defines as simple probabilistic classifier and several measures are used for gauging the performance of
efficient based on Bayes theorem with independence different machine learning models. These performance
assumption between the features, this means that the measures are used for comparing and evaluating models
Naive Bayes classifier is based on estimating the developed using various machine learning techniques. A
probabilities of the unobserved node, based on the depiction of the number of studies using each performance
observed probabilities [5], [22].
measures is used in Fig. 3. The most used performance metric
• Artificial Neural Networks (ANNs): ANNs are machine is accuracy, which is closely followed by recall, precision, and
learning models or nonlinear classifiers used to model F1-score, and some less commonly metrics are H-measure,
complex relationships between inputs and outputs for Area Under the Curve (AUC) and Receiver Operating
classification purposes. An ANN model contains Characteristics (ROC) curve.
multiple units (layers) for information processing which
are known as neurons. The layers are typically named
the input layer, hidden layer, and output layer [5]. When
implementing a neural network, a set of consistent
training values must be available to set up the expected
operation of the network and a set of validation values
to validate the training process [14].
• Random Forest is one of the most utilized models, due
its effortlessness and the way, which it can be utilized
for both characterization and relapse assignments. It is
an adaptable and simple to utilize machine learning
calculation, even without hyper-parameter tuning [23].
• Support Vector Machine (SVM): SVM is one of the
regulated machine learning models. It is a Fig. 2. Number of Studies across ML Techniques based on Classifications.
comparatively novel learning approach used for binary
classification. The primary role is to discover a hyper-
plane, which divide the dimensional data completely
into two categories [15], [32].
• Deep Learning (DL): DL is one of an artificial
intelligence function that mimics the workings of the
human brain. It allows and helps to solve complex
problems with using a data set that is very diverse,
unstructured, and interconnected [40].
• K-Nearest Neighbor define as a simple supervised
classification algorithm in which an object is classified
by looking at the K nearest objects and by choice most
frequently occurring class [28].

Fig. 3. Studies using different Performance Measures for SBP.

729 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021

4) RQ4: What the conclusions can we draw about the metrics, Halstead metrics, Lines of Codes (LoC), and McCabe
efficiencies of machine learning techniques for software bug complexity. Various defects detection methods like Black box
prediction from results presented in the selected studies?: To probing, automatic formal methods, etc. And different
answer this research question, this study evaluates the best machine learning models like linear regression, the M5’ model
tree learner and the J48 decision tree learner have been
machine learning techniques for devolving an effective model
implemented in these projects [10]. Table III, Table IV shows
for software bug prediction through evaluating the presented the information about dataset, and software metrics (features).
software bug prediction models in previous studies. Different
machine learning techniques have different characteristic like TABLE III. DESCRIPTIONS OF DATASETS (PROJECTS) USED IN THIS STUDY
speed, accuracy, interpretability, and simplicity. This study
focused on the studies that applied machine learning # %
Projects Language Description
Modules Defects
algorithms and performance measures that most used. Looking
at the results achieved in the literature review and the results Real-time predictive
ground system: Uses
achieved in our study, machine learning techniques are well JM1 10885 19% C
simulations to generate
applicable to static code analysis for software bug prediction. predictions.

VI. SOFTWARE METRICS (FEATURES) AND DATASETS Flight software for earth
PC1 1107 6.8% C
orbiting satellite.
Software metrics are a quantitative and standard measure Storage management for
of some property of software that assigns numbers or symbols KC1 2107 15.4% C++ receiving and processing
to attributes of the measured entity. Software metrics can be ground data.
used to collect information regarding structural properties of a Software for science data
software design which can be further statistically analyzed, KC2 523 20% C++
processing.
interpreted and linked to its quality. In software comprise
complexity, cohesion, and coupling related metrics can be TABLE IV. DESCRIPTIONS OF SOFTWARE METRICS (FEATURES) USED IN
measured during the software development phases such as THIS STUDY
design or coding and it also used to calculate the quality of
software [19], [34], [36]. Software metrics can be classified to Metrics Type Description
static code metrics and process metrics. Static code metrics It counts the line of code in
Loc McCabe
can be directly extracted from source code, like Lines of Code software module.
(LOC), Cyclomatic Complexity Number (CCN). Object Measure McCabe Cyclomatic
v(g) McCabe
Complexity.
oriented metrics is a subcategory of static code metrics, like
Depth of Inheritance Tree (DIT), coupling between Objects ev (g) McCabe McCabe Essential Complexity.
(CBO), Number of Children (NOC), and Response for Class iv (g) McCabe McCabe Design Complexity.
(RFC). Process metrics can be extracted from Source Code
Total number of operators and
Management system based on historic changes on source code N Derived Halstead
operands.
overtime. Metrics can also be classified based on development
phase of software life cycle, into source code level metrics, V Derived Halstead Volume.
detailed design level metrics or test level metrics. Object- L Derived Halstead Program length.
oriented metrics are often used to assess the testability, D Derived Halstead Measure difficulty.
maintainability or reusability of source code [20], [35].
Commonly dataset that used for software bug prediction I Derived Halstead Measure Intelligence.
domain is promise repository dataset. To perform this E Derived Halstead Measure Effort.
experiment, the data is obtained from the publicly available B Derived Halstead Effort estimate.
and published data in defect prediction datasets that stored
software metrics along with defect information of several T Derived Halstead Time Estimator.
projects, these datasets were collected from real software Number of lines in software
Locoed Line Count
projects by NASA. These public domain datasets are used in module.
this experiment because this is a benchmarking procedure of Locomment Line Count Number of comments.
defect prediction research [17, 21]. To perform machine
Loblank Line Count Number of blank lines.
learning on the available source code, it is necessary to
establish a set of features that can be extracted that contain the Locodeandcom
Line Count Number of codes and comments.
information needed. Many studies [4, 6, 7, and 14] use ment
software metrics as independent variables to measuring the uniq_op Basic Halstead Unique operators.
quality of software modules and build software bug prediction uniq_opnd Basic Halstead Unique operands.
models. It is intuitive to think that the bug proneness of a
module is correlated with its complexity; therefore, bug total_op Basic Halstead Total operators.
prediction studies usually employ product metrics to improve total_opnd Basic Halstead Total operands.
prediction accuracy. The projects used in this study were BranchCount Branch Total Number of branch count.
developed using different programming languages and include
heterogeneous code metrics like Object-Oriented (OO)

730 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021

VII. CLASSIFIERS USED FOR SOFTWARE BUG PREDICTION IN TP

Precision = (2)
(TP + FP)
THIS STUDY
The next step after collecting datasets is using the • Recall: Recall is the number of positive predictions
collected datasets to train a machine learning models potential divided by the total number of positives or defined as
buggy modules as it is shown is Fig. 4. Four supervised the fraction between true positive instances and actual
machine learning algorithms will be analyzed and evaluated in yes instances. The top (maximum) recall is 1, whereas
this study, which are DT, NB, RF and LR. These algorithms the low (minimum) is 0. The formula of recall given
were chosen because are the most algorithms used in previous below:
studies. TP
Recall = (3)
TP + FN

• F1-score: F1-score is weighted harmonic mean of

precision and recall or defined as the fraction between
product of the recall and precision to the summation of
recall and precision parameter of classification, it is
used to combine the recall and precision measures in
one measure to compare different machine learning
algorithms. F1-score formula is given below:
(2∗ Recall ∗ Precision)
F1 − score = (4)
(Recall + Precision)

Fig. 4. Structure of Software Bug Prediction Model.

TABLE V. THE CORRELATION MATRIX

VIII. BUILDING AND EVALUATION OF PREDICTION MODELS Actual

Predicted
Most studies of software bug prediction divide the data Class X Class Y
into two sets: a training set and a test set. The training set is Class X TN FP
used to train the bug prediction models, whereas the testing set
is used to evaluate the performance of the bug prediction Class Y FN TP
models. After building the prediction model, we need to
evaluate the performance of the model. To evaluate the IX. RESULT AND DISCUSSION
performance of using machine learning models in software This study aimed at improving the understanding of the
bug prediction in this study used a set of performance process of software bug prediction especially using supervised
measures based on the confusion matrixes and ROC (Receiver machine learning techniques. In the literature review, several
Operating Characteristic) Curves. Confusion papers were found that discussed machine learning models for
matrix(correlation matrix) is often used to describe the predicting software bugs that classify the defective and non-
performance of machine learning models(classification defective module. It was observed in the RQ1 analysis that
methods) using a set of test data, correlation summarizes the most of the machine learning techniques used in software bug
results of the testing algorithm and provides a report of (1) prediction are NB, ANNs, and SVM. As it is noted in the RQ2
True Positives (TP), (2) False Positives (FP), (3) True analysis that studies used different performance measures. The
Negatives (TN), and (4) False Negatives (FN). ROC curves experiment of this study was performed in PYTHON
are plots the false positive rate on the x-axis and true positive environment to evaluate four machine learning algorithms:
rate on the y-axis over all possible or potential classification DT, NB, RF and LR. The evaluation process is implemented
thresholds. The subsections bellow describes the confusion with real datasets. Experimental results are collected and
matrix and performance measures applied as it is shown in evaluated based on various performance measures (accuracy,
Table V and equations. precision, recall, F1-score and ROC Curves). Results
demonstrated that the machine learning algorithms are
• Accuracy: Accuracy is the ratio of true results that
efficient approaches to predict software bugs. The comparison
calculated as the sum total of true positive and true
results demonstrated that the Decision Tree (DT) and Random
negative instances divided by 100. The top (maximum)
Forest (RF) classifiers have the best results. Tables VI to IX
accuracy is 1, whereas the low (minimum) accuracy is
show the performance of proposed models on the four data
0. Accuracy can be computed by using the following
sets based on all performance measures. The maximum (best)
formula:
accuracy value is 99%, which was achieved by Decision Tree
(TP + TN) (DT) and Random Forest (RF) models in JM1, PC1and KC1
Accuracy = (1)
(TP + TN+ FP + FN) datasets. The maximum (best) precision value is 99%, which
• Precision: Precision is defined as the number of true was achieved by Decision Tree (DT) and Random Forest (RF)
positive predictions divided by the total number of models in JM1, PC1and KC1 datasets. The maximum (best)
positive predictions or fraction of true positive and recall value is 100%, which was achieved by Decision Tree
predicted yes instances. The top (maximum) precision (DT) and Random Forest (RF) models in all datasets. The
is 1, whereas the low (minimum) is 0 and it can be maximum (best) F1-score value is 99%, which was achieved
calculated as: by Decision Tree (DT) and Random Forest (RF) models in

731 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021

PC1 dataset. The average accuracy of the proposed models on

the four data sets is shown in Fig. 5 and Fig. 6. As shown, the TABLE IX. PERFORMANCE MEASURES OF THE PROPOSED MODELS OVER
two ML models Decision Tree (DT) and Random Forest (RF) KC2 DATASET
achieved a high average accuracy rate. The average value for Performance measures
the accuracy rate in all datasets for the two models is over proposed model
98.5% on average. The minimum value appears for Naive Accuracy Precision Recall F1-score
Bayes (NB) model in the JM1 dataset, because the data set is DT 0.98 0.98 1.00 0.99
small and the Naive Bayes (NB) model needs a large data set NB 0.83 0.83 0.98 0.90
in order to achieve a high accuracy value. Fig. 7 to Fig. 10
presents the ROC Curves of proposed models on the four data RF 0.98 0.98 1.00 0.99
sets. The results show that Decision Tree (DT) and Random LR 0.84 0.86 0.96 0.91
Forest (RF) models have better values than Naive Bayes (NB)
and Logistic Regression (LR) models. For evaluating the
effectiveness of the proposed models, in Tables X and XI we
have compared the results of our study with the results of
three others studies [4, 7, and 10] which used the same dataset
and different performance measures (Accuracy, Precision,
Recall, and F1-score). The results showed that our proposed
models performed better than others models. After a
comprehensive study of Machine Learning techniques, there
must be a deterministic strategy for selecting machine learning
techniques to predict software bugs.

TABLE VI. PERFORMANCE MEASURES OF THE PROPOSED MODELS OVER

JM1 DATASET

Performance measures
proposed model
Accuracy Precision Recall F1-score
DT 0.99 0.99 1.00 0.99
NB 0.80 0.81 0.97 0.89
RF 0.99 0.99 1.00 0.99
LR 0.81 0.82 0.99 0.89 Fig. 5. Average of Accuracy Measure of Models across the JM1 and PC1
Dataset.

TABLE VII. PERFORMANCE MEASURES OF THE PROPOSED MODELS OVER

PC1 DATASET

Performance measures
proposed model
Accuracy Precision Recall F1-score
DT 0.99 0.99 1.00 1.00
NB 0.91 0.94 0.96 0.95
RF 0.99 0.99 1.00 1.00
LR 0.93 0.94 0.99 0.96

TABLE VIII. PERFORMANCE MEASURES OF THE PROPOSED MODELS OVER

KC1 DATASET

Performance measures
proposed model
Accuracy Precision Recall F1-score
DT 0.99 0.99 1.00 0.99
NB 0.85 0.88 0.96 0.92
RF 0.99 0.99 1.00 0.99
LR 0.85 0.87 0.96 0.92

Fig. 6. Average of Accuracy Measure of Models across the KC1 and KC2
Dataset.

732 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021

Fig. 7. Comparison of ROC Curves for Models across the JM1 Dataset. Fig. 9. Comparison of ROC Curves for Models across the KC1 Dataset.

Fig. 10. Comparison of ROC Curves for Models across the KC2 Dataset.
Fig. 8. Comparison of ROC Curves for Models across the PC1 Dataset.

733 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021

TABLE X. COMPARING THE RESULTS OF OUR STUDY WITH THE RESULTS X. CONCLUSION
OF STUDIES WHICH USES THE SAME DATASET AND ALGORITHMS ACROSS THE
JM1 AND PC1 DATASET Software bug prediction is very important field in static
code analysis to improve software quality and reliability. It is
JM1 dataset
an approach, in which a prediction model is constructed for
Studies the purpose of predicting future software defects based on
Performance ML
measure models First Second Third Our historical data using some software metrics. Many approaches
Study Study Study study have been presented using various datasets, various metrics,
DT - - 0.81 0.99 and various performance measures. The aims of this study are
successfully achieved. The aims are evaluate and present
Accuracy NB - - 0.81 0.80 comprehensive study on machine learning techniques have
RF - - 0.82 0.99 been used for software bug prediction in recent years and
DT - - 0.90 0.99
apply the best techniques for software bug prediction in this
study. To compare and evaluate the performance of the
NB 0.75 - 0.89 0.89 proposed models, we used different performance measures.
F1-score
RF 0.76 - 0.90 0.99 The results concluded that ML techniques are gaining interest
LR 0.74 - - 0.89
in software bug prediction, to improve the efficiency of bug
detection. Four NASA public datasets were chosen for this
pc1 dataset experiment and analyze the performance of models. The
DT - - 0.93 0.99 experimental results revealed that the DT and RF classifiers
Accuracy NB - - 0.88 0.91
are better than others classifiers. Static code analysis requires
further research to identify and detect of software bugs and
RF - - 0.93 0.99 several machine learning techniques can be used to improve
DT - - 0.97 1.00 results. As a future work, we plan to introduce other machine
NB 0.89 - 0.94
learning techniques with data balancing techniques to improve
0.95
F1-score the accuracy for predicting software bugs.
RF 0.91 - 0.97 1.00
LR 0.91 - - 0.96
ACKNOWLEDGMENT
The authors gratefully acknowledge the financial
TABLE XI. COMPARING THE RESULTS OF OUR STUDY WITH THE RESULTS assistance from the Institute of Information Science, Faculty
OF STUDIES WHICH USES THE SAME DATASET AND ALGORITHMS ACROSS THE of Mechanical Engineering and Informatics, University of
KC1 AND KC2 DATASET Miskolc.
kc1 dataset REFERENCES
Studies [1] Y. Li, S. Wang, T. N. Nguyen, and S. V. Nguyen, “Improving bug
Performance ML detection via context-based code representation learning and attention-
measure models First Second Third Our based neural networks”, in Proceedings of the ACM on Programming
Study Study Study study Languages, vol. 3, OOPSLA, paper no. 162, pages 1‒30, 2019.
DT - - 0.84 0.99 [2] L. Jonsson, M. Borg, D. Broman, K. Sandahl, S. Eldh, and P. Runeson,
“Automated bug assignment: Ensemble-based machine learning in large
Accuracy NB - 0.82 0.82 0.85 scale industrial contexts”, Empirical Software Engineering, vol. 21, pp.
RF - - 0.85 0.99 1533‒1578, 2016.
[3] T. Chappelly, C. Cifuentes, P. Krishnan and S. Gevay, “Machine
Precision NB - 0.80 - 0.88 learning for finding bugs: An initial report” in IEEE Workshop on
Recall NB - 0.83 - 0.96 Machine Learning Techniques for Software Quality Evaluation,
Klagenfurt, Austria, 21 -21 February 2017, pp. 21‒26.
DT - - 0.92 0.99 [4] S. K. Pandey, R. B. Mishra, and A. K. Tripathi, “BPDET: An effective
NB 0.82 0.81 0.90 0.92 software bug prediction model using deep representation and ensemble
F1-score learning techniques”, Expert Systems with Applications, vol. 144, paper
RF 0.82 - 0.92 0.99 no. 113085, 2020.
LR 0.81 - - 0.92 [5] A. Hammouri, M. Hammad, M. Alnabhan, and F. Alsarayrah, “Software
bug prediction using machine learning approach”, International Journal
kc2 dataset of Advanced Computer Science and Applications, vol. 9, no. 2, pp. 78‒
83, 2018.
DT - - 0.82 0.98
[6] S. K. Pandey, R. B. Mishra, and A. K. Triphathi, “Software bug
Accuracy NB - - 0.84 0.83 prediction prototype using Bayesian network classifier: A
comprehensive model”, Procedia Computer Science, vol. 132, pp. 1412‒
RF - - 0.82 0.98 1421, 2018.
DT - - 0.89 0.99 [7] S. S. Meenakshi, “Software bug prediction using machine learning
approach”, International Research Journal of Engineering and
NB 0.80 - 0.90 0.90
F1-score Technology, vol. 6, no. 4, pp. 4968‒4971, 2019.
RF 0.76 - 0.89 0.99 [8] I. U. N. Uqaili, S. N. Ahsan, “Machine learning based prediction of
complex bugs in source code”, The International Arab Journal of
LR 0.79 - - 0.91
Information Technology, vol. 17, no. 1, pp. 26‒37, 2020.

734 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021

[9] Károly, Nehéz, and Khleel Nasraldeen Alnor Adam. "Tools, processes [25] ÖZTÜRK, Elife, Kökten Ulaş Birant, and Derya Birant. "An Ordinal
and factors influencing of code review." Multidiszciplináris Classification Approach for Software Bug Prediction." Dokuz Eylül
Tudományok 10.3 (2020): 277-284. Üniversitesi Mühendislik Fakültesi Fen ve Mühendislik Dergisi 21.62
[10] Aleem, Saiqa, Luiz Fernando Capretz, and Faheem Ahmed. (2019): 533-544.
"Comparative performance analysis of machine learning techniques for [26] Ferenc, Rudolf, et al. "An automatically created novel bug dataset and
software bug detection." ITCS, CST, JSE, SIP, ARIA, DMS (2015): 71- its validation in bug prediction." Journal of Systems and Software 169
79. (2020): 110691.
[11] M. J. Islam, P. Pan, G. Nguyen, and H. Rajan,“A comprehensive study [27] Pecorelli, Fabiano, and Dario Di Nucci. "Adaptive selection of
on deep learning bug characteristics”, in Proceedings of the 2019 27th classifiers for bug prediction: A large-scale empirical analysis of its
ACM Joint Meeting on European Software Engineering Conference and performances and a benchmark study." Science of Computer
Symposium on the Foundations of Software Engineering, Tallinn, Programming 205 (2021): 102611.
Estonia, 26–30 August 2019, pages 1‒11, 2019. [28] Sharma, Shubham, and Sandeep Kumar. "Analysis of Ensemble Models
[12] S. Gitika, S. Sharma, and S. Gujral. “A novel way of assessing software for Aging Related Bug Prediction in Software Systems." ICSOFT. 2018.
bug severity using dictionary of critical terms”, Procedia Computer [29] Kumar, Raj. "Multiclass Software Bug Severity Classification using
Science, vol. 70, pp. 632‒639, 2015. Decision Tree, Naive Bayes and Bagging." Turkish Journal of Computer
[13] P. Maltare and V. Sharma, “Implementation advance technique for and Mathematics Education (TURCOMAT) 12.2 (2021): 1859-1865.
prediction bug using machine learning”, International Journal of [30] Ferenc, Rudolf, et al. "Deep learning in static, metric-based bug
Computer Science and Information Technologies, vol. 8, no. 1, pp. 16‒ prediction." Array 6 (2020): 100021.
19, 2017.
[31] Ye, Xin, et al. "Bug Report Classification using LSTM architecture for
[14] S. D. Immaculate, M. F. Begam, and M. Floramary. “Software bug more accurate software defect locating." 2018 17th IEEE International
prediction using supervised machine learning algorithms”, in Conference on Machine Learning and Applications (ICMLA). IEEE,
International Conference on Data Science and Communication, 2018.
Bangalore, India, 1-2 March 2019, pages 1‒7, 2019.
[32] Bani-Salameh, Hani, and Mohammed Sallam. "A Deep-Learning-Based
[15] G. Rodríguez-Pérez, A. Serebrenik, A. Zaidman, D. M. Germán and J. Bug Priority Prediction Using RNN-LSTM Neural Networks." e-
M. Gonzalez-Barahona, “How bugs are born: a model to identify how Informatica Software Engineering Journal 15.1 (2021).
bugs are introduced in software components”, Empirical Software
Engineering, vol. 25, pp. 1294‒1340, 2020. [33] Pascarella, Luca, Fabio Palomba, and Alberto Bacchelli. "Re-evaluating
method-level bug prediction." 2018 IEEE 25th International Conference
[16] M. Sharma, P. Bedi, K.K. Chaturvedi, and V.B. Singh, “Predicting the on Software Analysis, Evolution and Reengineering (SANER). IEEE,
priority of a reported bug using machine learning techniques and cross 2018.
project validation”, in 12th International Conference on Intelligent
[34] Puranik, Shruthi, Pranav Deshpande, and K. Chandrasekaran. "A novel
Systems Design and Applications, Kochi, India, 27-29 November 2012,
pp. 539‒545, 2012. machine learning approach for bug prediction." Procedia Computer
Science 93 (2016): 924-930.
[17] Shirabad, J. Sayyad, and Tim J. Menzies. "The PROMISE repository of
software engineering databases." School of Information Technology and [35] Saharudin, S. N., Wei, K. T. & Na, K. S. (2020). Machine Learning
Engineering, University of Ottawa, Canada 24 (2005). Techniques for Software Bug Prediction: A Systematic Review. Journal
of Computer Science, 16(11), 1558-1569.
[18] M. Efendioglu, A. Sen, and Y. Koroglu. “Bug prediction of system C
models using machine learning”, IEEE Transactions on Computer-Aided [36] Gupta, Varuna, N. Ganeshan, and Tarun K. Singhal. "Developing
Design of Integrated Circuits and Systems, vol. 38, no. 3, pp. 419‒429, software bug prediction models using various software metrics as the
2019. bug indicators." International Journal of Advanced Computer Science
and Applications (IJACSA) 6.2 (2015).
[19] Rajkumar, V. and V. Venkatesh. “Hybrid Approach for Fault Prediction
in Object-Oriented Systems.” (2017). [37] Baarah, Aladdin, et al. "Machine learning approaches for predicting the
severity level of software bug reports in closed source
[20] Meiliana, Syaeful Karim, et al. "Software Metrics for Fault Prediction projects." International Journal of Advanced Computer Science and
Using Machine Learning Approaches." IEEE (2017). Applications 10.10.14569 (2019).
[21] Iqbal, Ahmed, et al. "Performance analysis of machine learning [38] Qin, Fangyun, Xiaohui Wan, and Beibei Yin. "An empirical study of
techniques on software defect prediction using NASA datasets." Int. J. factors affecting cross-project aging-related bug prediction with
Adv. Comput. Sci. Appl 10.5 (2019): 300-308. TLAP." Software Quality Journal 28.1 (2020): 107-134.
[22] Baarah, Aladdin, et al. "Machine learning approaches for predicting the [39] Qin, Fangyun, et al. "Studying aging-related bug prediction using cross-
severity level of software bug reports in closed source projects." Mach project models." IEEE Transactions on Reliability 68.3 (2018): 1134-
Learn (2019). 1153.
[23] Kukkar, Ashima, et al. "A novel deep-learning-based bug severity [40] Som Gupta and Sanjai Kumar Gupta, “A Systematic Study of Duplicate
classification technique using convolutional neural networks and Bug Report Detection” International Journal of Advanced Computer
random forest with boosting." Sensors 19.13 (2019): 2964. Science and Applications(IJACSA), 12(1), 2021.
[24] Moustafa, Sammar, et al. "Software bug prediction using weighted
majority voting techniques." Alexandria engineering journal 57.4
(2018): 2763-2774.

735 | P a g e
www.ijacsa.thesai.org

Learning Software Engineering
From Everand
Learning Software Engineering
IT Campus Academy
No ratings yet
Yuyot
100% (1)
Yuyot
1 page
Email Spam
No ratings yet
Email Spam
12 pages
Review Paper
No ratings yet
Review Paper
13 pages
Software Bug Prediction Using Machine Learning Approach
No ratings yet
Software Bug Prediction Using Machine Learning Approach
6 pages
Predicting Root Cause Analysis (RCA) Bucket For
No ratings yet
Predicting Root Cause Analysis (RCA) Bucket For
4 pages
IEEE - INDIACom 2018 Paper
No ratings yet
IEEE - INDIACom 2018 Paper
6 pages
May 2025: Top 10 Cited Articles in Software Engineering & Applications
No ratings yet
May 2025: Top 10 Cited Articles in Software Engineering & Applications
31 pages
August 2024: Top 10 Cited Articles in Software Engineering & Applications
No ratings yet
August 2024: Top 10 Cited Articles in Software Engineering & Applications
31 pages
KK
No ratings yet
KK
9 pages
Hyperparameter Optimization for Software Bug Prediction Using Ensemble Learning
No ratings yet
Hyperparameter Optimization for Software Bug Prediction Using Ensemble Learning
10 pages
papers 8
No ratings yet
papers 8
21 pages
Print Out Project MACHINE LEARNING
No ratings yet
Print Out Project MACHINE LEARNING
12 pages
OPABP NidhiSrivastava
No ratings yet
OPABP NidhiSrivastava
7 pages
Software Metrics For Fault Prediction Using Machine Learning Approaches
No ratings yet
Software Metrics For Fault Prediction Using Machine Learning Approaches
5 pages
Software Defect Prediction: A Survey With Machine Learning Approach
No ratings yet
Software Defect Prediction: A Survey With Machine Learning Approach
6 pages
A Survey of Different Machine Learning M
No ratings yet
A Survey of Different Machine Learning M
13 pages
Defect Prediction in Software Development & Maintainence
From Everand
Defect Prediction in Software Development & Maintainence
Rudra Kumar
No ratings yet
Bug Paper
No ratings yet
Bug Paper
10 pages
14 Apr
No ratings yet
14 Apr
9 pages
Designing A Robust Software Bug Prediction Model Using Enhanced Learning Principles With Artificial Intelligence Assistance
No ratings yet
Designing A Robust Software Bug Prediction Model Using Enhanced Learning Principles With Artificial Intelligence Assistance
6 pages
SDP Edited1.edited
No ratings yet
SDP Edited1.edited
8 pages
Optimal Machine Learning Model For Software Defect Prediction
No ratings yet
Optimal Machine Learning Model For Software Defect Prediction
14 pages
Predicciones de Defectos de Software
No ratings yet
Predicciones de Defectos de Software
6 pages
Romi Jse Template 2014
No ratings yet
Romi Jse Template 2014
5 pages
Software Defect Prediction PPR
No ratings yet
Software Defect Prediction PPR
11 pages
Software Defect Prediction Using ML
No ratings yet
Software Defect Prediction Using ML
6 pages
A Meta-Stacked Software Bug Prognosticator Classifier
No ratings yet
A Meta-Stacked Software Bug Prognosticator Classifier
7 pages
Research Proposal
No ratings yet
Research Proposal
4 pages
Emmanuel God With Us Seminar Work
No ratings yet
Emmanuel God With Us Seminar Work
19 pages
P11 - Software Fault Prediction A Literature Review and Current Trends
No ratings yet
P11 - Software Fault Prediction A Literature Review and Current Trends
11 pages
Deep Learning For Software Defect Prediction - A Survey
No ratings yet
Deep Learning For Software Defect Prediction - A Survey
6 pages
Overview of Software Defect Prediction Using Machine Learning Algorithms
No ratings yet
Overview of Software Defect Prediction Using Machine Learning Algorithms
12 pages
Software Defect Prediction Using Supervised Machine Learning and Ensemble Techniques
No ratings yet
Software Defect Prediction Using Supervised Machine Learning and Ensemble Techniques
17 pages
IJAS 25 069 Galley Proof
No ratings yet
IJAS 25 069 Galley Proof
6 pages
A Comprehensive Analysis of Ensemble-Based Fault Prediction Models Using Product, Process, and Object-Oriented Metrics in Software Engineering
No ratings yet
A Comprehensive Analysis of Ensemble-Based Fault Prediction Models Using Product, Process, and Object-Oriented Metrics in Software Engineering
8 pages
Neural Network Parameter Optimization Based On Genetic Algorithm For Software Defect Prediction
No ratings yet
Neural Network Parameter Optimization Based On Genetic Algorithm For Software Defect Prediction
2 pages
Fault Prediction
No ratings yet
Fault Prediction
9 pages
Software Defect Prediction Using An Intelligent Ensemble-Based Model
No ratings yet
Software Defect Prediction Using An Intelligent Ensemble-Based Model
20 pages
Muhammad
No ratings yet
Muhammad
17 pages
Software Bug Detection Using Data Mining
No ratings yet
Software Bug Detection Using Data Mining
6 pages
A Developer Centered Bug Prediction Model
No ratings yet
A Developer Centered Bug Prediction Model
21 pages
Sivam 219303066 Research Paper Testing 1
No ratings yet
Sivam 219303066 Research Paper Testing 1
13 pages
Calibration of Software Quality: Fuzzy Neural and Rough Neural Computing Approaches
No ratings yet
Calibration of Software Quality: Fuzzy Neural and Rough Neural Computing Approaches
4 pages
Ebug Final
No ratings yet
Ebug Final
25 pages
Software Defect
No ratings yet
Software Defect
46 pages
Reliable Prediction of Software Defects Using Shapley Interpretable Machine
No ratings yet
Reliable Prediction of Software Defects Using Shapley Interpretable Machine
20 pages
ASC Review Article
No ratings yet
ASC Review Article
26 pages
An Enhanced Bayesian Decision Tree Model For Defect Detection On Complex SDLC Defect Data
No ratings yet
An Enhanced Bayesian Decision Tree Model For Defect Detection On Complex SDLC Defect Data
6 pages
Machine Learning Algorithms for Data Scientists: An Overview
From Everand
Machine Learning Algorithms for Data Scientists: An Overview
Vinaitheerthan Renganathan
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Ai PPT 2)
No ratings yet
Ai PPT 2)
10 pages
Software Testing Defect Prediction Model - A Practical Approach
No ratings yet
Software Testing Defect Prediction Model - A Practical Approach
5 pages
A Defect Prediction Method For Software Versioning: Ó Springer Science+Business Media, LLC 2008
No ratings yet
A Defect Prediction Method For Software Versioning: Ó Springer Science+Business Media, LLC 2008
20 pages
Dynamic Selection of Classifiers in Bug Prediction: An Adaptive Method
No ratings yet
Dynamic Selection of Classifiers in Bug Prediction: An Adaptive Method
11 pages
A Study On Software Fault Prediction Techniques
No ratings yet
A Study On Software Fault Prediction Techniques
73 pages
Improving Bug Detection Via Context-Based Code Rep
No ratings yet
Improving Bug Detection Via Context-Based Code Rep
30 pages
Comparative Analysis of Software Reliability Prediction Using Machine Learning and Deep Learning
No ratings yet
Comparative Analysis of Software Reliability Prediction Using Machine Learning and Deep Learning
6 pages
A Systematic Literature Review On Fault Prediction Performance in Software Engineering
100% (2)
A Systematic Literature Review On Fault Prediction Performance in Software Engineering
7 pages
15 Jsee2445
No ratings yet
15 Jsee2445
11 pages
Review Article Abstract
No ratings yet
Review Article Abstract
2 pages
A General Software Defect-Proneness Prediction Framework: Qinbao Song, Zihan Jia, Martin Shepperd, Shi Ying, and Jin Liu
No ratings yet
A General Software Defect-Proneness Prediction Framework: Qinbao Song, Zihan Jia, Martin Shepperd, Shi Ying, and Jin Liu
15 pages
DBU Audit Journal
No ratings yet
DBU Audit Journal
11 pages
21 03 2012 M3u
No ratings yet
21 03 2012 M3u
6 pages
Nerdio Manager For Enterprise Implementation Guide
No ratings yet
Nerdio Manager For Enterprise Implementation Guide
214 pages
Installing ISA Server 2000
No ratings yet
Installing ISA Server 2000
21 pages
Chapter - 6 - Hadoop
No ratings yet
Chapter - 6 - Hadoop
51 pages
Mrs. Asha K, Dept. of ECE, Sai Vidya Institute of Technology
No ratings yet
Mrs. Asha K, Dept. of ECE, Sai Vidya Institute of Technology
36 pages
Azure Sentinel Workshop: Partner Overview Deck
100% (1)
Azure Sentinel Workshop: Partner Overview Deck
17 pages
Pytorch Slides
No ratings yet
Pytorch Slides
31 pages
Casio Scientific Calculator and Financial Calculators by Ezbuy
No ratings yet
Casio Scientific Calculator and Financial Calculators by Ezbuy
16 pages
Syllabus Mlis BHU
100% (1)
Syllabus Mlis BHU
13 pages
3 - Fundamentals of WANs and IP Routing
No ratings yet
3 - Fundamentals of WANs and IP Routing
6 pages
Manual
No ratings yet
Manual
56 pages
Studio Rip XF Brochure
No ratings yet
Studio Rip XF Brochure
2 pages
Network L2 TOC
No ratings yet
Network L2 TOC
2 pages
Agile Method Training BA 012021
No ratings yet
Agile Method Training BA 012021
90 pages
Comparison of Digital Fiber DAS and Analog Fiber DAS
No ratings yet
Comparison of Digital Fiber DAS and Analog Fiber DAS
13 pages
Specification Sheet WDM Medical X-Ray Radiography System NEW ORIENTAL 1000NC
No ratings yet
Specification Sheet WDM Medical X-Ray Radiography System NEW ORIENTAL 1000NC
3 pages
CHAPTER 5 Teletraffic Eng
No ratings yet
CHAPTER 5 Teletraffic Eng
27 pages
IDV-01-Course Overview
No ratings yet
IDV-01-Course Overview
100 pages
Inventory Management Systems
No ratings yet
Inventory Management Systems
7 pages
Google Company
No ratings yet
Google Company
11 pages
Priority Interrupt
No ratings yet
Priority Interrupt
22 pages
The Program For Control of The Manipulator of The Robotic Site
No ratings yet
The Program For Control of The Manipulator of The Robotic Site
6 pages
File Handling Programs With Solution
No ratings yet
File Handling Programs With Solution
28 pages
Lab Manual Logic Design 1-2
No ratings yet
Lab Manual Logic Design 1-2
23 pages
BB-Shipping Container Manual (REV B) PDF
No ratings yet
BB-Shipping Container Manual (REV B) PDF
15 pages
Satellite Uml
No ratings yet
Satellite Uml
6 pages
4 Create A Cluster
No ratings yet
4 Create A Cluster
3 pages

Comprehensive Study On Machine Learning

Uploaded by

Comprehensive Study On Machine Learning

Uploaded by

(IJACSA) International Journal of Advanced Computer Science and Applications,

Vol. 12, No. 8, 2021

Comprehensive Study on Machine Learning

Nisa Uqaili et al. in [8] proposed an approach to classify A. Study Selection

TABLE I. ML TECHNIQUES USED IN PREVIOUS STUDIES FOR SOFTWARE

Machine Learning techniques

[7] X x TABLE II. RESEARCH QUESTIONS

Fig. 3. Studies using different Performance Measures for SBP.

VII. CLASSIFIERS USED FOR SOFTWARE BUG PREDICTION IN TP

• F1-score: F1-score is weighted harmonic mean of

Fig. 4. Structure of Software Bug Prediction Model.

VIII. BUILDING AND EVALUATION OF PREDICTION MODELS Actual

PC1 dataset. The average accuracy of the proposed models on

TABLE VI. PERFORMANCE MEASURES OF THE PROPOSED MODELS OVER

TABLE VII. PERFORMANCE MEASURES OF THE PROPOSED MODELS OVER

TABLE VIII. PERFORMANCE MEASURES OF THE PROPOSED MODELS OVER

You might also like