Heart Disease Prediction Using Frequent Item Set M
Heart Disease Prediction Using Frequent Item Set M
net/publication/345383677
Heart Disease Prediction Using Frequent Item Set Mining and Classification
Technique
Article in International Journal of Information Engineering and Electronic Business · November 2019
DOI: 10.5815/ijieeb.2019.06.02
CITATIONS READS
32 281
4 authors:
All content following this page was uploaded by Sinkon Nayak on 11 February 2021.
Abstract—The heart is the most important part of the are used to spot and prevent the diseases at an primitive
human body. Any abnormality in heart results heart period of time. For the prediction of the heart related
related illness in which it obstructs blood vessels which illness it uses 14 attributes having 303 instances. Various
causes heart attack, chest pain or stroke. Care and performance measurement parameters are used like
improvement of the health by the help of identification, accuracy, sensitivity, specificity, positively predicted
prevention, and care of any kind of diseases is the main value, negatively predicted value and the area under
goal. So for this various prediction analysis methods are curve.
used which job is to identify the illness at prelim phase so This paper is organized into section as follows. Section
that prevention and care of heart disease is done. This II encapsulates heart disease. Section III provide a brief
paper emphasizes on the care of heart diseases at a description of literature survey of heart related disease.
primitive phase so that it will lead to a successful cure. In The work flow steps are discussed in section IV. Section
this paper, diverse data mining classification method like V is all about the preprocessing of data and VI describes
Decision tree classification, Naive Bayes classification, the attribute filtration. Section VII concise discussion of
Support Vector Machine classification, and k-NN the classification techniques such as Naive Bayes,
classification are used for determination and safeguard of Decision tree, SVM, k-NN. Dataset collection attributes
the diseases. elucidation, comparison study is discussed in section VIII.
Section VII is all of the result analysis. Section IX is the
Index Terms—Heart Disease, Frequent Itemset, conclusion, summarizes a brief overview of the content.
Classification , Performance Measurement Parameter.
Copyright © 2019 MECS I.J. Information Engineering and Electronic Business, 2019, 6, 9-15
10 Heart Disease Prediction Using Frequent Item Set Mining and Classification Technique
Copyright © 2019 MECS I.J. Information Engineering and Electronic Business, 2019, 6, 9-15
Heart Disease Prediction Using Frequent Item Set Mining and Classification Technique 11
various diseases. There are numerous symptoms observed heart data algorithm 1 is used in which frequent item set
in a patient for a particular disease which defines the is calculated.
clinical condition of them. For filtering the attribute of
Copyright © 2019 MECS I.J. Information Engineering and Electronic Business, 2019, 6, 9-15
12 Heart Disease Prediction Using Frequent Item Set Mining and Classification Technique
Table 2. Pros and Cons of Decision Tree Classification Techniques Modeling is not expensive. Classifying unknown data is
very expensive.
Pros Cons
C. SVM
Support Vector Machine can be described by a
hyperplane which separates the data into two parts which
lay in either side. It can be used for classification as well
as regression. It basically applied on the data which are
noisy and tangled in quality[10,19].
Table 4. Pros and Cons of SVM Classification Techniques Fig.5. Detail Description of Dataset
Pros Cons
Scale well for high dimensional Sensitive to noisy data. For the computation of Accuracy, Sensitivity,
data. Specificity, Area under curve and ROC curve uses
confusion matrix exhibits in table 6.
Table 7 gives the comparison of data mining
D. k-NN
classification algorithms on the basis of various
k-NN classifier is the most instance-based method for performance parameter without attribute filtration.
classifying data. k-NN stores all available records and Sensitivity : P(+|1) : Percentage of Truly Positive:
classifies them on the basis of similarity measures[20]. TP/(TP+FN) (1) which correctly predicts to have illness.
Copyright © 2019 MECS I.J. Information Engineering and Electronic Business, 2019, 6, 9-15
Heart Disease Prediction Using Frequent Item Set Mining and Classification Technique 13
k-NN 58.49 50 48.49 44.80 54.7 .628 SVM 81.13 41.30 56.96 44.53 53.71 .8080
3
Copyright © 2019 MECS I.J. Information Engineering and Electronic Business, 2019, 6, 9-15
14 Heart Disease Prediction Using Frequent Item Set Mining and Classification Technique
Table 8 gives the comparison of data mining support at an diminish monetary value. For this various
classification algorithms on the basis of various predictive analysis methods are used which leads to
performance parameter with attribute filtration. achieve the result which in needed. This paper scrivener
the key detection and hindrance of heart related
unhealthiness by diverse classification methods which are
implemented using R analytical tool. This research paper
describes the classification techniques used for the early
anticipation. For the anticipation of heart related
unhealthiness at the primaeval period of time the
accuracy of Naive Bayes is dominant as compared to
another. From findings, the accuracy of foresee heart
unhealthiness dissent from each other and the accuracy of
foresee also rely on the platform. The accuracy and area
under curve is sovereign in case of Naive Bayes classifier
by using R data analytical tool for predicting heart illness
with or without attribute filtration but performance of k-
NN increases but the performance of others decreases.
And after this we will try ensemble technique to optimize
the proposed model and also compare with the existing
proposed one.
REFERENCES
[1] Sundar, N. Aditya, P. Pushpa Latha, and M. Rama
Fig.8. ROC Curve of Various Classifier with Attribute Filtration Chandra. "Performance analysis of classification data
mining techniques over heart disease
database." International journal of engineering science &
advanced technology 2.3 (2012): 470-478.
[2] Palaniappan, Sellappan, and Rafiah Awang. "Intelligent
heart disease prediction system using data mining
techniques." 2008 IEEE/ACS international conference on
computer systems and applications. IEEE, 2008.
[3] Dangare, Chaitrali S., and Sulabha S. Apte. "Improved
study of heart disease prediction system using data mining
classification techniques." International Journal of
Computer Applications 47.10 (2012): 44-48.
[4] Thomas, J., and R. Theresa Princy. "Human heart disease
prediction system using data mining techniques." 2016
International Conference on Circuit, Power and
Computing Technologies (ICCPCT). IEEE, 2016.
[5] Wilson, Aswathy, et al. "Data Mining Techniques For
Fig.9. Performance Graph of Various Classifier with Attribute Filtration Heart Disease Prediction." (2014).
[6] Banu, MA Nishara, and B. Gomathy. "Disease forecasting
Figure 8 represents the ROC curve of different system using data mining methods." 2014 International
classifier and area under curve with attribute filtration is conference on intelligent computing applications. IEEE,
maximum for Naive Bayes classifier as compare to others 2014.
but when we consider the performance then the [7] Waghulde, Nilakshi P., and Nilima P. Patil. "Genetic
performance of k-NN increases but the performance of neural approach for heart disease
other classification methods are decreases. Figure 9 prediction." International Journal of Advanced Computer
Research 4.3 (2014): 778.
represents the performance of classification methods with
[8] Database: http://archive.ics.uci.edu/ml/
respect to accuracy, sensitivity and specificity in a datasets/Heart+Disease
graphical way. [9] Wu, Xindong, et al. "Data mining with big data." IEEE
transactions on knowledge and data engineering 26.1
(2014): 97-107.
IX. CONCLUSION AND FUTURE SCOPE [10] Umadevi, S., and KS Jeen Marseline. "A survey on data
mining classification algorithms." 2017 International
This paper focuses on the early anticipation of heart Conference on Signal Processing and Communication
related unwellness on the basis of various indicant (ICSPC). IEEE, 2017.
observed in a particular patient so that one can got the [11] Tomar, Divya, and Sonali Agarwal. "A survey on Data
appropriate care and treatment for recovery. These days Mining approaches for Healthcare." International Journal
to get better medical service so that every tolerant able to of Bio-Science and Bio-Technology 5.5 (2013): 241-266.
recover from unwellness independent of the illness. So [12] Krishnapuram, B., et al., A Bayesian approach to joint
feature selection and classifier design.Pattern Analysis
the key challenge to provide better care and medical
Copyright © 2019 MECS I.J. Information Engineering and Electronic Business, 2019, 6, 9-15
Heart Disease Prediction Using Frequent Item Set Mining and Classification Technique 15
and Machine Intelligence, IEEE Transactions on, 2004. Manjusha Pandey, Ph.D (Computer
6(9): p. 1105-1111 Science), Member of IEEE is Professor at
[13] “Heart disease” from http://wikipedia.org the School of Computer Engineering, KIIT
[14] Frawley and Piatetsky-Shapiro, 1996. Knowledge University, Bhubaneswar. She has more than
Discovery in Databases:An Overview. The AAAI/MIT a decade of teaching and research experience.
Press, Menlo Park, C.A. Dr. Pandey has published numbers of
[15] "Hospitalization for Heart Attack, Stroke, or Congestive Research Papers in peerreviewed
Heart Failure among Persons with Diabetes", Special International Journals and conferences. Her areas of interest is
report: 2001 – 2003, New Mexico. WSN, Data analytics etc. She can be reached at
[16] “ROC curve” from https://en.wikipedia.org [email protected]
[17] “Decision Tree” from https://en.wikipedia.org
[18] “Naive Bayes” from https://en.wikipedia.org
[19] “Support Vector Machine” from https://en.wikipedia.org Siddharth Swarup Rautaray, Ph.D
[20] “K Nearest Neighbour” from https://en.wikipedia.org (Computer Science), Member of IEEE is
Professor at the School of Computer
Engineering, KIIT University, Bhubaneswar.
He has more than a decade of teaching and
research experience. Dr. Rautaray has
Authors’ Profiles
published numbers of Research Papers in
peer-reviewed International Journals and conferences. His areas
Sinkon Nayak, is a Student. Currently
of interest is Image Processing/DA/Human Computer
pursuing M. Tech (Computer Science and
Interaction. He can be reached at [email protected]
Engineering) at the School of Computer
Engineering, KIIT University, Bhubaneswar.
His areas of interest Data Analytics ,Data
mining etc . She can be reached at
[email protected].
How to cite this paper: Sinkon Nayak, Mahendra Kumar Gourisaria, Manjusha Pandey, Siddharth Swarup Rautaray, "
Heart Disease Prediction Using Frequent Item Set Mining and Classification Technique", International Journal of
Information Engineering and Electronic Business(IJIEEB), Vol.11, No.6, pp. 9-15, 2019. DOI:
10.5815/ijieeb.2019.06.02
Copyright © 2019 MECS I.J. Information Engineering and Electronic Business, 2019, 6, 9-15