Paper 2
Paper 2
net/publication/352840718
CITATIONS READS
2 2,928
4 authors, including:
2 PUBLICATIONS 3 CITATIONS
Bharatividyapeeth's college of Engineering, New Delhi
10 PUBLICATIONS 9 CITATIONS
SEE PROFILE
SEE PROFILE
All content following this page was uploaded by Aadar Pandita on 30 June 2021.
Abstract: Heart disease has been one of the ruling causes for death for quite some time now. About 31% of all deaths every year
in the world take place as a result of cardiovascular diseases [1].
A majority of the patients remain uninformed of their symptoms until quite late while others find it difficult to minimise the
effects of risk factors that cause heart diseases.
Machine Learning Algorithms have been quite efficacious in producing results with a high level of correctness thereby
preventing the onset of heart diseases in many patients and reducing the impact in the ones that are already affected by such
diseases. It has helped medical researchers and doctors all over the world in recognising patterns in the patients resulting in
early detections of heart diseases.
Keywords: Cardiovascular Diseases (CVDs); Support Vector Machine (SVM); K- Nearest Neighbour (KNN); Naive Bayes (NB);
Random Forest (RF); Logistic Regression (LR); Machine Learning (ML); Prediction Model
I. INTRODUCTION
Machine Learning being a subset of Artificial Intelligence has been one of the most vital tools in recent history. It has helped further
the progress of the healthcare industry manifolds. With such a tool, medical professionals and researchers have been able to
diagnose and detect diseases with much accurate precision. It has contributed to saving many lives.
Majority of the heart related diseases can be averted if people focus on their physical activity, have a balanced and nutritious diet
and avoid consumption of products such as tobacco and alcohol that have a damaging effect on the heart and their overall health.
Therefore, it becomes paramount that these diseases are detected as early, so that its effects can be managed with medical advice
and medicines. In this paper, we are going to review various advancements and recent works that have been done using Machine
learning in the prediction of heart diseases. Heart diseases are a result of a multitude of aspects that can influence the cardiovascular
health of an individual such as age, blood sugar, blood pressure, cholesterol etc.
Harshit Jindal et al. [6] implemented a system that uses three different classification algorithms, KNN, RF, LR and results in an
accuracy of 87.5%. In this EHDPS i.e, effective heart disease prediction system, Logistic Regression and KNN outperform RF with
KNN providing an accuracy of 88.52% which is highest amongst the three techniques used.
Aadar Pandita et al. [7] proposed a predictive model that implements 5 machine learning algorithms and uses the technique with the
highest accuracy to build a web application that takes in patient’s medical details and predicts if they have a heart disease or not.
The web application is built using HTML/CSS and Flask based framework. The maximum accuracy obtained was obtained using
KNN, i.e, 89.06% while Logistic Regression contributed with least accuracy of 84.38%.
N. Saranya et al. [8] proposed a time and money efficient model of predicting heart disease using a web application. The model
works on two different methods : Random Forest and KNN. The dataset has been taken from one of Coimbatore’s hospitals which
produces an accuracy of 100% using Random Forest and 91.36% using KNN after cleaning and pre-processing of the dataset. An
ensemble model with and without Logistic Regression is also used to predict the chances with an accuracy of 98.77% and 95.06%
respectively.
Aravind Akella et al. [9] applied six predictive models on the UCI dataset and achieved a maximum accuracy of 93.03% with
Neural Networks with a recall of 93.8 indicating low chances of false negatives and therefore extremely precise result, while the
other five had an accuracy of almost 80% and more.
Ravindhar NV et al. [10] implemented five algorithms: Logistic Regression, Naive Bayes, Fuzzy KNN, K-Means Clustering and
back propagation Neural-Network. A 10-fold cross validation method is used in the experimental analysis of heart conditions. The
maximum accuracy was gathered using back propagation Neural Network with 98.2% accuracy and 87.64% recall and 89.65%
precision.
2020 Devansh Shah Heart disease prediction using Naive Bayes 88.157%
et al. [5] machine learning techniques
KNN 90.789%
SVM 87.50%
SVM 86.52%
KNN 84.27%
III. CONCLUSION
Majority of researchers have used the Cleveland Heart Disease Dataset available from the UCI repository containing 76 attributes
and 303 instances, of which only 14 attributes are used due to missing values [11]. There are huge benefits to having feature
selection methods so as to minimise the number of attributes that one has to use in order to build an accurate model by checking the
correlation between various attributes and their impact on the accuracy of the models. It can be seen from various research papers in
the field that KNN and Neural Network works quite accurately in most cases for the prediction of heart diseases.