0% found this document useful (0 votes)
57 views8 pages

Comprehensive Review of Machine Learning Applications in Heart Disease Prediction

Heart infections are responsible, for deaths and nowthey are a major contributor to depression in many individuals. To prevent fatalities, regular monitoring and early identification of heart conditions can significantly reduce the number of deaths.Detecting heart disease has become a task in the analysis ofdata. While accurately predicting heart infections may pose challenges employing advanced machine learning techniques can make it easier.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
57 views8 pages

Comprehensive Review of Machine Learning Applications in Heart Disease Prediction

Heart infections are responsible, for deaths and nowthey are a major contributor to depression in many individuals. To prevent fatalities, regular monitoring and early identification of heart conditions can significantly reduce the number of deaths.Detecting heart disease has become a task in the analysis ofdata. While accurately predicting heart infections may pose challenges employing advanced machine learning techniques can make it easier.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

Volume 9, Issue 7, July – 2024 International Journal of Innovative Science and Research Technology

ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24JUL1871

Comprehensive Review of Machine Learning


Applications in Heart Disease Prediction
Yogesh Kumar1; Geet Kiran Kaur2; Ranjit Singh3
Department of CSE Chandigarh UniversityChandigarh, INDIA

Abstract:- Heart infections are responsible, for deaths and heart disease treatment, as there is advanced development in
now they are a major contributor to depression in many the diagnosis of heart diseases and well informed population.
individuals. To prevent fatalities, regular monitoring and Early detection of heart disease is crucial as it can
early identification of heart conditions can significantly significantly improve the chancesof effective treatment and
reduce the number of deaths. Detecting heart disease has reduce mortality rates (see Figure1). The factors that cause
become a task in the analysis of data. While accurately heart diseases are also grouped into the reversible and fixed
predicting heart infections may pose challenges employing components. These are factors that cannot change
advanced machine learning techniques can make it easier. biologically status and encompasses age, gender, genes
Studies have shown that machine learning methods can among others. Again, whereas non-modifiable risk factors are
effectively predict heart disease enabling detection and beyond one’s control, modifiable risk factors refer to the
assessment of its severity. This approach aims to lower lifestyle factors. These include being over-weight, smoking,
mortality rates decrease the severity of the illness and no exercise, poor diet, high blood pressure and high levels
facilitate diagnosis. The field of therapy is undergoing of cholesterol respectively. It is thus advisable to work on the
advancements through the integration of machine alterable risk factors as a way of decreasing the prevalence
learning techniques leading to enhanced accuracy in and severity of heart diseases. Another difficulty that is linked
interpreting analyses. These techniques play a role, in intimately with heart diseases is the extent of public ignorance
identifying indicators for predicting cardiac diseases with about these factors. This lack of knowledge, most of the time,
precision. The presentation is put together using leads to late presentation and hence delayed intervention,
categorization techniques, such, as Decision Tree (DT) which is followed by severe health consequences, or even
K Nearest Neighbors (K NN) Random Forest (RF) and mortality. It is extensive knowledge among the medical
Support Vector Machine (SVM). The performance of communities and the providers in the health care industry that
these four algorithms is assessed from angles, including early detection and preventive measures are optimal when it
specificity, recall, accuracy and precision. While precision comes to heart diseases. Heart diseases if diagnosed on time
varies SVM appears to deliver the results in this approach are usually manageable; however, if diagnosed at a senior
for calculations, in many instances. stage are fatal. Better diagnosis enhanced by improved
technology and knowledgeable populace will encourage early
Keywords:- Heart Disease, Machine Learning, Prediction, health care and intervention, better results.
Supervised Learning, Unsupervised Learning, Deep
Learning.

I. INTRODUCTION

Cardiovascular disease as a whole including different


forms, is among the principal causes of death in the global
population; it comprises of coronary artery diseases,
congestive heart failure, cardiac arrhythmias and valvular
heart diseases. Thisis why, greater emphasis should be
placed on detection ofthis disease in its early stages, so
that an influx of patientsdoes not overwhelm the medical
practitioners and mortality rates soar. Though certain triggers
such as heredity and age remain fixed, actual preventive
measures include the use of healthy food, regular exercise,
gaining control of weight and averting high blood pressure. It
is vital for the public to familiarize itself with these factors in Fig 1: Early Detection of Heart Disease
order to detect their presence and seek needed treatment on
time. Preventive check and early detection forms the basis of

IJISRT24JUL1871 www.ijisrt.com 2805


Volume 9, Issue 7, July – 2024 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24JUL1871

Often, the disease is only recognized in its final research paper attained the highest accuracy of 99. 05%
stages or after passing. This phenomenon has prompted achieved by using the Random Forest Bagging Method and
therapeutic organizations to emphasize the importance of Relief feature selection.
early disease detection.
S. Mondal et al. [3] which aimed at proposing a new two
II. LITERATURE SURVEY tiered stacked predictive model for analysis of heart diseases
risks with the use of machine learning. This model leveraged
Cardiovascular Diseases or CVD which includes heart a dataset containing eleven significant characteristics from
dis- eases is a global problem and early diagnostic tools are 1190 patients sourced from five distinct datasets: Some of the
easier to implement. Thus, the work conducted on this area available datasets are the Hungarian dataset, the Cleveland
with the help of Machine learning (ML) has shown the dataset, the Switzerland dataset, Long Beach dataset, and
improvement of prediction accuracy and efficiency of heart the Statlog dataset. Several other ML algorithms, such as
diseases. This review focuses on the updated developments Stochastic Gradient Descent, K-Nearest Neighbor, Logistic
(last decade) in the application of methods based on the ML Regression, and Random Forest have been used in the
algorithm for diagnosing heart disease. Understanding the previousworks to predict HD. In the context of the research
efficiency of thesemethods, comparing their results to each paper, the proposed stacking model outperforms the works
other, and investigat-ing the new trends in this perspective, we reviewed in this study in accuracy, recall, and ROC-AUC
only consider studies based on ML. Because in the current values getting toan accuracy rate of 96% and a recall of 0.
academic database search (IEEE Xplore, PubMed, Google 98.
Scholar) we focus on the most significant work regarding
predictive models. The goal of this review is to provide those A. Lakshmanarao et al. [4] concerned with superior
involved in research or practicing healthcare management predic- tion of heart diseases using the state of the art machine
with useful information. Relatedly, researchers will find future learning integrated with better feature engineering. The
research directions and prospects of oil exploration and experiments concerning the prediction of heart diseases were
clinicians will identify ML applications and best practices for done using heart disease dataset obtained from Kaggle. This
applying them to clinical practice. Thus, this review helps was done through association and correlation of the original
the ongoing processes of improving people’s health and dataset and consequently buildup of a fused available dataset
decreasing the frequency of heart diseases by pointing out contain 29 features and 918 samples. The algorithms used
those approaches that are more promising while indicating were: Thus, there are K-nearest neighbours (KNN), Support
the fields that require further investigationand development. vector machine (SVM), Decision tree, Random forest and
XGBoost. All of these classifiers achieved a good accuracy
T. Ullah et al. [1] focused on increasing the efficiency rate for heart disease detection, ranging from 84% to 96%
ofthe CVD diagnosis by choosing the best features of ECG across different algorithms.
signals which are widely employed for the automatic detec-
tion of CVD using machine learning methods. The research Gorapalli Srinivasa Rao and G Muneeswari [5] looked
utilized two main datasets to detect: Another one is the at how IoT, Data Mining, Deep Learning, and Machine
Hungarian Heart Disease Dataset (HHDD) and the other one Learning help predict heart disease. They used public
is the Behavioural Risk Factor Surveillance System (BRFSS) datasets like UCI open dataset, Heart Disease Dataset,
Dataset. The algorithms used by them were: Gradient Boost- Cleveland heart disease dataset, and Cardiovascular Disease
ing, Logistic Regression, Extra Tree, Random Forest, Support Dataset, plus localdatasets. To predict heart disease, they tried
Vector Machines (SVM), Comparison with State-of the-Art- feature selection methods such as GRAE, CAE, Lasso, IGAE,
Algorithms and Machine Learning-Based Architecture. and ETC. Then, it used classifiers like Random Forest, Naive
Lever-aging such techniques, the study achieved impressive Bayes, and Gradient Boost to check if patients had heart
accuracyrates in cardiovascular disease detection tasks. disease. The study wrapped up by saying we need more
complex systemsto better spot heart disease. It stressed the
P. Ghosh et al. [2] introduced a framework for accurate value of combining different approaches.
Heart Disease prediction by the different methods stressing
on efficient Data Collection, Data Pre-processing and Data C. M. Bhatt et al. [6] set out to examine how well
Transformation for model training. Using various sources of different machine learning methods work to predict heart
data namely Cleveland, Long Beach VA, Switzerland, disease. The research looked at random forest, decision tree
Hungar- ian, and Stat log the study establishes suitable classifier, multilayer perceptron, and XGBoost algorithms.
features through applying Relief and LASSO improving on The teamused a dataset of 70,000 patient records. Each record
the heart disease prediction accuracy. By applying the had 12 key features such as age, gender, and blood pressure
Decision Tree, Bayesian classifier, neural network, readings. The models they came up with showed high
Association law, SVM, KNN and others, the heart diseases accuracy: decision trees got 86.37% with cross-validation,
are detectable with good results.The proposed model in the XGBoost reached 86.87% random forest hit 87.05%, and

IJISRT24JUL1871 www.ijisrt.com 2806


Volume 9, Issue 7, July – 2024 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24JUL1871

multilayer perceptron topped out at 87.28%. Going forward, N. Lutimath et al. [10] focused on the machine learning
the scientists want to check how reliable and useful these techniques to predict heart disease using biomedical data. The
findings are. They also aimto make the results easy for people data -set on proof is taken to his website of heart diseases
to grasp. What’s more, theythink it’s a good idea to explore dataset, Diabetes Mellitus and Liver disease COSTNTAX
how gaps in data and weird outliers might mess with how (video interpretation): Dataset: 600 records were part of this,
well the models work. having a high accuracy rate to about 98% accurately pre-
dicted). These classification methods are used in this paper:
A. M. Qadri et al. [7] conducted an in-depth analysis of Decision Tree, Support Vector Machines and Random Forest
heart-related dataset features, contributing to the model. This paper compares the random forest regression
understanding of key factors influencing heart disease model to decision tree regression based on classification,
prediction. The dataset used for the study contained 499 feature engineering, and performance measures. From this
healthy patients and 526 patients with heart failure disease, study we find that of decision tree regression model and
with a distribution of 300 males and 226 females diagnosed random forest regression model the results are more accurate
with heart failure. Various machine learning algorithms, in case of prediction heart disease by using Random Forest as
developed using the Python programming language-based compared to Decision Tree model.
scikit-learn library module, such as SVM, random forest,
decision tree, logistic regression, and na¨ıve Bayes classifier N. Alageel at al. [11] aimed to increase the accuracy of
were used to predict heart failure. The decision tree method predicting a stroke in the medical field due to Machine and
emerged as the top-performing model, achieving an deep learning technologies have been made. Datasets from
impressive accuracy score of 100% in heart failure prediction, kaggle, EHRs were used to predict stroke with many features
surpassing other machine learning algorithms and without and with age, BMI, glucose level, and smoking
highlighting the success of the proposed feature engineering status. The following machine learning algorithms have been
approach. employed in this research, among others: Na¨ıve Bayes,
SVM, Decision Tree, Random Forest, K-Nearest Neighbours
P. Rahman et al. [8] attempted to develop a machine (KNN), and Stacking. Thus the application showcases the
learning tool based on various machine learning algorithms importance of benefiting simple algorithms, which
together with artificial neural networks (ANNs) for the demonstrate high ac- curacy with explainable outcomes, than
prediction of heart failure risk which can ensure early complex algorithms.
diagnosis and cost effective treatment. The age of dataset
from research was an average 65 years old, showing real M. K. Joshi et al. [12] intended to compare the
patient demographics affected by cardiovascular diseases performance of different Machine learning algorithms in
with a minimum recorded as low as 42 and maximum even predicting cardio- vascular indices, emphasizing on using data
higher than that at the astonishing number of over 95. In the driven methods for early detection and ultimatum of disease.
present study, a combi- nation of various supervised learning The used datasets such as Cleveland dataset from the UCI ML
classifiers such as KNN, SVM, DT, RF and repository and PIMA dataset to train and test their machine
xgBoost/RandomForest, xgBosst/CatBoose was used to learning model for prediction of cardiovascular disease. The
develop a highly stable accurate model. The results methodologies involve implementing deep neural networks,
highlighted the importance of a predictive modeling early KNN and SVM- based models, decision trees with random
detection for heart failure to control effectively the forest classifiers, logistic regression model along with
cardiovascular diseases world epidemic. Naive Bayes classi erand support vector classifier. For heart
disease prediction novel method like MSSO-ANFIS has been
J. Rashid et al. [9] aimed at identifying relevant input introduced here, and therefore, feature selection methods such
features based on the brute-force algorithm and by as LCSA show higher accuracy with lower error rate.
introducing machine learning techniques to improve heart
disease Classi- fier. The research paper uses three datasets of  Machine Learning in Healthcare
heart disease named the Cleveland, Statlog and Hungary Machine learning (ML) involves algorithms that allow
datsets which are extracted from University Of California- computers to learn from and make predictions on data. In
Irvine ML repository. The heart disease prediction can be healthcare, ML can analyze large datasets to uncover patterns
improved using machine learning techniques like Support and make predictions. ML methods are increasingly being
Vector Machine (SVM), Random Forest, K Nearest Neighbor usedfor disease prediction, including heart disease.
(KNN) and Naive Bayes. Relative to the results of split
validation, the accuracy with Naive Bayes reached 97%
whereas that caused by Random Forest via cross-validation
was about 95%.

IJISRT24JUL1871 www.ijisrt.com 2807


Volume 9, Issue 7, July – 2024 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24JUL1871

III. TECHNIQUES FOR HEART DISEASE IV. PROPOSED METHODOLOGY FOR HEART
PREDICTION DISEASE PREDICTION

A. Supervised Learning Methods This section outlines the methodology used to develop a
prediction model for identifying individuals at risk of heart
 Logistic Regression: Logistic regression is used for binary disease. The approach encompasses a series of steps for data
classification problems. It is simple and interpretable, collection, preparation, analysis, and evaluation to construct
making it a common choice for medical predictions. an effective prediction model. The methodology provides a
 Decision Trees: Decision trees partition data into subsets comprehensive framework for conducting a thorough and
based on feature values. They are easy to interpret but productive study as shown in Figure 2.
proneto overfitting.
 Random Forest: Random forest is an ensemble method A. Data Collection
that builds multiple decision trees and merges their results
to improve accuracy and control overfitting.  Effective Heart Disease Prediction Begins with the
 Support Vector Machines: Support vector machines Collectionof Relevant Patient Data. This Data Typically
(SVM) find the hyperplane that best separates different Includes:
classes in the feature space. They are effective in high-
dimensional spaces.  Medical Histories: Patient health records detailing past
 k-Nearest Neighbors: k-Nearest Neighbors (k-NN) clas- medical conditions and treatments.
sify data points based on their proximity to other points. It  Demographic Information: Age, gender, ethnicity, and
is simple but can be computationally intensive. other demographic factors.
 Neural Networks: Neural networks consist of layers of  Clinical Measurements: Blood pressure, cholesterol lev-
nodes that mimic the human brain. They can model els, blood sugar levels, and body mass index.
complex relationships but require large datasets and  Lifestyle Factors: Smoking status, physical activity lev-
computational power. els, dietary habits, and family history of heart disease.

B. Unsupervised Learning Methods B. Preprocessing

 Clustering: Clustering algorithms like k-means and hier-  Preprocessing is Essential to Ensure the Quality and
archical clustering group data points based on similarity, Consis-tency of the Dataset. The Steps Involved are:
whichcan reveal patterns in heart disease data.
 Data Cleaning: Address missing values, correct incon-
C. Ensemble Methods sistencies, and handle outliers.
 Normalization: Scale features to ensure uniformity
 Bagging: Bagging, or Bootstrap Aggregating, improves across different attributes.
the stability and accuracy of machine learning algorithms  Data Splitting: Divide the dataset into training andtesting
by combining the predictions of multiple models. subsets, typically using an 80:20 ratio.
 Boosting: Boosting sequentially applies models to im-
prove weak predictions, commonly used algorithms C. Feature Selection
include AdaBoost and Gradient Boosting. Feature selection involves choosing the most relevant
fea-tures for the predictive model. Techniques used for
D. Deep Learning Methods feature relevance based on their relationship with the target
variable.
 Convolutional Neural Networks: Convolutional Neural
Networks (CNNs) are effective for image data, useful in  Information Gain: Measures how much a feature
analyzing medical imaging for heart disease. improves the prediction of the target variable.
 Recurrent Neural Networks: Recurrent Neural Networks
(RNNs) handle sequential data, such as time series of  Wrapper Methods:
patient vitals, to predict heart disease trends.
 Forward Selection: Starts with an empty set andadds
features based on model performance.
 Backward Elimination: Begins with all features and
removes them iteratively based on performance.
 Recursive Feature Elimination (RFE): Removes features
iteratively to find the best subset.

IJISRT24JUL1871 www.ijisrt.com 2808


Volume 9, Issue 7, July – 2024 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24JUL1871

 Embedded Methods: Where:

 Lasso Regression: Uses L1 regularization to shrink some  TP = True Positives


coefficients to zero, performing feature selec- tion during
training.
 Decision Trees: Provides feature importance scores that
help in selecting relevant features.

 Ensemble Methods:
 Feature Importance from Ensemble Models: Al- gorithms
like Random Forest and Gradient Boosting provide scores
based on feature contribution to pre- dictions.

D. Model Testing

 Testing Ensures the Model’s Robustness and Applicability


by:

 Evaluation on Test Data: Assessing model performance on


unseen data to verify its predictive accuracy.
 Performance Metrics: Evaluating metrics such as accu-
racy, precision, recall, F1-score, and AUC-ROC to gauge
model effectiveness.

E. Results and Analysis

 Researchers Analyze the Model’s Performance to:

 Performance Evaluation: Determine how well the model


predicts heart disease and classifies patients.
 Comparative Analysis: Compare results across different
models and techniques to identify the most effective
approach.
 Visualization: Use graphical tools and flowcharts to
illustrate methodology and results.

V. MODEL EVALUATION AND PERFORMANCE


METRICS
Fig 2: Flowchart Depicting the Methodology for Heart
Model evaluation is crucial for assessing the Diseaseprediction
performance of predictive models. Various metrics are used
to measure the effectiveness of models in predicting heart
 TN = True Negatives
disease. Here, we discuss several key metrics and their
 FP = False Positives
formulas.
 FN = False Negatives
A. Accuracy
Accuracy measures the proportion of correctly B. Precision
classifiedinstances out of the total instances. It is Precision, also known as Positive Predictive Value,
calculated as: measures the proportion of true positives out of the total
predicted positives. It is given by:

(1)
(2)

IJISRT24JUL1871 www.ijisrt.com 2809


Volume 9, Issue 7, July – 2024 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24JUL1871

C. Recall  One-Size-Fits-All Risk Assessment: Traditional car-diac


Recall, also known as Sensitivity or True Positive Rate, risk models are fundamentally flawed since they areusing
measures the proportion of true positives out of the total actual generic data yet to examine the billions of possi- bilities.
positives. It is calculated as: This ignores factors such as personal lifestyle habits (diet,
activity levels, stress), genetic influences and even
socioeconomic status - all of which also playa large role
in cardiovascular health. To combat this, individualized
(3) risk models are currently being tailored. By taking
personal data into account, these models offer more
D. F1-Score individualized calculations of risks and can deliver risk
The F1-score is the harmonic mean of precision and targeted approaches for patients identified with higher
recall, providing a single metric that balances both. It is levels of genetic predisposition to the disease.
defined as:  Untapped Potential of Remote Monitoring: Despite being
promising, wearable technology & smartphoneapps have
a long way to go for effective accomplishment of
continuous heart health tracking. These are devices that
can measure heart rate, blood pressure and oxygen levels
(4) live as well as data points regarding the way a person
is living day to day. Efforts are underway toimprove
E. AUC-ROC the data that can be provided by these wearable - ideally
The Area Under the Receiver Operating Characteristic in a way people could stand wearing them all day. This
Curve (AUC-ROC) measures the performance of a classifi- ongoing monitoring can help people manage the Area
cation model by evaluating its ability to distinguish between Under the Receiver Operating Characteristic Curve
classes. The ROC curve plots the True Positive Rate (Recall) (AUC-ROC) measures the performance of a classifi-
against the False Positive Rate (1 Specificity) at various cation model by evaluating its ability to distinguish
thresholds. The AUC is the area under this curve, with a value between their heart health before issues occur, the
between 0 and 1. Higher AUC values indicate better model company said. In conclusion, remote monitoring has the
performance: potential to enhance external access and convenience of
cardiac care.
 Limited Prediction Accuracy: The trouble with data
analysis today is that while technology can analyse the
(5)
needs of manufacturing lines, it will struggle when it
comes to making sense out of masses complex and varied
F. Cross-Validation
patient information. Which either then leads to the
Cross-validation is a technique used to ensure the
missing of high-risk people or raising a bunch of false
robustness and generalizability of the model. It involves
alarms. One answer lies in deep learning enabling the
partitioning the data into training and validation sets multiple
efficient processing of complex data sets. Deep learning
times to assess model performance consistently. The most
can provide substantial performance gains by recognizing
common method isk-fold cross-validation, where the data is
subtle patterns and narrowing the field down to select key
split into k subsets, and the model is trained k times, each time
data points for better prediction centres.
using a different subset as the validation set and the remaining
data as thetraining set. The overall performance is averaged
VII. CHALLENGES AND LIMITATIONS
over all foldsto obtain a robust estimate.
 Data Obstacles: The datasets tend to be imbalanced with a
VI. RESEARCH GAPS
higher number of records from healthy patient than those
with heart diseases. This leads to situations where the AI
 Inaccurate Early Detection: The current diagnostic tools,
models are extreamly good at prediction of healthy
such as ECGs and stress tests are great so farbut they
people but entirely miss out on heart disease. HE often
frequently overlook the subtle manifestationsof early-
entails missing and sometimes conflicting or incorrect
stage heart problems causing underdiagnosis where
medical data. This causes what is referred toas ‘noisy’
potential conditions go undetected or overdiagno-sis that
data to be generated and this makes the AI predictions to
can spawn more unnecessary testing and inter- ventions.
be wrong. Cleaning data is very impor- tant for any
This emphasises the requirement of improved and
machine learning and artificial intelligence models.
nuanced methods that integrate machine learning/AIwith
large datasets for increased sensitivity, specificity, and
consequently early diagnosis without biopsy may assist in
earlier patient outcomes.

IJISRT24JUL1871 www.ijisrt.com 2810


Volume 9, Issue 7, July – 2024 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24JUL1871

 Ethical Considerations: The protection of patients’ lifestyle data to create a more comprehensive picture of
information as well as the effectiveness of results derived patient health, resulting in more accurate predictions. To build
from big data and machine learning in the health sector is trust and transparency, researchers are developing methods
challenging. Despite its usefulness for developing like SHAP and LIME that provide insights into how models
accurate AI models, patient data is not readily available reach their conclusions. Finally, hybrid models that combine
due to regulatory restrains such as HIPAA and GDPR. traditional machine learning with deep learning leverage the
Others are the ethical issues that include, informed strengths of both approaches to enhance prediction accuracy
consent and the clear statement on how data will be used. and reliability.
Patient data is the main ingredient used in designing and
implementing an AI system in healthcare, and such data Looking ahead, the future of heart disease prediction
needs to be trusted to ensure proper usage. with machine learning is bright. Continuous monitoring
 Model Development & Integration: Applying of ma- through real-time data streams holds promise for even better
chine learning models to practical clinical work means predic- tions and enabling proactive interventions. Exploring
multiple stages of work. First, real-world testing of advanced deep learning techniques remains a key area of
interventions makes it possible to enhance the safety interest, with the potential to further improve feature
of ventures and their efficiency. But before that, the extraction and model performance. Personalized prediction
most appropriate method of adopting machine learningis models that consider individual variations in genetics,
selected depending on the problem and data type. lifestyle, and environmental factors are another exciting
Clinicians must comprehend why the model’s prediction avenue for research. Ultimately, the success of these models
are accurate in assessing the patient; lastly, integration hinges on their interpretability and usability in clinical
with the existing health-care concept and electronic settings. Developing user-friendly tools and making models
medical records greatly enhances the chances of the more transparent will be crucial for their seamless integration
model’s successful implementation. into healthcare workflows.
 Long-Term Challenges: Despite the successful devel-
opment, the long-term problems of applying machine IX. CONCLUSION
learning models to cardiac diagnosis remain topical. This
flexibility is required due to the constant changes with the This review paper aims at giving a systematic review of
patient demographics, the methods of treatments, and several machine learning models that are applied in predicting
interpreting changing rules and laws. This ensures models the odds of cardiac ailments and noting notable developments
are not outdated, invalid or may infringe on patient’s rights for early detection of the ailments and treatment performance.
for long term sustainable managementof diseases. The use of machine learning approaches has been of great
 Scalability & Sustainability: The effectiveness and safety help in improving the chance of predicting cases of heart
of the algorithms to use in diagnosing heart diseases, disease through proper sorting of the risk factors with the
depend on their applicability for real-life use. This makes help of big data. The ability of ML algorithms to sift through
it necessary to carry out tests to establishthe efficacy large amounts of data from different sources can uncover
particularly in the diverse patients and scenarios. Thirdly, potentially unseen risk factors where conventional means
the models require constant updating of data to be fail, thus detecting and inhibiting before an event occurs
effective; this can prove cumbersome at times. Last but probably saves lives. The research emphasizes the critical
not the least, it is necessary to invest financial resources need for ML in addressing this growing epidemic of heart
and personnel time in the long term to routinely check the disease.
models’ validity and update and fine-tune the models.
Some of the promising directions of further
VIII. RECENT ADVANCES AND FUTURE development of ML for improving the assessment of the risk
DIRECTIONS of cardiac diseases are described in the paper. The potential to
further enhance precision and clinical relevance lies in real-
time monitoring via wearables, advanced deep learning
Machine learning is revolutionizing heart disease
algorithms trained across clusters of patients, as well as
prediction. Real-time data from wearables and internet-
personalizedmodels. But broader adoption requires resolving
connected devicesallows for continuous patient monitoring,
issues includ-ing interpretability for clinicians and integration
enabling models to adapt to changing health conditions and
into existing healthcare workflows. The paper also argues that
make dynamic predic- tions. Deep learning techniques like
rigorousinnovation of machine learning models, transparency
Convolutional Neural Net- works and Recurrent Neural
about their predictions and integration into the clinical
Networks excel at handling com- plex medical data such as
workflowis critical going forward. Altogether, these advances
images and time-series information. This leads to improved
have considerable promise for improved diagnostics that are
feature extraction and overall model performance. finer and more accurate- potentially saving lives worldwide
Additionally, data fusion techniques are bringing together by helping to alleviate the burden of cardiovascular diseases.
information from electronic health records, genetics, and

IJISRT24JUL1871 www.ijisrt.com 2811


Volume 9, Issue 7, July – 2024 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24JUL1871

REFERENCES [9]. J. Rashid, S. Kanwal, J. Kim, M. Wasif Nisar,


U. Naseem, and A. Hussain, “Heart disease diagnosis
[1]. T. Ullah, S. I. Ullah, K. Ullah, M. Ishaq, A. Khan, Y. using the brute force algorithm and machine learning
Y. Ghadi, and A. Algarni, “Machine learning-based techniques,” Computers, Materials amp; Continua,
cardiovascular disease detection using optimal feature vol. 72, no. 2, p. 3195–3211, 2022. [Online].
selection,” IEEE Access, vol. 12, p. 16431–16446, Available:
2024. [Online]. Available: http://dx.doi.org/10.1109/ http://dx.doi.org/10.32604/cmc.2022.026064.
ACCESS.2024.3359910 [10]. N. Lutimath, N. Sharma, and B. K, “Prediction of heart
[2]. P. Ghosh, S. Azam, M. Jonkman, A. Karim, F. M. J. disease using biomedical data through machine
M. Shamrat, E. Ignatious, S. Shultana, A. R. learning techniques,” EAI Endorsed Transactions on
Beeravolu, and F. De Boer, “Efficient prediction of Pervasive Health and Technology, p. 170881, Jul.
cardiovascular disease using machine learning 2018. [Online]. Available:
algorithms with relief and lasso feature selection http://dx.doi.org/10.4108/eai.30-8-2021.170881
techniques,” IEEE Access, vol. 9, p. 19304–19326, [11]. N. Alageel, R. Alharbi, R. Alharbi, M. Alsayil, and L.
2021. [Online]. Available: A. Alharbi, “Using machine learning algorithm as a
http://dx.doi.org/10.1109/ACCESS.2021.3053759 method for improving stroke prediction,” International
[3]. S. Mondal, R. Maity, Y. Omo, S. Ghosh, and A. Nag, Journal of Advanced Computer Science and
“An efficient computational risk prediction model of Applications, vol. 14, no. 4, 2023. [Online]. Available:
heart diseases based on dual-stage stacked machine http://dx.doi.org/10.14569/IJACSA.2023.0140481
learning approaches,” IEEE Access, vol. 12, p. 7255– [12]. M. K. Joshi, D. Dembla, and S. Bhatia, “Prediction of
7270, 2024. [Online]. Available: http: cardiovascular disease using machine learning
//dx.doi.org/10.1109/ACCESS.2024.3350996 algorithms.” International Journal of Advanced
[4]. Lakshmanarao, T. V. Sai Krishna, T. S. Ravi Kiran, C. Computer Science & Applications, vol. 15, no. 3,
V. Murali krishna, S. Ushanag, and N. Supriya, “Heart 2024.
disease prediction using ml through enhanced feature
engineering with association and correlation analysis,”
Indonesian Journal of Electrical Engineering and
Computer Science, vol. 34, no. 2, p. 1122, May 2024.
[Online]. Available:
http://dx.doi.org/10.11591/ijeecs.v34.i2.pp1122-1130
[5]. Srinivasa Rao and G. Muneeswari, “A review:
Machine learning and data mining approaches for
cardiovascular disease diagnosis and prediction,” EAI
Endorsed Transactions on Pervasive Health and
Technology, vol. 10, Mar. 2024. [Online]. Available:
http://dx.doi.org/10.4108/eetpht.10.5411
[6]. C. M. Bhatt, P. Patel, T. Ghetia, and P. L. Mazzeo,
“Effective heart disease prediction using machine
learning techniques,” Algorithms, vol. 16, no. 2, p. 88,
Feb. 2023. [Online]. Available:
http://dx.doi.org/10.3390/a16020088
[7]. M. Qadri, A. Raza, K. Munir, and M. S. Almutairi,
“Effective feature engineering technique for heart
disease prediction with machine learning,” IEEE
Access, vol. 11, p. 56214–56224, 2023. [Online].
Available:
http://dx.doi.org/10.1109/ACCESS.2023.3281484
[8]. P. Rahman, A. Rifat, M. IftehadAmjad Chy, M.
Monirujjaman Khan, M. Masud, and S. Aljahdali,
“Machine learning and artificial neural network for
predicting heart failure risk,” Computer Systems
Science and Engineering, vol. 44, no. 1, p. 757–775,
2023. [Online]. Available:
http://dx.doi.org/10.32604/csse.2023.021469

IJISRT24JUL1871 www.ijisrt.com 2812

You might also like