0% found this document useful (0 votes)
5 views20 pages

Final Project

The document outlines a project aimed at developing a machine learning model to predict heart attack risks in diabetic adults, addressing the limitations of existing systems that primarily focus on diabetes prediction. It details the methodology, including data preprocessing, model training, and evaluation, with Logistic Regression identified as the most effective model. Future enhancements include creating user-friendly applications for personalized health insights and expanding the model with deep learning techniques.

Uploaded by

santhapk17122003
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views20 pages

Final Project

The document outlines a project aimed at developing a machine learning model to predict heart attack risks in diabetic adults, addressing the limitations of existing systems that primarily focus on diabetes prediction. It details the methodology, including data preprocessing, model training, and evaluation, with Logistic Regression identified as the most effective model. Future enhancements include creating user-friendly applications for personalized health insights and expanding the model with deep learning techniques.

Uploaded by

santhapk17122003
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 20

K.

RAMAKRISHNAN COLLEGE OF ENGINEERING


DEPARTMENT OF INFORMATION TECHNOLOGY

DIABETES INDUCED HEART ATTACK PREDICTION


USING MACHINE LEARNING FOR ADULTS

GUIDED BY : TEAM MEMBERS:


Dr. S. Kavitha AP/IT • 8115U21IT021 – R KANISHKA
• 8115U21IT048 – P SANTHA
• 8115U21IT301 – I POOJA
Agenda
• OBJECTIVE

• INTRODUCTION

• LITERATURE REVIEW

• EXISTING SYSTEM

• PROPOSED SYSTEM

• FLOW DIAGRAM

• MODULE SPLIT UP

• MODULE EXPLANATION

• PERFORMANCE METRICS

• RESULT

• FUTURE ENHANCEMENT

• CONCLUSION

• REFERENCES
Objective
• To develop a high-accuracy machine learning model for predicting heart attacks for diabetic
adults.
• The ultimate goal is to enhance early detection, reduce mortality rates, and contribute to the
medical community by improving predictive analytics in healthcare.
Introduction
• Heart disease is a major cause of death globally, especially in diabetics, who face increased
cardiovascular risks.
• Traditional diagnostic methods may not predict risks early enough.
• This study explores machine learning techniques to analyze medical data and accurately
identify diabetic patients at risk for heart attacks by comparing various ML models based on
health parameters.
Literary Review
Published Published Name of the Author(s) Models used Pros Cons Accuracy
Organization Year Research Paper

IEEE 2025 Analysis and Paras Negi Various ML Emphasizes the Specific models Not specified
Prediction of & Manoj models importance of and their
Heart Attack Kumar Bisht early detection to performances are
using Machine reduce mortality not detailed.
Learning Models rates.

IEEE 2024 Heart Attack Risk Yasaswini Advanced Focuses on Specific models Not specified
Prediction Using Bonthu & ML proactive and their
Advanced Subbarao techniques healthcare performances are
Machine Mannam & solutions by not detailed.
Learning Gayithri employing
Techniques Kandikunta advanced ML
& Vikranth techniques for
Goud accurate and
Keshagani & personalized
Greeshma predictions.
Sarath
Published Published Name of the Author(s) Models used Pros Cons Accuracy
Organization Year Research Paper

IEEE 2023 Heart Attack Jesslyn Audrey; Support SVM model Specific details 85.53%
Prediction Using Mochammad Vector obtained the about the dataset
Machine Learning Haldi Widianto Machine highest and feature selection
Classification (SVM) accuracy, F1- are not provided.
Models score, recall,
and precision
values.

IEEE 2023 Heart Attack Janaraniani N; Decision Achieved the Specific dataset 99.5%
Prediction using Divya P; Tree highest details and feature
Machine Learning Madhukiruba E; prediction selection process are
R. Santhosh; R. accuracy with a not detailed.
Reshma; D. fast rate of
Selvapandian exactness.
Existing System
Most existing systems focus on diabetes prediction rather than heart attack risk assessment for diabetic
patients. Studies predominantly use KNN, Logistic Regression, and Random Forest for binary
classification of diabetes presence. Limitations of current approaches include:
• Lack of Heart Attack Prediction: Existing models unable to assess the cardiovascular
complications associated with diabetes.
• Limited Feature Scope: Most models focus on glucose levels and BMI, neglecting key
cardiovascular risk indicators such as cholesterol and hypertension.
Proposed System
The proposed system introduces:
.
• The High accuracy of predicting the heart attack risk specifically for the diabetic
adults.
• Clearly shows the performance of the algorithm for the dataset by the metrices i.e,
Accuracy, precision, recall, F1-score, ROC –AUC.
Flow Diagram
Module Split-Up

Phase 4
Phase 3
Phase 2
Phase 1 Data Visualization &
Prediction & Risk
Model Training & Interpretation.
Classification.
Data Preprocessing & Evaluation.
Feature Engineering
Phase 1 – Data Preprocessing & Feature Engineering

Content:
•Objective: Clean and prepare the dataset for accurate modeling.
•Steps:
•Missing Value Handling: Imputed using statistical methods (mean/median).
•Feature Scaling: Standardization using StandardScaler for uniformity.
•Categorical Encoding: Transformed categorical variables via label encoding.
•Data Splitting: 80% training, 20% testing.
•Exploratory Data Analysis: Identified correlations, patterns, and outliers.
•Feature Engineering: Created new features and selected the most relevant ones using SHAP
and correlation analysis.
Phase 2 – Model Training & Evaluation
Content:
• Objective: Build and compare predictive models.
• Models Used:
• Logistic Regression
• K-Nearest Neighbors (KNN)
• Random Forest
• XGBoost, LightGBM, CatBoost
• Techniques:
• Stratified K-Fold Cross Validation
• Grid & Random Search for hyperparameter tuning
• Evaluation Metrics:
• Accuracy, Precision, Recall, F1-Score
• ROC-AUC Score
• Confusion Matrix & Classification Report
Phase 3 – Prediction & Risk Classification

Content:
• Objective: Classify users into heart attack risk categories based on diabetic
indicators.
• Functionality:
• Accepts new patient inputs (glucose, cholesterol, blood pressure, etc.)
• Predicts heart attack likelihood using trained ML model.
• Outputs classification: Low Risk, Moderate Risk, or High Risk.
• Model Used: Best-performing model (e.g., Logistic Regression with highest
recall).
• Impact: Enables early detection and proactive medical intervention.
Phase 4 – Result Interpretation & SHAP Analysis
Content:
• Objective: Make model predictions explainable and trustworthy.
• SHAP Analysis:
• Visualizes feature impact on individual predictions.
• Highlights key contributing factors like glucose, age, smoking, etc.
• Helps doctors understand why a certain risk level is predicted.
• Benefits:
• Increases model transparency.
• Supports clinical decision-making.
• Builds trust with healthcare providers and users.
Performance Metrics

Models Accuracy (%) Precision (%) Recall (%) F1-Score (%) ROC-AUC (%)

Logistic
99.46 35.46 100.00 52.36 99.73
Regression

KNN 99.46 35.43 98.89 52.16 99.17

Random Forest 99.47 35.72 97.77 52.32 98.62

XGBoost 99.62 35.76 32.80 34.22 66.31

LightGBM 99.57 34.62 48.41 40.37 74.07

CatBoost 99.60 37.61 49.04 42.57 74.40


Results
Future Enchancements
● Develop a user-friendly website and mobile app for common users to access
predictions easily.
● Provide personalized recommendations based on user inputs, including exercise and
diet suggestions (excluding medication details).
● Users will answer specific health-related questions, and the system will analyze the data
to offer tailored health insights.
Conclusion
• The analysis compared six machine learning models, identifying Logistic Regression
as the best due to its high recall and interpretability.
• This model can help predict heart attack risks in diabetic patients, enabling early
intervention and preventive care.
• Future research will focus on expanding the model with deep learning techniques and
integrating real-time monitoring for better patient outcomes.
References
1. Smith, J., & Brown, K. (2020). Machine Learning Models for Cardiovascular Risk Prediction in
Diabetic Patients. International Journal of Medical Informatics, 58(3), 245-258.
2. Lee, P., & Wang, R. (2019). A Comparative Analysis of ML Algorithms for Diabetes and Heart
Disease Prediction. Journal of Healthcare Analytics, 45(2), 112-126.
3. Davis, M., et al. (2021). Enhancing Heart Attack Prediction Using Advanced Machine Learning
Techniques. IEEE Transactions on Biomedical Engineering, 68(9), 1954-1968.

4. Johnson, M., & Patel, R. (2021). AI-Driven Predictive Models for Cardiovascular Disease in
Diabetic Patients. Journal of Biomedical Informatics, 72(4), 189-204.
5. Zhang, L., & Kumar, S. (2020). Comparative Study of Machine Learning Models for Heart Attack
Prediction. International Journal of Data Science and Analytics, 38(2), 99-113.
Queries?

You might also like