67% found this document useful (3 votes)

5K views23 pages

Final PPT Heart Disease

This document presents a project on predicting heart disease using machine learning techniques. It discusses collecting heart disease data from an online source, understanding and preprocessing the data which includes checking data types, missing values, and duplicates. Various machine learning models - logistic regression, random forest, and neural networks - are built and their performance is compared using confusion matrices and accuracy scores. The models achieved around 93% accuracy. The document concludes the machine learning approaches were effective for heart disease prediction and discusses potential applications and future extensions of the project.

Uploaded by

nithish Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

67% found this document useful (3 votes)

5K views23 pages

Final PPT Heart Disease

Uploaded by

nithish Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

UNIVERSITY COLLEGE OF SCIENCE

. OSMANIA
UNIVERSITY
DEPARTMENT OF STATISTICS

D ATA M O D E L I N G USING
M A C H I N E LEARNING
TECHNIQUES
HEART DISEASE
PREDICTION
PRESENTED BY
Mr. PUDATHU NITHISH KUMAR
Mr. POLOJU SATHEESH
Mr. B RAMESH

Under the Supervision of

Prof. N.Ch. Bhatracharyulu
Agenda
• Introduction
• Data Collection
• Data Understanding
• Data Preprocessing
• Data Visualization
• Model Building
• Applications
• Future Scope
• Conclusion
INTRODUction
This project is related to " Heart Disease prediction".
 Heart disease is the leading cause of death in the world over the past 10
years.
Several different symptoms are associated with heart disease, which makes it difficult
to diagnose it quicker and better.
This issue can be resolved by machine learning techniques.

Problem Statement

 There are instruments available which can predict heart disease but either
they are expensive or are not efficient to calculate chance of heart disease in
human.
Problem solution
Early detection of cardiac diseases can decrease the mortality rate and
overall complications. By using machine learning algorithms we can predict
the Heart disease or Heart attack based on different symptoms.
Required packages and libraries
Data collection
• we have collected the data of heart disease

• We have extracted the data from the below website

https
://www.kaggle.com/datasets/alexteboul/heart-disease-
health-indicators-dataset

Dataset
Data Understanding
• Data understanding process includes collecting and exploring the data.

• The data we have collected consist of 1000 observations with 22 attributes.

The attributes are :

 High Blood Pressure

 Diabetes
 High Cholesterol
 Body Mass Index (BMI)
 Smoking
Age
Sex
Physical Activity

Diet Alcohol Consumption
Stroke
Diabetes
Health Care
Health General and Mental Health
Education
Heart disease (label): 1=Presence of heart disease 0=Absence of
heart disease
Understanding Data using Descriptive Statistics:
 Measures of central Tendency and Measures of Dispersion
 Visual description
Data pre-processing

Checking the data type

We need to check whether the data is in Numerical form or not .
If the data is not in numerical then we need to transform the data
into numerical form

Checking for missing values:

we need to check if the data contains any missing values …..

If there is any missing values found in the dataset, we have to

follow Imputation ,Deletion,Prediction.

We can detect missing values visually as shown in the figure. In

the figure there is no missing values.
,
Checking the data type Checking the missing values

The data is in float i.e., numerical form There is no missing values in the dataset.
so we need to transform the data type.
Checking and dropping the Duplicate records

We need to check Duplicate records in the data set if any values are detected duplicate
We need to drop such values

Before

After
Data visualization

The data can be understand visually by plotting the graphs like Box,Count,Bar chart and pie
charts.
This is the Heat map or correlation plot which shows the relation between one or more variables.
Here the graph shows that there is No Multi collinear points.
Model Building
The data set is trained and tested within 3 methods
1) Logistic Regression
2) Random Forest Classification
3) Neural Networks Classification

1. Logistic Regression/Classification:-

Logistic regression falls under the category of supervised learning; it

measures the relationship between the dependent variable which is
categorical with one or more than one independent variables by estimating
probabilities using a logistic/sigmoid function. Logistic regression can
generally be used when the dependent variable is Binary or Dichotomous. It
means that the dependent variable can take only two possible values like
“Yes or No”, “Living or dead”.
Random Forest Classifier :
Random Forest is a tree based classification algorithm.
As the name indicates , the algorithm creates a forest with a large number of trees.
It creates a set of decision trees from a random sample of the training set.

Neural Networks Classifier

A method of computing , based on the interaction of multiple

connected processing elements.
A powerful technique to solve many real world problems.
The ability to deal with incomplete information.
Confusion Matrices of all the three Models
Test scores and Comparision of All models
Applications

 Medical Institutions
To teach the medical students how the heart attack or heart disease is
measured and how to identify that the person is suffering from Heart disease

 Hospitals
To detect that the person is having Heart Disease or not.
Conclusion

The machine learning algorithms is used to this project is Logistic

Regression and Neural Networks Model .
After training and testing the data we calculated the confusion Matrix
and accuray score of each algorithm then we get the accuracy score
of all models whatever we applied is same and score is 92.77 which
concludes that all the three algorithms performes same of this
project and also concludes that approxiametely 93% we have
predicted correctly and rest of 7% was failed to predict correctly.
Future scope of this project
The correct prediction of heart disease can prevent life threats, and
incorrect prediction can prove to be fatal at the same time

In this project different machine learning algorithms are applied to

compare the results and analysis of the machine learning

Heart disease dataset is a dataset which would help to provide better

outcomes and helps health professionals in predicting heart disease
efficiently and effectively
Thank you

1 - Heart Disease Prediction Using Machine Learning
81% (26)
1 - Heart Disease Prediction Using Machine Learning
59 pages
Data Science Project Report
43% (7)
Data Science Project Report
10 pages
Heart Disease Prediction System Using Machine Learning
86% (22)
Heart Disease Prediction System Using Machine Learning
24 pages
Project Report
70% (10)
Project Report
47 pages
CST414 A
No ratings yet
CST414 A
2 pages
Applsci 10 00370 v2
No ratings yet
Applsci 10 00370 v2
14 pages
Preprints202502 2059 v1
No ratings yet
Preprints202502 2059 v1
19 pages
Artificial Intelligence in Medicine Book - 2022 - 1
No ratings yet
Artificial Intelligence in Medicine Book - 2022 - 1
18 pages
Big Questions With Answers
100% (1)
Big Questions With Answers
32 pages
Modeling Hydrological Characteristics Based On Land Use Land Cover and Climate Changes in Muga Watershed Abay River Basin Ethiopia
No ratings yet
Modeling Hydrological Characteristics Based On Land Use Land Cover and Climate Changes in Muga Watershed Abay River Basin Ethiopia
22 pages
(PDF Download) Remote Sensing Digital Image Analysis 6th Ed. 2022 Edition John A. Richards Fulll Chapter
100% (8)
(PDF Download) Remote Sensing Digital Image Analysis 6th Ed. 2022 Edition John A. Richards Fulll Chapter
64 pages
Prediction of Cervical Cancer From Behavior Risk Using Machine Learning Techniques
No ratings yet
Prediction of Cervical Cancer From Behavior Risk Using Machine Learning Techniques
10 pages
Diabetics Prediction Using Machine Learning
100% (1)
Diabetics Prediction Using Machine Learning
18 pages
GOOGLE AIML Report
No ratings yet
GOOGLE AIML Report
43 pages
MODULE 3 Classification
No ratings yet
MODULE 3 Classification
5 pages
Heart Disease Prediction Final
67% (3)
Heart Disease Prediction Final
45 pages
Voting System Mini Project Report
100% (2)
Voting System Mini Project Report
18 pages
ML Unit Ii
No ratings yet
ML Unit Ii
30 pages
Ccs334 - Big Data Analytics
75% (4)
Ccs334 - Big Data Analytics
2 pages
A Project Report On "House Price Prediction": Prepared by
100% (2)
A Project Report On "House Price Prediction": Prepared by
15 pages
HCIA-AI V3.5 Version Instructions
No ratings yet
HCIA-AI V3.5 Version Instructions
2 pages
Multi Variate Statistical Analysis
No ratings yet
Multi Variate Statistical Analysis
13 pages
House Price Prediction
67% (3)
House Price Prediction
8 pages
ML Lec 15 Naive Bayes
No ratings yet
ML Lec 15 Naive Bayes
16 pages
Assignment 2, Quiz 2 & Quiz 3 PDF
No ratings yet
Assignment 2, Quiz 2 & Quiz 3 PDF
2 pages
Personality Classification From Online Text
No ratings yet
Personality Classification From Online Text
17 pages
Final House Prediction
50% (2)
Final House Prediction
83 pages
AI Pioneers in Investment Management
No ratings yet
AI Pioneers in Investment Management
44 pages
Report of Quiz Application
No ratings yet
Report of Quiz Application
39 pages
Text Classification PDF
No ratings yet
Text Classification PDF
34 pages
A Sentiment Analysis Method of Short Texts in Microblog: Jie Li Lirong Qiu
No ratings yet
A Sentiment Analysis Method of Short Texts in Microblog: Jie Li Lirong Qiu
4 pages
Trash Bot New
No ratings yet
Trash Bot New
23 pages
Stress Detection in It Professional by Image Processing and Machine Learning
No ratings yet
Stress Detection in It Professional by Image Processing and Machine Learning
91 pages
Data Analytics Question Bank
67% (3)
Data Analytics Question Bank
27 pages
House Price Prdiction Mini Project Report
100% (2)
House Price Prdiction Mini Project Report
8 pages
ME P4252-II Semester - MACHINE LEARNING
100% (1)
ME P4252-II Semester - MACHINE LEARNING
48 pages
Problem Statements On Machine Learning
100% (2)
Problem Statements On Machine Learning
17 pages
Predictive Analytics I: Data Mining: Process, Methods, and Algorithms
No ratings yet
Predictive Analytics I: Data Mining: Process, Methods, and Algorithms
60 pages
Disease Prediction and Drug Recommendation Using Machine Learning
100% (1)
Disease Prediction and Drug Recommendation Using Machine Learning
26 pages
Algorithms: K Nearest Neighbors
No ratings yet
Algorithms: K Nearest Neighbors
16 pages
Data Analytics in Iot: Cs578: Internet of Things
No ratings yet
Data Analytics in Iot: Cs578: Internet of Things
27 pages
Title: Iterative Deepening Depth-First Search (IDDFS) : Department of Computer Science and Engineering
No ratings yet
Title: Iterative Deepening Depth-First Search (IDDFS) : Department of Computer Science and Engineering
6 pages
Abhishek Data Scientist Resume
0% (1)
Abhishek Data Scientist Resume
5 pages
Machine Learning System Design PDF
100% (1)
Machine Learning System Design PDF
14 pages
CP4252 ML Syllabus
No ratings yet
CP4252 ML Syllabus
4 pages
Weather App Project Report
No ratings yet
Weather App Project Report
32 pages
ccs346 Eda Lab Manual
No ratings yet
ccs346 Eda Lab Manual
41 pages
Disease Prediction
33% (3)
Disease Prediction
12 pages
Natural Language Processing With Java and Lingpipe Cookbook: Chapter No. 1 "Simple Classifiers"
No ratings yet
Natural Language Processing With Java and Lingpipe Cookbook: Chapter No. 1 "Simple Classifiers"
50 pages
Synopsis On Weather Api
83% (6)
Synopsis On Weather Api
3 pages
Mental Health Detection Using Machine Learning
100% (1)
Mental Health Detection Using Machine Learning
31 pages
Heart Disease Python Report 1st Phase
No ratings yet
Heart Disease Python Report 1st Phase
33 pages
Implement of Salary Prediction System To Improve Student Motivation Using Data Mining Technique PDF
No ratings yet
Implement of Salary Prediction System To Improve Student Motivation Using Data Mining Technique PDF
6 pages
Machine Learning Internshala: Mini Project / Internship Report
100% (1)
Machine Learning Internshala: Mini Project / Internship Report
28 pages
Crop Recommender System
No ratings yet
Crop Recommender System
23 pages
SPATIAL DATA ANALYSIS ........ Modified2
No ratings yet
SPATIAL DATA ANALYSIS ........ Modified2
47 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
17 pages
19MCA1097 Project Report On Heart Failure Prediction
No ratings yet
19MCA1097 Project Report On Heart Failure Prediction
63 pages
S MapReduce Types Formats
100% (2)
S MapReduce Types Formats
22 pages
Final Report
No ratings yet
Final Report
43 pages
Final Diabetes Prediction Documentation
No ratings yet
Final Diabetes Prediction Documentation
52 pages
Image and Video Analytics
No ratings yet
Image and Video Analytics
3 pages
Seminar PPT 2029002
No ratings yet
Seminar PPT 2029002
14 pages
"House Price Prediction": Internship Project Report On
No ratings yet
"House Price Prediction": Internship Project Report On
34 pages
Ds4015 Big Data Analytics QB
No ratings yet
Ds4015 Big Data Analytics QB
155 pages
Find S Algorithm
No ratings yet
Find S Algorithm
7 pages
Mini Project Review PPT 1
100% (1)
Mini Project Review PPT 1
35 pages
Exposys Data Labs: Internship Report On Data Science Project
No ratings yet
Exposys Data Labs: Internship Report On Data Science Project
23 pages
Lab Manual Laboratory Practice-I: (System Programming & Operating System)
100% (2)
Lab Manual Laboratory Practice-I: (System Programming & Operating System)
35 pages
Python Mini Report PDF
100% (2)
Python Mini Report PDF
13 pages
Unit 1 Introduction of Machine Learning Notes
No ratings yet
Unit 1 Introduction of Machine Learning Notes
57 pages
ML PPT On Laptop Price Prediction
100% (1)
ML PPT On Laptop Price Prediction
17 pages
Analysis Factor Analysis Cluster Analysis
No ratings yet
Analysis Factor Analysis Cluster Analysis
18 pages
Crop Prediction
No ratings yet
Crop Prediction
21 pages
Diabetes PPT
100% (1)
Diabetes PPT
9 pages
Data Science Lab Manual - CS3361-Ramprakash S
No ratings yet
Data Science Lab Manual - CS3361-Ramprakash S
47 pages
Project Presentation On House Price Prediction System: Presented by Name: Simran B Solanki Roll No: 19020
100% (1)
Project Presentation On House Price Prediction System: Presented by Name: Simran B Solanki Roll No: 19020
32 pages
Heart Disease Prediction Final Report
100% (1)
Heart Disease Prediction Final Report
31 pages
Project Report
No ratings yet
Project Report
67 pages
Heart Disease Prediction Using Machine Learning
100% (1)
Heart Disease Prediction Using Machine Learning
54 pages
Color Receipe
100% (1)
Color Receipe
9 pages
1NH17CS407
No ratings yet
1NH17CS407
110 pages
Heart Attack Predictions Using Machine Learning
No ratings yet
Heart Attack Predictions Using Machine Learning
8 pages
Analysis of Heart Diseases Using Machine Learning: September 6, 2018
No ratings yet
Analysis of Heart Diseases Using Machine Learning: September 6, 2018
4 pages
Heart Disease Detection Using Machine Learning
No ratings yet
Heart Disease Detection Using Machine Learning
12 pages
Gold Price Prediction Using Ensemble Based Supervised Machine Learning
100% (2)
Gold Price Prediction Using Ensemble Based Supervised Machine Learning
30 pages
IOT Based Weather Reporting System
No ratings yet
IOT Based Weather Reporting System
12 pages
18MAT41 Module-5
No ratings yet
18MAT41 Module-5
25 pages
Heart Disease Prediction (Review-1)
No ratings yet
Heart Disease Prediction (Review-1)
10 pages

Final PPT Heart Disease

Uploaded by

Final PPT Heart Disease

Uploaded by

UNIVERSITY COLLEGE OF SCIENCE

Under the Supervision of

• We have extracted the data from the below website

• The data we have collected consist of 1000 observations with 22 attributes.

The attributes are :

 High Blood Pressure

Checking the data type

Checking for missing values:

we need to check if the data contains any missing values …..

If there is any missing values found in the dataset, we have to

We can detect missing values visually as shown in the figure. In

Logistic regression falls under the category of supervised learning; it

Neural Networks Classifier

A method of computing , based on the interaction of multiple

The machine learning algorithms is used to this project is Logistic

In this project different machine learning algorithms are applied to

Heart disease dataset is a dataset which would help to provide better

You might also like