Heart Disease Prediction Using KNN Algorithm-2
Heart Disease Prediction Using KNN Algorithm-2
Early prediction of heart disease is essential to reduce the mortality rate. Data
mining provides a user-oriented approach to extract novel and uncovered patterns
in the data set. Data mining is to extract useful knowledge within medical data for
medical diagnosis
Abstract
Heart disease commonly occurring disease and is the major cause of sudden death
nowadays. This disease attacks the persons instantly. Most of the people do not aware of the
symptoms of heart disease. Timely attention and proper diagnosis of heart disease will reduce the
mortality rate. Medical data mining is to explore hidden pattern from the data sets. Supervised
algorithms are used for the early prediction of heart disease. Nearest Neighbor (KNN) is the widely
used lazy classification algorithm. KNN is the most popular, effective and efficient algorithm used for
pattern recognition. Medical data sets contain a large number of features. The Performance of the
classifier will be reduced if the data sets contain noisy features. Feature subset selection is proposed
to solve this problem. Particle Swarm Optimization (PSO) is an Evolutionary Computation (EC)
technique used for feature selection. PSO are computationally inexpensive and converges quickly. In
this we apply KNN and PSO for prediction of heart disease. Experimental results show that the
algorithm performs very well with 100% accuracy with PSO as feature selection.
Literature survey
SI.NO Author Name Title of Paper Remarks
1 Vijeta Sharma , Heart Disease Prediction using Use of benchmark dataset which
Shrinkhala Yadav, Machine Learning Techniques consists of only 14 parameters.
Manjari Gupta [May 2020] Using random forest ,which
reduces the accuracy of prediction
for complex problems
4 Reldean Williams, Heart Disease Prediction using Use of Decision tree- That increases
Thokozani Machine Learning Techniques [Dec the processing time
Shongwe, 2021] Use of logistic regression – which
Ali N. Hasan, fails to predict a continuous
Vikash Rameshar outcome.
SI.NO Author Name Title Remarks
Sanchayita Dhar,Pritha A Hybrid Machine Learning Uses Random Forest ,decision tree and
5 Datta , Ankur Biswas, Approach for Prediction of Naïve Bayes that integrates a prediction
Tanusree Dey, Krishna Heart Diseases [2018] system that is capable to envisage heart
Roy diseases based on measurements extracted
from ERIC laboratory.
Problem Identification
The heart disease a major concern to be dealt with. But it is difficult to identify
heart disease because of several contributory risk factors such as diabetes, high
blood pressure, high cholesterol, abnormal pulse rate, and many other factors. Due
to such constraints, scientists have turned towards modern approaches like Data
Mining and Machine Learning for predicting the disease.
Machine learning (ML) proves to be effective in assisting in making decisions and
predictions from the large quantity of data produced by the healthcare industry.
System Architecture
KNN Algorithm ,Random
Forest algorithm
Test Set
Clinical Model Evaluation
Dataset
Accuracy , sensitivity,
Specification , Precision ,
MCC
Modules
1. Data Preprocessing
2. Feature Selection
4. Model evaluation
5. Deployment
1.Data Preprocessing
The first module involves preparing the dataset for the K-NN algorithm by :
In this module, we identify the most relevant features or variables that
can influence the prediction of heart diseases.
Techniques involved:
Correlation analysis
In this project , we developed a model using K-NN algorithm for the
prediction of cardiovascular disease.
The model was trained on a dataset of 70,000 patient records, which
were collected from various hospitals and clinics.
The dataset contained a total of 16 features, including age, gender,
blood pressure, cholesterol level and smoking status.
Publication