Heart Failure Prediction Using Hybrid Method
Heart Failure Prediction Using Hybrid Method
PROPOSED WORK
It is important to have the first aid at the time of
heart attack. The number of deaths due to heart
attack occurs because there is a lack of awareness
and first aid given to patients. As the living style of
the person all-round the globe has been changed
which is the fundamental establishment for the
reason for various heart intricacies, there is Fig. 1. Proposed working architecture
sufficient research done to about the prediction.
In the proposed architecture as we can see that there
Here we are aiming to provide a one step further that
are six different sections which are listed above.
is studying the complexity of the heart disease and
Here first we are selecting the database in which it
giving the medical and non-medical suggestions to
is important to work on the target database so we are
get rid of the heart disease. In the proposed work we
selecting the target database from the heart disease
are focusing on analysis, prediction, accuracy after
database after that samples needs to be selected
using many algorithms and comparison and
from the pool of dataset and also it is important to
providing the suggestions. It is also important to
remove the noisy data present the dataset and since
know whether the person needs to be diagnosed
there are lot of attributes present in the dataset so it
with heart disease or not. In our work we are also
is necessary to create the specific attributes required
examining if the person needs to be examined or not
for the training of dataset now, we have to extract
after training the dataset. Experimenting with the
the relevant attributes which are useful for the
various classification models and checking with
process. In the next section there is Modelling and
yield the greatest accuracy.
training of the selected dataset happens. Here, in
this section we are using seven different machine 8. (thalach) maximum heart rate achieved
learning models one after another so that the best (#)
model that is the model with highest accuracy can 9. (exang) exercise induced angina
be find out. After applying the different machine (binary) (1 = yes; 0 = no)
learning algorithms and models now it extracts 10. (oldpeak) = ST depression induced by
the knowledge and finally gives the medical exercise relative to rest (#)
and non-medical suggestions. We have used the 11. (slope) of the peak exercise ST segment
supervised learning models in the proposed (Ordinal) (Value 1: up sloping, Value
system. 2: flat, Value 3: down sloping)
12. (ca) number of major vessels (0–
Quantitative research needs numerical data that can 3, Ordinal) colored by fluoroscopy
come out either from the numerical data itself or (thal) maximum heart rate achieved — (Ordinal): 3
otherwise graphs. Statistical methods are applied on = normal; 6 = fixed defect; 7 = reversible defect
it to get usefulness from the data. Qualitative
research is in words and in thoughts. There must be
expert opinion that can bring useful information EVALUATION RESULTS
through the thoughts and feeling of the examinee.
The prediction models are developed using 13
Qualitative research is to understand concepts,
features and the accuracy is calculated for modeling
thoughts, experiences and feelings of the patients.
techniques. The best classification methods are
This research paper uses both quantitative and
given. This table compares the accuracy,
qualitative data. We have used the University of
classification error, precision, F-measure,
California Irvine (UCI) dataset for this paper. There
sensitivity and specificity. The highest accuracy is
are 3 types of data used in this paper which are:
achieved by this proposed hybrid classification
Continuous (#): which is quantitative data that can method in comparison with existing methods. Out
be measured of the 13 features we examined, the top 4 significant
features that helped us classify between a positive &
Ordinal Data: Categorical data that has an order to negative Diagnosis were chest pain type (cp),
it (0,1,2,3, etc) maximum heart rate achieved (thalach), number of
Binary Data: data whose unit can take on only two major vessels (ca), and ST depression induced by
exercise relative to rest (oldpeak) as shown in the
possible states (0 &1)
figure 2.
There are 13 feature attributes identified in the
dataset for the heart disease prediction and working
which are mentioned below:
1. age (#)
2. sex: 1= Male, 0= Female (Binary)
3. (cp) chest pain type (4 values -
Ordinal): Value 1: typical angina,
Value 2: atypical angina, Value 3: non-
anginal pain, Value 4: asymptomatic
4. (trestbps) resting blood pressure (#) Fig. 2. Feature importance graph
5. (chol) serum cholesterol in mg/dl (#)
6. (fbs)fasting blood sugar > 120 mg/dl
(Binary) (1 = true; 0 = false)
7. (restecg) resting electrocardiography
results (values 0,1,2)
CONCLUSION
Identifying the processing of raw healthcare data of
heart information will help in the long term saving
of human lives and early detection of abnormalities
in heart conditions. Ma chine learning techniques
were used in this work to process raw data and
provide a new and novel discernment towards heart
disease. Heart disease prediction is challenging and
very important in the medical field. However, the
mortality rate can be drastically controlled if the
disease is detected at the early stages and
preventative measures are adopted as soon as
Fig. 3. Correlation Matrix possible. Further extension of this study is highly
desirable to direct the investigations to real-world
datasets instead of just theoretical approaches and
There is a positive correlation between chest pain simulations. The proposed hybrid approach is used
(cp) & target (our predictor). This makes sense combining the characteristics of Random Forest
since, the greater amount of chest pain results in a (RF) and Linear Method (LM). This method proved
greater chance of having heart disease. Cp (chest to be quite accurate in the prediction of heart
pain), is an ordinal feature with 4 values: Value 1: disease. The future course of this research can be
typical angina, Value 2: atypical angina, Value 3: performed with diverse mixtures of machine
non-anginal pain, Value 4: asymptomatic. learning techniques to better prediction techniques.
Furthermore, new feature selection methods can be
In addition, we see a negative correlation between developed to get a broader perception of the
exercises induced angina & our predictor. This significant features to increase the performance of
makes sense because when you exercise, your heart heart disease prediction.
requires more blood, but narrowed arteries slow
down blood flow.
From comparing positive and negative heart disease REFERENCES
patients. There are vast differences in means for [1] M. S. Amin, Y. K. Chiam, K. D. Varathan,
many of our Features. From examine the details, we ‘‘Identification of significant features and data mining
can observe that positive patients experience techniques in predicting heart disease,’’ Telematics
heightened maximum heart rate achieved (thalach) Inform., vol. 36, pp. 82–93, Mar. 2019. [Online].
average. In addition, positive patients exhibit about Available:
1/3rd the amount of ST depression induced by
https://linkinghub.elsevier.com/retrieve/pii/S073658531
exercise relative to rest (oldpeak). 8308876
Our Hybrid machine learning algorithm can now [2] S. M. S. Shah, S. Batool, I. Khan, M. U. Ashraf, S. H.
classify patients with Heart Disease. Now we can Abbas, and S. A. Hussain, ‘‘Feature extraction through
properly diagnose patients, & get them the help they parallel probabilistic principal component analysis for
need to recover. By diagnosing detecting these heart disease diagnosis,’’ Phys. A, Stat. Mech. Appl., vol.
features early, we may prevent worse symptoms 482, pp. 796–807, 2017. doi:
from arising later. Our Random Forest algorithm 10.1016/j.physa.2017.04.113.
yields the highest accuracy, 80%. Any accuracy [3] Y. E. Shao, C.-D. Hou, and C.-C. Chiu, ‘‘Hybrid
above 70% is considered good, but be careful intelligent modelling schemes for heart disease
because if your accuracy is extremely high, it may classification,’’ Appl. Soft Comput. J., vol. 14, pp. 47–
be too good to be true (an example of Over fitting). 52, Jan. 2014. doi: 10.1016/j.asoc.2013.09.020.
Thus, 80% is the ideal accuracy.
[4] J. S. Sonawane and D. R. Patil, ‘‘Prediction of heart [14] W. Zhang and J. Han, ‘‘Towards heart sound
disease using multilayer perceptron neural network,’’ in classification without segmentation using convolutional
Proc. Int. Conf. Inf. Commun. Embed- neural network,’’ in Proc. Comput. Cardiol. (CinC), vol.
44, Sep. 2017, pp. 1–4.
ded Syst., Feb. 2014, pp. 1–6.
[15] Y. Meidan, M. Bohadana, A. Shabtai, J. D.
[5] C. Sowmiya and P. Sumitra, ‘‘Analytical study of Guarnizo, M. Ochoa, N. O. Tippenhauer, and Y. Elovici,
heart disease diagnosis using classification techniques,’’ ‘‘ProfilIoT: A machine learning approach for IoT device
in Proc. IEEE Int. Conf. Intell. Techn. Control, Optim. identification based on network traffic analysis,’’ in Proc.
Signal Process. (INCOS), Mar. 2017, pp. 1–5. Symp. Appl. Comput., Apr. 2017, pp. 506–509.
[6] B. Tarle and S. Jena, ‘‘An artificial neural network [16] J. Wu, S. Luo, S. Wang, and H. Wang, ‘‘NLES: A
based pattern classification algorithm for diagnosis of novel lifetime extension scheme for safety-critical cyber-
heart disease,’’ in Proc. Int. Conf. Comput.,Commun., physical systems using SDN and NFV,’’ IEEE Internet
Control Automat. (ICCUBEA), Aug. 2017, pp. 1–4. Things J., no. 6, no. 2, pp. 2463–2475, Apr. 2019.
[7] V. P. Tran and A. A. Al-Jumaily, ‘‘Non-contact [17] J. Wu, M. Dong, K. Ota, J. Li, and Z. Guan, ‘‘Big
Doppler radar based prediction of nocturnal body data analysis-based secure cluster management for
orientations using deep neural network for chronic heart optimized control plane in software-defined networks,
failure patients,’’ in Proc. Int. Conf. Elect. Comput. IEEE Trans. Netw. Service Manag., vol. 15, no. 1, pp.
Technol. Appl. (ICECTA), Nov. 2017, pp. 1–5. 27–38, Mar. 2018.
[8] K. Uyar and A. Ilhan, ‘‘Diagnosis of heart disease [18] J. Wu, M. Dong, K. Ota, J. Li, and Z. Guan, ‘‘FCSS:
using genetic algorithm based trained recurrent fuzzy Fog computing based content-aware filtering for security
neural networks,’’ Procedia Comput. Sci., vol. 120, pp. services in information centric social networks,’’ IEEE
588–593, 2017. Trans. Emerg. Topics Comput., to be published. doi:
[9] T. Vivekanandan and N. C. S. N. Iyengar, ‘‘Optimal 10.1109/TETC.2017.2747158.
feature selection using a modified differential evolution [20] G. Li, J. Wu, J. Li, K. Wang, and T. Ye, ‘‘Service
algorithm and its effectiveness for prediction of heart popularity-based smart resources partitioning for fog
disease,’’ Comput. Biol. Med., vol. 90, pp. 125–136, computing-enabled industrial Internet of things,’’ IEEE
Nov. 2017. Trans. Ind. Information., vol. 14, no. 10, pp. 4702–4711,
[10] S. Radhimeenakshi, ‘‘Classification and prediction Oct. 2018.
of heart disease risk using data mining techniques of
support vector machine and artificial neural network,’’ in [21] J. Wu, K. Ota, M. Dong, and C. Li, ‘‘A hierarchical
Proc. 3rd Int. Conf. Comput. Sustain. Global Develop. security framework for defending against sophisticated
(INDIACom), New Delhi, India, Mar. 2016, pp. 3107– attacks on wireless sensor networks in smart cities,’’
3111. IEEE Access, vol. 4, pp. 416–424, 2016.
[11] R. Wagh and S. S. Paygude, ‘‘CDSS for heart [22] H. Li, K. Ota, and M. Dong, ‘‘Learning IoT in edge:
disease prediction using risk factors,’’ Int. J. Innov. Res. Deep learning for the Internet of Things with edge
Comput., vol. 4, no. 6, pp. 12082–12089, Jun. 2016. computing,’’ IEEE Netw., vol. 32, no. 1, pp. 96–101,
Jan./Feb. 2018.
[12] O. W. Samuel, G. M. Asogbon, A. K. Sangaiah, P.
Fang, and G. Li, ‘‘An integrated decision support system
based on ANN and Fuzzy_AHP for heart failure risk
prediction,’’ Expert Syst. Appl., vol. 68, pp. 163–172,
Feb. 2017.