Cccccccccccccccs
Cccccccccccccccs
An Internship Report
On
HEART DISEASE PREDICTION
Submitted in Partial Fulfillment of the requirement for the award of the degree of
BACHELOR OF ENGINEERING
IN
COMPUTER SCIENCE AND ENGINEERING
Submitted By
ANNAPUREDDY PRANATHI 1SJ18CS005
Carried out at
QUANT MASTERS
(#812, 6th cross 3rd main, Rajajinagar, Bengaluru - 560021)
Under the guidance of
Internal Guide External Guide
Mr. Apoorva S Mr.Shashank
Assistant Professor, Technical Lead,
Dept. of CSE, SJCIT. QUANT MASTERS
S J C INSTITUTE OF TECHNOLOGY
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
CHIKKABALLAPUR-562101
2021-2022
COMPANY CERTIFICATE
i
DECLARATION
DATE:13/05/2022 USN:1SJ18CS005
ii
ABSTRACT
Heart plays significant role in living organisms. Heart disease is one of the most
significant causes of mortality in the world today. Prediction of cardio vascular disease is
a critical challenge in the area of clinical data analysis. Diagnosis and prediction of
heartrelated diseases requires more precision, perfection and correctness because a little
mistake can cause fatigue problem or death of the person, there are numerous death cases
related to heart and their counting is increasing exponentially day by day. Machine
learning has been shown to be effective in assisting in making decisions and predictions
from the large quantity of data produced by the health care industry. Various studies give
only a glimpse into predicting heart disease with ML techniques. Here, we design a model
that aims at finding significant features by applying machine learning techniques resulting
in improving the accuracy in the prediction of heart disease. There are many to do
accomplish this task successfully, but how effective are they? Our main aim is to produce
an enhanced performance level with the good accuracy level through the prediction model
for heart disease with the SVM, KNN, Naïve Bayes, logistic regression & Random Forest
algorithms. It is estimated that on an average about 17 million people die of
cardiovascular diseases each year, which is about one third of total deaths across the
globe. In this proposed project we designed a model to detect and predict the accuracy of
heart disease. This system can provide most of the important features required to detect
heart disease with different algorithms. As we know the population is been increased day
by day the diseases of the people are increasing day by day, so with that we have to
upgrade with the technology constantly, it is becoming easy to track the behaviour and
pattern diseases and get cure at the early stages. To come up with the solution one can
make use of technologies with the increase of machine learning so it becomes feasible to
automate this process and to save someone's life by detecting the disease at an early
stages. Initially, we will collect the data set by users and classify it as trained and testing
dataset using different type's algorithm and decision trees. By using the feasible
algorithm, we can analyze the larger data-set and user provided current data set. Then
augment the accuracy of the result data. Proceeded with the application of processing of
some of the attributes provided which can find weather the user is having heart disease in
viewing the graphical model of data visualization. The performance of the techniques is
gauged based on accuracy, sensitivity, and specificity, precision. The results are indicated
concerning the best accuracy for Random Forest is unit 80% respectively.
iii
ACKNOWLEDGEMENT
With reverential pranam, I express my sincere gratitude and salutations to the feet of his
holiness Byravaikya Padmabhushana Sri Sri Sri Dr. Balagangadharanatha Maha
Swamiji & his holiness Jagadguru Sri Sri Sri Dr. Nirmalanandanatha Swamiji of Sri
Adichunchanagiri Mutt for their unlimited blessings. First and foremost, I wish to express
my sincere gratitude to my institution, Sri Jagadguru Chandrashekaranathaswamiji
Institute of Technology for providing me an opportunity to complete my internship work
successfully.
I would like extend this deep sense of sincere gratitude to Dr. G T Raju,
Principal, S. J. C. Institute of Technology, Chickballapur, for providing everything
without which it would have been impossible to complete the Internship Work.
I extend special in-depth, heartfelt, and sincere gratitude to our HOD, Dr.
Manjunatha Kumar B H, Professor and Head of Department, Computer Science
and Engineering, S. J. C. Institute of Technology, Chickballapur, for his constant
support and valuable guidance of the Internship Work.
I convey our sincere thanks to Internship Internal Guide Prof. Swetha T,
Assistant Professor, Department of Computer Science and Engineering, S. J. C.
Institute of Technology, for her constant support, guidance and suggestions.
I also feel immense pleasure to express deep and profound gratitude to our
Internship Coordinator’s Prof. Swetha T and Prof. Shrihari M R, Assistant Professor,
Department of Computer Science and Engineering, S J C Institute of Technology,
for his guidance and suggestions of the Internship Work.
Finally, I would like to thank all faculty members of Department of Computer
Science and Engineering, S. J. C. Institute of Technology, Chickballapur for their
support.
I also thank all those who extended their support and co-operation while bringing
ANNAPUREDDY PRANATHI
iv
TABLE OF CONTENTS
Certificate i
Declaration ii
Abstract iii
Acknowledgement iv
Contents v
List Of Figures vi
1.1.1 Objectives 1
2.3 Testing 5
3.1 Introduction 8
4.1 Experience 10
4.2 Technical Outcomes 10
4.5 Implementation 14
4.5.1 Modules 14
5 Conclusion 21
Bibliography 22
Appendix 23
LIST OF FIGURES
4.6.8 SVC 19
4.6.9 Gaussian NB 19
1.1.1 Objectives
• The essential objective of QUANT MASTERS is to improve the quality of
training and enhance the learning process.
• Most importantly to create engaging and effective learning experiences and
provide a variety of technological information, ideas to encourage curiosity,
stimulate self-confidence through the knowledge and develop practical skills.
1
Heart Disease Prediction Introduction
1.1.2 Operation of the Organization
Our mission is to make learning truly interesting and make it easier as well as more
affordable for the students to prepare for their placements or competitive examinations
and help them make a perfect start to their career. There are about 20 people working in
the organization working as intern guides in various platform and trained above 700+
students enroll in our batches every month from all over the country and one of the
corporate institutions for providing internships.
On-going projects: We start a new placement training batch every 1.5 months.
Currently we are working towards giving quality training cum internships to the
students and give them the practical implications of the related projects. The training
provided by us is also helpful to various students preparing for competitive exams
from different branches- Engineering, Humanity, Commerce, Arts, Management etc.
We will soon be launching our services with regards to various new technological
advancements and certification courses.
1.4 Services Offered
• Quantitative Aptitude
• Technical Training
• Verbal Aptitude
• Logical training
• Soft skills/ Communication Skills
• Resume Building
• LinkedIn Networking
• AI and ML internship
• GD Preparation
4
Heart Disease Prediction About The Department
2.3 Testing
The various testing techniques used by the department can be summarized as follows:
1.Functionality Testing of a Website: it is a process that includes several testing
parameters like user interface, APIs, database testing, security testing, client and server
testing and basic website functionalities. Functional testing is very convenient and it
allows users to perform both manual and automated testing. It is performed to test the
functionalities of each feature on the website.
2.Usability Testing: This type of testing includes testing the site navigations and contents
of the website.
3.Interface Testing: Three areas to be tested here are Application,Web and Database
Server.
4.Database Testing: Database is one critical component of your web application and
stress must be laid to test it thoroughly Testing activities will include Test if any errors
are shown while executing queries, Data Integrity is maintained while creating, updating
or deleting data in database, Check response time of queries and fine tune them if
necessary, Test data retrieved from your database is shown accurately in your web
application.
5. Compatibility testing: Compatibility tests ensures that your web application displays
correctly across different devices. This would include-Browser Compatibility Test: Same
website in different browsers will display differently. You need to test if your web
application is being displayed correctly across browsers, JavaScript, AJAX and
authentication is working fine.
6. Pipeline testing: After compatibility testing it is the time to test all the micro services
in pipeline together to check their compatibility and message passing.
Thus all the services/functionalities are kept in pipeline and tested together. Afterwards
whole pipeline is pushed in the deployment server.
8
Heart Disease Prediction Task Performed
4.1 Experience
The internship has been a really useful experience for me that I can learn a lot of new
knowledge that will definitely be useful for my future study. I’m grateful that my
assignments have a lot of variety instead of just focusing on a specific area. This allows
me to be able to learn more and also challenge myself to overcome many different kinds
of difficulties encountered during my internship. Having many assignments also required
me to manage my work time efficiently prioritizing the urgent task .
Some tasks require me to do research with less available online documentation other task
requires me to make attempts on works that I have never experienced before just by
learning from documentations. Although the task may be difficult and overwhelming
sometimes, I’m really excited to push my skills to the limit and carry out those task
assigned to me.
Beside technical skills, I also observed and learned a lot of soft skills from my supervisors
and my co-workers such as professional communication and team work. I have also
learned a lot from my supervisor who’s always willing to help me when I face difficulties
and also willing to share a lot of his knowledge and wisdom to me from his post
experience.
• Optimize the models learned and report on the expectancy accuracy that can be achieved
by applying the models.
10
Heart Disease Prediction Reflection Notes
4.2.1 System Requirement Specification
Hardware Requirements
Software Requirements
BACK-END : Python 3
The traditional detection method mainly depends on the doctor’s vision of treating the
patient and his level of experience, which usually are delayed, inaccurate and not in-time.
After following these methods, it may take time for diagnosing the records and giving the
summary and then treating the patient.
There are few disadvantages identified in the existing system and are defined below:
i. Inaccurate results
i. Accurate results
v. Graphical representation
vi. Cost-efficienccy
The above figure represents the data flow diagram of the project.
• Evaluating the trained model and finding the best algorithm for the project.
4.5.1 Modules
• The different algorithms used for training are K-Nearest Neighbors, Logistic
Regression, Support Vector Machine
First ,import all the libraries/packages which are necessary to analyse the dataset
Next ,we have to insert the dataset which is present in the local system using pandas
library
At the end of this project, we have acquired the result of an accurate value of using a
random forest algorithm with new enhancements. In comparison to existing modules, this
proposed module is applicable for the dataset and provides more accurate results. The
Random forest algorithm will provide better performance with many training data, but
speed during testing and application will still suffer. Usage of more pre-processing
techniques would also assist.In this project, we have seen that the accuracy of Random
Forest Algorithm is best when compared to other algorithms.
21
BIBLIOGRAPHY
[1] Machine Learning with Python: Design and Develop Machine Learning and Deep
LearningTechnique using real world code examples, Abhishek Vijayvargia, 1st Edition, 2019.
[2]Python GUI Programming - A Complete Reference Guide: Develop responsive and powerful
GUI applications with PyQt and Tkinter, Alan D. Moore, B. M. Harwani, 2019.
[3]Machine Learning for Beginners: The Definitive Guide to Neural Networks, Random Forests,
and Decision Trees, Jennifer Grange, 2017.
[4] A. S. Abdullah and R. R. Rajalaxmi, ‘‘A data mining model for predicting the coronary heart
disease using random forest classifier,’’ in Proc. Int. Conf. Recent Trends Comput. Methods,
Commun. Controls, Apr. 2012, pp. 22–25.
[5] A. H. Alkeshuosh, M. Z. Moghadam, I. Al Mansoori, and M. Abdar, ‘‘Using PSO algorithm for
producing best rules in diagnosis of heart disease,’’ in Proc. Int. Conf. Comput. Appl. (ICCA), Sep.
2017,pp.306–311.
https://www.kaggle.com/datasets?fileType=csv
22
APPENDIX
Appendix A: Abbreviation
23