index list
index list
9001:2015 Certified
5th KM. STONE, DELHI-MEERUT ROAD, GHAZIABAD (U.P)-201003
Department of Computer Science & Engineering
6. SUPERVISOR:
…………………………………………………………………………………
…………………………………………………………………………………
…………………………………………………………………………………
…………………………………………………………………………………
i
PROJECT SYNOPSIS
STATEMENT-:
Now-a-days, people face various diseases due to the environmental condition and their living
habits. So, the prediction of disease at earlier stage becomes important task. The correct
prediction of disease is the most challenging task. To overcome this problem data mining plays
an important role to predict the disease. With the help of disease data, data mining finds hidden
pattern information in the huge amount of medical data. We proposed general disease
prediction based on symptoms of the patient.
As we all know now days due to COVID-19, people have the fear of diseases even at the
minimum symptoms. The basic idea behind making the project was to give people an interface
on which the rely and see the disease they are having and get the medicines acc. The just have
to enter the symptoms the are having and the system will analyse the symptoms and tell the
user the disease he/she may be suffering from.
OBJECTIVE-:
METHODOLOGY-:
KNN (K-Nearest Neighbor): - KNN algorithm assumes the similarity between new case/data
and available case and put the new case into the category that is most similar to the available
categories. SVM (Support Vector Machine):- SVM creates the best line or decision boundary
that can segregate n-dimensional space into classes so that we can easily put the new data point
in the correct category in the future.
ii
HARWARE REQUIREMENT-:
A computer system with min 8gb of RAM, 50gb of Storage, a powerful graphic card to support
the virtual calculation and graphics.
SOFTWARE REQUIRMENT-:
Android Studio- To use the emulator for the application interface control, editing and
development.
Sublime text- Used to write the code for the web application development.
Octave (GUI) - Used to run the ML algorithms and train the machine.
TESTING TECHNOLOGY-:
Automated Testing-: Automated testing is a process in which the tester uses special tools to
put the software through its paces and find any bugs that may exist. Automated testing is an
integral part of the development process. It can help in identifying bugs and defects early on in
the development lifecycle, which can save time and money on fixing them later on. Automated
tests are also known to be more reliable than manual tests because they are less prone to human
errors.
We got accurate general disease risk prediction as output, by giving the input as patients record
which help us to understand the level of disease risk prediction low time consumption and
minimal cost possible for disease prediction and risk prediction.
iii
TABLE OF CONTENTS
1. INTRODUCTION 1
1.1 Objective 1
1.2 Motivation 1
1.3 Problem Definition 2
1.4 Methodology 2
1.4.1 KNN Algorithm 2
1.4.2 Support vector machine 4
1.4.3 Random Forest Algorithm 5
1.5 Background and related work 6
1.5.1 Different Approaches 7
1.6 Related Work 9
iv
3.1.5 Increment model 19
3.1.6 Agile model 19
3.1.7 Iterative model 20
3.1.8 Big Bang model 21
3.1.9 Protype model 21
3.2 How it affects our model 22
5. RISK ASSESSMENT 28
5.1 Evolution Matrices 29
5.1.1 Feature engineering 29
5.1.2 Pre-processing 29
5.1.3 Risk equation 30
5.2 Classification 30
5.3 Risk assessment service 31
5.3.1 Risk score 31
5.3.2 Risk factor 31
v
6. DFD AND APPLICATION ARCHITECTURE 32
6.1 Zero-layer DFD 32
6.2 First-layer DFD 33
6.3 Second-layer DFD 34
9. PROJECT SNAPSHOTS 45
9.1 Output screen of web application 45
9.2 Dataset 50
vi
LIST OF FIGURES
vii
7 Figure 7.1 Detailed Design model 35
viii
LIST OF TABLES
ix