Project Synopsis
Project Synopsis
By
Group No: - 12
1
18
CERTIFICATE 3
ACKNOWLEDGEMENTS 4
INTRODUCTION 5
MOTIVATION OF THE PROJECT 6
HARDWARE AND SOFTWARE TOOLS TO BE USED 7
FLOW-CHART OF THE PROJECT 8
ABOUT DATASET 9
PREPROSSEING DATASET 10
ABOUT CLASSIFICATION SUPERVISED MODEL 11
CONFUSION MATRIX 13
ROC CURVE 15
OUTPUT COMPARISON 17
FUTURE SCOPE 17
CONCLUSION 18
REFRENCES 18
2
18
We do hereby declaring that the work which is being presented in the Project Report entitled Diabetes
Prediction using Machine Learning, in partial fulfilment of the requirements for the award of the Bachelor of
Technology in Information Technology and submitted to the Department of Information Technology of Future
Institute of Engineering and Management, Kolkata, is an authentic record of our own work carried out during the
period from September 2021 to June 2022, under the supervision of Prof. Debjyoti Basu & Prof. Subhasis Mitra.
The matter presented in this thesis has not been submitted by us for the award of any other degree elsewhere.
Date: 09.06.2022
Signature of the Supervisor Signature of the Supervisor
3
18
ACKNOWLEDGEMENT
We have taken efforts in this project. However, it would not have been possible
without the kind support and help of many individuals. We would like to extend
our sincere thanks to all of them.
We are highly indebted to our guide Prof. Debjyoti Basu and Prof. Subhasis
Mitra for his guidance and constant supervision as well as for providing
necessary information regarding the project and also for his support in
completing the project.
We express our thanks to our Principal Dr. Aloke Ghosh and our Head of the
Department Prof. Prasenjit Basu for extending their support. We would also
thank our Institution and the faculty members without whom this project would
have been a distant reality.
Our thanks and appreciations also go to all people who have willingly helped us
out with their abilities.
Abhishek Sinha
Arka Dutta
Ritayan Midya
Pritam Pal
Ehsan Hassan
Rohit Paul
4
18
INTRODUCTION
5
18
In recent times, most peoples are suffering in Diabetes. There are estimated
72.96 million cases of diabetes in adult population of India. The prevalence in
urban areas ranges between 10.9% and 14.2% and prevalence in rural India was
3.0-7.8% among population aged 20 years and above with a much higher
prevalence among individuals aged over 50 years. For this purpose we use the
Pima Indian Diabetes Dataset, we apply various Machine Learning classification
to predict diabetes. Machine Learning Is a method that is used to train computers
or machines explicitly. Various Machine Learning Techniques provide efficient
result to collect Knowledge by building various classification and ensemble
models from collected dataset. Such collected data can be useful to predict
diabetes. Various techniques of Machine Learning can capable to do prediction,
however it’s tough to choose best technique. Thus for this purpose we apply
popular classification method K-NN & Logistic Regression on dataset for
prediction. And main objective of this project comparison between this two
method & choose the best prediction method.
6
18
HARDWARE:
Any Kind of Laptop or Desktop (Windows 10) with internet
connectivity.
GPU
SOFTWARE:
Google Colab
MS Excel
Python
Sklearn
Flask
HTML,CSS
7
18
This is most important phase which includes model building for prediction of
diabetes. In this we have implemented various machine learning algorithms
which are discussed above for diabetes prediction.
8
18
SPLIT
DATASET
DATA PROCESSING CLASSIFIER
TEST
DATASET
(20%)
Predict the
Render the html
Load the classifier Outcome from
file and get data
Into api file given data and
from html file
Show the result
9
18
10
18
11
18
Algorithm-
It classify the data in binary form means only in 0 and 1 which refer case
to classify patient that is positive or negative for diabetes.
13
18
15
18
16
18
FUTURE SCOPE
Implementing SVM,RandomForest Classification. Basically try to
improving for more AccuracyRate.
Implement GUI as Front End.
17
18
REFERENCES
https://www.javatpoint.com/supervised-machine-learning
www.youtube.com
www.kaggle.com
www.ijert.org
18