TechnologyName Phase1
TechnologyName Phase1
Abstract
Many factors can cause a person to get affected by diabetes, like excessive body
weight, abnormal cholesterol level, family history, physical inactivity, bad food
habit etc. People with diabetes for a long time can get several complications like
heart disorder, kidney disease, nerve damage, diabetic retinopathy etc. But its risk
can be reduced if it is predicted early. We are going to use machine learning
classification methods, that is, decision tree, SVM, Random Forest, Logistic
Regression, KNN, and various ensemble techniques, to determine which algorithm
produces the best prediction results. The explainable AI approach with LIME and
SHAP frameworks is implemented to understand how the model predicts the final
results.
Introduction
Diabetes is a chronic disease that directly affects the pancreas, and the body is
incapable of producing insulin.
Many factors, such as excessive body weight, physical inactivity, high blood
pressure, and abnormal cholesterol level, can cause a person get affected by
diabetes.
First, the dataset was collected and preprocessed to remove the necessary
discrepancies from the dataset
Then the dataset was separated into the training set and test set using the
holdout validation technique.
The zero value has been replaced by its corresponding mean value.
The training and test dataset has been separated using the holdout
validation technique, where 80% is the training data and 20% is the
test data.
There are some future scopes of this work, for example, we recommend
getting additional private data with a larger cohort of patients to get
better results.