c20 Final Final
c20 Final Final
2. This diagnosis is a difficult task i.e. it should be performed precisely and efficiently. The
research paper mainly focuses on which patient is more likely to have a diabetic disease based
on various medical attributes.
3. We prepared a diabetic disease prediction system to predict whether the patient is likely to be
diagnosed with a diabetic disease or not using the medical history of the patient.
4. We used different algorithms of machine learning such as SVM to predict and classify the
patient with diabetic disease.
Existing System
1. There was an existing system called Diabetic Disease prediction using Machine
Learning. They have used the data set which was very small i.e it contains around 100-
120 entries and 5-6 parameters to check. The existing model used to possess an
accuracy rate of 77%.
2. GUI of existing model was Static . This model will ask for 4-5 kind of symptoms and
try to predict the chances of getting Diabetic disease .
Proposed System
1. Our System basically provides the enhanced results , data set got improved
which consists around 400 entries of data .
3. Model is designed in such a way that it is capable of providing the best and
accurate results.
Outcome: Class variable (0 or 1), where 0 indicates non-diabetic and 1 indicates diabetic.
Output Screenshots:
Performance Metrics
1. Accuracy
Accuracy is the ratio of correctly predicted instances to the total instances.
Accuracy=True Positives+True NegativesTotal Instances\text{Accuracy} = \frac{\text{True
Positives} + \text{True Negatives}}{\text{Total
Instances}}Accuracy=Total InstancesTrue Positives+True Negatives
2. Precision
Precision is the ratio of correctly predicted positive observations to the total predicted
positives. Precision=True PositivesTrue Positives+False Positives\text{Precision} = \frac{\
text{True Positives}}{\text{True Positives} + \text{False
Positives}}Precision=True Positives+False PositivesTrue Positives
3. Recall (Sensitivity or True Positive Rate)
Recall is the ratio of correctly predicted positive observations to all observations in the actual
class. Recall=True PositivesTrue Positives+False Negatives\text{Recall} = \frac{\text{True
Positives}}{\text{True Positives} + \text{False
Negatives}}Recall=True Positives+False NegativesTrue Positives
Result analysis
1. Analyzing the results of a diabetes prediction model involves evaluating several key
metrics to assess its performance. The confusion matrix provides a detailed breakdown of
predictions, distinguishing between true positives (correctly identified diabetic patients),
true negatives (correctly identified non-diabetic patients), false positives (non-diabetic
patients incorrectly identified as diabetic), and false negatives (diabetic patients incorrectly
identified as non-diabetic). These metrics are essential for understanding how well the
model distinguishes between diabetic and non-diabetic individuals.