0% found this document useful (0 votes)

18 views3 pages

Deep Learning Approach For Diabetes Prediction Using PIMA Indian Dataset

Uploaded by

doraeshin04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views3 pages

Deep Learning Approach For Diabetes Prediction Using PIMA Indian Dataset

Uploaded by

doraeshin04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Deep Learning Approach for Diabetes Prediction using PIMA Indian Dataset

The PIMA Indian Diabetes dataset consists of medical records of patients, including several independent
variables (features) and a target variable (Outcome), where the goal is to predict the presence of diabetes (binary
classification problem).
Steps for Designing the Deep Learning Model
1. Dataset Overview:
o The dataset contains 768 samples, with the following features:
 Pregnancies: Number of times the patient has been pregnant.
 Glucose: Plasma glucose concentration.
 BloodPressure: Diastolic blood pressure (mm Hg).
 SkinThickness: Triceps skinfold thickness (mm).
 Insulin: 2-Hour serum insulin (mu U/ml).
 BMI: Body mass index (weight in kg/height in m²).
 DiabetesPedigreeFunction: Diabetes pedigree function (family history of diabetes).
 Age: Age of the patient (years).
 Outcome: Target variable (0 for non-diabetic, 1 for diabetic).
2. Preprocessing:
o Handle missing values (if any).
o Normalize the features (scaling them between 0 and 1) since the features have different
ranges, which helps improve the performance of deep learning models.
o Split the dataset into training and test sets.
3. Deep Learning Model:
o Use a deep neural network for classification.
o Three hidden layers with ReLU activation function.
o Dropout layers to avoid overfitting.
o Sigmoid activation in the output layer for binary classification.
4. Evaluation:
o Use appropriate evaluation metrics like accuracy, precision, recall, and F1-score.
o Cross-validation or a validation set should be used to evaluate the generalization of the model.
Deep Learning Model Implementation
Below is an implementation using TensorFlow and Keras for diabetes prediction:
python
Copy code
# Import necessary libraries
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, Dropout
from sklearn.metrics import accuracy_score, classification_report

# Load the PIMA Indian Diabetes dataset

url = 'https://raw.githubusercontent.com/jbrownlee/Datasets/master/pima-indians-diabetes.data.csv'
columns = ['Pregnancies', 'Glucose', 'BloodPressure', 'SkinThickness', 'Insulin', 'BMI',
'DiabetesPedigreeFunction', 'Age', 'Outcome']
data = pd.read_csv(url, names=columns)

# Split the dataset into features (X) and target (y)

X = data.iloc[:, :-1].values # All features except Outcome
y = data.iloc[:, -1].values # Target variable Outcome

# Train-test split (80% training, 20% testing)

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
# Standardize the features (normalization between 0 and 1)
scaler = StandardScaler()
X_train = scaler.fit_transform(X_train)
X_test = scaler.transform(X_test)

# Build the deep neural network model

def create_model():
model = Sequential()

# Input layer + First hidden layer with 256 neurons, ReLU activation
model.add(Dense(256, input_dim=8, activation='relu'))
model.add(Dropout(0.2))

# Second hidden layer with 256 neurons, ReLU activation

model.add(Dense(256, activation='relu'))
model.add(Dropout(0.2))

# Third hidden layer with 256 neurons, ReLU activation

model.add(Dense(256, activation='relu'))
model.add(Dropout(0.2))

# Output layer (binary classification) with Sigmoid activation

model.add(Dense(1, activation='sigmoid'))

# Compile the model with Adam optimizer and binary crossentropy loss
model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

return model

# Create the model

model = create_model()

# Train the model

history = model.fit(X_train, y_train, epochs=50, batch_size=32, validation_split=0.2)

# Evaluate the model on test data

y_pred = model.predict(X_test)
y_pred = np.round(y_pred) # Convert probabilities to binary predictions (0 or 1)

# Calculate accuracy and classification report

accuracy = accuracy_score(y_test, y_pred)
print(f"Test Accuracy: {accuracy * 100:.2f}%")
print(classification_report(y_test, y_pred))

# Model summary
model.summary()
Explanation:
 Preprocessing:
o Data is split into training and testing sets.
o Standardization is applied to normalize the data since features like age, insulin, and glucose
are on different scales.
 Model Architecture:
o Input Layer: Accepts 8 features (Pregnancies, Glucose, Blood Pressure, etc.).
o Hidden Layers: Three hidden layers with 256 neurons each and ReLU activation.
o Dropout: Applied after each hidden layer to reduce overfitting by randomly disabling 20% of
neurons during training.
o Output Layer: Uses a single neuron with Sigmoid activation, which outputs a probability
score for binary classification (0 for non-diabetic, 1 for diabetic).
 Optimizer and Loss Function:
o The Adam optimizer is used, which adjusts the learning rate dynamically and efficiently.
o Binary Crossentropy is used as the loss function, which is appropriate for binary classification
tasks.
 Training:
o The model is trained for 50 epochs with a batch size of 32, using 20% of the training data as
validation data.
Mechanisms to Improve the Model:
1. Early Stopping: To prevent overfitting, the training process can be stopped early if the validation
accuracy starts to degrade.
2. Cross-Validation: Use k-fold cross-validation to ensure the model generalizes well.
3. Hyperparameter Tuning: Experiment with different batch sizes, learning rates, number of neurons, or
even layer architectures to find the best-performing configuration.
Evaluation Metrics:
 Accuracy: Measures the proportion of correct predictions.
 Precision, Recall, and F1-Score: Useful for understanding the performance in terms of false positives
and false negatives, especially for an imbalanced dataset like PIMA.
This deep learning approach provides a strong baseline for diabetes prediction using the PIMA Indian dataset,
with room for further optimization and evaluation techniques.

Diabetes Prediciton Model
100% (1)
Diabetes Prediciton Model
23 pages
Digital Signal Processing Ppt-1
100% (1)
Digital Signal Processing Ppt-1
12 pages
Aiml Project Report
No ratings yet
Aiml Project Report
10 pages
Presentation - Yussup Tumgoyev
No ratings yet
Presentation - Yussup Tumgoyev
128 pages
Diabetes - Test Report
No ratings yet
Diabetes - Test Report
62 pages
IPL Winning Prediction Intern Report
No ratings yet
IPL Winning Prediction Intern Report
52 pages
Deep Learning
No ratings yet
Deep Learning
41 pages
Minor Project Report
No ratings yet
Minor Project Report
46 pages
Diabetes Analysis and Prediction
No ratings yet
Diabetes Analysis and Prediction
45 pages
Final
No ratings yet
Final
44 pages
METTL - Logical Building 1 - 2 and 3 Links
100% (1)
METTL - Logical Building 1 - 2 and 3 Links
2 pages
Peerj Cs 1914
No ratings yet
Peerj Cs 1914
30 pages
Assignment 03 AI START
No ratings yet
Assignment 03 AI START
23 pages
CLC Assignment 03 AI START
No ratings yet
CLC Assignment 03 AI START
23 pages
Prepare, Sterilize and Dispense Culture Media
No ratings yet
Prepare, Sterilize and Dispense Culture Media
24 pages
MLPPT 11 45
No ratings yet
MLPPT 11 45
31 pages
DIABETES DETECTION USING NEURAL NETWORKS (1) (Autosaved)
No ratings yet
DIABETES DETECTION USING NEURAL NETWORKS (1) (Autosaved)
30 pages
Estimating Diabetic Risk Accurately
No ratings yet
Estimating Diabetic Risk Accurately
26 pages
An Effective Pre-Processing Techniques For Diabetes Mellitus Prediction in Healthcare Systems
No ratings yet
An Effective Pre-Processing Techniques For Diabetes Mellitus Prediction in Healthcare Systems
15 pages
CIEA Term Project
No ratings yet
CIEA Term Project
19 pages
Bio-Inspired PSO For Improving Neural Based Diabetes Prediction System
No ratings yet
Bio-Inspired PSO For Improving Neural Based Diabetes Prediction System
21 pages
Final Seminar Report Soumya
No ratings yet
Final Seminar Report Soumya
20 pages
c20 Final Final
No ratings yet
c20 Final Final
21 pages
2023 Article 5467
No ratings yet
2023 Article 5467
20 pages
ppt715B.pptm (Autosaved)
No ratings yet
ppt715B.pptm (Autosaved)
15 pages
Sse 25 21 114-1
No ratings yet
Sse 25 21 114-1
14 pages
Risab
No ratings yet
Risab
13 pages
DSPYProject Report
No ratings yet
DSPYProject Report
14 pages
Mini Project
No ratings yet
Mini Project
15 pages
20BCE7620 AP2021228000397 Experiment-6 Removed
No ratings yet
20BCE7620 AP2021228000397 Experiment-6 Removed
19 pages
Sse 25 21 114-2
No ratings yet
Sse 25 21 114-2
13 pages
Innovative
No ratings yet
Innovative
15 pages
Internshippppp Fimnalllll
No ratings yet
Internshippppp Fimnalllll
16 pages
Machine Learning and Deep Learning Techniques
No ratings yet
Machine Learning and Deep Learning Techniques
13 pages
Diabe PDF
No ratings yet
Diabe PDF
11 pages
Ai Datascience Project Grade 10
No ratings yet
Ai Datascience Project Grade 10
14 pages
Binod ML Project-052
No ratings yet
Binod ML Project-052
14 pages
Sse 25 21 114-3
No ratings yet
Sse 25 21 114-3
13 pages
Diabetes Prediction Presentation
No ratings yet
Diabetes Prediction Presentation
12 pages
DIAPRO - Diabetes Prediction Application
No ratings yet
DIAPRO - Diabetes Prediction Application
18 pages
TDP Sem 3
No ratings yet
TDP Sem 3
9 pages
Data Entry
No ratings yet
Data Entry
2 pages
Sse 25 21 114-4
No ratings yet
Sse 25 21 114-4
14 pages
Classification
No ratings yet
Classification
9 pages
241410
No ratings yet
241410
10 pages
An Effective Approach For Detecting Diabetes Using Deep Learning Techniques Based On Convolutional LSTM Networks
No ratings yet
An Effective Approach For Detecting Diabetes Using Deep Learning Techniques Based On Convolutional LSTM Networks
7 pages
MLDA1
No ratings yet
MLDA1
8 pages
Generative AI Binary Classification
No ratings yet
Generative AI Binary Classification
7 pages
Diabetes Decoded: Transitioning From Traditional Models To Hybrid Deep Learning Approaches
No ratings yet
Diabetes Decoded: Transitioning From Traditional Models To Hybrid Deep Learning Approaches
5 pages
Lab Manual-ANN
No ratings yet
Lab Manual-ANN
7 pages
Seetu Papers 1
No ratings yet
Seetu Papers 1
6 pages
Performance Analysis of Deep Neural Network and Machine Learning Algorithms For Diabetes Prediction
No ratings yet
Performance Analysis of Deep Neural Network and Machine Learning Algorithms For Diabetes Prediction
6 pages
5G-Smart Diabetes Toward Personalized Diabetes Diagnosis With Healthcare Big Data Clouds
No ratings yet
5G-Smart Diabetes Toward Personalized Diabetes Diagnosis With Healthcare Big Data Clouds
8 pages
Nishant PandeyKaran PandaNilesh Pal TCET Nishantpandey2004@Gmail - Com Karanpanda1206@Gmail - Com Expenilesh31@
No ratings yet
Nishant PandeyKaran PandaNilesh Pal TCET Nishantpandey2004@Gmail - Com Karanpanda1206@Gmail - Com Expenilesh31@
6 pages
Predicting Diabetes Onset Using Machine Learning
No ratings yet
Predicting Diabetes Onset Using Machine Learning
4 pages
Automated Payroll Management System
No ratings yet
Automated Payroll Management System
4 pages
DSU DevHack
No ratings yet
DSU DevHack
3 pages
Chat-AI ML Project Proposal
No ratings yet
Chat-AI ML Project Proposal
4 pages
5G-Smart Diabetes Toward Personalized Diabetes Diagnosis With Healthcare Big Data Clouds
No ratings yet
5G-Smart Diabetes Toward Personalized Diabetes Diagnosis With Healthcare Big Data Clouds
8 pages
Prediction of Diabetes Using Deep Learning
No ratings yet
Prediction of Diabetes Using Deep Learning
2 pages
BTVN6 Code
No ratings yet
BTVN6 Code
2 pages
ME990-IH-Section 2a - LongBoltFlangeDesignProblems
No ratings yet
ME990-IH-Section 2a - LongBoltFlangeDesignProblems
15 pages
Ficha Técnica American Marsh
No ratings yet
Ficha Técnica American Marsh
8 pages
Color Code Ieee 1580 Table 22
No ratings yet
Color Code Ieee 1580 Table 22
1 page
Diploma in Legal Studies 27.04.22
No ratings yet
Diploma in Legal Studies 27.04.22
17 pages
Jade M Kit
No ratings yet
Jade M Kit
1 page
Alcad Vantex VTX5.5-EN-2206
No ratings yet
Alcad Vantex VTX5.5-EN-2206
2 pages
Intro S4HANA Using Global Bike Exercises FI en v4.1
No ratings yet
Intro S4HANA Using Global Bike Exercises FI en v4.1
10 pages
DCCN Lab
No ratings yet
DCCN Lab
37 pages
Rittal White Paper 401: The Benefits of Busbar Power Distribution Systems For North American & Global Applications
No ratings yet
Rittal White Paper 401: The Benefits of Busbar Power Distribution Systems For North American & Global Applications
9 pages
Direcpeciallfbi Po Prelims
No ratings yet
Direcpeciallfbi Po Prelims
20 pages
RCC11 Element Design
No ratings yet
RCC11 Element Design
6 pages
مهارات الحاسب
No ratings yet
مهارات الحاسب
257 pages
RCH-Series, Hollow Plunger Cylinders: Shown From Left To Right: RCH-306, RCH-120, RCH-1003
No ratings yet
RCH-Series, Hollow Plunger Cylinders: Shown From Left To Right: RCH-306, RCH-120, RCH-1003
2 pages
Ul Ion Inverter
No ratings yet
Ul Ion Inverter
2 pages
Arun Internship Report
No ratings yet
Arun Internship Report
16 pages
Trắc nghiệm CCNA - Chương 5 Dynamic routing
No ratings yet
Trắc nghiệm CCNA - Chương 5 Dynamic routing
13 pages
Proposal Brochure - Academia
No ratings yet
Proposal Brochure - Academia
10 pages
41 Assigment 4 Chapter 6-9
No ratings yet
41 Assigment 4 Chapter 6-9
1 page
Colgate OpenCore ComputerVision
No ratings yet
Colgate OpenCore ComputerVision
8 pages
Whisper Speaker Identification: Leveraging Pre-Trained Multilingual Transformers For Robust Speaker Embeddings
No ratings yet
Whisper Speaker Identification: Leveraging Pre-Trained Multilingual Transformers For Robust Speaker Embeddings
6 pages
HTTPWWW Jamris Org012010saveas Phpquestjamrisno012010p08-19
No ratings yet
HTTPWWW Jamris Org012010saveas Phpquestjamrisno012010p08-19
12 pages
AT04 - AT05 Series Datasheet V2.1
No ratings yet
AT04 - AT05 Series Datasheet V2.1
3 pages
Development of Hydroponic IoT-based Monitoring System and Automatic Nutrition Control Using KNN
No ratings yet
Development of Hydroponic IoT-based Monitoring System and Automatic Nutrition Control Using KNN
6 pages
A High-Efficiency Step-Up Current-Fed PushPull Quasi-Resonant Converter With Fewer Components For Fuel Cell Application
No ratings yet
A High-Efficiency Step-Up Current-Fed PushPull Quasi-Resonant Converter With Fewer Components For Fuel Cell Application
10 pages
Wind Energy Conversion
No ratings yet
Wind Energy Conversion
7 pages
TH460 Service Report 023832
No ratings yet
TH460 Service Report 023832
1 page
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet

Deep Learning Approach For Diabetes Prediction Using PIMA Indian Dataset

Uploaded by

Deep Learning Approach For Diabetes Prediction Using PIMA Indian Dataset

Uploaded by

Deep Learning Approach for Diabetes Prediction using PIMA Indian Dataset

# Load the PIMA Indian Diabetes dataset

# Split the dataset into features (X) and target (y)

# Train-test split (80% training, 20% testing)

# Build the deep neural network model

# Second hidden layer with 256 neurons, ReLU activation

# Third hidden layer with 256 neurons, ReLU activation

# Output layer (binary classification) with Sigmoid activation

# Create the model

# Train the model

# Evaluate the model on test data

# Calculate accuracy and classification report

You might also like