0% found this document useful (0 votes)

20 views10 pages

Ads Exp 10

Uploaded by

codewijaj06

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views10 pages

Ads Exp 10

Uploaded by

codewijaj06

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Report on Case Study

“Diabetes Prediction “

Aishwarya Iyappan- 4201

Siddhi Ambekar- 4202
Anupam Kumari- 4203
Anjali Bansode- 4207

DEPARTMENT OF COMPUTER ENGINEERING

BHARATI VIDYAPEETH COLLEGE OF ENGINEERING,
NAVI MUMBAI
2023-24
Experiment No: 10

Title: Life cycle of Diabetes prediction using SVM(Support Vector Machine).

Tools:
1. NumPy: For numerical computations and array manipulations.
2. Pandas: For data manipulation and analysis, including reading and
loading datasets into DataFrames.
3. Matplotlib: For creating static, interactive, and animated visualizations in
Python.
4. Seaborn: For statistical data visualization based on Matplotlib, providing
a high-level interface for drawing attractive and informative statistical
graphics.
5. Scikit-Learn: For machine learning tasks, including K-Means clustering,
which is imported from sklearn.cluster.

PROBLEM STATEMENT:
The problem at hand revolves around predicting diabetes whether a person has
diabetes or not, based on information about the patient such as blood pressure,
body mass index (BMI), age, etc. By leveraging machine learning techniques,
specifically Support vector machine, the aim is to allow users to predict diabetes
utilizing the prediction engine. The objective is set to achieve the aims of the
project through a Research on statistical models in machine learning and to
understand how the algorithms works. This case study walks through the various
stages of the data science workflow.
LIFE CYCLE:

I. Data Collection

II. Data Exploration

III. Data Preparation

IV. Training and Evaluating the Machine Learning Model

V. Interpreting the ML Model

VI. Saving the Model

VII. Making Predictions with the Model

Methodology

1. Data Collection: The dataset used for this model is the Pima Indians
Diabetes dataset which consists of several medical predictor variables
and one target variable, Outcome. Predictor variables include the
number of pregnancies the patient has had, their Body Mass Index,
insulin level, glucose level, diabetes pedigree function, blood pressure,
skin thickness and age.

2. Data Cleaning: Clean the data by handling missing values, outliers, and
ensuring consistency. This step is crucial for accurate predictions .

3. Exploratory Data Analysis (EDA):

i. Understanding the Dataset: Analyze the dataset’s structure, distributions,
and relationships between features. EDA helps you gain insights into the
data.
ii. Visualization: Create visualizations such as histograms, scatter plots, and
correlation matrices to explore feature relationships.
4. Feature Engineering:

i. Feature Extraction: Extract meaningful features from the existing ones.

For example, you might create a new feature like “BMI category” based on
BMI values.
ii. Feature Scaling: Normalize or standardize numerical features .

5. Model Selection:
Choose appropriate machine learning models for binary classification
(diabetes vs. non-diabetes). Some common models include:
i. Logistic Regression:A simple yet effective model
ii. Random Forest: An ensemble of decision trees.
iii. Support Vector Machine (SVM): Good for non-linear data.

6. Model Training and Evaluation:

i. Data Splitting: Divide your dataset into training and testing subsets.
ii. Model Training: Train each selected model on the training data.
iii. Model Evaluation: Assess model performance using metrics such as
accuracy, sensitivity, specificity, precision, F1 score, and the Receiver
Operating Characteristic (ROC) curve.
iv. Use k-fold cross-validation to estimate how well the model generalizes to
unseen data.
7. Model Deployment:

i. Once you have a well-performing model, save it (e.g., using

Python’s pickle).
ii. Deploy the model in a production environment, such as an API, so that
users can interact with it.

8. Prediction of Diabetes:

i. Utilize the trained machine learning models to predict the probability of

individuals having diabetes based on their input features (e.g., glucose level,
BMI, etc.).
ii. Implement a user-friendly interface where users can input their data and
receive predictions.
Flow Diagram
Result:
Conclusion

After analyzing all these patient records, we’ve developed a machine

learning model (specifically, support vector machine , which performed the
best) that can effectively predict whether individuals in the dataset have
diabetes. Alongside this, we’ve gained valuable insights from the data through
analysis and visualization, aiding in the prediction of diabetes using machine
learning techniques. The project highlights the diabetes prediction of patients and
understanding supervised learning techniques in deriving actionable insights from
data. Additionally, the project underscores the iterative nature of data science
projects, emphasizing the need for continuous evaluation and refinement to meet
evolving business objectives and improve decision-making processes.

20BCE7620 AP2021228000397 Experiment-6 Removed
No ratings yet
20BCE7620 AP2021228000397 Experiment-6 Removed
19 pages
Binod ML Project-052
No ratings yet
Binod ML Project-052
14 pages
CIEA Term Project
No ratings yet
CIEA Term Project
19 pages
Dataset
No ratings yet
Dataset
13 pages
Automated Payroll Management System
No ratings yet
Automated Payroll Management System
4 pages
Seetu Papers 1
No ratings yet
Seetu Papers 1
6 pages
Machine Learning and Deep Learning Techniques
No ratings yet
Machine Learning and Deep Learning Techniques
13 pages
MLPPT 11 45
No ratings yet
MLPPT 11 45
31 pages
IPL Winning Prediction Intern Report
No ratings yet
IPL Winning Prediction Intern Report
52 pages
Estimating Diabetic Risk Accurately
No ratings yet
Estimating Diabetic Risk Accurately
26 pages
Ai Datascience Project Grade 10
No ratings yet
Ai Datascience Project Grade 10
14 pages
Mini Project
No ratings yet
Mini Project
15 pages
Final Seminar Report Soumya
No ratings yet
Final Seminar Report Soumya
20 pages
Final
No ratings yet
Final
44 pages
Internshippppp Fimnalllll
No ratings yet
Internshippppp Fimnalllll
16 pages
Literature Survey Paper On Comparative Analysis of Diabetics Prediction Systems Using Machine Learning Algorithms
No ratings yet
Literature Survey Paper On Comparative Analysis of Diabetics Prediction Systems Using Machine Learning Algorithms
4 pages
Sample INTERNSHIP Report
No ratings yet
Sample INTERNSHIP Report
32 pages
Diabetes - Test Report
No ratings yet
Diabetes - Test Report
62 pages
Major Project Report 2023-2024
No ratings yet
Major Project Report 2023-2024
33 pages
Diabetes Synopsis Report
No ratings yet
Diabetes Synopsis Report
10 pages
Mini Project Report
No ratings yet
Mini Project Report
34 pages
Independent Project
No ratings yet
Independent Project
10 pages
DSPYProject Report
No ratings yet
DSPYProject Report
14 pages
Machine Learning Based Diabetes Prediction - WITH TRACH CHANGES
No ratings yet
Machine Learning Based Diabetes Prediction - WITH TRACH CHANGES
10 pages
54 Batch Project Documentation-1
No ratings yet
54 Batch Project Documentation-1
82 pages
ZEROTHREVIEW
No ratings yet
ZEROTHREVIEW
10 pages
Diabetes Disease Prediction Using A Web Tool With The Help of A Machine Learning Model.
No ratings yet
Diabetes Disease Prediction Using A Web Tool With The Help of A Machine Learning Model.
43 pages
Diabetes Project MuskanAltaf
No ratings yet
Diabetes Project MuskanAltaf
15 pages
Cs Batchno19
No ratings yet
Cs Batchno19
53 pages
Prediction of Diabetes Using Machine Learning: A Modern User-Friendly Model
No ratings yet
Prediction of Diabetes Using Machine Learning: A Modern User-Friendly Model
7 pages
Diabetes Prediction PP T
No ratings yet
Diabetes Prediction PP T
16 pages
End To End Project Multiple Disease Detection Using ML - Nomidl
No ratings yet
End To End Project Multiple Disease Detection Using ML - Nomidl
24 pages
DIAPRO - Diabetes Prediction Application
No ratings yet
DIAPRO - Diabetes Prediction Application
18 pages
Predicting Diabetes Onset Using Machine Learning
No ratings yet
Predicting Diabetes Onset Using Machine Learning
4 pages
A Mini Skill Based Project Report On: Machine Learning & Optimization (270404)
No ratings yet
A Mini Skill Based Project Report On: Machine Learning & Optimization (270404)
20 pages
Major Project Final TABLE DIAGRAM
No ratings yet
Major Project Final TABLE DIAGRAM
28 pages
Aiml Project Report
No ratings yet
Aiml Project Report
10 pages
REPORT Final
No ratings yet
REPORT Final
29 pages
Synopsis Diabetes Pred System ML
No ratings yet
Synopsis Diabetes Pred System ML
9 pages
ppt715B.pptm (Autosaved)
No ratings yet
ppt715B.pptm (Autosaved)
15 pages
Ijs DR 2205103
No ratings yet
Ijs DR 2205103
4 pages
Risab
No ratings yet
Risab
13 pages
Project Report
No ratings yet
Project Report
10 pages
TDP Sem 3
No ratings yet
TDP Sem 3
9 pages
Diabetes Prediciton Model
100% (1)
Diabetes Prediciton Model
23 pages
PM For Diabetes
No ratings yet
PM For Diabetes
11 pages
FRTemplate Software
No ratings yet
FRTemplate Software
50 pages
c20 Final Final
No ratings yet
c20 Final Final
21 pages
Poster Template
No ratings yet
Poster Template
1 page
Ek125 Final Project
No ratings yet
Ek125 Final Project
13 pages
Diabetes Prediction - ML
No ratings yet
Diabetes Prediction - ML
29 pages
Kanak Blackbook Project
No ratings yet
Kanak Blackbook Project
57 pages
Diabetes Project Proposal
No ratings yet
Diabetes Project Proposal
6 pages
مختار النعيري - The Course Work Submission
No ratings yet
مختار النعيري - The Course Work Submission
31 pages
Afroz Content
No ratings yet
Afroz Content
24 pages
Review 2 Final
No ratings yet
Review 2 Final
27 pages
Gautam
No ratings yet
Gautam
7 pages
Internship
No ratings yet
Internship
15 pages
"Big Data Science" Basic Concepts and Applications
From Everand
"Big Data Science" Basic Concepts and Applications
Sukanta Bhattacharya
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Card Brands - Visa Update - Migration To The Eight-Digit BIN - Support Services PDF
No ratings yet
Card Brands - Visa Update - Migration To The Eight-Digit BIN - Support Services PDF
2 pages
Manual SIRIUS ACT With PROFINET IO en-US
No ratings yet
Manual SIRIUS ACT With PROFINET IO en-US
122 pages
Database Connectivity Using RDO
No ratings yet
Database Connectivity Using RDO
6 pages
Railway Group D Exam Guide
No ratings yet
Railway Group D Exam Guide
8 pages
Meeting8 Files and Exceptions N
No ratings yet
Meeting8 Files and Exceptions N
35 pages
10+ Proven Technical Interview Questions (+answers)
No ratings yet
10+ Proven Technical Interview Questions (+answers)
6 pages
SKF CMXA 80-F-K-SL-ND Specification
No ratings yet
SKF CMXA 80-F-K-SL-ND Specification
2 pages
Linux Privilege Escalation 1714714339
No ratings yet
Linux Privilege Escalation 1714714339
18 pages
Business On Wheels App
No ratings yet
Business On Wheels App
4 pages
Precision T5500 For VMware ESX Labs For Exam VCP 510
No ratings yet
Precision T5500 For VMware ESX Labs For Exam VCP 510
8 pages
PL SQL
No ratings yet
PL SQL
62 pages
CH 14namespaces
No ratings yet
CH 14namespaces
17 pages
Omar Mohamed Omar - Data Analyst
No ratings yet
Omar Mohamed Omar - Data Analyst
2 pages
Esko ArtiosCAD 7.20 Administrator Guide
No ratings yet
Esko ArtiosCAD 7.20 Administrator Guide
276 pages
Git and Github: Cs 4411 Spring 2020
No ratings yet
Git and Github: Cs 4411 Spring 2020
40 pages
HHSC
No ratings yet
HHSC
3 pages
CS3351 LP
No ratings yet
CS3351 LP
5 pages
Endpoint Protector 5 User Manual EN PDF
No ratings yet
Endpoint Protector 5 User Manual EN PDF
189 pages
BESCK104EIntroduction To C Programming
No ratings yet
BESCK104EIntroduction To C Programming
5 pages
BR2 Wallbox EN
No ratings yet
BR2 Wallbox EN
6 pages
ACC 317 Management Information Syetem
No ratings yet
ACC 317 Management Information Syetem
13 pages
Assignment 4
No ratings yet
Assignment 4
10 pages
API Accounts
No ratings yet
API Accounts
5 pages
Data-Driven Development A Complementing Approach For Automotive Systems Engineering
No ratings yet
Data-Driven Development A Complementing Approach For Automotive Systems Engineering
6 pages
Parker Rotary Knife
No ratings yet
Parker Rotary Knife
14 pages
Excel 2010 2 Module 1 20170101
No ratings yet
Excel 2010 2 Module 1 20170101
9 pages
3D and 4D Modeling For Design and Construction Coordination Issues and Lessons Learned
No ratings yet
3D and 4D Modeling For Design and Construction Coordination Issues and Lessons Learned
27 pages
All in One QA Jobs 15 Apr
No ratings yet
All in One QA Jobs 15 Apr
24 pages
CA-Clipper For DOS Version 5.3. Getting Started Guide
100% (1)
CA-Clipper For DOS Version 5.3. Getting Started Guide
205 pages
Template Full Manuscript JCB 2021 - Fin
No ratings yet
Template Full Manuscript JCB 2021 - Fin
4 pages

Ads Exp 10

Uploaded by

Ads Exp 10

Uploaded by

Report on Case Study

Aishwarya Iyappan- 4201

DEPARTMENT OF COMPUTER ENGINEERING

Title: Life cycle of Diabetes prediction using SVM(Support Vector Machine).

II. Data Exploration

III. Data Preparation

IV. Training and Evaluating the Machine Learning Model

V. Interpreting the ML Model

VI. Saving the Model

VII. Making Predictions with the Model

3. Exploratory Data Analysis (EDA):

i. Feature Extraction: Extract meaningful features from the existing ones.

6. Model Training and Evaluation:

i. Once you have a well-performing model, save it (e.g., using

i. Utilize the trained machine learning models to predict the probability of

After analyzing all these patient records, we’ve developed a machine

You might also like