0% found this document useful (0 votes)

11 views39 pages

Anurag008python

Uploaded by

Anurag Kandari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views39 pages

Anurag008python

Uploaded by

Anurag Kandari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

PYTHON PROJECT

on
PEPPERFRY
A Study on Predictive Analysis of Diabetes in Pima
Indian Women Using Python

Post Graduate Diploma in Management

(PGDM-A)
Batch -2024-26

Submitted to: Submitted by:

Ms. Shilpi Yadav Name Roll No.
Assistant Professor Anurag Kandari 08

Jagannath International Management School

MOR, Pocket-105, Kalkaji, New Delhi-110019

(Approved by All India Council for Technical Education (AICTE) and

Accredited by NBA SAQS and NAAC)

1
CONTENTS
S No. Topic Page No
1 Introduction 3

2 Introduction – Python 4

3 Literature review 9

4 Methodology 10

5 Conclusion 36

6 Learning outcomes 37

7 Bibliography 39

2
About The Data Set

This dataset comes from the National Institute of Diabetes and Digestive and Kidney Diseases. Its
main purpose is to predict whether a patient has diabetes based on various medical test results.
The data includes only female patients who are at least 21 years old and of Pima Indian heritage.
The dataset contains several independent variables, which are medical measurements, and one
dependent variable called Outcome. The Outcome variable shows whether the patient has diabetes
(1 for Yes, 0 for No).

Here’s a brief description of the columns:

 Pregnancies: Number of times the patient has been pregnant.

 Glucose: Blood glucose levels.
 BloodPressure: Blood pressure readings.
 SkinThickness: Thickness of the skin in millimeters.
 Insulin: Insulin level in the blood.
 BMI: Body Mass Index (a measure of body fat based on weight and height).
 DiabetesPedigreeFunction: Likelihood of diabetes based on family history.
 Age: Age of the patient in years.
 Outcome: Final diagnosis (1 = Diabetic, 0 = Non-Diabetic).
This dataset is stored in a CSV file and is useful for understanding the factors that
contribute to diabetes and building predictive models.

3
1. PYTHON
Python is a widely used general-purpose, high level programming language. It
was created by Guido van Rossum in 1991 and further developed by the Python
Software Foundation. It was designed with an emphasis on code readability, and
its syntax allows programmers to express their concepts in fewer lines of code.

Python is a programming language that lets you work quickly and integrate
systems more efficiently.

There are two major Python versions: Python 2 and Python 3. Both are quite
different.

Reason for increasing popularity

1. Emphasis on code readability, shorter codes, ease of writing
2. Programmers can express logical concepts in fewer lines of code in
comparison to languages such as C++ or Java.
3. Python supports multiple programming paradigms, like object-oriented,
imperative and functional programming or procedural.
4. There exists inbuilt functions for almost all of the frequently used concepts.
5. Philosophy is “Simplicity is the best”.

Language Features
 Interpreted
 There are no separate compilation and execution steps like C and C++.
 Directly run the program from the source code.
 Internally, Python converts the source code into an intermediate form called
bytecodes which is then translated into native language of specific
computer to run it.
 No need to worry about linking and loading with libraries, etc.
 Platform Independent
 Python programs can be developed and executed on multiple operating
system platforms.
 Python can be used on Linux, Windows, Macintosh, Solaris and many more.

 Free and Open Source; Redistributable

 High-level Language
 In Python, no need to take care about low-level details such as managing
the memory used by the program.
 Simple
4
 Closer to English language ; Easy to Learn
 More emphasis on the solution to the problem rather than the syntax
 Embeddable
 Python can be used within C/C++ program to give scripting capabilities for
the program’s users.
 Robust:
 Exceptional handling features
 Memory management techniques in built
 Rich Library Support
 The Python Standard Library is very vast.
 Known as the “batteries included” philosophy of Python. It can help do
various things involving regular expressions, documentation generation,
unit testing, threading, databases, web browsers, CGI, email, XML, HTML,
WAV files, cryptography, GUI and many more.
 Besides the standard library, there are various other high-quality libraries
such as the Python Imaging Library which is an amazingly simple image
manipulation library.

2. ANACONDA
Anaconda is the installation program used by Fedora, Red Hat Enterprise Linux
and some other distributions.

During installation, a target computer’s hardware is identified and configured and

the appropriate file systems for the system’s architecture are created. Finally,
anaconda allows the user to install the operating system software on the target
computer. Anaconda can also upgrade existing installations of earlier versions of
the same distribution. After the installation is complete, you can reboot into your
installed system and continue doing customization using the initial setup program.
Anaconda is a fairly sophisticated installer. It supports installation from local and
remote sources such as CDs and DVDs, images stored on a hard drive, NFS, HTTP,
and FTP. Installation can be scripted with kickstart to provide a fully unattended
installation that can be duplicated on scores of machines. It can also
be run over VNC on headless machines. A variety of advanced storage
devices including LVM, RAID, iSCSI, and multipath are supported from the
partitioning program. Anaconda provides advanced debugging features such as
remote logging, access to the python interactive debugger, and remote saving of
exception dumps.

5
What is NumPy?
NumPy stands for Numerical Python. NumPy is a Python library used for working
with arrays. It also has functions for working in domain of linear algebra, fourier
transform, and matrices.

NumPy was created in 2005 by Travis Oliphant. It is an open source project and
you can use it freely.

Why Use NumPy?

In Python we have lists that serve the purpose of arrays, but they are slow to
process. NumPy aims to provide an array object that is up to 50x faster than
traditional Python lists.

The array object in NumPy is called nd array, it provides a lot of supporting

functions that make working with nd array very easy. Arrays are very frequently
used in data science, where speed and resources are very important.

Pandas is an open-source library in Python that is made mainly for working with
relational or labelled data both easily and intuitively. It provides various data
structures and operations for manipulating numerical data and time series. This
library is built on top of the NumPy library of Python. Pandas is fast and it has high
performance & productivity for users.

History of Pandas Library

Pandas were initially developed by Wes McKinney in 2008 while he was working at
AQR Capital Management. He convinced the AQR to allow him to open source the
Pandas. Another AQR employee, Chang She, joined as the second major
contributor to the library in 2012. Over time many versions of pandas have been
released. The latest version of the pandas is 1.5.3, released on Jan 18, 2023

Why Use Pandas?

 Fast and efficient for manipulating and analyzing data.
 Data from different file objects can be easily loaded.
 Flexible reshaping and pivoting of data sets
 Provides time-series functionality.

What can you do using Pandas?

6
Pandas are generally used for data science but have you wondered why? This is
because pandas are used in conjunction with other libraries that are used for data
science. It is built on the top of the NumPy library which means that a lot of
structures of NumPy are used or replicated in Pandas. The data produced by
Pandas are often used as input for plotting functions of Matplotlib, statistical
analysis in SciPy, and machine learning algorithms in Scikit-learn. Here is a list of
things that we can do using Pandas.

Matplotlib graphs your data on Figures (e.g., windows, Jupyter widgets, etc.), each
of which can contain one or more Axes, an area where points can be specified in
terms of x-y coordinates (or theta-r in a polar plot, x-y-z in a 3D plot, etc.). The
simplest way of creating a Figure with an Axes is using pyplot.subplots. We can
then use Axes.plot to draw some data on the Axes:

Note that to get this Figure to display, you may have to call plt.show(), depending
on your backend. For more details of Figures and backends, see creating, viewing
& saving Matplotlib Figures.

Seaborn is a Python data visualization library based on matplotlib. It provides a

high-level interface for drawing attractive and informative statistical graphics.

For a brief introduction to the ideas behind the library, you can read the
introductory notes or the paper. Visit the installation page to see how you can
download the package and get started with it. You can browse the example gallery
to see some of the things that you can do with seaborn, and then check out the
tutorials or API reference to find out how.

Scikit-Learn is a popular Python library used for machine learning. It provides tools to
build models that can learn from data and make predictions. With Scikit-Learn, you can do tasks
like:
 Classifying data (e.g., predicting if a person has diabetes or not).
 Clustering data (e.g., grouping similar items together).
 Regression (e.g., predicting a number like house prices).
It also helps with preprocessing data (cleaning or preparing it) and evaluating how well your
model performs.
7
Plotly

Plotly is a Python library for creating interactive and beautiful visualizations. It allows you to
make:
 Line charts, bar charts, and scatter plots.
 3D graphs and maps.
 Dashboards for presenting data interactively.
What makes Plotly special is that the graphs are interactive, so users can zoom in, hover over
points, or explore data more easily.

Cufflinks

Cufflinks is a library that works with Plotly and Pandas (a library for working with data). It helps
you quickly create interactive Plotly charts directly from your data in Pandas.
For example:
 If you have a DataFrame (a table of data), you can make a chart in just one line of code.
 It makes plotting faster and easier when you're working with a lot of data.
In short, Cufflinks acts as a bridge between Pandas and Plotly to simplify creating interactive
visualizations.

8
The analysis of medical data for predicting diseases like diabetes has gained significant
importance in recent years. The dataset used in this study originates from the National
Institute of Diabetes and Digestive and Kidney Diseases. It contains medical diagnostic
information about female patients of Pima Indian heritage, making it a specialized dataset
for diabetes prediction.

The dataset includes various predictor variables such as glucose levels, insulin, BMI, and
skin thickness, which have been identified as key factors in diagnosing diabetes. Studies
have shown that machine learning techniques can effectively analyze such data to predict
health outcomes.

Scikit-Learn is a widely used library for implementing machine learning algorithms. It

provides robust tools for tasks such as classification, regression, and model evaluation,
making it suitable for analyzing the Pima Indian Diabetes dataset. Its ability to preprocess
data and evaluate model accuracy has been highlighted in numerous studies.
For data visualization, libraries like Plotly and Cufflinks are often used. Plotly allows for
creating interactive and user-friendly visualizations, enabling a deeper understanding of
data trends and relationships. Cufflinks simplifies this process by integrating with Pandas,
allowing for quick plotting of interactive charts directly from data tables. These tools
enhance the ability to visualize and interpret medical data effectively.
This study builds on existing work by leveraging Scikit-Learn for predictive modeling and
Plotly with Cufflinks for visualization. Together, these tools enable a comprehensive
analysis of diabetes prediction using the Pima Indian dataset.

9
This project uses a structured approach to analyze the Pima Indian Diabetes dataset and predict diabetes
outcomes. The methodology is divided into the following steps:

1. Data Collection
The dataset is sourced from the National Institute of Diabetes and Digestive and Kidney Diseases.
It contains key medical measurements, including glucose levels, insulin, BMI, and skin thickness,
along with the outcome variable (diabetic or not).

2. Data Preprocessing
o The data is cleaned to handle any missing or invalid values.
o Features are standardized or normalized to ensure they are on a similar scale, which helps
in improving model accuracy.
o The dataset is split into training and testing sets to evaluate the model's performance.

3. Exploratory Data Analysis (EDA)

Using Plotly and Cufflinks, interactive visualizations are created to explore the data. This includes
analyzing relationships between variables like glucose levels and the outcome. These
visualizations help identify trends and patterns in the data.

4. Model Building
o Scikit-Learn is used to build a machine learning model for predicting diabetes.
o Algorithms such as Logistic Regression, Decision Trees, or Random Forests are applied to
the dataset.
o The model is trained using the training data.

5. Model Evaluation
The performance of the model is evaluated using metrics like accuracy, precision, recall,
and F1 score.
The testing dataset is used to validate the model and ensure it generalizes well to unseen
data.

6. Visualization of Results
o The final results, including model performance and important features, are visualized using
Plotly and Cufflinks.
o Interactive charts help communicate findings effectively.

7. Conclusion
Insights gained from the analysis are summarized, highlighting the significant factors contributing
to diabetes prediction and the effectiveness of the model.

10
#name Anurag kandari
#roll no 08
# PGDM-A
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score, classification_report,
confusion_matrix
from sklearn.linear_model import LinearRegression
import matplotlib.pyplot as plt
import seaborn as sns
df = pd.read_csv('Diabitiese.csv')

df.head()

Pregnancies Glucose BloodPressure SkinThickness Insulin

BMI \
0 6 148 72 35 0 33.6

1 1 85 66 29 0 26.6

2 8 183 64 0 0 23.3

3 1 89 66 23 94 28.1

4 0 137 40 35 168 43.1

DiabetesPedigreeFunction Age Outcome

0 0.627 50 1
1 0.351 31 0
2 0.672 32 1
3 0.167 21 0
4 2.288 33 1

df.tail()

Pregnancies Glucose BloodPressure SkinThickness Insulin BMI

\
764 2 122 70 27 0 36.8

765 5 121 72 23 112 26.2

766 1 126 60 0 0 30.1

767 1 93 70 31 0 30.4

768 0 123 77 0 1 36.3

11
DiabetesPedigreeFunction Age Outcome
764 0.340 27 0
765 0.245 30 0
766 0.349 47 1
767 0.315 23 0
768 0.252 55 1
df.describe()

Pregnancies Glucose BloodPressure SkinThickness

Insulin \
count 769.000000 769.000000 769.000000 769.000000
769.000000
mean 3.840052 120.897269 69.115735 20.509753
79.697009
std 3.370237 31.951886 19.345296 15.959020
115.203999
min 0.000000 0.000000 0.000000 0.000000
0.000000
25% 1.000000 99.000000 62.000000 0.000000
0.000000
50% 3.000000 117.000000 72.000000 23.000000
29.000000
75% 6.000000 140.000000 80.000000 32.000000
127.000000
max 17.000000 199.000000 122.000000 99.000000
846.000000

BMI DiabetesPedigreeFunction Age Outcome

count 769.000000 769.000000 769.000000 769.000000
mean 31.998179 0.471590 33.269181 0.349805
std 7.880557 0.331208 11.778737 0.477219
min 0.000000 0.078000 21.000000 0.000000
25% 27.300000 0.244000 24.000000 0.000000
50% 32.000000 0.371000 29.000000 0.000000
75% 36.600000 0.626000 41.000000 1.000000
max 67.100000 2.420000 81.000000 1.000000

df.shape

(769, 9)

df.isnull().sum()

Pregnancies 0
Glucose 0
BloodPressure 0
SkinThickness 0
Insulin 0
BMI 0

12
DiabetesPedigreeFunction 0
Age 0
Outcome 0
dtype: int64

sns.relplot(x='Pregnancies',y='BloodPressure',data=df)
<seaborn.axisgrid.FacetGrid at 0x14628ebf3b0>

sns.relplot(x='DiabetesPedigreeFunction',y='BloodPressure',data=df)
<seaborn.axisgrid.FacetGrid at 0x14628f7e390>

13
df.fillna(0, inplace=True)

print(df)

Pregnancies Glucose BloodPressure SkinThickness Insulin BMI

\
0 6 148 72 35 0 33.6

1 1 85 66 29 0 26.6

2 8 183 64 0 0 23.3

3 1 89 66 23 94 28.1

4 0 137 40 35 168 43.1

.. ... ... ... ... ... ...

764 2 122 70 27 0 36.8

765 5 121 72 23 112 26.2

14
766 1 126 60 0 0 30.1

767 1 93 70 31 0 30.4

768 0 123 77 0 1 36.3

DiabetesPedigreeFunction Age Outcome

0 0.627 50 1
1 0.351 31 0
2 0.672 32 1
3 0.167 21 0
4 2.288 33 1
.. ... ... ...
764 0.340 27 0
765 0.245 30 0
766 0.349 47 1
767 0.315 23 0
768 0.252 55 1

[769 rows x 9 columns]

train = df.drop(['Pregnancies', 'Glucose', 'DiabetesPedigreeFunction',

'BloodPressure', 'SkinThickness', 'Insulin'], axis=1)
test = df['Insulin']

train

BMI Age Outcome

0 33.6 50 1
1 26.6 31 0
2 23.3 32 1
3 28.1 21 0
4 43.1 33 1
.. ... ... ...
764 36.8 27 0
765 26.2 30 0
766 30.1 47 1
767 30.4 23 0
768 36.3 55 1

[769 rows x 3 columns]

test

0 0
1 0
2 0
3 94
4 168
...

15
764 0
765 112
766 0
767 0
768 1
Name: Insulin, Length: 769, dtype: int64

X_train,X_test,y_train,y_test=train_test_split(train,test,test_size=0.
3,random_state=2)

X_train,X_test,y_train,y_test

( BMI Age Outcome

120 53.2 25 1
7 35.3 29 0
751 39.0 28 0
735 35.4 28 0
559 30.1 35 0
.. ... ... ...
534 33.3 24 0
584 28.7 52 1
493 28.9 45 1
527 26.3 24 0
168 31.9 29 0

[538 rows x 3 columns],

BMI Age Outcome
231 46.2 46 1
710 31.2 24 0
440 34.3 41 1
662 37.6 43 1
377 37.2 22 0
.. ... ... ...
495 26.6 66 0
236 35.9 51 1
130 29.7 33 1
225 34.6 22 0
388 32.0 58 1

[231 rows x 3 columns],

120 100
7 0
751 74
735 0
559 0
...
534 56
584 600
493 122
527 105

16
168 0
Name: Insulin, Length: 538, dtype: int64,
231 370
710 387
440 0
662 231
377 75
...
495 0
236 192
130 168
225 32
388 285
Name: Insulin, Length: 231, dtype: int64)

regression = LinearRegression()

regression

LinearRegression()

regression.fit(X_train,y_train)

LinearRegression()

predict = regression.predict(X_test)

predict

array([121.77350334, 78.10800424, 96.17867285, 102.9003638 ,

93.78428834, 81.74236627, 87.30403531, 134.18637683,
61.90647245, 65.36535226, 73.31302082, 61.73218918,
76.684759 , 47.47795137, 62.38814933, 73.20469325,
126.58907322, 71.51762522, 77.38548892, 115.40217735,
109.57009895, 2.35342173, 71.54060957, 82.4648816 ,
63.63651183, 91.02553897, 86.10044257, 1.74043292,
66.67907098, 95.77695157, 91.92353651, 43.4950228 ,
66.56954447, 71.95573128, 67.59885393, 106.18436088,
73.31302082, 101.73255638, 91.02553897, 116.12349372,
87.41476077, 59.0393955 , 93.48587918, 91.33951915,
68.97852835, 126.15156663, 86.21595618, 67.0742057 ,
114.06547427, 89.74199064, 87.63201538, 72.74480178,
66.7885975 , 82.31178426, 67.60065235, 55.82015519,
59.82606811, 64.88367538, 109.42478713, 47.26069676,
49.55775624, 65.94315523, 64.97141648, 97.74722992,
67.0742057 , 51.83482873, 89.87330256, 47.54450654,
76.44452004, 95.03983687, 97.24496657, 64.29247198,
96.33177019, 139.3976809 , 64.27068657, 82.61917788,
48.63797329, 107.43152443, 79.1015284 , 66.04489624,
39.466725 , 87.69797108, 81.26068939, 96.36815497,
68.40911036, 71.75846366, 89.51575157, 82.62576444,

17
64.62105153, 83.60371758, 64.51272396, 65.97834107,
117.10204634, 83.86634143, 99.48385586, 44.17396731,
62.73851429, 50.25848615, 91.93072254, 75.96224368,
74.03553615, 57.04553333, 102.28617605, 88.42887139,
101.07539728, 71.3663263 , 74.5837682 , 96.76927678,
79.18088452, 45.57362872, 97.76901533, 67.89224764,
70.37998818, 71.85458975, 100.03208792, 65.97834107,
61.75277564, 115.86086987, 84.26926166, 95.34603154,
91.04792385, 97.27333855, 76.44392056, 67.13956193,
58.14019902, 43.97669969, 113.58379739, 84.96100712,
72.3290806 , 61.0526452 , 99.06873415, 87.36939153,
83.52376198, 140.14258111, 62.1690963 , 90.98196815,
102.8266226 , 57.37411288, 90.98975366, 84.67659786,
48.55143113, 98.10478091, 92.9522465 , 92.05484843,
95.87187871, 99.08273405, 54.63774839, 99.89299048,
82.99132824, 80.41464765, 99.30178708, 75.97062866,
54.02475958, 104.30122416, 133.55220208, 60.19821847,
90.37736433, 67.5994534 , 78.9836169 , 102.746667 ,
63.26556041, 106.57769718, 38.34907495, 98.44795983,
65.6721464 , 89.64646402, 75.91867286, 97.64488944,
101.73914294, 71.4740544 , 127.22384744, 130.00498169,
89.85091768, 103.9290738 , 71.69310743, 65.34356685,
86.01210199, 108.74584262, 28.86983358, 81.26068939,
87.71975649, 55.64527244, 79.75688908, 82.03516051,
65.12451382, 75.59009331, 92.27390147, 67.40218578,
63.92032161, 78.78754822, 94.99686553, 64.05163354,
60.76883541, 108.81119885, 74.75984989, 90.114141 ,
96.63856433, 85.12129049, 97.29512395, 69.7004442 ,
113.60678174, 89.84373164, 72.48097898, 89.38384017,
76.90381204, 75.69961983, 100.7474172 , 75.02965978,
95.19293421, 99.54981156, 57.30815718, 87.6615863 ,
83.62550299, 79.53184896, 67.90564807, 117.48198221,
107.8254602 , 84.3921886 , 84.23849179, 89.29609906,
105.46184555, 94.12005393, 94.07768205, 49.07787777,
82.0933307 , 58.6878316 , 41.28390601, 93.90219984,
90.00401501, 87.52248887, 80.21857897])

regression.score(X_test,y_test)

0.054992667713272936

pip install plotly

Requirement already satisfied: plotly in c:\users\anurag kandari\

anaconda3\lib\site-packages (5.22.0)
Requirement already satisfied: tenacity>=6.2.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from plotly) (8.2.2)
Requirement already satisfied: packaging in c:\users\anurag kandari\
anaconda3\lib\site-packages (from plotly) (23.2)
Note: you may need to restart the kernel to use updated packages.

18
pip install cufflinks

Requirement already satisfied: cufflinks in c:\users\anurag kandari\

anaconda3\lib\site-packages (0.17.3)Note: you may need to restart the
kernel to use updated packages.

Requirement already satisfied: numpy>=1.9.2 in c:\users\anurag

kandari\anaconda3\lib\site-packages (from cufflinks) (1.26.4)
Requirement already satisfied: pandas>=0.19.2 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from cufflinks) (2.2.2)
Requirement already satisfied: plotly>=4.1.1 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from cufflinks) (5.22.0)
Requirement already satisfied: six>=1.9.0 in c:\users\anurag kandari\
anaconda3\lib\site-packages (from cufflinks) (1.16.0)
Requirement already satisfied: colorlover>=0.2.1 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from cufflinks) (0.3.0)
Requirement already satisfied: setuptools>=34.4.1 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from cufflinks) (69.5.1)
Requirement already satisfied: ipython>=5.3.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from cufflinks) (8.25.0)
Requirement already satisfied: ipywidgets>=7.0.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from cufflinks) (7.8.1)
Requirement already satisfied: decorator in c:\users\anurag kandari\
anaconda3\lib\site-packages (from ipython>=5.3.0->cufflinks) (5.1.1)
Requirement already satisfied: jedi>=0.16 in c:\users\anurag kandari\
anaconda3\lib\site-packages (from ipython>=5.3.0->cufflinks) (0.18.1)
Requirement already satisfied: matplotlib-inline in c:\users\anurag
kandari\anaconda3\lib\site-packages (from ipython>=5.3.0->cufflinks)
(0.1.6)
Requirement already satisfied: prompt-toolkit<3.1.0,>=3.0.41 in c:\
users\anurag kandari\anaconda3\lib\site-packages (from ipython>=5.3.0-
>cufflinks) (3.0.43)
Requirement already satisfied: pygments>=2.4.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from ipython>=5.3.0->cufflinks)
(2.15.1)
Requirement already satisfied: stack-data in c:\users\anurag kandari\
anaconda3\lib\site-packages (from ipython>=5.3.0->cufflinks) (0.2.0)
Requirement already satisfied: traitlets>=5.13.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from ipython>=5.3.0->cufflinks)
(5.14.3)
Requirement already satisfied: colorama in c:\users\anurag kandari\
anaconda3\lib\site-packages (from ipython>=5.3.0->cufflinks) (0.4.6)
Requirement already satisfied: comm>=0.1.3 in c:\users\anurag kandari\
anaconda3\lib\site-packages (from ipywidgets>=7.0.0->cufflinks)
(0.2.1)
Requirement already satisfied: ipython-genutils~=0.2.0 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from ipywidgets>=7.0.0-
>cufflinks) (0.2.0)
Requirement already satisfied: widgetsnbextension~=3.6.6 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from ipywidgets>=7.0.0-

19
>cufflinks) (3.6.6)
Requirement already satisfied: jupyterlab-widgets<3,>=1.0.0 in c:\
users\anurag kandari\anaconda3\lib\site-packages (from
ipywidgets>=7.0.0->cufflinks) (1.0.0)
Requirement already satisfied: python-dateutil>=2.8.2 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from pandas>=0.19.2-
>cufflinks) (2.9.0.post0)
Requirement already satisfied: pytz>=2020.1 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from pandas>=0.19.2->cufflinks)
(2024.1)
Requirement already satisfied: tzdata>=2022.7 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from pandas>=0.19.2->cufflinks)
(2023.3)
Requirement already satisfied: tenacity>=6.2.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from plotly>=4.1.1->cufflinks)
(8.2.2)
Requirement already satisfied: packaging in c:\users\anurag kandari\
anaconda3\lib\site-packages (from plotly>=4.1.1->cufflinks) (23.2)
Requirement already satisfied: parso<0.9.0,>=0.8.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jedi>=0.16->ipython>=5.3.0-
>cufflinks) (0.8.3)
Requirement already satisfied: wcwidth in c:\users\anurag kandari\
anaconda3\lib\site-packages (from prompt-toolkit<3.1.0,>=3.0.41-
>ipython>=5.3.0->cufflinks) (0.2.5)
Requirement already satisfied: notebook>=4.4.1 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (7.0.8)
Requirement already satisfied: executing in c:\users\anurag kandari\
anaconda3\lib\site-packages (from stack-data->ipython>=5.3.0-
>cufflinks) (0.8.3)
Requirement already satisfied: asttokens in c:\users\anurag kandari\
anaconda3\lib\site-packages (from stack-data->ipython>=5.3.0-
>cufflinks) (2.0.5)
Requirement already satisfied: pure-eval in c:\users\anurag kandari\
anaconda3\lib\site-packages (from stack-data->ipython>=5.3.0-
>cufflinks) (0.2.2)
Requirement already satisfied: jupyter-server<3,>=2.4.0 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (2.14.1)
Requirement already satisfied: jupyterlab-server<3,>=2.22.1 in c:\
users\anurag kandari\anaconda3\lib\site-packages (from
notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (2.25.1)
Requirement already satisfied: jupyterlab<4.1,>=4.0.2 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (4.0.11)
Requirement already satisfied: notebook-shim<0.3,>=0.2 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (0.2.3)

20
Requirement already satisfied: tornado>=6.2.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (6.4.1)
Requirement already satisfied: anyio>=3.1.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyter-server<3,>=2.4.0-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (4.2.0)
Requirement already satisfied: argon2-cffi>=21.1 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyter-server<3,>=2.4.0-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (21.3.0)
Requirement already satisfied: jinja2>=3.0.3 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyter-server<3,>=2.4.0-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (3.1.4)
Requirement already satisfied: jupyter-client>=7.4.4 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (8.6.0)
Requirement already satisfied: jupyter-core!=5.0.*,>=4.12 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (5.7.2)
Requirement already satisfied: jupyter-events>=0.9.0 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (0.10.0)
Requirement already satisfied: jupyter-server-terminals>=0.4.4 in c:\
users\anurag kandari\anaconda3\lib\site-packages (from jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (0.4.4)
Requirement already satisfied: nbconvert>=6.4.4 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyter-server<3,>=2.4.0-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (7.10.0)
Requirement already satisfied: nbformat>=5.3.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyter-server<3,>=2.4.0-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (5.9.2)
Requirement already satisfied: overrides>=5.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyter-server<3,>=2.4.0-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (7.4.0)
Requirement already satisfied: prometheus-client>=0.9 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (0.14.1)
Requirement already satisfied: pywinpty>=2.0.1 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyter-server<3,>=2.4.0-

21
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (2.0.10)
Requirement already satisfied: pyzmq>=24 in c:\users\anurag kandari\
anaconda3\lib\site-packages (from jupyter-server<3,>=2.4.0-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (25.1.2)
Requirement already satisfied: send2trash>=1.8.2 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyter-server<3,>=2.4.0-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (1.8.2)
Requirement already satisfied: terminado>=0.8.3 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyter-server<3,>=2.4.0-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (0.17.1)
Requirement already satisfied: websocket-client>=1.7 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (1.8.0)
Requirement already satisfied: async-lru>=1.0.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyterlab<4.1,>=4.0.2-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (2.0.4)
Requirement already satisfied: ipykernel in c:\users\anurag kandari\
anaconda3\lib\site-packages (from jupyterlab<4.1,>=4.0.2-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (6.28.0)
Requirement already satisfied: jupyter-lsp>=2.0.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyterlab<4.1,>=4.0.2-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (2.2.0)
Requirement already satisfied: babel>=2.10 in c:\users\anurag kandari\
anaconda3\lib\site-packages (from jupyterlab-server<3,>=2.22.1-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (2.11.0)
Requirement already satisfied: json5>=0.9.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyterlab-
server<3,>=2.22.1->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (0.9.6)
Requirement already satisfied: jsonschema>=4.18.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyterlab-
server<3,>=2.22.1->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (4.19.2)
Requirement already satisfied: requests>=2.31 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyterlab-
server<3,>=2.22.1->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (2.32.2)
Requirement already satisfied: idna>=2.8 in c:\users\anurag kandari\
anaconda3\lib\site-packages (from anyio>=3.1.0->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-

22
>ipywidgets>=7.0.0->cufflinks) (3.7)
Requirement already satisfied: sniffio>=1.1 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from anyio>=3.1.0->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (1.3.0)
Requirement already satisfied: argon2-cffi-bindings in c:\users\anurag
kandari\anaconda3\lib\site-packages (from argon2-cffi>=21.1->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (21.2.0)
Requirement already satisfied: MarkupSafe>=2.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jinja2>=3.0.3->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (2.1.3)
Requirement already satisfied: attrs>=22.2.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jsonschema>=4.18.0-
>jupyterlab-server<3,>=2.22.1->notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (23.1.0)
Requirement already satisfied: jsonschema-specifications>=2023.03.6 in
c:\users\anurag kandari\anaconda3\lib\site-packages (from
jsonschema>=4.18.0->jupyterlab-server<3,>=2.22.1->notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (2023.7.1)
Requirement already satisfied: referencing>=0.28.4 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jsonschema>=4.18.0-
>jupyterlab-server<3,>=2.22.1->notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (0.30.2)
Requirement already satisfied: rpds-py>=0.7.1 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jsonschema>=4.18.0-
>jupyterlab-server<3,>=2.22.1->notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (0.10.6)
Requirement already satisfied: platformdirs>=2.5 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyter-core!=5.0.*,>=4.12-
>jupyter-server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (3.10.0)
Requirement already satisfied: pywin32>=300 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyter-core!=5.0.*,>=4.12-
>jupyter-server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (305.1)
Requirement already satisfied: python-json-logger>=2.0.4 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from jupyter-
events>=0.9.0->jupyter-server<3,>=2.4.0->notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (2.0.7)
Requirement already satisfied: pyyaml>=5.3 in c:\users\anurag kandari\
anaconda3\lib\site-packages (from jupyter-events>=0.9.0->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (6.0.1)
Requirement already satisfied: rfc3339-validator in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jupyter-events>=0.9.0-
>jupyter-server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (0.1.4)

23
Requirement already satisfied: rfc3986-validator>=0.1.1 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from jupyter-
events>=0.9.0->jupyter-server<3,>=2.4.0->notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (0.1.1)
Requirement already satisfied: beautifulsoup4 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from nbconvert>=6.4.4->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (4.12.3)
Requirement already satisfied: bleach!=5.0.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from nbconvert>=6.4.4->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (4.1.0)
Requirement already satisfied: defusedxml in c:\users\anurag kandari\
anaconda3\lib\site-packages (from nbconvert>=6.4.4->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (0.7.1)
Requirement already satisfied: jupyterlab-pygments in c:\users\anurag
kandari\anaconda3\lib\site-packages (from nbconvert>=6.4.4->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (0.1.2)
Requirement already satisfied: mistune<4,>=2.0.3 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from nbconvert>=6.4.4->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (2.0.4)
Requirement already satisfied: nbclient>=0.5.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from nbconvert>=6.4.4->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (0.8.0)
Requirement already satisfied: pandocfilters>=1.4.1 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from nbconvert>=6.4.4->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (1.5.0)
Requirement already satisfied: tinycss2 in c:\users\anurag kandari\
anaconda3\lib\site-packages (from nbconvert>=6.4.4->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (1.2.1)
Requirement already satisfied: fastjsonschema in c:\users\anurag
kandari\anaconda3\lib\site-packages (from nbformat>=5.3.0->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (2.16.2)
Requirement already satisfied: charset-normalizer<4,>=2 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from requests>=2.31-
>jupyterlab-server<3,>=2.22.1->notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (2.0.4)
Requirement already satisfied: urllib3<3,>=1.21.1 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from requests>=2.31->jupyterlab-
server<3,>=2.22.1->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (2.2.2)
Requirement already satisfied: certifi>=2017.4.17 in c:\users\anurag

24
kandari\anaconda3\lib\site-packages (from requests>=2.31->jupyterlab-
server<3,>=2.22.1->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (2024.7.4)
Requirement already satisfied: debugpy>=1.6.5 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from ipykernel-
>jupyterlab<4.1,>=4.0.2->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (1.6.7)
Requirement already satisfied: nest-asyncio in c:\users\anurag
kandari\anaconda3\lib\site-packages (from ipykernel-
>jupyterlab<4.1,>=4.0.2->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (1.6.0)
Requirement already satisfied: psutil in c:\users\anurag kandari\
anaconda3\lib\site-packages (from ipykernel->jupyterlab<4.1,>=4.0.2-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (5.9.0)
Requirement already satisfied: webencodings in c:\users\anurag
kandari\anaconda3\lib\site-packages (from bleach!=5.0.0-
>nbconvert>=6.4.4->jupyter-server<3,>=2.4.0->notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (0.5.1)
Requirement already satisfied: fqdn in c:\users\anurag kandari\
anaconda3\lib\site-packages (from jsonschema[format-nongpl]>=4.18.0-
>jupyter-events>=0.9.0->jupyter-server<3,>=2.4.0->notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (1.5.1)
Requirement already satisfied: isoduration in c:\users\anurag kandari\
anaconda3\lib\site-packages (from jsonschema[format-nongpl]>=4.18.0-
>jupyter-events>=0.9.0->jupyter-server<3,>=2.4.0->notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (20.11.0)
Requirement already satisfied: jsonpointer>1.13 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jsonschema[format-
nongpl]>=4.18.0->jupyter-events>=0.9.0->jupyter-server<3,>=2.4.0-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (2.1)
Requirement already satisfied: uri-template in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jsonschema[format-
nongpl]>=4.18.0->jupyter-events>=0.9.0->jupyter-server<3,>=2.4.0-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (1.3.0)
Requirement already satisfied: webcolors>=1.11 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from jsonschema[format-
nongpl]>=4.18.0->jupyter-events>=0.9.0->jupyter-server<3,>=2.4.0-
>notebook>=4.4.1->widgetsnbextension~=3.6.6->ipywidgets>=7.0.0-
>cufflinks) (24.11.1)
Requirement already satisfied: cffi>=1.0.1 in c:\users\anurag kandari\
anaconda3\lib\site-packages (from argon2-cffi-bindings->argon2-
cffi>=21.1->jupyter-server<3,>=2.4.0->notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (1.16.0)
Requirement already satisfied: soupsieve>1.2 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from beautifulsoup4-
>nbconvert>=6.4.4->jupyter-server<3,>=2.4.0->notebook>=4.4.1-

25
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (2.5)
Requirement already satisfied: pycparser in c:\users\anurag kandari\
anaconda3\lib\site-packages (from cffi>=1.0.1->argon2-cffi-bindings-
>argon2-cffi>=21.1->jupyter-server<3,>=2.4.0->notebook>=4.4.1-
>widgetsnbextension~=3.6.6->ipywidgets>=7.0.0->cufflinks) (2.21)
Requirement already satisfied: arrow>=0.15.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from isoduration-
>jsonschema[format-nongpl]>=4.18.0->jupyter-events>=0.9.0->jupyter-
server<3,>=2.4.0->notebook>=4.4.1->widgetsnbextension~=3.6.6-
>ipywidgets>=7.0.0->cufflinks) (1.2.3)

import cufflinks as cf

from plotly.offline import

download_plotlyjs,init_notebook_mode,plot,iplot

init_notebook_mode(connected=True)

cf.go_offline()

pip install chart_studio

Requirement already satisfied: chart_studio in c:\users\anurag

kandari\anaconda3\lib\site-packages (1.1.0)
Requirement already satisfied: plotly in c:\users\anurag kandari\
anaconda3\lib\site-packages (from chart_studio) (5.22.0)
Requirement already satisfied: requests in c:\users\anurag kandari\
anaconda3\lib\site-packages (from chart_studio) (2.32.2)
Requirement already satisfied: retrying>=1.3.3 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from chart_studio) (1.3.4)
Requirement already satisfied: six in c:\users\anurag kandari\
anaconda3\lib\site-packages (from chart_studio) (1.16.0)
Requirement already satisfied: tenacity>=6.2.0 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from plotly->chart_studio)
(8.2.2)
Requirement already satisfied: packaging in c:\users\anurag kandari\
anaconda3\lib\site-packages (from plotly->chart_studio) (23.2)
Requirement already satisfied: charset-normalizer<4,>=2 in c:\users\
anurag kandari\anaconda3\lib\site-packages (from requests-
>chart_studio) (2.0.4)
Requirement already satisfied: idna<4,>=2.5 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from requests->chart_studio)
(3.7)
Requirement already satisfied: urllib3<3,>=1.21.1 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from requests->chart_studio)
(2.2.2)
Requirement already satisfied: certifi>=2017.4.17 in c:\users\anurag
kandari\anaconda3\lib\site-packages (from requests->chart_studio)
(2024.7.4)
Note: you may need to restart the kernel to use updated packages.

26
import chart_studio.plotly as py
import plotly.graph_objs as go

df.plot()

<Axes: >

df.iplot()

df.plot(kind='bar', x='BloodPressure', y='DiabetesPedigreeFunction')

<Axes: xlabel='BloodPressure'>

27
df.iplot(kind='bar')

df.count().iplot(kind='bar')

28
df.sum().iplot(kind='bar')

df.iplot(kind='box')

df['Pregnancies'].iplot(kind='hist',bins=25)

29
df.iplot(kind='hist')

df.iplot(kind='scatter',x='Pregnancies',y='BloodPressure',mode='marker
s',size=20)

df.iplot(kind='bubble',x='Pregnancies',y='BloodPressure',size='Outcome
')

30
df.scatter_matrix()

from sklearn.linear_model import LogisticRegression

from sklearn.metrics import accuracy_score, f1_score, precision_score,
recall_score, classification_report, confusion_matrix

x = df.iloc[:, :-1]
y = df.iloc[:, 6]

Pregnancies Glucose BloodPressure SkinThickness Insulin BMI

\
0 6 148 72 35 0 33.6

1 1 85 66 29 0 26.6

2 8 183 64 0 0 23.3

3 1 89 66 23 94 28.1

4 0 137 40 35 168 43.1

.. ... ... ... ... ... ...

31
764 2 122 70 27 0 36.8

765 5 121 72 23 112 26.2

766 1 126 60 0 0 30.1

767 1 93 70 31 0 30.4

768 0 123 77 0 1 36.3

DiabetesPedigreeFunction Age
0 0.627 50
1 0.351 31
2 0.672 32
3 0.167 21
4 2.288 33
.. ... ...
764 0.340 27
765 0.245 30
766 0.349 47
767 0.315 23
768 0.252 55

[769 rows x 8 columns]

0 0.627
1 0.351
2 0.672
3 0.167
4 2.288
...
764 0.340
765 0.245
766 0.349
767 0.315
768 0.252
Name: DiabetesPedigreeFunction, Length: 769, dtype: float64

x.head()

Pregnancies Glucose BloodPressure SkinThickness Insulin

BMI \
0 6 148 72 35 0 33.6

1 1 85 66 29 0 26.6

2 8 183 64 0 0 23.3

32
3 1 89 66 23 94 28.1

4 0 137 40 35 168 43.1

DiabetesPedigreeFunction Age
0 0.627 50
1 0.351 31
2 0.672 32
3 0.167 21
4 2.288 33

x = pd.get_dummies(x, prefix_sep='_', drop_first=True)

Pregnancies Glucose BloodPressure SkinThickness Insulin BMI

\
0 6 148 72 35 0 33.6

1 1 85 66 29 0 26.6

2 8 183 64 0 0 23.3

3 1 89 66 23 94 28.1

4 0 137 40 35 168 43.1

.. ... ... ... ... ... ...

764 2 122 70 27 0 36.8

765 5 121 72 23 112 26.2

766 1 126 60 0 0 30.1

767 1 93 70 31 0 30.4

768 0 123 77 0 1 36.3

DiabetesPedigreeFunction Age
0 0.627 50
1 0.351 31
2 0.672 32
3 0.167 21
4 2.288 33
.. ... ...
764 0.340 27
765 0.245 30
766 0.349 47

33
767 0.315 23
768 0.252 55

[769 rows x 8 columns]

y.describe()

count 769.000000
mean 0.471590
std 0.331208
min 0.078000
25% 0.244000
50% 0.371000
75% 0.626000
max 2.420000
Name: DiabetesPedigreeFunction, dtype: float64

x_train, x_test, y_train, y_test = train_test_split(x, y, test_size =

0.25, random_state = 0)

from sklearn.linear_model import LogisticRegression

from sklearn.metrics import accuracy_score, f1_score, precision_score,
recall_score, classification_report, confusion_matrix

data = {

"Pregnancies": [2.3, 1.8, 3.1, 2.5, 1.9, 3.6, 2.2, 3.4],

"BloodPressure ": [1.5, 2.3, 1.9, 2.8, 2.1, 3.2, 1.8, 3.0],

"Insulin": [1, 0, 1, 1, 0, 1, 0, 1],

}

df = pd.DataFrame(data)

X = df[["Pregnancies", "Insulin"]]
y = df["Insulin"]

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.3, random_state=42)

model = LogisticRegression()
model.fit(X_train, y_train)

LogisticRegression()

y_pred = model.predict(X_test)

print("Accuracy:", accuracy_score(y_test, y_pred))

print("\nConfusion Matrix:\n", confusion_matrix(y_test, y_pred))

34
print("\nClassification Report:\n", classification_report(y_test,
y_pred))

Accuracy: 1.0

Confusion Matrix:
[[1 0]
[0 2]]

Classification Report:
precision recall f1-score support

0 1.00 1.00 1.00 1

1 1.00 1.00 1.00 2

accuracy 1.00 3
macro avg 1.00 1.00 1.00 3
weighted avg 1.00 1.00 1.00 3

print("\nIntercept:", model.intercept_)
print("Coefficients:", model.coef_)

Intercept: [-1.83267869]
Coefficients: [[0.69083358 0.79529279]]

35
In this project, we analyzed the Pima Indian Diabetes dataset to predict diabetes in
female patients based on medical diagnostic measurements. The use of Python and its
powerful libraries like Scikit-Learn, Plotly, and Cufflinks enabled a systematic and
efficient approach to data analysis and modeling.

Through data preprocessing and exploratory analysis, we identified key factors such as
glucose levels, BMI, and insulin as significant predictors of diabetes. By building and
evaluating machine learning models, we achieved a reliable prediction system for
diagnosing diabetes.

The visualizations created using Plotly and Cufflinks provided valuable insights into the
relationships between variables, making the data easier to interpret and understand.
This project highlights the potential of machine learning and data visualization in
healthcare, emphasizing how data-driven approaches can aid in early diagnosis and
decision-making. Future work can explore more advanced algorithms or larger
datasets to further improve prediction accuracy and applicability in real-world
scenarios.

36
By completing this project, the following key learning outcomes were
achieved:

1. Understanding of the Dataset

o Gained knowledge of the Pima Indian Diabetes dataset and
its significance in predicting diabetes using medical
diagnostic measurements.
o Learned about key variables such as glucose levels, BMI,
insulin, and their role in determining health outcomes.

2. Data Preprocessing Skills

o Developed the ability to clean, preprocess, and prepare
raw data for analysis, including handling missing values
and normalizing features.
o Understood the importance of splitting data into training
and testing sets for model evaluation.

3. Exploratory Data Analysis (EDA)

o Acquired skills to explore data trends, relationships, and
distributions using interactive visualizations created
with Plotly and Cufflinks.
o Gained insights into identifying key patterns that
influence diabetes outcomes.

4. Machine Learning Model Development

o Learned to apply machine learning algorithms using Scikit-
Learn to build predictive models.
o Understood the process of training models, selecting
algorithms, and tuning hyperparameters for better
performance.

5. Model Evaluation Techniques

o Gained experience in evaluating model performance using
metrics such as accuracy, precision, recall, and F1 score.
o Learned the importance of validation to ensure models
generalize well to new data.

6. Visualization and Communication

o Developed the ability to present findings effectively

37
using interactive charts and graphs.
o Enhanced skills in communicating technical results in a
clear and visually appealing way.

7. Practical Application of Tools

o Gained hands-on experience with Python libraries like
Scikit-Learn, Plotly, and Cufflinks for data analysis and
visualization.
o Understood how to integrate different tools to solve real-
world problems in healthcare.

8. Insights into Healthcare Analytics

o Recognized the potential of machine learning in improving
healthcare outcomes through data-driven decision-making.
o Learned how predictive models can assist in early
diagnosis and treatment planning.

38
BIBLOGRAPHY
DATASET

https://www.kaggle.com/datasets/whenamancodes/predict-diabities

National Institute of Diabetes and Digestive and Kidney Diseases

Source of the Pima Indian Diabetes dataset.

Pedregosa, F., et al. (2011). "Scikit-learn: Machine Learning in Python."

Journal of Machine Learning Research, 12, 2825–2830.

Plotly Technologies Inc. (2015). Collaborative Data Science. Retrieved from

https://plotly.com.

Python Software Foundation. (2023). Pandas Documentation. Retrieved from

https://pandas.pydata.org.

Sundararajan, A. (2020). Interactive Visualizations with Plotly and Cufflinks in

Python. Medium Article.

Jason Brownlee. (2016). How to Evaluate Machine Learning Algorithms. Retrieved

from https://machinelearningmastery.com.

Zhang, Y., et al. (2019). "Machine Learning Applications in Healthcare."

Health Informatics Journal, 25(3), 567–582.

Kaggle. (2023). Pima Indian Diabetes Dataset Overview. Retrieved from

https://www.kaggle.com.

McKinney, W. (2010). "Data Structures for Statistical Computing in Python."

Proceedings of the 9th Python in Science Conference, 51–56.

Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical
Learning: Data Mining, Inference, and Prediction. Springer.

Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
100% (18)
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
208 pages
Learning Pandas Library
100% (1)
Learning Pandas Library
271 pages
sample project
No ratings yet
sample project
12 pages
Python Libraries Seminar Report
100% (2)
Python Libraries Seminar Report
16 pages
Python-2
No ratings yet
Python-2
18 pages
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
Software Environment
No ratings yet
Software Environment
11 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
18 pages
Data Science Lecture No 5
No ratings yet
Data Science Lecture No 5
16 pages
Python Programming: Learn, Code, Create
From Everand
Python Programming: Learn, Code, Create
Sachin Naha
No ratings yet
Python Basic
No ratings yet
Python Basic
145 pages
Python For Data Analytics Scientific and Technical Applications
No ratings yet
Python For Data Analytics Scientific and Technical Applications
6 pages
Introduction
No ratings yet
Introduction
45 pages
Python
No ratings yet
Python
23 pages
CS3361 - Data Science Laboratory
No ratings yet
CS3361 - Data Science Laboratory
31 pages
Data science lab
No ratings yet
Data science lab
61 pages
Python Module 1
No ratings yet
Python Module 1
9 pages
Data Manipulation with Python Step by Step: A Practical Guide with Examples
From Everand
Data Manipulation with Python Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
Python
No ratings yet
Python
323 pages
Module03-Introduction To Python
No ratings yet
Module03-Introduction To Python
40 pages
1 Introduction Python Programming For Data Science
No ratings yet
1 Introduction Python Programming For Data Science
11 pages
Python Week+1 New
No ratings yet
Python Week+1 New
44 pages
Introduction to Python
No ratings yet
Introduction to Python
71 pages
Fds PDF
No ratings yet
Fds PDF
4 pages
Python for Engineers: Solving Real-World Technical Challenges
From Everand
Python for Engineers: Solving Real-World Technical Challenges
Robert Johnson
No ratings yet
Bhabesh - Chapter 5
No ratings yet
Bhabesh - Chapter 5
19 pages
Your First Python Program
From Everand
Your First Python Program
Alexander Paz
No ratings yet
Amitesh Sharma ML
No ratings yet
Amitesh Sharma ML
28 pages
Python Programming For Economics Finance
No ratings yet
Python Programming For Economics Finance
267 pages
AIES Assignment1
No ratings yet
AIES Assignment1
15 pages
MLk65opyk45o4v 22i5vi2 It9359ci5ji3tjui3wmdlakmlmakmkmfiejrieuighegiurhgiurguir
No ratings yet
MLk65opyk45o4v 22i5vi2 It9359ci5ji3tjui3wmdlakmlmakmkmfiejrieuighegiurhgiurguir
23 pages
Python Programming For Economics Finance
No ratings yet
Python Programming For Economics Finance
267 pages
dsbda Unit4
No ratings yet
dsbda Unit4
110 pages
Data Science ppt
No ratings yet
Data Science ppt
17 pages
School of Computing and Creative Media XBIS 2023 Data Science Assignment Report
No ratings yet
School of Computing and Creative Media XBIS 2023 Data Science Assignment Report
21 pages
Data Science Handwritten Notes
No ratings yet
Data Science Handwritten Notes
44 pages
python notes sarang sir (1)
No ratings yet
python notes sarang sir (1)
24 pages
Dsbda Lab
100% (1)
Dsbda Lab
76 pages
unit 4
No ratings yet
unit 4
105 pages
Ip Project Class Xii
No ratings yet
Ip Project Class Xii
51 pages
1.1. Scope
No ratings yet
1.1. Scope
45 pages
ML Lab 1
No ratings yet
ML Lab 1
24 pages
Ds Python Unit-I
No ratings yet
Ds Python Unit-I
30 pages
Mastering Python in 7 Days
From Everand
Mastering Python in 7 Days
Alex Wood
No ratings yet
Sec-D ML Practical File PDF
No ratings yet
Sec-D ML Practical File PDF
19 pages
Lesson 03 Programming Environment Setup
No ratings yet
Lesson 03 Programming Environment Setup
33 pages
ML Lab Manual VI Sem
No ratings yet
ML Lab Manual VI Sem
35 pages
DS LAB MANUAL (1)
No ratings yet
DS LAB MANUAL (1)
113 pages
Python For Beginners
From Everand
Python For Beginners
TUDOR MARCIANTI
5/5 (1)
Ganesh
No ratings yet
Ganesh
28 pages
Python for AI: Applying Machine Learning in Everyday Projects
From Everand
Python for AI: Applying Machine Learning in Everyday Projects
Robert Johnson
No ratings yet
Centurion University of Technology and Managament: Topic: Numpy, Pandas, Matplotlib
No ratings yet
Centurion University of Technology and Managament: Topic: Numpy, Pandas, Matplotlib
12 pages
Python Data Persistence
From Everand
Python Data Persistence
Malhar Lathkar
No ratings yet
CS3361 Data Science Lab Manual
No ratings yet
CS3361 Data Science Lab Manual
65 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
74 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
42 pages
suraj report file
No ratings yet
suraj report file
17 pages
NumPy: Beginner's Guide - Third Edition - Sample Chapter
75% (4)
NumPy: Beginner's Guide - Third Edition - Sample Chapter
54 pages
Application Based Programming in Python_ACE_INTL - Copy
No ratings yet
Application Based Programming in Python_ACE_INTL - Copy
466 pages
Data Ty
No ratings yet
Data Ty
59 pages
Module 4 Part_4
No ratings yet
Module 4 Part_4
23 pages
GEN101 ME Dr. Neelam Tandon
No ratings yet
GEN101 ME Dr. Neelam Tandon
2 pages
FIN201_FM_Dr. Anuj Verma
No ratings yet
FIN201_FM_Dr. Anuj Verma
2 pages
Untitled
No ratings yet
Untitled
259 pages
E-Commerce (Unit-1)
No ratings yet
E-Commerce (Unit-1)
98 pages
Goals of E-Commerce
100% (2)
Goals of E-Commerce
18 pages
History of Taxation in India: CA Mohit Punetha
No ratings yet
History of Taxation in India: CA Mohit Punetha
161 pages
Network Techniques Revised
No ratings yet
Network Techniques Revised
85 pages
ATOK-BIG WEDGE MUTUAL BENEFIT ASSOCIATION v. ATOK-BIG WEDGE MINING COMPANY, INCORPORATED
No ratings yet
ATOK-BIG WEDGE MUTUAL BENEFIT ASSOCIATION v. ATOK-BIG WEDGE MINING COMPANY, INCORPORATED
2 pages
Chemical Reactors
No ratings yet
Chemical Reactors
36 pages
Child Development Theorist Webquest
No ratings yet
Child Development Theorist Webquest
3 pages
Dewatering
No ratings yet
Dewatering
18 pages
A Comparison of DNA and RNA Worksheet
No ratings yet
A Comparison of DNA and RNA Worksheet
5 pages
Baderia Global Institute Project
No ratings yet
Baderia Global Institute Project
47 pages
Jul-Sep 2008 Voice For Native Plants Newsletter, Native Plant Society of New Mexico
No ratings yet
Jul-Sep 2008 Voice For Native Plants Newsletter, Native Plant Society of New Mexico
16 pages
Port Alberni Plywood Site Environmental Assessement
No ratings yet
Port Alberni Plywood Site Environmental Assessement
39 pages
Formation of Hi
No ratings yet
Formation of Hi
2 pages
Keyence Hi 02kdatasheet
No ratings yet
Keyence Hi 02kdatasheet
8 pages
Icf 1
No ratings yet
Icf 1
5 pages
Interview Questions ISCP
No ratings yet
Interview Questions ISCP
4 pages
DR Dychangco
No ratings yet
DR Dychangco
26 pages
Refinery Integration CAAFI PNNL
100% (1)
Refinery Integration CAAFI PNNL
61 pages
School Health Services
No ratings yet
School Health Services
10 pages
Nso 9
No ratings yet
Nso 9
4 pages
Fundamentals of Chemistry PDF
No ratings yet
Fundamentals of Chemistry PDF
7 pages
Mid Term Examinations Fall 2020: 0830hrs November 09, 2020 Block - A Block - B
No ratings yet
Mid Term Examinations Fall 2020: 0830hrs November 09, 2020 Block - A Block - B
4 pages
Aptis General - Aptis SV - Reading Test
100% (1)
Aptis General - Aptis SV - Reading Test
7 pages
Form 941 Summary
100% (1)
Form 941 Summary
5 pages
Toxicity of Heavy Metals
No ratings yet
Toxicity of Heavy Metals
19 pages
9th Science EM Half Yearly Exam 2023 Question Paper Virudhunagar District English Medium PDF Download
No ratings yet
9th Science EM Half Yearly Exam 2023 Question Paper Virudhunagar District English Medium PDF Download
2 pages
MC33030 Motorola
No ratings yet
MC33030 Motorola
16 pages
AC Practice Exercise
No ratings yet
AC Practice Exercise
3 pages
CDM 2015 For Designer PDF
100% (1)
CDM 2015 For Designer PDF
21 pages
USE OF ENGLISH -1
No ratings yet
USE OF ENGLISH -1
4 pages
MREN502 Lecture 5 Thermal Collectors in Simple Energy Systems
No ratings yet
MREN502 Lecture 5 Thermal Collectors in Simple Energy Systems
57 pages
IJAA VOL. 7 NO.6 November-December 2020
No ratings yet
IJAA VOL. 7 NO.6 November-December 2020
240 pages
FAL09014 Solar PWM 10A USB Manual
No ratings yet
FAL09014 Solar PWM 10A USB Manual
1 page
Barista Case Study
No ratings yet
Barista Case Study
17 pages