0% found this document useful (0 votes)

10 views16 pages

DS Task-3 - Jupyter Notebook

The document details the installation of various Python packages including pandas, seaborn, and scikit-learn within a Jupyter Notebook environment. It also includes data loading and initial exploration of a dataset related to bank marketing, showing the structure and types of data present. The dataset contains 4119 entries with 21 columns, and no missing or duplicated values were found.

Uploaded by

clavu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views16 pages

DS Task-3 - Jupyter Notebook

Uploaded by

clavu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [4]: pip install pandas

Collecting pandas
Downloading pandas-2.1.1-cp310-cp310-win_amd64.whl (10.7 MB)
---------------------------------------- 10.7/10.7 MB 4.5 MB/s eta 0:00:00
Requirement already satisfied: pytz>=2020.1 in c:\users\ram\appdata\local\progr
ams\python\python310\lib\site-packages (from pandas) (2022.2.1)
Requirement already satisfied: python-dateutil>=2.8.2 in c:\users\ram\appdata\l
ocal\programs\python\python310\lib\site-packages (from pandas) (2.8.2)
Collecting numpy>=1.22.4
Downloading numpy-1.26.0-cp310-cp310-win_amd64.whl (15.8 MB)
---------------------------------------- 15.8/15.8 MB 5.2 MB/s eta 0:00:00
Collecting tzdata>=2022.1
Downloading tzdata-2023.3-py2.py3-none-any.whl (341 kB)
-------------------------------------- 341.8/341.8 KB 4.3 MB/s eta 0:00:00
Requirement already satisfied: six>=1.5 in c:\users\ram\appdata\local\programs
\python\python310\lib\site-packages (from python-dateutil>=2.8.2->pandas) (1.1
6.0)
Installing collected packages: tzdata, numpy, pandas
Successfully installed numpy-1.26.0 pandas-2.1.1 tzdata-2023.3
Note: you may need to restart the kernel to use updated packages.

WARNING: You are using pip version 22.0.4; however, version 23.2.1 is availabl
e.
You should consider upgrading via the 'C:\Users\RAM\AppData\Local\Programs\Pyth
on\Python310\python.exe -m pip install --upgrade pip' command.

In [5]: pip install seaborn

Collecting seaborn
Downloading seaborn-0.13.0-py3-none-any.whl (294 kB)
-------------------------------------- 294.6/294.6 KB 2.0 MB/s eta 0:00:00
localhost:8888/notebooks/DS Task-3.ipynb# 1/16
10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [5]: pip install seaborn

Collecting seaborn
Downloading seaborn-0.13.0-py3-none-any.whl (294 kB)
-------------------------------------- 294.6/294.6 KB 2.0 MB/s eta 0:00:00
Collecting matplotlib!=3.6.1,>=3.3
Downloading matplotlib-3.8.0-cp310-cp310-win_amd64.whl (7.6 MB)
---------------------------------------- 7.6/7.6 MB 4.7 MB/s eta 0:00:00
Requirement already satisfied: pandas>=1.2 in c:\users\ram\appdata\local\progra
ms\python\python310\lib\site-packages (from seaborn) (2.1.1)
Requirement already satisfied: numpy!=1.24.0,>=1.20 in c:\users\ram\appdata\loc
al\programs\python\python310\lib\site-packages (from seaborn) (1.26.0)
Requirement already satisfied: packaging>=20.0 in c:\users\ram\appdata\local\pr
ograms\python\python310\lib\site-packages (from matplotlib!=3.6.1,>=3.3->seabor
n) (21.3)
Requirement already satisfied: pyparsing>=2.3.1 in c:\users\ram\appdata\local\p
rograms\python\python310\lib\site-packages (from matplotlib!=3.6.1,>=3.3->seabo
rn) (3.0.9)
Collecting kiwisolver>=1.0.1
Downloading kiwisolver-1.4.5-cp310-cp310-win_amd64.whl (56 kB)
-------------------------------------- 56.1/56.1 KB 975.6 kB/s eta 0:00:00
Collecting fonttools>=4.22.0
Downloading fonttools-4.43.1-cp310-cp310-win_amd64.whl (2.1 MB)
---------------------------------------- 2.1/2.1 MB 2.1 MB/s eta 0:00:00
Collecting contourpy>=1.0.1
Downloading contourpy-1.1.1-cp310-cp310-win_amd64.whl (477 kB)
-------------------------------------- 478.0/478.0 KB 1.7 MB/s eta 0:00:00
Collecting cycler>=0.10
Downloading cycler-0.12.1-py3-none-any.whl (8.3 kB)
Collecting pillow>=6.2.0
Downloading Pillow-10.0.1-cp310-cp310-win_amd64.whl (2.5 MB)
---------------------------------------- 2.5/2.5 MB 2.9 MB/s eta 0:00:00
Requirement already satisfied: python-dateutil>=2.7 in c:\users\ram\appdata\loc
al\programs\python\python310\lib\site-packages (from matplotlib!=3.6.1,>=3.3->s
eaborn) (2.8.2)
Requirement already satisfied: tzdata>=2022.1 in c:\users\ram\appdata\local\pro
grams\python\python310\lib\site-packages (from pandas>=1.2->seaborn) (2023.3)
Requirement already satisfied: pytz>=2020.1 in c:\users\ram\appdata\local\progr
ams\python\python310\lib\site-packages (from pandas>=1.2->seaborn) (2022.2.1)
Requirement already satisfied: six>=1.5 in c:\users\ram\appdata\local\programs
\python\python310\lib\site-packages (from python-dateutil>=2.7->matplotlib!=3.
6.1,>=3.3->seaborn) (1.16.0)
Installing collected packages: pillow, kiwisolver, fonttools, cycler, contourp
y, matplotlib, seaborn
Successfully installed contourpy-1.1.1 cycler-0.12.1 fonttools-4.43.1 kiwisolve
r-1.4.5 matplotlib-3.8.0 pillow-10.0.1 seaborn-0.13.0
Note: you may need to restart the kernel to use updated packages.

In [6]: !pip install numpy

Requirement already satisfied: numpy in c:\users\ram\appdata\local\programs\pyt

hon\python310\lib\site-packages (1.26.0)

localhost:8888/notebooks/DS Task-3.ipynb# 2/16

10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [6]: !pip install numpy

Requirement already satisfied: numpy in c:\users\ram\appdata\local\programs\pyt

hon\python310\lib\site-packages (1.26.0)

In [10]: pip install -U scikit-learn

Collecting scikit-learn
Downloading scikit_learn-1.3.1-cp310-cp310-win_amd64.whl (9.3 MB)
---------------------------------------- 9.3/9.3 MB 4.0 MB/s eta 0:00:00
Collecting threadpoolctl>=2.0.0
Downloading threadpoolctl-3.2.0-py3-none-any.whl (15 kB)
Requirement already satisfied: numpy<2.0,>=1.17.3 in c:\users\ram\appdata\local
\programs\python\python310\lib\site-packages (from scikit-learn) (1.26.0)
Collecting joblib>=1.1.1
Downloading joblib-1.3.2-py3-none-any.whl (302 kB)
-------------------------------------- 302.2/302.2 KB 2.7 MB/s eta 0:00:00
Collecting scipy>=1.5.0
Downloading scipy-1.11.3-cp310-cp310-win_amd64.whl (44.1 MB)
---------------------------------------- 44.1/44.1 MB 3.0 MB/s eta 0:00:00
Installing collected packages: threadpoolctl, scipy, joblib, scikit-learn
Successfully installed joblib-1.3.2 scikit-learn-1.3.1 scipy-1.11.3 threadpoolc
tl-3.2.0
Note: you may need to restart the kernel to use updated packages.

In [23]: import pandas as pd

import numpy as np
import seaborn as sns
import warnings
import csv
from sklearn.tree import plot_tree
import matplotlib.pyplot as plt
warnings.filterwarnings("ignore")
%matplotlib inline

In [30]: df = pd.read_csv(r'G:\programming\bank-additional\bank-additional.csv',delimiter=
df.rename (columns={'y':'deposit'}, inplace=True)

In [31]: df.head()

Out[31]:
age job marital education default housing loan contact month day_of_wee

bl
localhost:8888/notebooks/DS Task-3.ipynb# 3/16
10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [31]: df.head()

Out[31]:
age job marital education default housing loan contact month day_of_wee

blue-
0 30 married basic.9y no yes no cellular may f
collar

1 39 services single high.school no no no telephone may f

2 25 services married high.school no yes no telephone jun we

3 38 services married basic.9y no unknown unknown telephone jun f

4 47 admin. married university.degree no yes no cellular nov mo

5 rows × 21 columns

In [32]: df.tail()
Out[32]:
age job marital education default housing loan contact month day_of_week

4114 30 admin. married basic.6y no yes yes cellular jul thu

4115 39 admin. married high.school no yes no telephone jul fri

4116 27 student single high.school no no no cellular may mon

4117 58 admin. married high.school no no no cellular aug fri

4118 34 management single high.school no yes no cellular nov wed

5 rows × 21 columns

In [33]: df.shape

Out[33]: (4119, 21)

In [34]: df.columns

Out[34]: Index(['age', 'job', 'marital', 'education', 'default', 'housing', 'loan',

'contact', 'month', 'day_of_week', 'duration', 'campaign', 'pdays',
'previous', 'poutcome', 'emp.var.rate', 'cons.price.idx',
'cons.conf.idx', 'euribor3m', 'nr.employed', 'deposit'],
dtype='object')

In [37]: df.dtypes.value_counts()

Out[37]: object 11
int64 5
float64 5
Name: count, dtype: int64

In [38]: df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 4119 entries, 0 to 4118
Data columns (total 21 columns):
localhost:8888/notebooks/DS Task-3.ipynb# 4/16
10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [38]: df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 4119 entries, 0 to 4118
Data columns (total 21 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 age 4119 non-null int64
1 job 4119 non-null object
2 marital 4119 non-null object
3 education 4119 non-null object
4 default 4119 non-null object
5 housing 4119 non-null object
6 loan 4119 non-null object
7 contact 4119 non-null object
8 month 4119 non-null object
9 day_of_week 4119 non-null object
10 duration 4119 non-null int64
11 campaign 4119 non-null int64
12 pdays 4119 non-null int64
13 previous 4119 non-null int64
14 poutcome 4119 non-null object
15 emp.var.rate 4119 non-null float64
16 cons.price.idx 4119 non-null float64
17 cons.conf.idx 4119 non-null float64
18 euribor3m 4119 non-null float64
19 nr.employed 4119 non-null float64
20 deposit 4119 non-null object
dtypes: float64(5), int64(5), object(11)
memory usage: 675.9+ KB

In [39]: df.duplicated().sum()

Out[39]: 0

In [40]: df.isna().sum()

Out[40]: age 0
job 0
marital 0
education 0
default 0
housing 0
loan 0
contact 0
month 0
day_of_week 0
duration 0
campaign 0
pdays 0
previous 0
poutcome 0
emp.var.rate 0
cons.price.idx 0
cons.conf.idx 0
euribor3m 0
nr.employed 0
deposit 0
In [42]: cat_cols = df.select_dtypes(include='object').columns
dtype: int64
print(cat_cols)
num_cols = df.select_dtypes(exclude='object').columns
print(num_cols)

localhost:8888/notebooks/DS Task-3.ipynb# 5/16

10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook
.e p oyed 0
deposit 0
In [42]: cat_cols = df.select_dtypes(include='object').columns
dtype: int64
print(cat_cols)
num_cols = df.select_dtypes(exclude='object').columns
print(num_cols)

Index(['job', 'marital', 'education', 'default', 'housing', 'loan', 'contact',

'month', 'day_of_week', 'poutcome', 'deposit'],
dtype='object')
Index(['age', 'duration', 'campaign', 'pdays', 'previous', 'emp.var.rate',
'cons.price.idx', 'cons.conf.idx', 'euribor3m', 'nr.employed'],
dtype='object')

In [43]: df.describe()

Out[43]:
age duration campaign pdays previous emp.var.rate cons.price.idx

count 4119.000000 4119.000000 4119.000000 4119.000000 4119.000000 4119.000000 4119.000000

mean 40.113620 256.788055 2.537266 960.422190 0.190337 0.084972 93.579704

std 10.313362 254.703736 2.568159 191.922786 0.541788 1.563114 0.579349

min 18.000000 0.000000 1.000000 0.000000 0.000000 -3.400000 92.201000

25% 32.000000 103.000000 1.000000 999.000000 0.000000 -1.800000 93.075000

50% 38.000000 181.000000 2.000000 999.000000 0.000000 1.100000 93.749000

75% 47.000000 317.000000 3.000000 999.000000 0.000000 1.400000 93.994000

max 88.000000 3643.000000 35.000000 999.000000 6.000000 1.400000 94.767000

In [44]: df.describe(include= 'object')

Out[44]:
job marital education default housing loan contact month day_of_week pou

count 4119 4119 4119 4119 4119 4119 4119 4119 4119

unique 12 4 8 3 3 3 2 10 5

top admin. married university.degree no yes no cellular may thu non

freq 1012 2509 1264 3315 2175 3349 2652 1378 860

In [45]: df.hist(figsize=(10, 10), color="#00FFFF")

plt.show()

localhost:8888/notebooks/DS Task-3.ipynb# 6/16

10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [45]: df.hist(figsize=(10, 10), color="#00FFFF")

plt.show()

In [53]: for feature in cat_cols:

plt.figure(figsize=(5, 5))
sns.countplot(x=feature, data=df, palette='Blues')
plt.title(f'Bar Plot of {feature}')
plt xlabel(feature)
localhost:8888/notebooks/DS Task-3.ipynb# 7/16
10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [53]: for feature in cat_cols:

plt.figure(figsize=(5, 5))
sns.countplot(x=feature, data=df, palette='Blues')
plt.title(f'Bar Plot of {feature}')
plt.xlabel(feature)
plt.ylabel('Count')
plt.xticks(rotation=90)
plt.show()

In [54]: df.plot(kind="box", subplots=True, layout=(2, 5), figsize=(20, 18), color="#1F618

plt.show()

localhost:8888/notebooks/DS Task-3.ipynb# 8/16

10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [54]: df.plot(kind="box", subplots=True, layout=(2, 5), figsize=(20, 18), color="#1F618

plt.show()

In [62]: column = df[['age','campaign', 'duration']]

q1 = np.percentile (column, 25)
q3 = np.percentile (column, 75)
iqr = q3 - q1
lower_bound = q1 - 1.5 * iqr
upper_bound = q3 + 1.5 * iqr
df[['age','campaign','duration']] = column[(column> lower_bound) & (column < uppe

In [65]: df.plot(kind='box', subplots=True, layout=(2,5), figsize=(20,10), color= '#808000

plt.show()

localhost:8888/notebooks/DS Task-3.ipynb# 9/16

10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [65]: df.plot(kind='box', subplots=True, layout=(2,5), figsize=(20,10), color= '#808000

plt.show()

In [68]: corr = df.corr()

print(corr)
corr =corr[abs(corr) >= 0.90]
sns.heatmap(corr, annot=True, cmap='coolwarm', linewidths=0.2)
plt show()
localhost:8888/notebooks/DS Task-3.ipynb# 10/16
10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [68]: corr = df.corr()

print(corr)
corr =corr[abs(corr) >= 0.90]
sns.heatmap(corr, annot=True, cmap='coolwarm', linewidths=0.2)
plt.show()

---------------------------------------------------------------------------
ValueError Traceback (most recent call last)
Input In [68], in <cell line: 1>()
----> 1 corr = df.corr()
2 print(corr)
3 corr =corr[abs(corr) >= 0.90]

File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\fr
ame.py:10707, in DataFrame.corr(self, method, min_periods, numeric_only)
10705 cols = data.columns
10706 idx = cols.copy()
> 10707 mat = data.to_numpy(dtype=float, na_value=np.nan, copy=False)
10709 if method == "pearson":
10710 correl = libalgos.nancorr(mat, minp=min_periods)

File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\fr
ame.py:1892, in DataFrame.to_numpy(self, dtype, copy, na_value)
1890 if dtype is not None:
1891 dtype = np.dtype(dtype)
-> 1892 result = self._mgr.as_array(dtype=dtype, copy=copy, na_value=na_value)
1893 if result.dtype is not dtype:
1894 result = np.array(result, dtype=dtype, copy=False)

File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\in
ternals\managers.py:1656, in BlockManager.as_array(self, dtype, copy, na_value)
1654 arr.flags.writeable = False
1655 else:
-> 1656 arr = self._interleave(dtype=dtype, na_value=na_value)
1657 # The underlying data was copied within _interleave, so no need
1658 # to further copy if copy=True or setting na_value
1660 if na_value is lib.no_default:

File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\in
ternals\managers.py:1715, in BlockManager._interleave(self, dtype, na_value)
1713 else:
1714 arr = blk.get_values(dtype)
-> 1715 result[rl.indexer] = arr
1716 itemmask[rl.indexer] = 1
1718 if not itemmask.all():

ValueError: could not convert string to float: 'blue-collar'

In [70]: high_corr_cols = ['emp.var.rate', 'euribor3n', 'nr.employed']

In [71]: df1 = df.copy()

df1.columns
Out[71]: Index(['age', 'job', 'marital', 'education', 'default', 'housing', 'loan',
'contact', 'month', 'day_of_week', 'duration', 'campaign', 'pdays',
'previous', 'poutcome', 'emp.var.rate', 'cons.price.idx',
'cons.conf.idx', 'euribor3m', 'nr.employed', 'deposit'],
In [74]: df1.shape
dtype='object')
Out[74]: (4119, 21)

I [76] from kl i import L b lE d

localhost:8888/notebooks/DS Task-3.ipynb# 11/16
10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook
previous , poutcome , emp.var.rate , cons.price.idx ,
'cons.conf.idx', 'euribor3m', 'nr.employed', 'deposit'],
In [74]: df1.shape
dtype='object')
Out[74]: (4119, 21)

In [76]: from sklearn.preprocessing import LabelEncoder

lb= LabelEncoder()
df_encoded = df1.apply(lb.fit_transform)
df_encoded
Out[76]:
age job marital education default housing loan contact month day_of_week ... campa

0 12 1 1 2 0 2 0 0 6 0 ...

1 21 7 2 3 0 0 0 1 6 0 ...

2 7 7 1 3 0 2 0 1 4 4 ...

3 20 7 1 2 0 1 1 1 4 0 ...

4 29 0 1 6 0 2 0 0 7 1 ...

... ... ... ... ... ... ... ... ... ... ... ...

4114 12 0 1 1 0 2 2 0 3 2 ...

4115 21 0 1 3 0 2 0 1 3 0 ...

4116 9 8 2 3 0 0 0 0 6 1 ...

4117 40 0 1 3 0 0 0 0 1 0 ...

4118 16 4 2 3 0 2 0 0 7 4 ...

4119 rows × 21 columns

In [77]: df_encoded['deposit'].value_counts()

Out[77]: deposit
0 3668
1 451
Name: count, dtype: int64

In [78]: x = df_encoded.drop('deposit',axis=1)
y = df_encoded ['deposit']
print(x.shape)
print (y.shape)
print(type(x))
print (type(y))

(4119, 20)
(4119,)
<class 'pandas.core.frame.DataFrame'>
<class 'pandas.core.series.Series'>

In [79]: from sklearn.model_selection import train_test_split

In [80]: print(4119*0.25)

1029.75

In [84]: x_train,x_test,y_train,y_test = train_test_split(x,y,test_size=0.25, random_stat

print(x_train.shape)
print(x_test.shape)
print(y_train.shape)
print(y test shape)
localhost:8888/notebooks/DS Task-3.ipynb# 12/16
10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [84]: x_train,x_test,y_train,y_test = train_test_split(x,y,test_size=0.25, random_stat

print(x_train.shape)
print(x_test.shape)
print(y_train.shape)
print(y_test.shape)

(3089, 20)
(1030, 20)
(3089,)
(1030,)

In [85]: from sklearn.metrics import confusion_matrix, classification_report,accuracy_scor

In [89]: def eval_model(y_test,y_pred):

acc = accuracy_score(y_test,y_pred)
print('Accuracy Score', acc)
cm = confusion_matrix(y_test,y_pred)
print('Confusion Matrix\n', cn)
print('Classification Report\n', classification_report(y_test,y_pred))

def escore(model):
train_score = model.score(x_train,y_train)
test_score = model.score(x_test,y_test)
print('Training Score', train_score)
print('Testing Score', test_score)

In [92]: from sklearn.tree import DecisionTreeClassifier

In [93]: dt = DecisionTreeClassifier(criterion='gini', max_depth=5,min_samples_split=10)

dt.fit(x_train,y_train)

Out[93]: ▾ DecisionTreeClassifier
DecisionTreeClassifier(max_depth=5, min_samples_split=10)

In [95]: ypred_dt = dt.predict(x_test)

print(ypred_dt)

[0 0 1 ... 1 0 0]

In [96]: eval_model(y_test, ypred_dt)

Accuracy Score 0.9087378640776699

---------------------------------------------------------------------------
NameError Traceback (most recent call last)
Input In [96], in <cell line: 1>()
----> 1 eval_model(y_test, ypred_dt)

Input In [89], in eval_model(y_test, y_pred)

3 print('Accuracy Score', acc)
4 cm = confusion_matrix(y_test,y_pred)
----> 5 print('Confusion Matrix\n', cn)
6 print('Classification Report\n', classification_report(y_test,y_pred))

In [97]: from sklearn.tree

NameError: import
name 'cn' plot_tree
is not defined

In [98]: cn =[ 'no', 'yes']

fn = x_train.columns
localhost:8888/notebooks/DS Task-3.ipynb# 13/16
10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [97]: from sklearn.tree

NameError: import
name 'cn' plot_tree
is not defined

In [98]: cn =[ 'no', 'yes']

fn = x_train.columns
print(fn)
print(cn)

Index(['age', 'job', 'marital', 'education', 'default', 'housing', 'loan',

'contact', 'month', 'day_of_week', 'duration', 'campaign', 'pdays',
'previous', 'poutcome', 'emp.var.rate', 'cons.price.idx',
'cons.conf.idx', 'euribor3m', 'nr.employed'],
dtype='object')
['no', 'yes']

In [100]: feature_names = df.columns.tolist()

class_names = ["class_0", "class_1"]
plot_tree(dt, feature_names=feature_names, class_names=class_names, filled=True)
plt.show()

In [102]: dt1 = DecisionTreeClassifier(criterion='entropy', max_depth=4,min_samples_split=1

dt1.fit(x_train,y_train)

Out[102]: ▾ DecisionTreeClassifier
DecisionTreeClassifier(criterion='entropy', max_depth=4, min_samples_split=15)

In [103]: mscore(dt1)

---------------------------------------------------------------------------
NameError Traceback (most recent call last)
Input In [103], in <cell line: 1>()
----> 1 mscore(dt1)

NameError: name 'mscore' is not defined

In [104]: ypred_dt1 = dt1.predict(x_test)

In [105]: eval_model(y_test,ypred_dt1)

Accuracy Score 0.9106796116504854

Confusion Matrix
['no', 'yes']
localhost:8888/notebooks/DS Task-3.ipynb# 14/16
10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook

In [105]: eval_model(y_test,ypred_dt1)

Accuracy Score 0.9106796116504854

Confusion Matrix
['no', 'yes']
Classification Report
precision recall f1-score support

0 0.94 0.96 0.95 930

1 0.55 0.42 0.48 100

accuracy 0.91 1030

macro avg 0.75 0.69 0.71 1030
weighted avg 0.90 0.91 0.91 1030

In [106]: plt.figure(figsize=(15, 15))

plot_tree(dt1, feature_names=fn.tolist(), class_names=cn, filled=True)
plt.show()

In [ ]:

localhost:8888/notebooks/DS Task-3.ipynb# 15/16

10/11/23, 8:45 PM DS Task-3 - Jupyter Notebook
In [ ]:

localhost:8888/notebooks/DS Task-3.ipynb# 16/16

Graph Vae Training - Log
No ratings yet
Graph Vae Training - Log
146 pages
2.1 Pytorch Intro Slides
No ratings yet
2.1 Pytorch Intro Slides
14 pages
Reactor Comfyui - Ipynb
No ratings yet
Reactor Comfyui - Ipynb
24 pages
Terminal 1
No ratings yet
Terminal 1
30 pages
Untitled7.ipynb - Colab
No ratings yet
Untitled7.ipynb - Colab
28 pages
Delhivery Business Case Study 1723758771
No ratings yet
Delhivery Business Case Study 1723758771
56 pages
FDS Lab Programs
No ratings yet
FDS Lab Programs
49 pages
Llava Data Prepare
No ratings yet
Llava Data Prepare
26 pages
Pip Install Jupyterthemes
No ratings yet
Pip Install Jupyterthemes
14 pages
Donald E. Knuth - Texbook
100% (6)
Donald E. Knuth - Texbook
494 pages
Practice Questions (Unsolved)
No ratings yet
Practice Questions (Unsolved)
8 pages
Caso 2 Lau
No ratings yet
Caso 2 Lau
27 pages
2 - Data - Analysis - Ipynb - Colaboratory
No ratings yet
2 - Data - Analysis - Ipynb - Colaboratory
28 pages
Scikit Learn
No ratings yet
Scikit Learn
4 pages
Warn Valorant Aio
No ratings yet
Warn Valorant Aio
8 pages
Python Widgets
No ratings yet
Python Widgets
3 pages
Mini Projects 3-6-Satyaki Mitra
No ratings yet
Mini Projects 3-6-Satyaki Mitra
60 pages
ML Ass
No ratings yet
ML Ass
27 pages
Bigmartsalesprediction
No ratings yet
Bigmartsalesprediction
27 pages
LFRFR Tr-Ka
No ratings yet
LFRFR Tr-Ka
28 pages
L1 Python Pandas 1 Series Notes
No ratings yet
L1 Python Pandas 1 Series Notes
25 pages
Day 22
No ratings yet
Day 22
6 pages
FODS Record
No ratings yet
FODS Record
66 pages
Trash Detection
No ratings yet
Trash Detection
17 pages
Proyecto Termianl
No ratings yet
Proyecto Termianl
12 pages
1 Data Science Pacages
No ratings yet
1 Data Science Pacages
12 pages
Python Package Installation Commands
No ratings yet
Python Package Installation Commands
6 pages
Combined Numpy Pandas Matplotlib Seaborn Roadmap
No ratings yet
Combined Numpy Pandas Matplotlib Seaborn Roadmap
2 pages
Week-5 - Jupyter Notebook
No ratings yet
Week-5 - Jupyter Notebook
9 pages
Untitled 28
No ratings yet
Untitled 28
5 pages
Cara Install LabelImg
No ratings yet
Cara Install LabelImg
19 pages
PC3 - SPATIAL - LismaSari - Ipynb - Colab
No ratings yet
PC3 - SPATIAL - LismaSari - Ipynb - Colab
9 pages
PC1 Lisma Sari - Ipynb - Colab
No ratings yet
PC1 Lisma Sari - Ipynb - Colab
9 pages
K Means Clustering - Ipynb - Colaboratory
No ratings yet
K Means Clustering - Ipynb - Colaboratory
4 pages
Reinforcement Learning - Ipynb - Colab
No ratings yet
Reinforcement Learning - Ipynb - Colab
5 pages
Lab1.ipynb - Colab
No ratings yet
Lab1.ipynb - Colab
5 pages
Assignment 2
No ratings yet
Assignment 2
17 pages
Hrithik Saini Class 12th c1, Roll No 1033
No ratings yet
Hrithik Saini Class 12th c1, Roll No 1033
25 pages
Random Forest - Car - Jupyter Notebook
No ratings yet
Random Forest - Car - Jupyter Notebook
4 pages
Untitled 5
No ratings yet
Untitled 5
4 pages
Connectivity Coding
No ratings yet
Connectivity Coding
4 pages
Pertemuan 3 - Latihan - Faiz Anugerah Gunawan
No ratings yet
Pertemuan 3 - Latihan - Faiz Anugerah Gunawan
6 pages
Install Pyqt5
No ratings yet
Install Pyqt5
38 pages
Instal Modules
No ratings yet
Instal Modules
6 pages
7 Apr
No ratings yet
7 Apr
3 pages
For Cor Pc2 Lismasari - Ipynb - Colab
No ratings yet
For Cor Pc2 Lismasari - Ipynb - Colab
5 pages
For Cor Pc1 Lismasari - Ipynb - Colab
No ratings yet
For Cor Pc1 Lismasari - Ipynb - Colab
5 pages
For Cor Pc3 Lismasari - Ipynb - Colab
No ratings yet
For Cor Pc3 Lismasari - Ipynb - Colab
5 pages
UV Demo
No ratings yet
UV Demo
1 page
NLP Py
No ratings yet
NLP Py
5 pages
Sample
No ratings yet
Sample
2 pages
Lab Work
No ratings yet
Lab Work
5 pages
1
No ratings yet
1
6 pages
Matematika Soal CBT
No ratings yet
Matematika Soal CBT
14 pages
Comfyui-Upscaling Kaggle - Ipynb
No ratings yet
Comfyui-Upscaling Kaggle - Ipynb
10 pages
NFL - SURVIVAL - Ipynb - Colab
No ratings yet
NFL - SURVIVAL - Ipynb - Colab
5 pages
Requirements Dev
No ratings yet
Requirements Dev
7 pages
Lab3 - Lab4
No ratings yet
Lab3 - Lab4
9 pages
Module Install
No ratings yet
Module Install
2 pages
Pronosticos - Ipynb - Colaboratory
No ratings yet
Pronosticos - Ipynb - Colaboratory
13 pages
Data Handling and CSV 2024 - 2025
No ratings yet
Data Handling and CSV 2024 - 2025
3 pages
Latex
No ratings yet
Latex
14 pages
DL - Libraries - Installation - Jupyter Notebook
No ratings yet
DL - Libraries - Installation - Jupyter Notebook
2 pages
Python Pandas Handson
No ratings yet
Python Pandas Handson
6 pages
IPython CUsersrohit
No ratings yet
IPython CUsersrohit
3 pages
Bibliographies - Using Harvard Referencing Style - TeX - LaTeX Stack Exchange
No ratings yet
Bibliographies - Using Harvard Referencing Style - TeX - LaTeX Stack Exchange
1 page
Requirements
No ratings yet
Requirements
2 pages
Librerias Python
No ratings yet
Librerias Python
3 pages
A Step-by-Step Guide To Calculating Autocorrelation and Partial Autocorrelation
No ratings yet
A Step-by-Step Guide To Calculating Autocorrelation and Partial Autocorrelation
13 pages
Dip Lab 01
No ratings yet
Dip Lab 01
2 pages
Requirements
No ratings yet
Requirements
1 page
Temp1: Pandas PD Numpy NP
No ratings yet
Temp1: Pandas PD Numpy NP
4 pages
Beamer Class Example8 Montpellier
No ratings yet
Beamer Class Example8 Montpellier
28 pages
1.a Numpy Code
No ratings yet
1.a Numpy Code
2 pages
To Install Instructions Guide
No ratings yet
To Install Instructions Guide
5 pages
Matplotlib - Pyplot PLT Numpy NP Scipy Seaborn Sns Scipy Random
No ratings yet
Matplotlib - Pyplot PLT Numpy NP Scipy Seaborn Sns Scipy Random
4 pages
Candidate Elimination - Jupyter Notebook
No ratings yet
Candidate Elimination - Jupyter Notebook
3 pages
Pandas Profiling Library For EDA
No ratings yet
Pandas Profiling Library For EDA
1 page
Install
No ratings yet
Install
2 pages
How To Install Mask-Rcnn For Nvidia Gpu
No ratings yet
How To Install Mask-Rcnn For Nvidia Gpu
19 pages
Requirements
No ratings yet
Requirements
1 page
Virtual Env
No ratings yet
Virtual Env
8 pages
Document 123
No ratings yet
Document 123
2 pages
Benchmarking Runs On CDH5.4 - Detailed Report
No ratings yet
Benchmarking Runs On CDH5.4 - Detailed Report
9 pages
Exam 1 Heat and Mass
No ratings yet
Exam 1 Heat and Mass
2 pages
PST Light3d
No ratings yet
PST Light3d
8 pages
Kubernetes Made Easy
From Everand
Kubernetes Made Easy
Pankaj Joshi
No ratings yet
CISCO PACKET TRACER LABS: Best practice of configuring or troubleshooting Network
From Everand
CISCO PACKET TRACER LABS: Best practice of configuring or troubleshooting Network
Mulayam Singh
No ratings yet
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hidaia Mahmood Alassouli
No ratings yet