0% found this document useful (0 votes)

71 views5 pages

DM - Lab - 8 - Jupyter Notebook

The document discusses classification and regression trees using CART, ID3 algorithms. It loads tennis and salary datasets, encodes categorical variables, splits data into training and test sets. Decision trees are constructed to classify tennis play and predict salary. The trees are plotted to visualize the models. Regression tree is also built on salary data to learn the relationship between job level and salary.

Uploaded by

GATE Aspirant

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views5 pages

DM - Lab - 8 - Jupyter Notebook

Uploaded by

GATE Aspirant

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

4/27/23, 8:17 PM DM_Lab_8 - Jupyter Notebook

Topic : CART - Classification & Regression Tree, ID3

In [22]:

1 import pandas as pd
2 import numpy as np
3
4 df1 = pd.read_csv('/home/c0nqu3r0r/Desktop/_Second sem/Data Mining/Dataset/Te
5 df1.drop('Day', axis = 1, inplace = True)# for classification
6
7 df2 = pd.read_csv('/home/c0nqu3r0r/Desktop/_Second sem/Data Mining/Dataset/ar

In [23]:

1 df1.head()

Out[23]:

Outlook Temperature Humidity Wind PlayTennis

0 Sunny Hot High Weak No

1 Sunny Hot High Strong No

2 Overcast Hot High Weak Yes

3 Rain Mild High Weak Yes

4 Rain Cool Normal Weak Yes

In [24]:

1 from sklearn.preprocessing import LabelEncoder

2 Le = LabelEncoder()
3
4 df1['Outlook'] = Le.fit_transform(df1['Outlook'])
5 df1['Temperature'] = Le.fit_transform(df1['Temperature'])
6 df1['Humidity'] = Le.fit_transform(df1['Humidity'])
7 df1['Wind'] = Le.fit_transform(df1['Wind'])
8 df1['PlayTennis'] = Le.fit_transform(df1['PlayTennis'])

In [25]:

1 y1 = df1['PlayTennis']
2 x1 = df1.drop(['PlayTennis'], axis = 1)

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 1/5

4/27/23, 8:17 PM DM_Lab_8 - Jupyter Notebook

In [26]:

1 # CART classification Tree

2 from sklearn import tree
3 clf1 = tree.DecisionTreeClassifier(criterion = 'gini')
4 clf1 = clf1.fit(x1, y1)
5 tree.plot_tree(clf1)

Out[26]:

[Text(0.4444444444444444, 0.9, 'X[0] <= 0.5\ngini = 0.459\nsamples =

14\nvalue = [5, 9]'),
Text(0.3333333333333333, 0.7, 'gini = 0.0\nsamples = 4\nvalue = [0,
4]'),
Text(0.5555555555555556, 0.7, 'X[2] <= 0.5\ngini = 0.5\nsamples = 1
0\nvalue = [5, 5]'),
Text(0.3333333333333333, 0.5, 'X[0] <= 1.5\ngini = 0.32\nsamples =
5\nvalue = [4, 1]'),
Text(0.2222222222222222, 0.3, 'X[3] <= 0.5\ngini = 0.5\nsamples = 2
\nvalue = [1, 1]'),
Text(0.1111111111111111, 0.1, 'gini = 0.0\nsamples = 1\nvalue = [1,
0]'),
Text(0.3333333333333333, 0.1, 'gini = 0.0\nsamples = 1\nvalue = [0,
1]'),
Text(0.4444444444444444, 0.3, 'gini = 0.0\nsamples = 3\nvalue = [3,
0]'),
Text(0.7777777777777778, 0.5, 'X[3] <= 0.5\ngini = 0.32\nsamples =
5\nvalue = [1, 4]'),
Text(0.6666666666666666, 0.3, 'X[0] <= 1.5\ngini = 0.5\nsamples = 2
\nvalue = [1, 1]'),
Text(0.5555555555555556, 0.1, 'gini = 0.0\nsamples = 1\nvalue = [1,
0]'),
Text(0.7777777777777778, 0.1, 'gini = 0.0\nsamples = 1\nvalue = [0,
1]'),
Text(0.8888888888888888, 0.3, 'gini = 0.0\nsamples = 3\nvalue = [0,
3]')]

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 2/5

4/27/23, 8:17 PM DM_Lab_8 - Jupyter Notebook

In [27]:

1 df2.head()

Out[27]:

Position Level Salary

0 Business Analyst 1 45000

1 Junior Consultant 2 50000

2 Senior Consultant 3 60000

3 Manager 4 80000

4 Country Manager 5 110000

In [28]:

1 x2 = df2.iloc[:, 1:2].values
2 y2 = df2.iloc[:, 2].values

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 3/5

4/27/23, 8:17 PM DM_Lab_8 - Jupyter Notebook

In [29]:

1 # CART - Regression Tree

2 from sklearn import tree
3 clf2 = tree.DecisionTreeRegressor()
4 clf2 = clf2.fit(x2, y2)
5 tree.plot_tree(clf2)

Out[29]:

[Text(0.703125, 0.9285714285714286, 'X[0] <= 8.5\nsquared_error = 80

662250000.0\nsamples = 10\nvalue = 249500.0'),
Text(0.53125, 0.7857142857142857, 'X[0] <= 6.5\nsquared_error = 692
1484375.0\nsamples = 8\nvalue = 124375.0'),
Text(0.375, 0.6428571428571429, 'X[0] <= 4.5\nsquared_error = 13812
50000.0\nsamples = 6\nvalue = 82500.0'),
Text(0.25, 0.5, 'X[0] <= 3.5\nsquared_error = 179687500.0\nsamples
= 4\nvalue = 58750.0'),
Text(0.1875, 0.35714285714285715, 'X[0] <= 2.5\nsquared_error = 388
88888.889\nsamples = 3\nvalue = 51666.667'),
Text(0.125, 0.21428571428571427, 'X[0] <= 1.5\nsquared_error = 6250
000.0\nsamples = 2\nvalue = 47500.0'),
Text(0.0625, 0.07142857142857142, 'squared_error = 0.0\nsamples = 1
\nvalue = 45000.0'),
Text(0.1875, 0.07142857142857142, 'squared_error = 0.0\nsamples = 1
\nvalue = 50000.0'),
Text(0.25, 0.21428571428571427, 'squared_error = 0.0\nsamples = 1\n
value = 60000.0'),
Text(0.3125, 0.35714285714285715, 'squared_error = 0.0\nsamples = 1
\nvalue = 80000.0'),
Text(0.5, 0.5, 'X[0] <= 5.5\nsquared_error = 400000000.0\nsamples =
2\nvalue = 130000.0'),
Text(0.4375, 0.35714285714285715, 'squared_error = 0.0\nsamples = 1
\nvalue = 110000.0'),
Text(0.5625, 0.35714285714285715, 'squared_error = 0.0\nsamples = 1
\nvalue = 150000.0'),
Text(0.6875, 0.6428571428571429, 'X[0] <= 7.5\nsquared_error = 2500
000000.0\nsamples = 2\nvalue = 250000.0'),
Text(0.625, 0.5, 'squared_error = 0.0\nsamples = 1\nvalue = 200000.
0'),
Text(0.75, 0.5, 'squared_error = 0.0\nsamples = 1\nvalue = 300000.
0'),
Text(0.875, 0.7857142857142857, 'X[0] <= 9.5\nsquared_error = 62500
000000.0\nsamples = 2\nvalue = 750000.0'),
Text(0.8125, 0.6428571428571429, 'squared_error = 0.0\nsamples = 1
\nvalue = 500000.0'),
Text(0.9375, 0.6428571428571429, 'squared_error = 0.0\nsamples = 1
\nvalue = 1000000.0')]

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 4/5

4/27/23, 8:17 PM DM_Lab_8 - Jupyter Notebook

In [30]:

1 # ID3
2 from sklearn import tree
3 clf3 = tree.DecisionTreeClassifier(criterion = 'entropy')
4 clf3 = clf3.fit(x1, y1)
5 tree.plot_tree(clf3)

Out[30]:

[Text(0.4444444444444444, 0.9, 'X[0] <= 0.5\nentropy = 0.94\nsamples

= 14\nvalue = [5, 9]'),
Text(0.3333333333333333, 0.7, 'entropy = 0.0\nsamples = 4\nvalue =
[0, 4]'),
Text(0.5555555555555556, 0.7, 'X[2] <= 0.5\nentropy = 1.0\nsamples
= 10\nvalue = [5, 5]'),
Text(0.3333333333333333, 0.5, 'X[0] <= 1.5\nentropy = 0.722\nsample
s = 5\nvalue = [4, 1]'),
Text(0.2222222222222222, 0.3, 'X[3] <= 0.5\nentropy = 1.0\nsamples
= 2\nvalue = [1, 1]'),
Text(0.1111111111111111, 0.1, 'entropy = 0.0\nsamples = 1\nvalue =
[1, 0]'),
Text(0.3333333333333333, 0.1, 'entropy = 0.0\nsamples = 1\nvalue =
[0, 1]'),
Text(0.4444444444444444, 0.3, 'entropy = 0.0\nsamples = 3\nvalue =
[3, 0]'),
Text(0.7777777777777778, 0.5, 'X[3] <= 0.5\nentropy = 0.722\nsample
s = 5\nvalue = [1, 4]'),
Text(0.6666666666666666, 0.3, 'X[0] <= 1.5\nentropy = 1.0\nsamples
= 2\nvalue = [1, 1]'),
Text(0.5555555555555556, 0.1, 'entropy = 0.0\nsamples = 1\nvalue =
[1, 0]'),
Text(0.7777777777777778, 0.1, 'entropy = 0.0\nsamples = 1\nvalue =
[0, 1]'),
Text(0.8888888888888888, 0.3, 'entropy = 0.0\nsamples = 3\nvalue =
[0, 3]')]

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 5/5

Strength of Materials Formula Sheet
68% (22)
Strength of Materials Formula Sheet
4 pages
Sy Sl120 English
No ratings yet
Sy Sl120 English
18 pages
Lecture 7.2 - DTC Algorithm Implementation
No ratings yet
Lecture 7.2 - DTC Algorithm Implementation
7 pages
Is Lab Aman Agarwal PDF
No ratings yet
Is Lab Aman Agarwal PDF
8 pages
Practical No4 - 5 ML
No ratings yet
Practical No4 - 5 ML
11 pages
ML Using Python Programs
No ratings yet
ML Using Python Programs
12 pages
CART Practical 6
No ratings yet
CART Practical 6
2 pages
Python Implementation of Random Forest Algorithm
No ratings yet
Python Implementation of Random Forest Algorithm
10 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
MLT - Lab - Manual FINAL
No ratings yet
MLT - Lab - Manual FINAL
38 pages
ML Exp8 C36
No ratings yet
ML Exp8 C36
18 pages
Unit Iii Machine Learning
No ratings yet
Unit Iii Machine Learning
19 pages
5b Python Implementation of Decision Tree
No ratings yet
5b Python Implementation of Decision Tree
7 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
Decision - Tree - Regression - Ipynb - Colab
No ratings yet
Decision - Tree - Regression - Ipynb - Colab
3 pages
ML5 Implementation
No ratings yet
ML5 Implementation
32 pages
S6 - Data Mining Lab Experiments (Except 1)
No ratings yet
S6 - Data Mining Lab Experiments (Except 1)
6 pages
ML Manual With Outputs
No ratings yet
ML Manual With Outputs
30 pages
CO3
No ratings yet
CO3
8 pages
PYHTONPRACT
No ratings yet
PYHTONPRACT
4 pages
MLA Lab 6:-Implementation of Decision Tree
No ratings yet
MLA Lab 6:-Implementation of Decision Tree
16 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Shobit Sharma (2124399) ML Lab File PDF
No ratings yet
Shobit Sharma (2124399) ML Lab File PDF
19 pages
ML Codes
No ratings yet
ML Codes
9 pages
Najir Shaikh Practical 4
No ratings yet
Najir Shaikh Practical 4
4 pages
Assignment 04
No ratings yet
Assignment 04
17 pages
ML Recap
No ratings yet
ML Recap
96 pages
Decision Tree - Jupyter Notebook
No ratings yet
Decision Tree - Jupyter Notebook
4 pages
St. John College of Engineering and Management, Palghar - Maharashtra
No ratings yet
St. John College of Engineering and Management, Palghar - Maharashtra
11 pages
Practical 15 Python
No ratings yet
Practical 15 Python
6 pages
Exp4 - Supervised Learning
No ratings yet
Exp4 - Supervised Learning
10 pages
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
No ratings yet
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
8 pages
Tree Based Methods
No ratings yet
Tree Based Methods
21 pages
Classification and Regression Trees CART
No ratings yet
Classification and Regression Trees CART
40 pages
Classification and Regression Trees (CART) Theory and Applications
No ratings yet
Classification and Regression Trees (CART) Theory and Applications
40 pages
8 To 12 Jaimeen
No ratings yet
8 To 12 Jaimeen
34 pages
MLT 1 - 7 Kanish
No ratings yet
MLT 1 - 7 Kanish
24 pages
ML Lab-1
No ratings yet
ML Lab-1
32 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
Experiment 8
No ratings yet
Experiment 8
14 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Kaggle Course Notes
No ratings yet
Kaggle Course Notes
87 pages
Week 7 Laboratory Activity
No ratings yet
Week 7 Laboratory Activity
12 pages
Desicion Tree Ipynb
No ratings yet
Desicion Tree Ipynb
6 pages
Hci Lab2 1
No ratings yet
Hci Lab2 1
4 pages
CSET301 LabW8L2
No ratings yet
CSET301 LabW8L2
1 page
2021BCS0103
No ratings yet
2021BCS0103
7 pages
CART Algorithm
No ratings yet
CART Algorithm
65 pages
Day 3 Assignment
No ratings yet
Day 3 Assignment
4 pages
ML Functions
No ratings yet
ML Functions
12 pages
Decision TREE
100% (1)
Decision TREE
3 pages
ML Fat
No ratings yet
ML Fat
9 pages
Da Lab Mannual
No ratings yet
Da Lab Mannual
25 pages
AIML - ECE304 - Assign-2 - Kartikeya - Kandpal - Ajitesh - S.ipynb - Colab
No ratings yet
AIML - ECE304 - Assign-2 - Kartikeya - Kandpal - Ajitesh - S.ipynb - Colab
3 pages
ML Assignment
No ratings yet
ML Assignment
7 pages
10 Random - Forest - Algo
No ratings yet
10 Random - Forest - Algo
6 pages
23BCE7092 ML Lab Assignment
No ratings yet
23BCE7092 ML Lab Assignment
14 pages
Janani Prakash Loan Prediction Study
No ratings yet
Janani Prakash Loan Prediction Study
97 pages
Random Forest
No ratings yet
Random Forest
2 pages
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
No ratings yet
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
13 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
STRESSES SHAFTINGS KEYS and SPLINES COUPLINGS Feb2023 Rev0
No ratings yet
STRESSES SHAFTINGS KEYS and SPLINES COUPLINGS Feb2023 Rev0
15 pages
2.attenuation of X-Rays
No ratings yet
2.attenuation of X-Rays
50 pages
Crossword Puzzle Quantum Physics
No ratings yet
Crossword Puzzle Quantum Physics
2 pages
Kendriya Vidyalaya Sangathan-1
No ratings yet
Kendriya Vidyalaya Sangathan-1
2 pages
Me136p Exp2 Tensile Test of Reinforcing Steel Bars
No ratings yet
Me136p Exp2 Tensile Test of Reinforcing Steel Bars
15 pages
Latest CAT 2024 Syllabus
No ratings yet
Latest CAT 2024 Syllabus
16 pages
States of Matter Notes
No ratings yet
States of Matter Notes
14 pages
Dual Nature Notes
No ratings yet
Dual Nature Notes
4 pages
Jemapu 03
No ratings yet
Jemapu 03
8 pages
Kinetic Molecular Theory
67% (3)
Kinetic Molecular Theory
3 pages
Physics 10 Ch. 10+16
No ratings yet
Physics 10 Ch. 10+16
1 page
Computing in Physics Education: Comment
No ratings yet
Computing in Physics Education: Comment
3 pages
Katz2016 220119 191631
No ratings yet
Katz2016 220119 191631
11 pages
Dokumen Panduan Teknik Rekabentuk Elektrik Edisi 4
91% (23)
Dokumen Panduan Teknik Rekabentuk Elektrik Edisi 4
330 pages
Induction Motor Bee
No ratings yet
Induction Motor Bee
3 pages
Prism Project
No ratings yet
Prism Project
10 pages
Datasheet PA 12 Extended-6
No ratings yet
Datasheet PA 12 Extended-6
8 pages
Wire Rope Grips
No ratings yet
Wire Rope Grips
5 pages
Worksheet 5 Phys1 2425
No ratings yet
Worksheet 5 Phys1 2425
2 pages
b7 Force Analysis PDF
No ratings yet
b7 Force Analysis PDF
25 pages
Athul Kunjipurayil F, P Mode Recent
No ratings yet
Athul Kunjipurayil F, P Mode Recent
17 pages
Designand Modelingof Ball Valve Final Version
No ratings yet
Designand Modelingof Ball Valve Final Version
27 pages
General Physics I (Topic List and Schedule) - Topic List
No ratings yet
General Physics I (Topic List and Schedule) - Topic List
8 pages
Generalization of Abrams' Law For Cement Mortars: G. Appa Rao
No ratings yet
Generalization of Abrams' Law For Cement Mortars: G. Appa Rao
8 pages
Baroghel-Bouny - 2007 - Water Vapour Sorption Experiments On Hardened Ceme
No ratings yet
Baroghel-Bouny - 2007 - Water Vapour Sorption Experiments On Hardened Ceme
17 pages
2024 Physical Chemistry in Food Technology - Full
No ratings yet
2024 Physical Chemistry in Food Technology - Full
213 pages
Stability of Columns
No ratings yet
Stability of Columns
45 pages
LABORATORY SHEET ACT.1 LOQUIRE and GUIAPAR 1
No ratings yet
LABORATORY SHEET ACT.1 LOQUIRE and GUIAPAR 1
5 pages

DM - Lab - 8 - Jupyter Notebook

Uploaded by

DM - Lab - 8 - Jupyter Notebook

Uploaded by

4/27/23, 8:17 PM DM_Lab_8 - Jupyter Notebook

Topic : CART - Classification & Regression Tree, ID3

Outlook Temperature Humidity Wind PlayTennis

0 Sunny Hot High Weak No

1 Sunny Hot High Strong No

2 Overcast Hot High Weak Yes

3 Rain Mild High Weak Yes

4 Rain Cool Normal Weak Yes

1 from sklearn.preprocessing import LabelEncoder

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 1/5

1 # CART classification Tree

[Text(0.4444444444444444, 0.9, 'X[0] <= 0.5\ngini = 0.459\nsamples =

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 2/5

Position Level Salary

0 Business Analyst 1 45000

1 Junior Consultant 2 50000

2 Senior Consultant 3 60000

4 Country Manager 5 110000

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 3/5

1 # CART - Regression Tree

[Text(0.703125, 0.9285714285714286, 'X[0] <= 8.5\nsquared_error = 80

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 4/5

[Text(0.4444444444444444, 0.9, 'X[0] <= 0.5\nentropy = 0.94\nsamples

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 5/5

You might also like