0% found this document useful (0 votes)

85 views12 pages

Decision Tree

The document provides an overview of decision trees, including: - Decision trees are a supervised learning algorithm that uses a tree-like model to predict an outcome based on input data. - The tree starts with a root node and branches out to multiple solutions based on conditions at each node. - An example decision tree is provided to classify fruits based on diameter and color properties. - Key decision tree terminology is defined, including root node, leaf node, branches, pruning, and parent/child nodes.

Uploaded by

shrikrishna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

85 views12 pages

Decision Tree

Uploaded by

shrikrishna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

Decision Tree

Decision Tree: A Decision Tree is a supervised learning algorithm. It is a

graphical representation of all the possible solutions. All the decisions were
made based on some conditions.

Do you know why this algorithm is known as a Decision Tree??? Because

it starts from the root node and branches off to the number of solutions just
like a tree. The tree also starts from the root and it starts growing its
branches when it grows bigger and bigger.

To understand more, Take this example which is represented in a tree

model below.
Here the root node is a question asking whether you are hungry or not. If
you are not hungry then go back to sleep. If you are hungry then check
whether you have 100 dollars or not. If you have sufficient money then go
to the restaurant. If you don’t have enough money then go and just buy
some juice.

In this way, the Decision Tree divides into different groups based on some
conditions.
In this dataset, we can see fruits were labelled into either Mango, Grape, or
Lemon based on colour and Diameter.

Here we take the root node as diameter. If the diameter is greater than 3
then we have colour as either Green or yellow. and so if the diameter is
less than 3 then for sure it is grape which is red.
Now we have to check the colour. If it is not yellow then for sure it is
mango. But if it is in yellow then we have two options. it can be either
mango or lemon. So there is a chance for 50% mango and 50% Lemon.

But which question comes as the root node and which question comes
next? Here we need to see which attribute will unmix the label at that
particular point. We can find the amount of uncertainty at a single node with
something known as Gini impurity and also we can find how much a
question reduces that uncertainty with something known as Information
Gain.

We use these to decide which question is asked and at which point

Terminologies

Root node: The root node is the base node of the tree where the entire
tree starts from it.

Terminal Node/Leaf Node: This is the final node where no further

segregation is possible.

Branch/Sub-tree: A branch is formed by splitting a node.

Pruning: Pruning is the opposite of splitting. Pruning is the process of

removing nodes to decrease the size of the decision tree.

Parent Node/Child Node: Always root node is the parent node and all the
other nodes which are derived from the parent node are known as child
nodes.
CART Algorithm

CART(Classification and Regression Tree) algorithm is a predictive model

which shows how an outcome of a variable is predicted based on other
values.

Let’s have a look at this data.

Here there are attributes like Outlook, temperature, humidity, wind, and a
label Play. All these attributes decide whether to play or not.

So among all of them which one should we pick first? The attribute which
classifies the training data best will be picked first.

To decide that we need to learn some terms.

Gini Index: Gini Index is the measure of impurity or the purity that is used
in building a decision tree in the CART Algorithm.

Information Gain: Information gain is the measure of how much

information a feature gives about the class. It is the decrease in entropy
after splitting the dataset based on the attribute.

Constructing a decision tree is all about finding the attribute that has the
highest information gain.

Reduction in Variance: In general Variance is how much your data is

varying. So here also attribute with a lower variance will be split first.

Chi-square: It is an algorithm to find the statistical significance of the

differences between sub-nodes and parent nodes.

The first step to do before solving the problem for the decision tree is
entropy which is used to find information gain. As we know splitting will be
done based on information gain. Attribute with highest information gain is
selected first.

Entropy: Entropy is the measure of uncertainty. It is a metric that

measures the impurity of something.

Let’s understand what is impurity first.

Imagine a basket full of cherries and a bowl that contains labels with cherry
written on it. Now if you take 1 fruit from the basket and 1 label from a bowl.
So the probability of matching cherry with cherry is 1 and there is no
impurity here.

Now imagine another situation with different fruits in a basket and different
fruits names labels in the bowl. Now if you pick up 1 random fruit from the
basket and 1 random label from the bowl, the probability of matching cherry
with cherry is certainly not 1. It is less than 1. Here there is impurity.

Entropy=-ΣP(x)logP(x)/

Entropy(s)=-P(yes)logP(yes)-P(no)logP(no)

where,

s = total sample space

P(yes) = Probability of yes

P(no) = Probability of no

If number of yes = number of no,Then

P(s)=0.5 and Entropy(s) = 1

If it contains either all yes or all no, Then

P(s) = 1 or 0 and Entropy(s) = 0

let’s see first case, where number of yes = number of no

Entropy(s)=-P(yes)logP(yes)

E(s)=0.5 log20.5 – 0.5 log20.5

E(s)=0.5(log20.5 – log20.5)

E(s)=1

let’s see second case where it contains either all yes or all no

Entropy(s)=-P(yes)logP(yes)

E(s)=1log1

E(s)=0

Similarly with no.

Entropy(s)=-P(no)logP(no)

E(s)=1log1

E(s)=0

Calculating Information Gain

Information gain = Entropy(s) –[ (average weight) *

Entropy(each feature)]

Let’s calculate the entropy for this dataset. Here total we have 14 data
points in which 9 are yes and 5 are no.
Entropy(s)=-P(yes)logP(yes)-P(no)logP(no)

E(s)=-(9/14)log(9/14) – (5/14)log(5/14)

E(s)=0.41+0.53

E(s)=0.94

So entropy for this dataset is 0.94.

Now out of outlook, Temperature, Humidity, Windy which node is

selected as a root node?

Let’s see one by one.

Source: Author
coming to outlook,

Here for sunny, we have 3 yes and 2 no. For Overcast, we have all yes.
And for rainy we have 3 yes and 2 no.

E(outlook=sunny)=-2/5log2/5 – 3/5log3/5 = 0.971

E(outlook=Overcast)= -1log1 =0

E(outlook=rainy)=- 3/5log3/5 -2/5log2/5 =0.971

Information from outlook,

I(outlook)=5/14 * 0.971 + 4/14 * 0 + 5/14 * 0.971 = 0.693

Information gain from Outlook,

IG(Outlook)=E(s) -I(outlook)

=0.94 – 0.693
=0.247

So Information gain for outlook is 0.247.

Now lets consider windy as root node and calculate the information gain.

In the case of False, we have 6 yes and 2 no whereas in the case of True
we have 3 yes and 3 no.

E(windy=True)=1

E(windy=False)=-6/8log6/8 -2/8log2/8 = 0.811

I(windy)=8/14 * 0.811 + 6/14 * 1 =0.892

IG(windy) = 0.94 – 0.892 = 0.048

So information gain for windy is 0.048.

Similarly, Calculate for others too.

Then finally you will get,

Information gain(Outlook) = 0.247

Information Gain(Temperature) = 0.029

Information Gain(Humidity) = 0.152

Information Gain(Windy) = 0.048

Attribute with highest information gain is selected. So here we can see

information gain for outlook is more than remaining. So Outlook is selected
as a root node.

We can observe that if the outlook is overcast then the final output will be
yes. But if it is between sunny or rain, then again we have to calculate
entropy for the remaining attributes and calculate the information gain to
decide the next node. When all these calculations are done the final tree
will look like this.

Source: Author
If Outlook is overcast then you can play without checking any other
condition. If the outlook is sunny, then check for Humidity. If humidity is
high, then don’t play. If humidity is normal then you can play. If the outlook
is Rainy, then check for Windy conditions. If windy is strong then don’t play.
If Windy is weak, then you can play.

The Practically Cheating Statistics Handbook, The Sequel! (2nd Edition)
From Everand
The Practically Cheating Statistics Handbook, The Sequel! (2nd Edition)
S. Deviant
4.5/5 (3)
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
6__DecisionTrees__ID3_CART
No ratings yet
6__DecisionTrees__ID3_CART
24 pages
MLT UNIT-3 notes
No ratings yet
MLT UNIT-3 notes
35 pages
ML Unit-3 ppt
No ratings yet
ML Unit-3 ppt
92 pages
DMDW-CO3-SESSION-14
No ratings yet
DMDW-CO3-SESSION-14
55 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Lecture Note #5_PEC-CS701E
No ratings yet
Lecture Note #5_PEC-CS701E
16 pages
Decision Tree (1)
No ratings yet
Decision Tree (1)
7 pages
Ml Unit 2 Final_iii Yr
No ratings yet
Ml Unit 2 Final_iii Yr
72 pages
Decision Trees
No ratings yet
Decision Trees
3 pages
Decision Tree Notes
No ratings yet
Decision Tree Notes
6 pages
Machine Learning Unit-3.2
No ratings yet
Machine Learning Unit-3.2
61 pages
Trees
No ratings yet
Trees
78 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Decision Tree
No ratings yet
Decision Tree
100 pages
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
11 pages
2.3 Decision-Tree-Algorithm
No ratings yet
2.3 Decision-Tree-Algorithm
61 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
9-Module 5 Decision Tree-21-03-2024
No ratings yet
9-Module 5 Decision Tree-21-03-2024
83 pages
DT Classifier
No ratings yet
DT Classifier
45 pages
Decision_tree
No ratings yet
Decision_tree
15 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
decision tree
No ratings yet
decision tree
66 pages
Entropy and Information Gain Explained
No ratings yet
Entropy and Information Gain Explained
10 pages
Lecture 04 Decession Trees 04112022 015118pm
No ratings yet
Lecture 04 Decession Trees 04112022 015118pm
43 pages
unit-4[1].docx ML
No ratings yet
unit-4[1].docx ML
42 pages
7_DecisionTree
No ratings yet
7_DecisionTree
58 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
Tree
No ratings yet
Tree
7 pages
Unit-3 (1)
No ratings yet
Unit-3 (1)
81 pages
NOTES
No ratings yet
NOTES
18 pages
Lect 8-Decision Tree-2
No ratings yet
Lect 8-Decision Tree-2
16 pages
19 -- Decision Tree -- ID3
No ratings yet
19 -- Decision Tree -- ID3
87 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
28 pages
Dtree&rf
No ratings yet
Dtree&rf
26 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Decision Tree
100% (4)
Decision Tree
66 pages
Decision Trees
No ratings yet
Decision Trees
45 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
80 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
No ratings yet
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
22 pages
Deciosn_tree_(1)
No ratings yet
Deciosn_tree_(1)
5 pages
ML Classification Tree
No ratings yet
ML Classification Tree
36 pages
Decisiontree 2
No ratings yet
Decisiontree 2
16 pages
Decision Tree Theory
No ratings yet
Decision Tree Theory
22 pages
Decision Tree Classification Algorithm (2)
No ratings yet
Decision Tree Classification Algorithm (2)
11 pages
m3
No ratings yet
m3
141 pages
Module 3 Chap 3 Decision Tree Learning
No ratings yet
Module 3 Chap 3 Decision Tree Learning
79 pages
M2 Decision trees
No ratings yet
M2 Decision trees
37 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Decision Tree
No ratings yet
Decision Tree
74 pages
ML Unit-2.1
No ratings yet
ML Unit-2.1
17 pages
Dokumen - Tips Decision Tree and Random Forest 58f9e8a0cce07
No ratings yet
Dokumen - Tips Decision Tree and Random Forest 58f9e8a0cce07
17 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
17 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
UNIT 2 - Groups (Decision Tree) (1)
No ratings yet
UNIT 2 - Groups (Decision Tree) (1)
20 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
How Pi Can Save Your Life: Using Math to Survive Plane Crashes, Zombie Attacks, Alien Encounters, and Other Improbable Real-World Situations
From Everand
How Pi Can Save Your Life: Using Math to Survive Plane Crashes, Zombie Attacks, Alien Encounters, and Other Improbable Real-World Situations
Chris Waring
No ratings yet
DragonNest 2011 10 28-21 01 28 97
No ratings yet
DragonNest 2011 10 28-21 01 28 97
1 page
Un23ns132b 20240419114850
No ratings yet
Un23ns132b 20240419114850
18 pages
RSU-4000 Data Sheet 0
No ratings yet
RSU-4000 Data Sheet 0
1 page
Bài tập tình huống_Quản trị kinh doanh quốc tế
No ratings yet
Bài tập tình huống_Quản trị kinh doanh quốc tế
15 pages
Year 1 CRS AI Capstone Project 2024-25
No ratings yet
Year 1 CRS AI Capstone Project 2024-25
25 pages
Instant Download Intelligent information technologies and applications 1st Edition Vijayan Sugumaran PDF All Chapters
100% (1)
Instant Download Intelligent information technologies and applications 1st Edition Vijayan Sugumaran PDF All Chapters
67 pages
Debenu Quick PDF Library 10.11 Reference Guide
No ratings yet
Debenu Quick PDF Library 10.11 Reference Guide
891 pages
MTH212_2018t
No ratings yet
MTH212_2018t
7 pages
14.8.1 Packet Tracer - TCP and UDP Communications[1]
No ratings yet
14.8.1 Packet Tracer - TCP and UDP Communications[1]
11 pages
Digital Electronics MCQ PDF (Erexams - Com)
0% (1)
Digital Electronics MCQ PDF (Erexams - Com)
6 pages
V - Cse - CS3501 - CD - QB - Unit 2
No ratings yet
V - Cse - CS3501 - CD - QB - Unit 2
7 pages
Thesis Topics For Educational Management
100% (3)
Thesis Topics For Educational Management
6 pages
Features of Yolo11
No ratings yet
Features of Yolo11
9 pages
CC Mid 1Subjective&ObjectiveQuestions
No ratings yet
CC Mid 1Subjective&ObjectiveQuestions
9 pages
Dps - Final
No ratings yet
Dps - Final
100 pages
20110520101701
100% (2)
20110520101701
27 pages
Smart Tyre Pressure Monitoring Android Phones
No ratings yet
Smart Tyre Pressure Monitoring Android Phones
8 pages
W3!04!05-06-Cisco 7500 Series Product Overview
No ratings yet
W3!04!05-06-Cisco 7500 Series Product Overview
43 pages
FLEXSIM A Flexible Manufacturing System Simulator
No ratings yet
FLEXSIM A Flexible Manufacturing System Simulator
18 pages
Industrial Oriented Mini Project - Summer Internship On
No ratings yet
Industrial Oriented Mini Project - Summer Internship On
14 pages
2101.06341
No ratings yet
2101.06341
27 pages
Materials - V-Ray For SketchUp - Chaos Help
No ratings yet
Materials - V-Ray For SketchUp - Chaos Help
6 pages
Awon Eto Funfun | PDF
No ratings yet
Awon Eto Funfun | PDF
103 pages
Downloaded From Manuals Search Engine
No ratings yet
Downloaded From Manuals Search Engine
171 pages
repaper
No ratings yet
repaper
9 pages
Student Chatbot
No ratings yet
Student Chatbot
84 pages
System Audit Framework
No ratings yet
System Audit Framework
6 pages
Developing Speaking Skills and Learning Pronunciation With New Technologies in The French As A Foreign Language Classroom
No ratings yet
Developing Speaking Skills and Learning Pronunciation With New Technologies in The French As A Foreign Language Classroom
34 pages
FC726 Fire Control Panel: Building Technologies
No ratings yet
FC726 Fire Control Panel: Building Technologies
8 pages
Qatar National Convention Centre Public Sector Case Study
No ratings yet
Qatar National Convention Centre Public Sector Case Study
4 pages