0% found this document useful (0 votes)

34 views

Decision Trees For Classification - A Machine Learning Algorithm - Xoriant

The document discusses decision trees, which are a type of supervised machine learning model that splits data according to parameters. It explains key concepts like decision nodes, leaves, entropy, and information gain. It also describes the ID3 algorithm for building decision trees and provides an example to illustrate the process.

Uploaded by

maanav8098

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views

Decision Trees For Classification - A Machine Learning Algorithm - Xoriant

Uploaded by

maanav8098

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

ntroduction Decision Trees are a type of Supervised Machine Learning (that is you explain what the input is and

what
the corresponding output is in the training data) where the data is continuously split according to a certain parameter. The
tree can be explained by two entities, namely decision nodes and leaves. The leaves are the decisions or the final
outcomes. And the decision nodes are where the data is split.

An example of a decision tree can be explained using above binary tree. Let’s say you want to predict whether a person is
fit given their information like age, eating habit, and physical activity, etc. The decision nodes here are questions like
‘What’s the age?’, ‘Does he exercise?’, ‘Does he eat a lot of pizzas’? And the leaves, which are outcomes like either ‘fit’,
or ‘unfit’. In this case this was a binary classification problem (a yes no type problem). There are two main types of
Decision Trees:

1. Classification trees (Yes/No types)

What we’ve seen above is an example of classification tree, where the outcome was a variable like ‘fit’ or ‘unfit’. Here
the decision variable is Categorical.

2. Regression trees (Continuous data types)

Here the decision or the outcome variable is Continuous, e.g. a number like 123. Working Now that we know what a
Decision Tree is, we’ll see how it works internally. There are many algorithms out there which construct Decision Trees,
but one of the best is called as ID3 Algorithm. ID3 Stands for Iterative Dichotomiser 3. Before discussing the ID3
algorithm, we’ll go through few definitions.

Entropy:

Entropy, also called as Shannon Entropy is denoted by H(S) for a finite set S, is the measure of the amount of uncertainty

or randomness in data. Intuitively, it

tells us about the predictability of a certain event. Example, consider a coin toss whose probability of heads is 0.5 and
probability of tails is 0.5. Here the entropy is the highest possible, since there’s no way of determining what the outcome
might be. Alternatively, consider a coin which has heads on both the sides, the entropy of such an event can be predicted
perfectly since we know beforehand that it’ll always be heads. In other words, this event has no randomness hence it’s
entropy is zero. In particular, lower values imply less uncertainty while higher values imply high uncertainty.

Information Gain:

nformation gain is also called as Kullback-Leibler divergence denoted by IG(S,A) for a set S is the effective change in
entropy after deciding on a particular attribute A. It measures the relative change in entropy with respect to the
independent variables.
Alternatively, where IG(S, A) is the

information gain by applying feature A. H(S) is the Entropy of the entire set, while the second term calculates the
Entropy after applying the feature A, where P(x) is the probability of event x.

Let’s understand this with the help of an example. Consider a piece of data collected over the course of 14 days where
the features are Outlook, Temperature, Humidity, Wind and the outcome variable is whether Golf was played on the day.
Now, our job is to build a predictive model which takes in above 4 parameters and predicts whether Golf will be played
on the day. We’ll build a decision tree to do that using ID3 algorithm.
Day Outlook Temperature Humidity Wind Play Golf
D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Hot High Weak Yes
D4 Rain Mild High Weak Yes
D5 Rain Cool Normal Weak Yes
D6 Rain Cool Normal Strong No
D7 Overcast Cool Normal Strong Yes
D8 Sunny Mild High Weak No
D9 Sunny Cool Normal Weak Yes
D10 Rain Mild Normal Weak Yes
D11 Sunny Mild Normal Strong Yes
D12 Overcast Mild High Strong Yes
D13 Overcast Hot Normal Weak Yes
D14 Rain Mild High Strong No
ID3 Algorithm will perform following tasks recursively

1. Create root node for the tree

2. If all examples are positive, return leaf node ‘positive’
3. Else if all examples are negative, return leaf node ‘negative’
4. Calculate the entropy of current state H(S)
5. For each attribute, calculate the entropy with respect to the attribute ‘x’ denoted by H(S, x)
6. Select the attribute which has maximum value of IG(S, x)
7. Remove the attribute that offers highest IG from the set of attributes
8. Repeat until we run out of all attributes, or the decision tree has all leaf nodes.

Now, let's go ahead and grow the decision tree. The initial step is to calculate H(S), the Entropy of the current state. In
the above example, we can see in total there are 5 No’s and 9 Yes’s.
Yes No Total
9 5 14

Remember that the Entropy is 0 if all members belong to the same class, and 1 when half of them belong to one class and
other half belong to other class that is perfect randomness. Here it’s 0.94 which means the distribution is fairly
random. Now, the next step is to choose the attribute that gives us highest possible Information Gain which we’ll
choose as the root node. Let’s start with ‘Wind’

where ‘x’ are the possible values for an attribute. Here, attribute ‘Wind’ takes two possible values in the sample data,
hence x = {Weak, Strong} We’ll have to calculate:

Amongst all the 14 examples we

have 8 places where the wind is weak and 6 where the wind is Strong.

Wind = Weak Wind = Strong Total

8 6 14

Now, out of the 8 Weak examples, 6 of them were ‘Yes’ for Play Golf and 2 of them were ‘No’ for ‘Play Golf’. So, we

have, Similarly, out of 6 Strong

examples, we have 3 examples where the outcome was ‘Yes’ for Play Golf and 3 where we had ‘No’ for Play Golf.

Remember, here half items belong

to one class while other half belong to other. Hence we have perfect randomness. Now we have all the pieces required to

calculate the Information Gain,

Which tells us the Information Gain by considering ‘Wind’ as the feature and give us information gain of 0.048. Now we
must similarly calculate the Information Gain for all the features.

We can clearly see that IG(S,

Outlook) has the highest information gain of 0.246, hence we chose Outlook attribute as the root node. At this point,

the decision tree looks like.

Here we observe that whenever the outlook is Overcast, Play Golf is always ‘Yes’, it’s no coincidence by any chance, the
simple tree resulted because of the highest information gain is given by the attribute Outlook. Now how do we
proceed from this point? We can simply apply recursion, you might want to look at the algorithm steps described earlier.
Now that we’ve used Outlook, we’ve got three of them remaining Humidity, Temperature, and Wind. And, we had three
possible values of Outlook: Sunny, Overcast, Rain. Where the Overcast node already ended up having leaf node ‘Yes’, so
we’re left with two subtrees to compute: Sunny and Rain.
Table where the value of Outlook is
Sunny looks like:

Temperature Humidity Wind Play Golf

Hot High Weak No
Hot High Strong No
Mild High Weak No
Cool Normal Weak Yes
Mild Normal Strong Yes

In the similar fashion, we compute

the following values As we can see

the highest Information Gain is given by Humidity. Proceeding in the same way with
will give us Wind as the one with
highest information gain. The final Decision Tree looks something like this.

BX Options Class Builder
No ratings yet
BX Options Class Builder
82 pages
ACLS Rhythms Cheat Sheet
100% (2)
ACLS Rhythms Cheat Sheet
21 pages
Miseq/Miseq FGX System: Installationguide
No ratings yet
Miseq/Miseq FGX System: Installationguide
224 pages
HighNote4 U5A Grammar Quiz B
No ratings yet
HighNote4 U5A Grammar Quiz B
1 page
Lecture 023+-+Decision+Trees+ - 1
No ratings yet
Lecture 023+-+Decision+Trees+ - 1
54 pages
Practice Q Machine Learning Ans
No ratings yet
Practice Q Machine Learning Ans
54 pages
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
11 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
Decision Trees For Classification - A Machine Learning Algorithm - Xoriant Blog
No ratings yet
Decision Trees For Classification - A Machine Learning Algorithm - Xoriant Blog
17 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
Lecture - 3 Classification (Decision Tree)
No ratings yet
Lecture - 3 Classification (Decision Tree)
44 pages
Slide 3
No ratings yet
Slide 3
23 pages
Aiml M4 C1
No ratings yet
Aiml M4 C1
101 pages
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
No ratings yet
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
7 pages
DM UNIT III (1)
No ratings yet
DM UNIT III (1)
87 pages
ML-chap-3
No ratings yet
ML-chap-3
52 pages
UNIT II 2.1 ML Decision Tree Learning
No ratings yet
UNIT II 2.1 ML Decision Tree Learning
55 pages
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
No ratings yet
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
54 pages
Classification
No ratings yet
Classification
148 pages
DecisionTree Numerical ID3Prob
No ratings yet
DecisionTree Numerical ID3Prob
114 pages
Ch02 DecisionTree
No ratings yet
Ch02 DecisionTree
41 pages
Lecture 6 - Decision Trees
No ratings yet
Lecture 6 - Decision Trees
43 pages
Lecture 04 Decession Trees 04112022 015118pm
No ratings yet
Lecture 04 Decession Trees 04112022 015118pm
43 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
AIML Module-04
No ratings yet
AIML Module-04
46 pages
Lesson 7 Supervised Method (Decision Trees) Algorithms
No ratings yet
Lesson 7 Supervised Method (Decision Trees) Algorithms
12 pages
ML Unit 2
No ratings yet
ML Unit 2
8 pages
ML L8 Decision Tree
No ratings yet
ML L8 Decision Tree
109 pages
Decision - Tree
No ratings yet
Decision - Tree
75 pages
فاينل تعلم
No ratings yet
فاينل تعلم
144 pages
L6 Decision Tree Classifier
No ratings yet
L6 Decision Tree Classifier
46 pages
Decision Trees Edited
No ratings yet
Decision Trees Edited
56 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
3 - Decision trees
No ratings yet
3 - Decision trees
16 pages
06 - Decision Trees
100% (1)
06 - Decision Trees
83 pages
Ms. Mehroz Sadiq: 11/23/2020 Bahria University Islamabad 1
No ratings yet
Ms. Mehroz Sadiq: 11/23/2020 Bahria University Islamabad 1
75 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
50 pages
Complete ID3 Decision Tree
No ratings yet
Complete ID3 Decision Tree
15 pages
Machine Learning Approaches: Decision Trees
No ratings yet
Machine Learning Approaches: Decision Trees
44 pages
Unit IV Da Online - PPTX 2 82
No ratings yet
Unit IV Da Online - PPTX 2 82
81 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Unit II Part 1
No ratings yet
Unit II Part 1
62 pages
Decision Tree: Dept of CS & IT Bahauddin Zakariya University, Sahiwal Campus
No ratings yet
Decision Tree: Dept of CS & IT Bahauddin Zakariya University, Sahiwal Campus
31 pages
Decision Trees ID3
No ratings yet
Decision Trees ID3
45 pages
Unit6 -2 Classification-Decision-Trees_25625586-1bf9-4821-a721-70db2d7805ef
No ratings yet
Unit6 -2 Classification-Decision-Trees_25625586-1bf9-4821-a721-70db2d7805ef
36 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Classification and Clustering
No ratings yet
Classification and Clustering
59 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
Lec-3-Decision Trees
No ratings yet
Lec-3-Decision Trees
47 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
Unit 3
No ratings yet
Unit 3
46 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
MLT UNIT-3 notes
No ratings yet
MLT UNIT-3 notes
35 pages
Video Tutorial: Decision Tree Learning
No ratings yet
Video Tutorial: Decision Tree Learning
21 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Decision Tree.pptx
No ratings yet
Decision Tree.pptx
41 pages
Deep Learning: Decision Trees I
No ratings yet
Deep Learning: Decision Trees I
45 pages
Stem Guides To Weather
From Everand
Stem Guides To Weather
Kay Robertson
No ratings yet
89009-class---3
No ratings yet
89009-class---3
6 pages
Things To Do After Installing Ubuntu 12-04-3-LTS
No ratings yet
Things To Do After Installing Ubuntu 12-04-3-LTS
124 pages
PythonReference en
No ratings yet
PythonReference en
147 pages
Plato Past Paper Solved
No ratings yet
Plato Past Paper Solved
2 pages
Grade 9 Hindi 2022-23 Term - 1 Question Paper
No ratings yet
Grade 9 Hindi 2022-23 Term - 1 Question Paper
5 pages
Quia - Week 5 Grammar - Simple Past Tense
No ratings yet
Quia - Week 5 Grammar - Simple Past Tense
5 pages
Teacher As Researcher
No ratings yet
Teacher As Researcher
16 pages
250209-SOTC5
No ratings yet
250209-SOTC5
7 pages
01-03 Local Attack Defense Configuration
No ratings yet
01-03 Local Attack Defense Configuration
76 pages
Virtual Private Network
No ratings yet
Virtual Private Network
8 pages
Long Vowel Sounds Phonics Pack E
No ratings yet
Long Vowel Sounds Phonics Pack E
7 pages
Tartakower's Poetry
No ratings yet
Tartakower's Poetry
5 pages
Test UGW
No ratings yet
Test UGW
533 pages
Titus Pop From Eurocentrism To Hibridity or From Singularity To Plurality
No ratings yet
Titus Pop From Eurocentrism To Hibridity or From Singularity To Plurality
9 pages
Download Full (Ebook) King's Indian: A Complete Black Repertoire by Victor Bologan ISBN 9789548782715, 9548782715 PDF All Chapters
100% (1)
Download Full (Ebook) King's Indian: A Complete Black Repertoire by Victor Bologan ISBN 9789548782715, 9548782715 PDF All Chapters
81 pages
Nal'ibali - A Birthday Wish (Xhosa)
No ratings yet
Nal'ibali - A Birthday Wish (Xhosa)
4 pages
Reported Speech
0% (1)
Reported Speech
5 pages
Module 4 - Reading5 - UniformResourceLocator
No ratings yet
Module 4 - Reading5 - UniformResourceLocator
7 pages
Theatreandperformanceartsonstageandon 170506143128
No ratings yet
Theatreandperformanceartsonstageandon 170506143128
51 pages
WEDE5020POE Assignment
No ratings yet
WEDE5020POE Assignment
23 pages
CMM366A-WIFI_en
No ratings yet
CMM366A-WIFI_en
17 pages
DrWeb Crash
No ratings yet
DrWeb Crash
9 pages
The Hunk Cassanova Wants Me (BXB)
No ratings yet
The Hunk Cassanova Wants Me (BXB)
61 pages
Unit Test in Music of East Asia
100% (1)
Unit Test in Music of East Asia
2 pages
10 Minute Tutorial Apache Shiro
No ratings yet
10 Minute Tutorial Apache Shiro
4 pages
3GPP TS 38.522
No ratings yet
3GPP TS 38.522
17 pages

Decision Trees For Classification - A Machine Learning Algorithm - Xoriant

Uploaded by

Decision Trees For Classification - A Machine Learning Algorithm - Xoriant

Uploaded by

ntroduction Decision Trees are a type of Supervised Machine Learning (that is you explain what the input is and

1. Classification trees (Yes/No types)

2. Regression trees (Continuous data types)

or randomness in data. Intuitively, it

1. Create root node for the tree

Amongst all the 14 examples we

Wind = Weak Wind = Strong Total

have, Similarly, out of 6 Strong

Remember, here half items belong

calculate the Information Gain,

We can clearly see that IG(S,

the decision tree looks like.

Temperature Humidity Wind Play Golf

In the similar fashion, we compute

the following values As we can see

You might also like