0% found this document useful (0 votes)

20 views10 pages

Entropy and Information Gain Explained

The document discusses entropy and information gain, which are concepts used in decision tree algorithms like ID3 to build classification models. [1] Entropy measures the homogeneity or purity of a sample, with lower entropy indicating a more homogeneous sample. [2] Information gain is the expected reduction in entropy from splitting the data on an attribute, with the attribute giving the largest information gain used to split the data at each node. [3] The ID3 algorithm uses these concepts to recursively split the data into increasingly homogeneous subsets and build a decision tree in a top-down manner until reaching leaf nodes containing single class labels.

Uploaded by

boryszczyn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views10 pages

Entropy and Information Gain Explained

Uploaded by

boryszczyn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

https://www.saedsayad.com/decision_tree.

htm

ENTROPY AND INFORMATION GAIN

Decision Tree - Classification

Decision tree builds classification or regression models in the form of a tree structure. It breaks down
a dataset into smaller and smaller subsets while at the same time an associated decision tree is
incrementally developed. The final result is a tree with decision nodes and leaf nodes. A decision
node (e.g., Outlook) has two or more branches (e.g., Sunny, Overcast and Rainy). Leaf node (e.g.,
Play) represents a classification or decision. The topmost decision node in a tree which corresponds
to the best predictor called root node. Decision trees can handle both categorical and numerical
data.
Algorithm
The core algorithm for building decision trees called ID3 by J. R. Quinlan which employs a top-down,
greedy search through the space of possible branches with no backtracking. ID3
uses Entropy and Information Gain to construct a decision tree.

Entropy
A decision tree is built top-down from a root node and involves partitioning the data into subsets that
contain instances with similar values (homogenous). ID3 algorithm uses entropy to calculate the
homogeneity of a sample. If the sample is completely homogeneous the entropy is zero and if the
sample is an equally divided it has entropy of one.
To build a decision tree, we need to calculate two types of entropy using frequency tables
as follows:

a) Entropy using the frequency table of one attribute:

C = the number of classes (in this case there

ase 2 - ie ‘yes’ and ‘no’)

The value of pi is the number of occurrences of

class i divided by the total number of
instances.

So for class ‘no’ there are 5 instances divided

by the total number of instances (14). So pi =
0.36

For class ‘yes’ there are 9 instances divided by

the total number of instances, so pi = .64

Taking the sum of the two, -pi log2 pi gives you

an entropy of .94

Minimum value of entropy is 0 – when all

instances have the same class.

Maximum value of entropy is 1 – when classes

are equally distributed among the instances.

So this has frequeny table shows a high level

of ‘uncertainty’
b) Entropy using the frequency table of two attributes:

For entropy of ‘play golf’ and

‘outlook’ you need to work out the
proportion of each of the options for
‘outlook’ against the total number of
instances, and multiply it by the
entropy for ‘play golf’ (parent)
frequency.

So for ‘sunny’ – there are 5/14 total

instances (either a ‘yes’ or a ‘no’) and
the entropy of the class occurrences
(3 ‘yes’ and 2 ‘no’) worked out using
the first formula is 0.971

Adding together all the ‘outlook’

calculations gives a total entropy of
.693
Information Gain
The information gain is based on the decrease in entropy after a dataset is split on an attribute. Constructing a
decision tree is all about finding attribute that returns the highest information gain (i.e., the most
homogeneous branches).

Step 1: Calculate entropy of the target.

So the ‘entropy
method’ of selecting
attributes to split on
is to choose the
attribute that gives
the greatest
reduction in average
Step 2: The dataset is then split on the different attributes. The entropy for each branch is calculated. Then it is
entropy, ie the one
added proportionally, to get total entropy for the split. The resulting entropy is subtracted from the entropy
that maximises the
before the split. The result is the Information Gain, or decrease in entropy.
value of Information
Gain
So the Information Gain is calculated
by the entropy of the unsplit data set
(the parent) (T) minus the entropy of
the average of the entropy of the split
(child) sets.

.
`
Information Gain is the expected reduction in entropy caused by partitioning the examples according to a
particular attribute – when the number of instances of each value is equal (eg circles or crosses in the above
example), then information reaches its maximum because we are very uncertain about the outcome.

Step 3: Choose attribute with the largest information gain as the decision node, divide the dataset by its
branches and repeat the same process on every branch.

Step 4a: A branch with entropy of 0 is a leaf node.

Step 4b: A branch with entropy more than 0 needs further splitting.

Step 5: The ID3 algorithm is run recursively on the non-leaf branches, until all data is classified.
Decision Tree to Decision Rules
A decision tree can easily be transformed to a set of rules by mapping from the root node to the leaf nodes
one by one.

Solutions: Solutions Manual For Introduction To The Thermodynamics of Materials 6Th Edition Gaskell
75% (4)
Solutions: Solutions Manual For Introduction To The Thermodynamics of Materials 6Th Edition Gaskell
228 pages
RGSHOA Memo For Garbage Collection
100% (1)
RGSHOA Memo For Garbage Collection
1 page
Mediated Memories in The Digital Age 1st Edition Jose Van Dijck Instant Download
No ratings yet
Mediated Memories in The Digital Age 1st Edition Jose Van Dijck Instant Download
56 pages
Wa0007.
No ratings yet
Wa0007.
55 pages
Sedimentology Final Examination
No ratings yet
Sedimentology Final Examination
12 pages
Hollywood Sex Analysis
No ratings yet
Hollywood Sex Analysis
2 pages
Module 3 - NC II - Solving and Addressing General Workplace Problems - ForTrainingOnly
100% (7)
Module 3 - NC II - Solving and Addressing General Workplace Problems - ForTrainingOnly
86 pages
Reinforcement Learning For Portfolio Management: Meng Dissertation
No ratings yet
Reinforcement Learning For Portfolio Management: Meng Dissertation
123 pages
Unit - 2 TC
No ratings yet
Unit - 2 TC
23 pages
Biology Practical Class 12
No ratings yet
Biology Practical Class 12
7 pages
S6 Aceitaka 2017 Agric P1
No ratings yet
S6 Aceitaka 2017 Agric P1
12 pages
2025 Applicationguideline E-25
No ratings yet
2025 Applicationguideline E-25
1 page
ĐỀ THI HSG ANH 8 ĐOAN HÙNG PHÚ THỌ 2023 2024
No ratings yet
ĐỀ THI HSG ANH 8 ĐOAN HÙNG PHÚ THỌ 2023 2024
9 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
Worksheet 2
No ratings yet
Worksheet 2
2 pages
ID3 Complete Solution
No ratings yet
ID3 Complete Solution
3 pages
06 Classification Decision Tree
No ratings yet
06 Classification Decision Tree
42 pages
DT Classifier
No ratings yet
DT Classifier
45 pages
Geography Notes Unit 12
No ratings yet
Geography Notes Unit 12
5 pages
Chapter 4 and Appendxeix
No ratings yet
Chapter 4 and Appendxeix
11 pages
Test Score Descriptions
No ratings yet
Test Score Descriptions
1 page
Self Respect
No ratings yet
Self Respect
10 pages
01 - SS036 - Historical Antecedents
No ratings yet
01 - SS036 - Historical Antecedents
46 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
Cse 445 Lecture 8 Mma
No ratings yet
Cse 445 Lecture 8 Mma
107 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
Classification Trees
No ratings yet
Classification Trees
48 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Entropy and IG
No ratings yet
Entropy and IG
23 pages
Decision Tree
No ratings yet
Decision Tree
19 pages
Online Test One, Grade 10
No ratings yet
Online Test One, Grade 10
5 pages
Trees
No ratings yet
Trees
78 pages
10b Understanding Entropy Information Gain
No ratings yet
10b Understanding Entropy Information Gain
10 pages
DLL - Tle - M&B Q3 WK 5
100% (1)
DLL - Tle - M&B Q3 WK 5
7 pages
Examples
No ratings yet
Examples
8 pages
Learn About Ecosystems - Lesson Plan
No ratings yet
Learn About Ecosystems - Lesson Plan
2 pages
Đọc Viết 2 - 23092021
No ratings yet
Đọc Viết 2 - 23092021
9 pages
Strategic Marketing Plan For Black Thunder
No ratings yet
Strategic Marketing Plan For Black Thunder
24 pages
Via Character Strengths Survey Results Via Institute On Character Via Institute
No ratings yet
Via Character Strengths Survey Results Via Institute On Character Via Institute
1 page
Id3algorithm 200307175839
No ratings yet
Id3algorithm 200307175839
22 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
SOM - by EasyEngineering - Net 01 PDF
No ratings yet
SOM - by EasyEngineering - Net 01 PDF
129 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
Classification
No ratings yet
Classification
30 pages
Module #2: Transformation of Stresses in 2-D
No ratings yet
Module #2: Transformation of Stresses in 2-D
34 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
Decision Trees
No ratings yet
Decision Trees
19 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
18 pages
Jawi Unicode Compliant Font PDF
No ratings yet
Jawi Unicode Compliant Font PDF
5 pages
7 DecisionTree
No ratings yet
7 DecisionTree
58 pages
Week 8
No ratings yet
Week 8
2 pages
Unit 4a Decision Tree
No ratings yet
Unit 4a Decision Tree
90 pages
ML Unit 3 Notes-1
No ratings yet
ML Unit 3 Notes-1
118 pages
23 Id3
No ratings yet
23 Id3
20 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
Machine Learning Unit-3.2
No ratings yet
Machine Learning Unit-3.2
61 pages
Lecture2 DT
No ratings yet
Lecture2 DT
75 pages
DMDW Co3 Session 14
No ratings yet
DMDW Co3 Session 14
55 pages
Decision Tree Basics
No ratings yet
Decision Tree Basics
30 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Decision Tree
No ratings yet
Decision Tree
100 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
Decision Tree (Class 37-38) 169692509554958626652505a71d481
No ratings yet
Decision Tree (Class 37-38) 169692509554958626652505a71d481
45 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
ML Unit-3
No ratings yet
ML Unit-3
92 pages
Classification - Decision Trees
No ratings yet
Classification - Decision Trees
43 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Functions and Probability for Sixth Graders
From Everand
Functions and Probability for Sixth Graders
Home School Brew
No ratings yet
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Decision Tree - Associative Rule Mining
No ratings yet
Decision Tree - Associative Rule Mining
69 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
FIn Problems
No ratings yet
FIn Problems
8 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Decision Trees Notes
No ratings yet
Decision Trees Notes
11 pages
Decision Tree Classifier-Introduction, ID3
No ratings yet
Decision Tree Classifier-Introduction, ID3
34 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
No ratings yet
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
7 pages
Artificial Intelligence 11. Decision Tree Learning
No ratings yet
Artificial Intelligence 11. Decision Tree Learning
25 pages
Decision Tree Print
No ratings yet
Decision Tree Print
4 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
Decision Tree
100% (4)
Decision Tree
66 pages
Decision Tree 2
No ratings yet
Decision Tree 2
20 pages
Dec Tree
No ratings yet
Dec Tree
17 pages
ID3
No ratings yet
ID3
7 pages

Entropy and Information Gain Explained

Uploaded by

Entropy and Information Gain Explained

Uploaded by

https://www.saedsayad.com/decision_tree.

ENTROPY AND INFORMATION GAIN

Decision Tree - Classification

a) Entropy using the frequency table of one attribute:

C = the number of classes (in this case there

The value of pi is the number of occurrences of

So for class ‘no’ there are 5 instances divided

For class ‘yes’ there are 9 instances divided by

Taking the sum of the two, -pi log2 pi gives you

Minimum value of entropy is 0 – when all

Maximum value of entropy is 1 – when classes

So this has frequeny table shows a high level

For entropy of ‘play golf’ and

So for ‘sunny’ – there are 5/14 total

Adding together all the ‘outlook’

Step 1: Calculate entropy of the target.

Step 4a: A branch with entropy of 0 is a leaf node.

You might also like