Decision Trees 2

This document discusses using entropy to select attributes for decision tree induction. It explains that entropy is a measure of uncertainty in a dataset, and that selecting the attribute with the greatest information gain (the difference between original and new entropy) will minimize uncertainty at each split. The example shows calculating the entropy and information gain for splitting the lens24 dataset on different attributes, finding that splitting on 'tears' gives the greatest information gain and smallest new entropy. The process of splitting on attributes is repeated down the tree until reaching pure leaf nodes with entropy of zero.

Uploaded by

Miguel pilamunga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views18 pages

Decision Trees 2

Uploaded by

Miguel pilamunga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Decision Tree Induction: Using

Entropy
for Attribute Selection
Principles of Datamining Chap 5
Dra María Hallo
Attribute selection
• Dependieng of the order of the attribute
selection for the tree we have different trees.
• No attribute may be selected twice in the same
branch.
• Figure 5.1 shows the results of running the TDIDT
algorithm with attribute selection strategies
takefirst, takelast and random in turn to generate
decision trees for the seven datasets contact
lenses, lens24, chess, vote, monk1, monk2and
monk3.
Number of Branches Generated by
TDIDT with Three Attribute
Selection Methods
Ex of a Dataset
Decision Tree 1 - IDDT algorithm
Decision Tree 2 - IDDT algorithm
Choosing Attributes to Split On: Using
Entropy
• One commonly used method is to select the
attribute that minimises the value of entropy,
thus maximising the information gain.
• Entropy is an information-theoretic measure
of the ‘uncertainty’ contained a training set,
due to the presence of more than one
possible classification.
Choosing Attributes to Split On: Using
Entropy
• If there are K classes, we can denote the
proportion of instances with classification i by pi
• for i =1 to K. The value of p is the number of
occurrences of class i divided by the total number
of instances, which is a number between
0 and 1 inclusive.
• The entropy of the training set is denoted by E. It is
measured in ‘bits’ of information and is defined by
the formula
Choosing Attributes to Split On: Using
Entropy (number of generated
branches)
Choosing Attributes to Split On: Using
Entropy (number of generated
branches)
There is no guarantee that using entropy will
always lead to a small decision tree, but
experience shows that it generally produces
trees with fewer branches than other attribute
selection criteria.
Choosing Attributes to Split On: Using
Entropy (number of generated
branches)
There is no guarantee that using entropy will
always lead to a small decision tree, but
experience shows that it generally produces
trees with fewer branches than other attribute
selection criteria.
E start
For the initial lens24, training set of 24 instances, there are 3
classes. There are 4 instances with classification 1, 5 instances
with classification 2 and 15 instances with classification 3. So p1
=4/24, p2=5/24 and p3=15/24.
Entropy Estart. is given by
Estart= -(4/24) log 2 (4/24) - (5/24) log2 (5/24) - (15/24) log
(15/24)
=0.4308 + 0.4715 + 0.4238
=1.3261 bits
Using Entropy for Attribute Selection
• Training set 1 (age = 1)
Entropy E1= -(2/8) log2(2/8) - (2/8) log2(2/8) - (4/8)
log(4/8)
=0.5+0.5+0.5=1.5
• Training set 2 (age = 2)
Entropy E2= -(1/8) log2(1/8) - (2/8) log2(2/8) - (5/8)
log(5/8)
=0.375 + 0.5+0.4238 = 1.2988
• Training Set 3 (age = 3)
Entropy E3= -(1/8) log2(1/8) - (1/8) log2(1/8) - (6/8) log
(6/8)
=0.375 + 0.375 + 0.3113 = 1.0613
• 2
• 2
• 2
Average entropy
The values E1, E2 and E3 need to be weighted by
the proportion of the original instances in each
of the three subsets. In this case all the weights
are the same, i.e. 8/24.
Average entropy of the three training sets
produced by splitting on attribute age is
denoted by E new , then
Enew =(8/24)E1+(8/24)E2 +(8/24)E23= 1.2867
Information Gain = Estart- E new
Information gain from splitting on attribute age
is 1.3261 - 1.2867 = 0.0394 bits
Minimising the value of E new
The ‘entropy method’ of attribute selection is to
choose to split on the attribute that gives the
greatest reduction in (average) entropy, i.e. the
one that maximises the value of Information
Gain.
This is equivalent to minimising the value of E
new as E start is fixed
Information Gain
Maximising Information Gain
• atribute age E new =1.2867
Information Gain = 1.3261 - 1.2867 = 0.0394 bits
• attribute specRx E new = 1.2866
Information Gain = 1.3261 - 1.2866 = 0.0395 bits
• attribute astig E new = 0.9491
Information Gain = 1.3261 - 0.9491 = 0.3770 bits
• attribute tears E = 0.7773
Information Gain = 1.3261 - 0.7773 = 0.5488 bits
Thus, the largest value of Information Gain (and the smallest
value of the new entropy E ) is obtained by splitting on
attribute tears
Process of splitting on nodes
The process of splitting on nodes is
repeated for each branch of the evolving
decision tree, terminating when the subset
at every leaf node has entropy zero

Practice Midterm
No ratings yet
Practice Midterm
4 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
15 1 Random Forest and Decision Tree
No ratings yet
15 1 Random Forest and Decision Tree
66 pages
Lecture 4
No ratings yet
Lecture 4
74 pages
Decision Tree and Random Forest
No ratings yet
Decision Tree and Random Forest
74 pages
Lecture 4
No ratings yet
Lecture 4
74 pages
23 Id3
No ratings yet
23 Id3
20 pages
2 ML Ch3 Decision Trees Final
No ratings yet
2 ML Ch3 Decision Trees Final
70 pages
Trees
No ratings yet
Trees
78 pages
Decisiontrees l8
No ratings yet
Decisiontrees l8
18 pages
CENG313 Introduction To Data Science: Lecture 12: Classification Decision Trees
No ratings yet
CENG313 Introduction To Data Science: Lecture 12: Classification Decision Trees
61 pages
تمييز اشكال ميد
No ratings yet
تمييز اشكال ميد
267 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
18 pages
Decision Tree
No ratings yet
Decision Tree
29 pages
Decision Tree
No ratings yet
Decision Tree
19 pages
Decision Tree
No ratings yet
Decision Tree
58 pages
Decision Tree Part 1
No ratings yet
Decision Tree Part 1
15 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
Lesson 5
No ratings yet
Lesson 5
28 pages
Decision Tree
No ratings yet
Decision Tree
30 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
80 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
52 pages
Aiml Easy Solution
No ratings yet
Aiml Easy Solution
70 pages
01 Section 6.2.1 QR Code Content
No ratings yet
01 Section 6.2.1 QR Code Content
5 pages
Act 9
No ratings yet
Act 9
22 pages
Entropy and Information Gain Explained
No ratings yet
Entropy and Information Gain Explained
10 pages
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
No ratings yet
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
7 pages
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
No ratings yet
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
25 pages
Classification
No ratings yet
Classification
7 pages
Lecture 5 DecisionTree
No ratings yet
Lecture 5 DecisionTree
21 pages
Business Analytics & Machine Learning: Decision Tree Classifiers
No ratings yet
Business Analytics & Machine Learning: Decision Tree Classifiers
60 pages
Decision Trees Notes
No ratings yet
Decision Trees Notes
11 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
ML Unit 3 Notes-1
No ratings yet
ML Unit 3 Notes-1
118 pages
3 Decision Trees - LMS
No ratings yet
3 Decision Trees - LMS
47 pages
Classification Trees
No ratings yet
Classification Trees
48 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
Chapter 3
No ratings yet
Chapter 3
88 pages
ML Classification Tree
No ratings yet
ML Classification Tree
36 pages
AIML Lec-11
No ratings yet
AIML Lec-11
18 pages
ML Unit 3 Notes
No ratings yet
ML Unit 3 Notes
117 pages
Decision Tree
No ratings yet
Decision Tree
54 pages
Decision Tree
No ratings yet
Decision Tree
25 pages
Decision Trees: Classifier
No ratings yet
Decision Trees: Classifier
23 pages
Decision Tree
No ratings yet
Decision Tree
23 pages
Decision Tree
No ratings yet
Decision Tree
71 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
DMDW Co3 Session 14
No ratings yet
DMDW Co3 Session 14
55 pages
SDG Sdgs DF
No ratings yet
SDG Sdgs DF
23 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
41 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
No ratings yet
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
25 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
Classification - Decision Trees
No ratings yet
Classification - Decision Trees
43 pages
L3 - Decision Trees
No ratings yet
L3 - Decision Trees
28 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Principles of Digital Electronics
From Everand
Principles of Digital Electronics
Sapana Rane
No ratings yet
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Introduction To Theory of Automata: Lecture # 1
No ratings yet
Introduction To Theory of Automata: Lecture # 1
18 pages
Uea05 2
No ratings yet
Uea05 2
3 pages
LN19
No ratings yet
LN19
65 pages
01 Speed Read Tensorflow Playground
No ratings yet
01 Speed Read Tensorflow Playground
6 pages
Pooja Kabadi - Predictive Modelling Project
No ratings yet
Pooja Kabadi - Predictive Modelling Project
70 pages
ML - LAB - 7 - Jupyter Notebook
100% (1)
ML - LAB - 7 - Jupyter Notebook
7 pages
4-Data Structures Operations and Its Cost Estimation
No ratings yet
4-Data Structures Operations and Its Cost Estimation
4 pages
7 Jasc 2019
No ratings yet
7 Jasc 2019
23 pages
Mod6 Slides
No ratings yet
Mod6 Slides
27 pages
Basic Signals and Signal Operation Lec2
No ratings yet
Basic Signals and Signal Operation Lec2
18 pages
SecIoT HAMMI
No ratings yet
SecIoT HAMMI
9 pages
6.241 Dynamic Systems and Control: Lecture 5: Matrix Perturbations Readings: DDV, Chapter 5
No ratings yet
6.241 Dynamic Systems and Control: Lecture 5: Matrix Perturbations Readings: DDV, Chapter 5
11 pages
Penerapan Algoritma Convolutional Neural Network Dalam Klasifikasi Telur Ayam Fertil Dan Infertil Berdasarkan Hasil Candling
No ratings yet
Penerapan Algoritma Convolutional Neural Network Dalam Klasifikasi Telur Ayam Fertil Dan Infertil Berdasarkan Hasil Candling
9 pages
ROAD DESIGN PROCEDURE FROM START TO FINISH - Support Group
No ratings yet
ROAD DESIGN PROCEDURE FROM START TO FINISH - Support Group
2 pages
A Computer Algorithm To Determine The Steady-State Response of Nonlinaer Oscillators
No ratings yet
A Computer Algorithm To Determine The Steady-State Response of Nonlinaer Oscillators
7 pages
Sketch Techniques For Approximate Query Processing
No ratings yet
Sketch Techniques For Approximate Query Processing
67 pages
Ex 01 Introduction To Management Science - Question N Answers
No ratings yet
Ex 01 Introduction To Management Science - Question N Answers
2 pages
Case Study Report 2: 2020 Busa3015 - Business Forecasting
No ratings yet
Case Study Report 2: 2020 Busa3015 - Business Forecasting
7 pages
Differential Equations and Boundary Value Problems Computing and Modeling 5th Edition Edwards Solutions Manual Download
100% (22)
Differential Equations and Boundary Value Problems Computing and Modeling 5th Edition Edwards Solutions Manual Download
152 pages
Electronic Spreadsheet
No ratings yet
Electronic Spreadsheet
3 pages
AI and DAA Practical Record
No ratings yet
AI and DAA Practical Record
40 pages
Clause and Effect
No ratings yet
Clause and Effect
4 pages
A Mind-Reading Machine
No ratings yet
A Mind-Reading Machine
3 pages
Skin Cancer Detection
No ratings yet
Skin Cancer Detection
16 pages
Prediction Zabbix Triger
No ratings yet
Prediction Zabbix Triger
10 pages
Advertisement - Rolling Round of PHD Mathematics Admission
No ratings yet
Advertisement - Rolling Round of PHD Mathematics Admission
2 pages
Explicit and Implicit Examples
No ratings yet
Explicit and Implicit Examples
8 pages
Slides TKT Dynamic
No ratings yet
Slides TKT Dynamic
42 pages
Module 2-Data Science
No ratings yet
Module 2-Data Science
3 pages

Decision Trees 2

Uploaded by

Decision Trees 2

Uploaded by

Decision Tree Induction: Using

You might also like