Decision Tree-Using Entropy

Uploaded by

mehtabksidhu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Decision Tree-Using Entropy

Uploaded by

mehtabksidhu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Decision Tree based Learning

Example

5/13/2022
Decision tree representation (PlayTennis)

Outlook=Sunny, Temp=Hot, Humidity=High, Wind=Strong No

5/13/2022
Decision trees expressivity
 Decision trees represent a disjunction of conjunctions on
constraints on the value of attributes:
(Outlook = Sunny  Humidity = Normal) 
(Outlook = Overcast) 
(Outlook = Rain  Wind = Weak)

5/13/2022
Top-down induction of Decision Trees
 ID3 (Quinlan, 1986) is a basic algorithm used to create DT's
 Given a training set of examples, the algorithms for building DT
performs search in the space of decision trees.
 The construction of the tree is top-down. The algorithm is greedy.
 The fundamental question is “which attribute should be tested next?
Which attribute gives us more information?”
 Select the best attribute
 A descendent node is then created for each possible value of this
attribute and data set is partitioned according to this value.
 The process is repeated for each successor node until all the
examples are classified correctly or there are no attributes left

5/13/2022
Which attribute is the best classifier?

 A statistical property called information gain, measures how

well a given attribute separates the training examples
 Information gain uses the notion of entropy, commonly used in
information theory
 Information gain = expected reduction of entropy

5/13/2022
5/13/2022
Entropy in binary classification
 Entropy measures the impurity of a collection of examples. It
depends from the distribution of the random variable p.
 S is a collection of training examples
 p+ the proportion of positive examples in S
 p– the proportion of negative examples in S
Entropy (S)  – p+ log2 p+ – p–log2 p– [0 log20 = 0]
Entropy ([14+, 0–]) = – 14/14 log2 (14/14) – 0 log2 (0) = 0
Entropy ([9+, 5–]) = – 9/14 log2 (9/14) – 5/14 log2 (5/14) = 0.94
Entropy ([7+, 7– ]) = – 7/14 log2 (7/14) – 7/14 log2 (7/14) =
= 1/2 + 1/2 = 1 [log21/2 = – 1]
Note: the log of a number < 1 is negative, 0  p  1, 0  entropy  1
Entropy in general
 Entropy measures the amount of information in a random
variable
H(X) = – p+ log2 p+ – p– log2 p– X = {+, –}
for binary classification [two-valued random variable]
c c
H(X) = –  pi log2 pi =  pi log2 1/ pi X = {i, …, c}
i=1 i=1
for classification in c classes
Example: rolling a die with 8, equally probable, sides
8
H(X) = –  1/8 log2 1/8 = – log2 1/8 = log2 8 = 3
i=1
Information gain as entropy reduction
 Information gain is the expected reduction in entropy caused by
partitioning the examples on an attribute.
 The higher the information gain the more effective the attribute
in classifying training data.
 Expected reduction in entropy knowing A

Gain(S, A) = Entropy(S) −  |Sv|

Entropy(Sv)
v  Values(A) |S|
Values(A) possible values for A
Sv subset of S for which A has value v
Example: expected information gain
 Let
 Values(Wind) = {Weak, Strong}
 S = [9+, 5−]
 SWeak = [6+, 2−]
 SStrong = [3+, 3−]
 Information gain due to knowing Wind:
Gain(S, Wind) = Entropy(S) − 8/14 Entropy(SWeak) − 6/14 Entropy(SStrong)
= 0.94 − 8/14  0.811 − 6/14  1.00
= 0.048
Which attribute is the best classifier?
First step: which attribute to test at the root?

 Which attribute should be tested at the root?

 Gain(S, Outlook) = 0.246
 Gain(S, Humidity) = 0.151
 Gain(S, Wind) = 0.048
 Gain(S, Temperature) = 0.029
 Outlook provides the best prediction for the target
 Lets grow the tree:
 add to the tree a successor for each possible value of Outlook
 partition the training samples according to the value of Outlook
After first step
Second step
 Working on Outlook=Sunny node:
Gain(SSunny, Humidity) = 0.970  3/5  0.0  2/5  0.0 = 0.970
Gain(SSunny, Wind) = 0.970  2/5  1.0  3/5  0.918 = 0 .019
Gain(SSunny, Temp.) = 0.970  2/5  0.0  2/5  1.0  1/5  0.0 = 0.570
 Humidity provides the best prediction for the target
 Lets grow the tree:
 add to the tree a successor for each possible value of Humidity
 partition the training samples according to the value of Humidity
Second and third steps

{D1, D2, D8} {D9, D11} {D4, D5, D10} {D6, D14}
No Yes Yes No
Thanks

7. Decision Tree & Random Forest
No ratings yet
7. Decision Tree & Random Forest
41 pages
Class 16 Decision Tree
No ratings yet
Class 16 Decision Tree
45 pages
Module 3 Chap 3 Decision Tree Learning
No ratings yet
Module 3 Chap 3 Decision Tree Learning
79 pages
7-Decision Trees Learning
No ratings yet
7-Decision Trees Learning
51 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
2c Decision Tree Algorithm
No ratings yet
2c Decision Tree Algorithm
21 pages
ML-Lec5
No ratings yet
ML-Lec5
7 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
6CS4-02 Machine Learning Manish Bhardwaj
No ratings yet
6CS4-02 Machine Learning Manish Bhardwaj
625 pages
SDG Sdgs DF
No ratings yet
SDG Sdgs DF
23 pages
Unit-3 (1)
No ratings yet
Unit-3 (1)
81 pages
unit 3
No ratings yet
unit 3
90 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
2.3 Decision-Tree-Algorithm
No ratings yet
2.3 Decision-Tree-Algorithm
61 pages
DMDW-CO3-SESSION-14
No ratings yet
DMDW-CO3-SESSION-14
55 pages
Lecture2 DT
No ratings yet
Lecture2 DT
75 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
Module 3
No ratings yet
Module 3
101 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
Recitation Decision Trees Adaboost 02-09-2006
No ratings yet
Recitation Decision Trees Adaboost 02-09-2006
30 pages
DS_w12_DT
No ratings yet
DS_w12_DT
61 pages
Tasks on Decision Trees
No ratings yet
Tasks on Decision Trees
11 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
21 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
Decision Trees Notes
No ratings yet
Decision Trees Notes
11 pages
Decision - Tree
No ratings yet
Decision - Tree
75 pages
Decision Tree Example
No ratings yet
Decision Tree Example
21 pages
Module 2 Notes v1 PDF
No ratings yet
Module 2 Notes v1 PDF
20 pages
module 2
No ratings yet
module 2
42 pages
AIML- Module 3- Updated
No ratings yet
AIML- Module 3- Updated
42 pages
ID3_Explanation
No ratings yet
ID3_Explanation
23 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
No ratings yet
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
49 pages
Chapter 4 (2)
No ratings yet
Chapter 4 (2)
103 pages
ML Lecture 13-14
No ratings yet
ML Lecture 13-14
33 pages
Machine Learning - Part 1
100% (1)
Machine Learning - Part 1
80 pages
Module 3
No ratings yet
Module 3
102 pages
Machine Learning 10601 Recitation 8 Oct 21, 2009: Oznur Tastan
No ratings yet
Machine Learning 10601 Recitation 8 Oct 21, 2009: Oznur Tastan
46 pages
DAA Project
No ratings yet
DAA Project
20 pages
7_DecisionTree
No ratings yet
7_DecisionTree
58 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
Ml Lecture04x2
No ratings yet
Ml Lecture04x2
16 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
Chapter 4
No ratings yet
Chapter 4
103 pages
2.decision Tree
No ratings yet
2.decision Tree
74 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
3 Decision Trees_LMS
No ratings yet
3 Decision Trees_LMS
47 pages
Module - 3 - DTL & Ann
No ratings yet
Module - 3 - DTL & Ann
10 pages
id3algorithm-200307175839
No ratings yet
id3algorithm-200307175839
22 pages
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
No ratings yet
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
25 pages
L3 - Decision Trees
No ratings yet
L3 - Decision Trees
28 pages
ML intro
No ratings yet
ML intro
45 pages
03-FSSR_DS610_2024=2025T1_DT
No ratings yet
03-FSSR_DS610_2024=2025T1_DT
51 pages
Unit2 ML
No ratings yet
Unit2 ML
19 pages
Decision Tree
No ratings yet
Decision Tree
25 pages
Decision Trees: Decision Tree Representation ID3 Learning Algorithm Entropy, Information Gain Overfitting
No ratings yet
Decision Trees: Decision Tree Representation ID3 Learning Algorithm Entropy, Information Gain Overfitting
33 pages
BAYES Theorem
From Everand
BAYES Theorem
Jeffery Short
2/5 (5)
Functions and Probability for Sixth Graders
From Everand
Functions and Probability for Sixth Graders
Home School Brew
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Pds
No ratings yet
Pds
37 pages
Homework 1 4P41 Fall 2021
No ratings yet
Homework 1 4P41 Fall 2021
3 pages
Codevita
No ratings yet
Codevita
20 pages
Stack Organisation - javatpoint
No ratings yet
Stack Organisation - javatpoint
3 pages
K - Means Clustering
No ratings yet
K - Means Clustering
13 pages
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
No ratings yet
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
13 pages
UT Dallas Syllabus For Cs4349.581.06u Taught by Ramaswamy Chandrasekaran (Chandra)
No ratings yet
UT Dallas Syllabus For Cs4349.581.06u Taught by Ramaswamy Chandrasekaran (Chandra)
2 pages
COSC6384 Real-Time Systems Assignment 1 (Spring 2005)
No ratings yet
COSC6384 Real-Time Systems Assignment 1 (Spring 2005)
3 pages
Complexity
No ratings yet
Complexity
19 pages
Cambridge International AS & A Level: Computer Science 9618/22
No ratings yet
Cambridge International AS & A Level: Computer Science 9618/22
20 pages
3 Mech - Java Programming Unit 5
No ratings yet
3 Mech - Java Programming Unit 5
84 pages
Linked List - 1
No ratings yet
Linked List - 1
17 pages
methods used to design and construct a solution to a problem notes
No ratings yet
methods used to design and construct a solution to a problem notes
18 pages
Anishish Sharan
No ratings yet
Anishish Sharan
15 pages
COL106: Data Structures and Algorithms: Ragesh Jaiswal, IIT Delhi
No ratings yet
COL106: Data Structures and Algorithms: Ragesh Jaiswal, IIT Delhi
35 pages
Computer Science 2022-Practical File
No ratings yet
Computer Science 2022-Practical File
32 pages
FPGA Lec07 FFT
No ratings yet
FPGA Lec07 FFT
23 pages
Introduction To Daa
100% (1)
Introduction To Daa
126 pages
The Oxford College of Engineering: (Affiliated To Visvesvaraya Technological University, Belgaum)
No ratings yet
The Oxford College of Engineering: (Affiliated To Visvesvaraya Technological University, Belgaum)
42 pages
Linked List in Data Structures 2
No ratings yet
Linked List in Data Structures 2
58 pages
ECC201 Course Outline-1
No ratings yet
ECC201 Course Outline-1
3 pages
Breadth First Search
No ratings yet
Breadth First Search
37 pages
Algorithm Syllabus
No ratings yet
Algorithm Syllabus
4 pages
4 - Linear Queue and Circular Queue
No ratings yet
4 - Linear Queue and Circular Queue
13 pages
Mining Association Rules in Large Databases
No ratings yet
Mining Association Rules in Large Databases
40 pages
Data Structures and Algorithms Question Bank
100% (2)
Data Structures and Algorithms Question Bank
24 pages
Sorting and Its Types
No ratings yet
Sorting and Its Types
26 pages
Lecture 6 Stack Using Linked List
No ratings yet
Lecture 6 Stack Using Linked List
21 pages
Presentation 3 CE316
No ratings yet
Presentation 3 CE316
26 pages
Data Structures Using C (Modern College5)
50% (2)
Data Structures Using C (Modern College5)
51 pages

Decision Tree-Using Entropy

Uploaded by

Decision Tree-Using Entropy

Uploaded by

Decision Tree based Learning

Outlook=Sunny, Temp=Hot, Humidity=High, Wind=Strong No

 A statistical property called information gain, measures how

Gain(S, A) = Entropy(S) −  |Sv|

 Which attribute should be tested at the root?

You might also like