Decision Tree

Uploaded by

arkadebmisra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Decision Tree

Uploaded by

arkadebmisra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Decision Tree

Entropy
• Entropy is the measure of uncertainty of a random variable, it
characterizes the impurity of an arbitrary collection of examples.
The higher the entropy more the information content.

H(s) = σ −𝑝ሺ𝑥 ሻ log 𝑝ሺ𝑥ሻ

H(s) = Measure of the amount of uncertainty in data

e.g. Toss of a coin
• 𝑃 𝐻 = 𝑃 𝑇 = 0.5
• Outcome is random so entropy is 1
• 𝑃 𝐻 = 1, 𝑃 𝑇 = 0
• No randomness so entropy is 0

• Information gain
It is also known as Kullback-Leibler divergence denoted by 𝐼𝐺ሺ𝑆, 𝐴ሻ is the effective
change in entropy after deciding on a particular attribute A.

𝐼𝐺 𝑆, 𝐴 = 𝐻 𝑆 − 𝐻 𝑆, 𝐴
𝑥 is the possible value of attribute
= 𝐻 𝑆 − σ 𝑃 𝑋 × 𝐻ሺ𝑥ሻ
Dataset
Day Outlook Temperature Humidity Wind Play Golf
D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Hot High Weak Yes
D4 Rain Mild High Weak Yes
D5 Rain Cool Normal Weak Yes
D6 Rain Cool Normal Strong No
D7 Overcast Cool Normal Strong Yes
D8 Sunny Mild High Weak No
D9 Sunny Cool Normal Weak Yes
D10 Rain Mild Normal Weak Yes
D11 Sunny Mild Normal Strong Yes
D12 Overcast Mild Normal Strong Yes
D13 Overcast Hot Normal Weak Yes
D14 Rain Mild High Strong No
1. Play(Yes) = 9 Play(No) = 5 Total = 14
Entropy(S) = H(S) = -σ 𝑃 𝑥 log 𝑝 𝑥
9 9 5 5
=− log − log = 0.94
14 14 14 14

2. Compute the attribute with highest information gain

𝐼𝐺ሺ𝑆, 𝑤𝑖𝑛𝑑ሻ = 𝐻 𝑆 − σ 𝑃 𝑥 × 𝐻 𝑥
𝑥 is the possible value of attribute
Total = 14, wind (weak) = 8, wind(strong) = 6
P(Sweak) = 8/14 P(Sstrong) = 6/14
Out of 8 weak -> 6 play yes and 2 No.
6 6 2 2
So entropy(Sweak) = − log − ሺlog = 0.811
8 8 8 8
3 3 3 3
Entropy(Sstrong) = − log − log =1
6 6 6 6
IG(S,wind) = 𝐻 𝑆 − σ 𝑃 𝑥 × 𝐻 𝑥
=𝐻 𝑆 − 𝑃 𝑆𝑤𝑒𝑎𝑘 × 𝐻 𝑆𝑤𝑒𝑎𝑘 − 𝑃 𝑆𝑠𝑡𝑟𝑜𝑛𝑔 × 𝐻 𝑆𝑠𝑡𝑟𝑜𝑛𝑔
8 6
=0.940 − × 0.811 − × 1 = 0.048
14 14
Similarly –
• IG(S, Outlook) = 0.246
• IG(S, Temperature) = 0.029
• IG(S, Humidity) = 0.151
• IG(S, wind) = 0.048

Highest information gain IG(S, outlook), so the decision tree is -

Outlook

Sunny Overcast Rain

Yes
• To further split the sunny node
Temperature Humidity Wind Play
Hot High Weak No
Hot High Strong No
Mild Hot Weak No
Cool No Weak Yes
Mild No Strong Yes

• IG(Sunny, Humidity) = 0.96 ----------- Highest

• IG(Sunny, Temperature) = 0.57
• IG(Sunny, wind) = 0.019

In the same way, Srain will provide us wind as the highest information gain. So the decision
tree becomes -
Yes

Yes No No Yes
ID3 Algorithm
1. Create root node for the tree
2. If all the example are positive return leaf node positive
3. Else all example are negative then return leaf node negative
4. Calculate entropy of current state H(S)
5. For each attribute x, compute entropy w.r.t x H(S,x)
6. Select the attribute which has maximum IG(S,x)
7. Remove the attribute that offer highest IG
8. Repeat until there is no attribute or the decision has all leaf nodes.

Case Studies
82% (17)
Case Studies
51 pages
ITI Study-Trainers Textbook - Solar Thermal PDF
No ratings yet
ITI Study-Trainers Textbook - Solar Thermal PDF
48 pages
Decision Tree (Class 37-38) 169692509554958626652505a71d481
No ratings yet
Decision Tree (Class 37-38) 169692509554958626652505a71d481
45 pages
Entropy and Information Gain Explained
No ratings yet
Entropy and Information Gain Explained
10 pages
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
No ratings yet
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
7 pages
Decision Tree Classification
100% (1)
Decision Tree Classification
11 pages
07_Decision tree
No ratings yet
07_Decision tree
45 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
Dec Tree
No ratings yet
Dec Tree
17 pages
MLT UNIT-3 notes
No ratings yet
MLT UNIT-3 notes
35 pages
Decision Trees
No ratings yet
Decision Trees
19 pages
Lec-2 Decision Tree_13-8-2024
No ratings yet
Lec-2 Decision Tree_13-8-2024
38 pages
Decision Tree Theory
No ratings yet
Decision Tree Theory
4 pages
Decision Tree
100% (1)
Decision Tree
10 pages
Decision Tree Classifier-Introduction, ID3
No ratings yet
Decision Tree Classifier-Introduction, ID3
34 pages
Lecture2 DT
No ratings yet
Lecture2 DT
75 pages
Decision Tree
No ratings yet
Decision Tree
27 pages
ML-19 (1)
No ratings yet
ML-19 (1)
28 pages
entropy and IG
No ratings yet
entropy and IG
23 pages
What Is An ID3 Algorithm?
No ratings yet
What Is An ID3 Algorithm?
10 pages
4.Decision Tree
No ratings yet
4.Decision Tree
39 pages
ID3 Decision Tree Explanation
No ratings yet
ID3 Decision Tree Explanation
8 pages
Classification
No ratings yet
Classification
148 pages
7. Decision Tree & Random Forest
No ratings yet
7. Decision Tree & Random Forest
41 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
Module 3 DecisionTree Notes
100% (1)
Module 3 DecisionTree Notes
14 pages
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
No ratings yet
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
25 pages
ML_Unit-2_Material
No ratings yet
ML_Unit-2_Material
20 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
02 DecisionTrees Done
No ratings yet
02 DecisionTrees Done
68 pages
Decision Tree
No ratings yet
Decision Tree
25 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
18 pages
ML_Unit-3
No ratings yet
ML_Unit-3
29 pages
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
48 pages
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
No ratings yet
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
7 pages
Decision Tree Class 2
No ratings yet
Decision Tree Class 2
40 pages
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material II 19-May-2021 Random Forest
No ratings yet
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material II 19-May-2021 Random Forest
22 pages
Machine Learning-Lecture 05
No ratings yet
Machine Learning-Lecture 05
21 pages
Unit-3 (1)
No ratings yet
Unit-3 (1)
81 pages
DM UNIT III (1)
No ratings yet
DM UNIT III (1)
87 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
3 Decision Trees_LMS
No ratings yet
3 Decision Trees_LMS
47 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Decision Tree
No ratings yet
Decision Tree
100 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
12 pages
Dokumen - Tips Decision Tree and Random Forest 58f9e8a0cce07
No ratings yet
Dokumen - Tips Decision Tree and Random Forest 58f9e8a0cce07
17 pages
10b Understanding Entropy Information Gain
No ratings yet
10b Understanding Entropy Information Gain
10 pages
MODULE 4-Dr - GM
No ratings yet
MODULE 4-Dr - GM
23 pages
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
11 pages
ML-Lec5
No ratings yet
ML-Lec5
7 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Decision Tree
No ratings yet
Decision Tree
19 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
Practice Q Machine Learning Ans
No ratings yet
Practice Q Machine Learning Ans
54 pages
ML Lecture 13-14
No ratings yet
ML Lecture 13-14
33 pages
Unit 4a Decision Tree
No ratings yet
Unit 4a Decision Tree
90 pages
Pressure, Heat and Temperature - Physics for Kids - 5th Grade | Children's Physics Books
From Everand
Pressure, Heat and Temperature - Physics for Kids - 5th Grade | Children's Physics Books
Baby Professor
No ratings yet
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
Classification and Clustering
No ratings yet
Classification and Clustering
80 pages
Feature Selection
No ratings yet
Feature Selection
8 pages
Svm Student
No ratings yet
Svm Student
40 pages
Pca
No ratings yet
Pca
34 pages
Clustering 2
No ratings yet
Clustering 2
13 pages
Bai Kiem Tra
No ratings yet
Bai Kiem Tra
6 pages
User Guide SBDART Input
No ratings yet
User Guide SBDART Input
37 pages
TR-0320301 (1)
No ratings yet
TR-0320301 (1)
4 pages
Climate Change and Its Solution
No ratings yet
Climate Change and Its Solution
2 pages
Kshitij Final Weather App
No ratings yet
Kshitij Final Weather App
31 pages
Chemrite AG 300
No ratings yet
Chemrite AG 300
3 pages
CH 4 Probability
100% (1)
CH 4 Probability
44 pages
New Eti Micro
No ratings yet
New Eti Micro
7 pages
Call For Concept Notes Climate Adaptation and Resilience CLARE
No ratings yet
Call For Concept Notes Climate Adaptation and Resilience CLARE
20 pages
Sai Satcharitra: Sai Satchritra - Chapter XI
No ratings yet
Sai Satcharitra: Sai Satchritra - Chapter XI
5 pages
Cambridge 6 FINAL Revision T1_112624
No ratings yet
Cambridge 6 FINAL Revision T1_112624
25 pages
Eagle - The Push
100% (1)
Eagle - The Push
2 pages
CPST 3030 Assignment #1
No ratings yet
CPST 3030 Assignment #1
6 pages
Keen (2021) The Appallingly Bad Neoclassical Economics of Climate Change
No ratings yet
Keen (2021) The Appallingly Bad Neoclassical Economics of Climate Change
30 pages
2023 China Balloon Incident - Wikipedia
No ratings yet
2023 China Balloon Incident - Wikipedia
16 pages
Ulusarasi İşaret Kod Ki̇tabi
100% (2)
Ulusarasi İşaret Kod Ki̇tabi
51 pages
STS Finals Reviewer
No ratings yet
STS Finals Reviewer
9 pages
11th___20th_JAN_PIB_COMPILATION_lyst1737709146151 (1)
No ratings yet
11th___20th_JAN_PIB_COMPILATION_lyst1737709146151 (1)
41 pages
Rules and Regulations News Cating
No ratings yet
Rules and Regulations News Cating
3 pages
Soal Big Kelas 11
No ratings yet
Soal Big Kelas 11
9 pages
Astrology, Financial Applications of Astronomic Cycles
No ratings yet
Astrology, Financial Applications of Astronomic Cycles
5 pages
Hydro Meteorogical Hazards Lecture
No ratings yet
Hydro Meteorogical Hazards Lecture
15 pages
Energy
No ratings yet
Energy
32 pages
Epithalamium
No ratings yet
Epithalamium
7 pages
ssrn-5049349
No ratings yet
ssrn-5049349
47 pages
(Warehouse - Abbott) : Gundlapochampally, Hyderabad
No ratings yet
(Warehouse - Abbott) : Gundlapochampally, Hyderabad
10 pages
Unit9 Reading Ex2
No ratings yet
Unit9 Reading Ex2
2 pages
Intermed. B Revision-1
No ratings yet
Intermed. B Revision-1
3 pages

Decision Tree

Uploaded by

Decision Tree

Uploaded by

Decision Tree

H(s) = σ −𝑝ሺ𝑥 ሻ log 𝑝ሺ𝑥ሻ

H(s) = Measure of the amount of uncertainty in data

2. Compute the attribute with highest information gain

Highest information gain IG(S, outlook), so the decision tree is -

Sunny Overcast Rain

• IG(Sunny, Humidity) = 0.96 ----------- Highest

You might also like