0% found this document useful (0 votes)

128 views4 pages

Decision Tree Theory

Decision tree is a supervised learning algorithm that uses a tree-like model to predict target variables by learning simple decision rules from data features. It creates a model that predicts values by learning from data features. It works by splitting a dataset into smaller and smaller subsets while simultaneously an associating a prediction with each leave. Decision trees are built using information gain, entropy, and final gain to choose the features that provide the highest split in data. The feature with the highest information gain is selected as the root node to split the data. The process continues recursively on the sub-trees.

Uploaded by

PAWAN TIWARI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

128 views4 pages

Decision Tree Theory

Uploaded by

PAWAN TIWARI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Decision Tree

Decision tree is a supervised learning algorithm used for both classification and
regression. The main goal of decision tree is to create a model that predicts the value of
a target variable by learning simple decision rules inferred from the data features.

Weight

Heavy Not Heavy

High Mileage Horsepower

<=86 >86

High Mileage Low Mileage

Q. How to build decision tree?

Ans: - We can build the decision by using the below mentioned sequence of formulas:
1. Information Gain: - It is also known as “Kullback-Leibler Divergence” denoted
by IG(S,A). It is actually the effective change in entropy for a set “S” after deciding
on a particular attribute “A”. Basically, it is a effective change in entropy w.r.t.
independent variable.
[ IG(S,A) = H(S) – H(S,A) = H(S) - ∑ 𝑃(𝑋) ∗ 𝐻(𝑋) ]
Here, H(S) -> Entropy of S
We can also rewrite the above formula as: -
[ IG(S,A) =
( )
∗ 𝑙𝑜𝑔
( )
− ( )
∗ 𝑙𝑜𝑔
( )
]

2. Entropy: - It is also known as “Shanon Entropy” denoted by H(S) for a finite set
“S”. It is the measure of amount of uncertainty or randomness in data.
[ H(S) = ∑ 𝑃(𝑖) ∗ 𝑙𝑜𝑔 ( ) ]
Here, n -> No. of data
P(i) -> Probability of “I”
We can also rewrite entropy as:
( )
[ H(S) = ∑ ∗ 𝐼𝐺(𝑆 , 𝐴 ) ]
( )
We can say that entropy tells us about the predictability of certain event.
3. Final Gain: - In this step we will choose the attribute which provides us highest
final gain. We choose such attribute as a root node for our decision tree.
[ Final Gain = IG(S,A) – H(S) ]

Example: -
Consider the dataset given below: -

Size Shape Colour Choice

M Brick Blue Yes
S Wedge Red No
L Wedge Red No
S Sphere Red Yes
L Pillar Green Yes
L Pillar Red N0
L Sphere Green Yes

Here we have 2 classes:

1. “yes”, S = 4
2. “No”, A = 3
Step-1: - Information Gain
𝟒 𝟑
IG(S,A) = ∗ 𝑙𝑜𝑔 − ∗ 𝑙𝑜𝑔 = 0.985
(𝟒 𝟑) ( ) ( ) ( )

Step-2: - Entropy

Size 𝑺𝒊 𝑨𝒊 IG(𝑺𝒊 , 𝑨𝒊 )
S 1 1 1
M 1 0 0
L 2 2 1

H(Size) = ∗1 + ∗0 + ∗ 1 = 0.857

Shape 𝑺𝒊 𝑨𝒊 IG(𝑺𝒊 , 𝑨𝒊 )
Brick 1 0 0
Wedge 0 2 0
Sphere 2 0 0
Pillar 1 1 1

H(Shape) = ∗0 + ∗0 + ∗0 + ∗ 1 = 0.286
Colour 𝑺𝒊 𝑨𝒊 IG(𝑺𝒊 , 𝑨𝒊 )
Blue 1 0 0
Red 1 3 0.815
Green 2 0 0

H(Colour) = ∗0 + ∗ 0.815 + ∗ 0 = 0.466

Step-3: - Final Gain

FG(Size) = IF(S,A) – H(Size) = 0.985 – 0.857 = 0.128
FG(Shape) = IF(Shape) – H(Shape) = 0.985 – 0.286 = 0.7
FG(Colour) = IF(Colour) – H(Colour) = 0.985 – 0.466 = 0.519
Now, we select the feature with highest gain and choose it as a root. i.e;

Shape

Brick Wedge Sphere Pillar

It is clear from the data that choice class for

1. Brick = “Yes”
2. Wedge = “No”
3. Sphere = “Yes”

But for “Pillar” we have two choices: -

Size Shape Colour Choice

L Pillar Green Yes
L Pillar Red No

Now, we can remove the “Shape” attribute from the dataset and select the attribute with
next highest final gain. i.e;

As we already made a choice for shape attributes except “pillar” so we select the
attributes of “colour” for only “pillar”. Then, our decision tree will look like: -
Shape

Brick Wedge Sphere Pillar

Yes No Yes Colour

Green Red

Yes No

Now, if we got an unlabelled data X = <S,Pillar,Red>, then with the help of above
decision tree we can easily predict it’s label as: “No”

Seminar Report
57% (7)
Seminar Report
31 pages
ISO 4301-5 1991 (E) - Image 600 PDF Document
0% (1)
ISO 4301-5 1991 (E) - Image 600 PDF Document
8 pages
Cse 445 Lecture 8 Mma
No ratings yet
Cse 445 Lecture 8 Mma
107 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
FeelingFaces Cards En-Blank
No ratings yet
FeelingFaces Cards En-Blank
4 pages
Detailed Lesson Plan in Oral Communication I. Objectives
No ratings yet
Detailed Lesson Plan in Oral Communication I. Objectives
4 pages
Calculation of Slab On Grade 15 CM
No ratings yet
Calculation of Slab On Grade 15 CM
2 pages
Gamer Printshop - Rude Awakening
100% (1)
Gamer Printshop - Rude Awakening
20 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
4 pages
Decision Tree - Associative Rule Mining
No ratings yet
Decision Tree - Associative Rule Mining
69 pages
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
No ratings yet
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
25 pages
6CS4-02 Machine Learning Manish Bhardwaj
No ratings yet
6CS4-02 Machine Learning Manish Bhardwaj
625 pages
Dokumen - Tips Decision Tree and Random Forest 58f9e8a0cce07
No ratings yet
Dokumen - Tips Decision Tree and Random Forest 58f9e8a0cce07
17 pages
Classification
No ratings yet
Classification
148 pages
Decision Tree Algorithm Tutorial With Example in R
No ratings yet
Decision Tree Algorithm Tutorial With Example in R
23 pages
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
No ratings yet
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
7 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
Decision - Tree
No ratings yet
Decision - Tree
75 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
Decision Tree For Classification (ID3 Information Gain Entropy)
No ratings yet
Decision Tree For Classification (ID3 Information Gain Entropy)
3 pages
Decision Tree Introduction
No ratings yet
Decision Tree Introduction
14 pages
LVC 1 Post-Session Summary
No ratings yet
LVC 1 Post-Session Summary
9 pages
Module 3 Chap 3 Decision Tree Learning
No ratings yet
Module 3 Chap 3 Decision Tree Learning
79 pages
Decision Tree
No ratings yet
Decision Tree
52 pages
Decisiontrees
No ratings yet
Decisiontrees
28 pages
UNIT - 5 - ID3 Algotithm (Good Slide)
No ratings yet
UNIT - 5 - ID3 Algotithm (Good Slide)
28 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
ML Unit 3 Notes
No ratings yet
ML Unit 3 Notes
117 pages
MODULE 4-Dr - GM
No ratings yet
MODULE 4-Dr - GM
23 pages
Classification Trees
No ratings yet
Classification Trees
48 pages
Entropy and Information Gain Explained
No ratings yet
Entropy and Information Gain Explained
10 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
11 pages
7 DecisionTree
No ratings yet
7 DecisionTree
58 pages
Decision Tree
No ratings yet
Decision Tree
19 pages
Act 9
No ratings yet
Act 9
22 pages
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
No ratings yet
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
54 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
Decision Trees
No ratings yet
Decision Trees
45 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
ML Unit 3 Notes-1
No ratings yet
ML Unit 3 Notes-1
118 pages
Experiment No-2
No ratings yet
Experiment No-2
4 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
12 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Difference Between General and Technical Communication
No ratings yet
Difference Between General and Technical Communication
7 pages
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
No ratings yet
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
22 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
DMDW Co3 Session 14
No ratings yet
DMDW Co3 Session 14
55 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
6 DecisionTrees ID3 CART
No ratings yet
6 DecisionTrees ID3 CART
24 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
Decision Tree Algorithm - A Complete Guide: Data Science Blogathon
No ratings yet
Decision Tree Algorithm - A Complete Guide: Data Science Blogathon
13 pages
2024 Decision Trees
No ratings yet
2024 Decision Trees
28 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
ML Unit 3 Qa
No ratings yet
ML Unit 3 Qa
26 pages
Trees
No ratings yet
Trees
78 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
Week 11 - Decision Tree Learning
No ratings yet
Week 11 - Decision Tree Learning
43 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Centenary of 'A Portrait of The Artist As A Young Man' (ABEI Journal, Vol.18-2016)
No ratings yet
Centenary of 'A Portrait of The Artist As A Young Man' (ABEI Journal, Vol.18-2016)
206 pages
DT Classifier
No ratings yet
DT Classifier
45 pages
The Muncaster Steam-Engine Models: 3-Simple Slide-Valve Engines
No ratings yet
The Muncaster Steam-Engine Models: 3-Simple Slide-Valve Engines
3 pages
(LSE Monographs On Social Anthropology 63) Andre Beteille - Society and Politics in India - Essays in A Comparative Perspective-Athlone Press - Routledge (1991) (Z-Lib - Io)
No ratings yet
(LSE Monographs On Social Anthropology 63) Andre Beteille - Society and Politics in India - Essays in A Comparative Perspective-Athlone Press - Routledge (1991) (Z-Lib - Io)
326 pages
Consumer Equilibrium
No ratings yet
Consumer Equilibrium
31 pages
2015 HK
No ratings yet
2015 HK
20 pages
MT6761 Android Scatter
No ratings yet
MT6761 Android Scatter
12 pages
Electronic Nose
No ratings yet
Electronic Nose
25 pages
Prasoon Raj - 1709131099 - Report
100% (1)
Prasoon Raj - 1709131099 - Report
41 pages
Studies On Removal of Methylene Blue Dye by A Novel Activated Carbon Prepared From Thespesia Populnea Waste Biomass
No ratings yet
Studies On Removal of Methylene Blue Dye by A Novel Activated Carbon Prepared From Thespesia Populnea Waste Biomass
21 pages
Naive Bayes Theory
No ratings yet
Naive Bayes Theory
4 pages
2020 GKS-U Application Guidelines (Regional University Track)
No ratings yet
2020 GKS-U Application Guidelines (Regional University Track)
28 pages
Split Case Pump Mos
100% (1)
Split Case Pump Mos
10 pages
Dimentionality Reduction Implementation
No ratings yet
Dimentionality Reduction Implementation
8 pages
Lived Experiences and Challenges of Senior High School Learners in The Implementation of Limited Face-To-Face Classes
No ratings yet
Lived Experiences and Challenges of Senior High School Learners in The Implementation of Limited Face-To-Face Classes
7 pages
Idef02 - BPWin Standard
No ratings yet
Idef02 - BPWin Standard
238 pages
Lectronic OSE: Presentation by
No ratings yet
Lectronic OSE: Presentation by
18 pages
93 - Grammar Likes and Dislikes
No ratings yet
93 - Grammar Likes and Dislikes
3 pages
Smart Tacho V2 Remote HMI Transport Protocol Specifications 1.0
No ratings yet
Smart Tacho V2 Remote HMI Transport Protocol Specifications 1.0
101 pages
Probability and Reliability Aspects in P PDF
No ratings yet
Probability and Reliability Aspects in P PDF
5 pages
Performance Evaluation Form
No ratings yet
Performance Evaluation Form
1 page
Matplotlib Library Implementation
No ratings yet
Matplotlib Library Implementation
3 pages
Naive Bayes Implementation
No ratings yet
Naive Bayes Implementation
2 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
2 pages
Term Paper On Chile
100% (1)
Term Paper On Chile
4 pages
Cross Coverage
No ratings yet
Cross Coverage
31 pages
Direct and Indirect
No ratings yet
Direct and Indirect
5 pages
Power Windows Description and Operation
No ratings yet
Power Windows Description and Operation
4 pages
Computer
No ratings yet
Computer
2 pages
A Set of Measures of Centrality Based On Betweenness
No ratings yet
A Set of Measures of Centrality Based On Betweenness
8 pages
SABIK MARINE Datasheet LED 155
No ratings yet
SABIK MARINE Datasheet LED 155
2 pages
Be Careful Not To Add in The Student Number.
No ratings yet
Be Careful Not To Add in The Student Number.
2 pages
GEHealthcare Transport Pro Monitor Spec Sheet
No ratings yet
GEHealthcare Transport Pro Monitor Spec Sheet
2 pages

Decision Tree Theory

Uploaded by

Decision Tree Theory

Uploaded by

Decision Tree

Heavy Not Heavy

High Mileage Horsepower

High Mileage Low Mileage

Q. How to build decision tree?

Size Shape Colour Choice

Here we have 2 classes:

H(Colour) = ∗0 + ∗ 0.815 + ∗ 0 = 0.466

Step-3: - Final Gain

Brick Wedge Sphere Pillar

It is clear from the data that choice class for

But for “Pillar” we have two choices: -

Size Shape Colour Choice

Brick Wedge Sphere Pillar

Yes No Yes Colour

You might also like