0% found this document useful (0 votes)

88 views30 pages

Decision Tree Decision Tree: R. Akerkar

This document discusses decision trees, including: - Decision trees classify records by sorting them down the tree from root to leaf node based on attribute tests. - They can handle both numerical and categorical data. - The ID3 algorithm selects the splitting attribute at each node that maximizes information gain to classify the training data.

Uploaded by

segnumutra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views30 pages

Decision Tree Decision Tree: R. Akerkar

Uploaded by

segnumutra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Decision Tree

R. Akerkar
TMRF, Kolhapur, India

R. Akerkar 1
Introduction

 A classification scheme which g generates a tree and

a set of rules from given data set.

 The sett off records

Th d available
il bl ffor d
developing
l i
classification methods is divided into two disjoint
subsets – a training g set and a test set.
 The attributes of the records are categorise into two
types:
 Attributes
Attrib tes whose
hose domain is n
numerical
merical are called numerical
n merical
attributes.
 Attributes whose domain is not numerical are called the
categorical attributes
attributes.

R. Akerkar 2
Introduction

 A decision tree is a tree with the following

gpproperties:
p
 An inner node represents an attribute.

 An edge represents a test on the attribute of the father

node.
node
 A leaf represents one of the classes.

 Construction of a decision tree

 Based on the training data

 Top-Down
Top Down strategy

R. Akerkar 3
Decision Tree
Example

 The data set has five attributes.

 There is a special attribute: the attribute class is the class label.
 The attributes, temp (temperature) and humidity are numerical
attributes
 Other attributes are categorical, that is, they cannot be ordered.

 Based on the training data set, we want to find a set of rules to

know what values of outlook, temperature, humidity and wind,
determine whether or not to play golf.

R. Akerkar 4
Decision Tree
Example

 We have five leaf nodes.

 In a decision tree, each leaf node represents a rule.

 We have the following rules corresponding to the tree given in

Figure.

 RULE 1 If it is sunny and the humidity is not above 75%

75%, then play
play.
 RULE 2 If it is sunny and the humidity is above 75%, then do not play.
 RULE 3 If it is overcast, then play.
 RULE 4 If it is rainy and not windy, then play.
 RULE 5 If it is
i rainy
i and d windy,
i d then
th don't
d 't play.
l

R. Akerkar 5
Classification

 The classification of an unknown input

p vector is done by y
traversing the tree from the root node to a leaf node.
 A record enters the tree at the root node.
 At the root,
root a test is applied to determine which child
node the record will encounter next.
 This process is repeated until the record arrives at a leaf
node.
d
 All the records that end up at a given leaf of the tree are
classified in the same way.y
 There is a unique path from the root to each leaf.
 The path is a rule which is used to classify the records.

R. Akerkar 6
 In our tree, we can carry out the classification
for
o aan u
unknown
o record
eco d as follows.
o o s
 Let us assume, for the record, that we know
the values of the first four attributes (but we
do not know the value of class attribute) as

 outlook= rain; temp = 70; humidity = 65; and

windy= true.
windy true
R. Akerkar 7
 We start from the root node to check the value of the attribute
associated at the root node.
 This attribute is the splitting attribute at this node.
 For a decision tree, at every node there is an attribute associated
with the node called the splitting attribute.

 In our example, outlook is the splitting attribute at root.

 Since for the given record, outlook = rain, we move to the right-
most child node of the root.
 At this node, the splitting attribute is windy and we find that for
the record we want classify, windy = true.
 Hence, we move to the left child node to conclude that the class
l b l IIs ""no play".
label l "

R. Akerkar 8
 The accuracy of the classifier is determined by the percentage of the
t t data
test d t sett that
th t is
i correctly
tl classified.
l ifi d

 We can see that for Rule 1 there are two records of the test data set
satisfying outlook= sunny and humidity < 75, and only one of these
is correctly classified as play.
 Thus, the accuracy of this rule is 0.5 (or 50%). Similarly, the
accuracy of Rule 2 is also 0.5 (or 50%). The accuracy of Rule 3 is
0.66.

RULE 1
If it is sunny and the humidity
is not above 75%, then play.

R. Akerkar 9
Concept of Categorical Attributes

 Consider the following training

data set.
 There are three attributes,
namely, age, pincode and class.
 The attribute class is used for
class label.

The attribute age is a numeric attribute, whereas pincode is a categorical

one.
Though
Th h th
the d
domain
i off pincode
i d is
i numeric,
i no ordering
d i can b
be d
defined
fi d
among pincode values.
You cannot derive any useful information if one pin-code is greater than
another pincode.
pincode

R. Akerkar 10
 Figure gives a decision tree for the
training data
data.

 The splitting attribute at the root is

pincode and the splitting criterion
here is pincode = 500 046.
 Similarly, for the left child node, the
p g criterion is age
splitting g < 48 ((the
splitting attribute is age).
At root level, we have 9 records.
 Although the right child node has The associated splitting
p g criterion is
the same attribute as the splitting pincode = 500 046.
attribute, the splitting criterion is
different. As a result, we split the records
into two subsets. Records 1, 2, 4, 8,
and 9 are to the left child note and
remaining to the right node.
The process is repeated at every
node.

R. Akerkar 11
Advantages and Shortcomings of Decision
Tree Classifications
 A decision tree construction process is concerned with identifying
the
h splitting
li i attributes
ib andd splitting
li i criterion
i i at every llevell off the
h tree.

 Major strengths are:

 Decision tree able to generate understandable rules.
 They are able to handle both numerical and categorical attributes.
 They provide clear indication of which fields are most important for
prediction or classification
classification.

 Weaknesses are:
 The process of growing a decision tree is computationally expensive
expensive. At
each node, each candidate splitting field is examined before its best split
can be found.
 Some decision tree can only deal with binary-valued target classes.

R. Akerkar 12
Iterative Dichotomizer (ID3)
 Quinlan (1986)
 Each node corresponds to a splitting attribute
 Each arc is a possible value of that attribute.

 At each node the splitting attribute is selected to be the most

informative among the attributes not yet considered in the path from
the root.

 Entropy is used to measure how informative is a node.

 The algorithm uses the criterion of information gain to determine the
goodness
d off a split.
lit
 The attribute with the greatest information gain is taken as
the splitting attribute, and the data set is split for all distinct
values of the attribute
attribute.

R. Akerkar 13
Training Dataset
This follows an example from Quinlan’s ID3

The class label attribute,

buys_computer, has two distinct
values. age income student credit_rating buys_computer
<=30 high
g no fair no
Thus there are two distinct <=30 high no excellent no
classes. (m =2) 31…40 high no fair yes
Class C1 corresponds to yes >40 medium no fair yes
and class C2 corresponds to no.
no >40 low yes fair yes
>40 low yes excellent no
There are 9 samples of class yes 31…40 low yes excellent yes
and 5 samples of class no. <=30 medium no fair no
<=30 low yes fair yes
>40
40 medium
di yes fair
f i yes
<=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
>40 medium no excellent no

R. Akerkar 14
Extractingg Classification Rules from Trees

 Represent the knowledge in

the form of IF-THEN rules
 One rule is created for each
path from the root to a leaf
 Each attribute-value pair
along a path forms a
conjunction
 The leaf node holds the class
p
prediction
What are the rules?
 Rules are easier for humans
to understand

R. Akerkar 15
Solution (Rules)

IF age = “<=30” AND student = “no” THEN buys_computer = “no”

IF age = “<=30” AND student = “yes” THEN buys_computer = “yes”

IF age = “31…40” THEN buys_computer = “yes”

IF age = “>40” AND credit_rating = “excellent” THEN

buys_computer = “yes”

IF age = “<=30” AND credit_rating = “fair” THEN buys_computer =

“no”

R. Akerkar 16
Algorithm for Decision Tree Induction

 Basic algorithm (a greedy algorithm)

 Tree is constructed in a top-down recursive divide-and-conquer
manner
 At start, all the training examples are at the root

 Attributes are categorical

g ((if continuous-valued, they
y are
discretized in advance)
 Examples are partitioned recursively based on selected attributes

 Test attributes are selected on the basis of a heuristic or

statistical measure (e.g., information gain)
 Conditions for stopping partitioning
 All samples
p for a g
given node belong g to the same class
 There are no remaining attributes for further partitioning –
majority voting is employed for classifying the leaf
 There are no samples p left

R. Akerkar 17
Attribute Selection Measure: Information
Gain (ID3/C4.5)
(ID3/C4 5)

 Select the attribute with the highest information gain

 S contains si tuples of class Ci for i = {1, …, m}
 information measures info required q to classifyy anyy
arbitrary tuple m
si si
I( s1,s 2,...,s m )    log 2

i 1 s s ….information is encoded in bits.
 entropy off attribute
b A with h valuesl {a
{ 1,a2,…,av}
v
s1 j  ... smj
E(A)  I ( s1 j ,...,smj )
j 1 s

 information gained by branching on attribute A

Gain(A)  I(s 1, s 2 ,..., sm)  E(A)

R. Akerkar 18
Entropy
 Entropy measures the homogeneity (purity) of a set of examples.
 It gives the information content of the set in terms of the class labels of
the examples.
 Consider that you have a set of examples, S with two classes, P and N. Let
the set have p instances for the class P and n instances for the class N.
 So the total number of instances we have is t = p + n. The view [p, n] can
be seen as a class distribution of S.

The entropy for S is defined as

 Entropy(S) = - (p/t).log2(p/t) - (n/t).log2(n/t)

 Example: Let a set of examples consists of 9 instances for class positive,

and 5 instances for class negative.
 Answer: p = 9 and n = 5.
 So Entropy(S) = - (9/14).log2(9/14) - (5/14).log2(5/14)
 = -(0.64286)(-0.6375) - (0.35714)(-1.48557)
 = (0.40982) + (0.53056)
 = 0.940

R. Akerkar 19
Entropyy
The entropy for a completely pure set is 0 and is 1 for a set with
equall occurrences ffor both
b th th
the classes.
l

i.e. Entropy[14,0] = - (14/14).log2(14/14) - (0/14).log2(0/14)

= -1.log2(1)
1 l 2(1) - 0.log2(0)
0 l 2(0)
= -1.0 - 0
=0

i.e. Entropy[7,7] = - (7/14).log2(7/14) - (7/14).log2(7/14)

= - (0.5).log2(0.5) - (0.5).log2(0.5)
= - (0.5).(-1) - (0.5).(-1)
= 0.5 + 0.5
=1

R. Akerkar 20
Attribute Selection by Information Gain
Computation
5 4
 Class P: buys_computer = “yes” E ( age )  I ( 2 ,3)  I ( 4,0 )
14 14
 Class N: buys_computer = “no”
5
 I(p, n) = I(9, 5) =0.940  I (3, 2 )  0 .694
 Compute the entropy for age:
14
age pi ni I(pi, ni) 5
<=30 2 3 0.971 I ( 2,3) means “age
“ <=30”
30” h
has 5
14
30…40 4 0 0 out of 14 samples, with 2 yes's
>40
40 3 2 0.971 and 3 no’s.
no s. Hence
age income student credit_rating buys_computer
<=30
<=30
high
high
no
no
fair
excellent
no
no
Gain ( age )  I ( p , n )  E ( age )  0.246
31…40 high no fair yes
>40
>40
medium
low
no
yes
fair
fair
yes
yes
Similarly Gain(income)  0.029
Similarly,
>40 low yes excellent no Gain( student )  0.151
31…40 low yes excellent yes
<=30 medium no fair no Gain(credit _ rating )  0.048
<=30 low yes fair yes
>40 medium yes fair yes
<=30 medium yes excellent yes Since, age has the highest information gain
31…40 medium no excellent yes among the attributes, it is selected as the
31…40 high yes fair yes
>40 medium no excellent no R. Akerkar
test attribute. 21
Exercise 1
 The following table consists of training data from an employee
database.
database

 Let status be the class attribute. Use the ID3 algorithm to construct a
decision tree from the given data.

R. Akerkar 22
Solution 1

R. Akerkar 23
Other Attribute Selection Measures

 Gini index (CART,

(CART IBM IntelligentMiner)
 All attributes are assumed continuous-valued
 Assume there exist several possible split values for each
attribute
 May need other tools, such as clustering, to get the
possible split values
 Can be modified for categorical attributes

R. Akerkar 24
Gini Index (IBM IntelligentMiner)
 If a data set T contains examples from n classes, gini index, gini(T) is
n
defined as i i (T )  1   p 2
gini j
j 1
where pj is the relative frequency of class j in T.
 If a data set T is split into two subsets T1 and T2 with sizes N1 and N2
respectively, the gini index of the split data contains examples from n
classes, the gini index gini(T) is defined as

gini split
(T )  N 1 gini (T 1)  N 2 gini (T 2 )
N N
 The attribute provides the smallest ginisplit(T) is chosen to split the node
(need to enumerate all possible splitting points for each attribute).

R. Akerkar 25
Exercise 2

R. Akerkar 26
Solution 2
 SPLIT: Age <= 50
 ----------------------
 | High | Low | Total
 --------------------
 S1 (left) | 8 | 11 | 19
 S2 (right) | 11 | 10 | 21
 --------------------
 For S1: P(high) = 8/19 = 0.42 and P(low) = 11/19 = 0.58
 For S2: P(high) = 11/21 = 0.52 and P(low) = 10/21 = 0.48
 Gini(S1) = 1-[0.42x0.42 + 0.58x0.58] = 1-[0.18+0.34] = 1-0.52 = 0.48
 Gini(S2) = 1-[0.52x0.52 + 0.48x0.48] = 1-[0.27+0.23] = 1-0.5 = 0.5
 Gini-Split(Age<=50) = 19/40 x 0.48 + 21/40 x 0.5 = 0.23 + 0.26 = 0.49

 SPLIT: Salary <= < 65K

 ----------------------
 | High | Low | Total
 --------------------
 S1 (top) | 18 | 5 | 23
 S2 (bottom) | 1 | 16 | 17
 --------------------
 For S1: P(high) = 18/23 = 0.78 and P(low) = 5/23 = 0.22
 For S2: P(high) = 1/17 = 0.06 and P(low) = 16/17 = 0.94
 Gini(S1) = 1-[0.78x0.78 + 0.22x0.22] = 1-[0.61+0.05] = 1-0.66 = 0.34
 Gini(S2) = 1-[0.06x0.06
1-[0 06x0 06 + 0 0.94x0.94]
94x0 94] = 1-[0
1-[0.004+0.884]
004+0 884] = 1-0
1-0.89
89 = 0
0.11
11
 Gini-Split(Age<=50) = 23/40 x 0.34 + 17/40 x 0.11 = 0.20 + 0.05 = 0.25

R. Akerkar 27
Exercise 3

 In previous exercise
exercise, which is a better split of
the data among the two split points? Why?

R. Akerkar 28
Solution 3
 Intuitively Salary <= 65K is a better split point since it produces
relatively ``pure''
pure'' partitions as opposed to Age <= 5050, which
results in more mixed partitions (i.e., just look at the distribution
of Highs and Lows in S1 and S2).

 More formally, let us consider the properties of the Gini index.

If a partition is totally pure, i.e., has all elements from the same
class, then gini(S) = 1-[1x1+0x0] = 1-1 = 0 (for two classes).

On the other hand if the classes are totally mixed, i.e., both
classes have equal probability then
gini(S) = 1 - [0.5x0.5
[0 5x0 5 + 0
0.5x0.5]
5x0 5] = 1
1-[0.25+0.25]
[0 25+0 25] = 0
0.5.
5

In other words the closer the gini value is to 0, the better the
partition is.
is Since Salary has lower gini it is a better split.
split

R. Akerkar 29
Avoid
vo d Overfitting
v g in Classification
C ss c o
 Overfitting: An induced tree may overfit the training data
 Too many branches, some may reflect anomalies due to noise
or outliers
 Poor accuracy y for unseen samples
p
 Two approaches to avoid overfitting
 Prepruning: Halt tree construction early—do not split a node if
this would result in the goodness measure falling below a
threshold
 Difficult to choose an appropriate threshold

 Postpruning: Remove branches from a “fully grown” tree—get a

sequence off progressively
i l prunedd ttrees
 Use a set of data different from the training data to decide
which is the “best pruned tree”

R. Akerkar 30

2.3.1 The McCulloch-Pitts Model of Neuron
No ratings yet
2.3.1 The McCulloch-Pitts Model of Neuron
2 pages
GW Basic Programs
100% (1)
GW Basic Programs
6 pages
Budget of Work ILLUSTRATION 9
100% (1)
Budget of Work ILLUSTRATION 9
10 pages
15.module6 Decisiontree-Updated 14
No ratings yet
15.module6 Decisiontree-Updated 14
20 pages
Decision Tree
No ratings yet
Decision Tree
74 pages
Data Mining - Lecture 5
No ratings yet
Data Mining - Lecture 5
33 pages
Wk. 5.2. Decision Trees (27.10.2020)
No ratings yet
Wk. 5.2. Decision Trees (27.10.2020)
57 pages
ML 04 Classification Decission Tree
No ratings yet
ML 04 Classification Decission Tree
28 pages
Supervised Learning Algorithm
No ratings yet
Supervised Learning Algorithm
59 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
3-Classification, Clustering and Prediction
No ratings yet
3-Classification, Clustering and Prediction
142 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
Classification
No ratings yet
Classification
75 pages
Decision Tree
No ratings yet
Decision Tree
41 pages
03 Decision Tree
No ratings yet
03 Decision Tree
59 pages
Unit 3 Classification
No ratings yet
Unit 3 Classification
71 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
20210913115613D3708 - Session 05-08 Decision Tree Classification
No ratings yet
20210913115613D3708 - Session 05-08 Decision Tree Classification
37 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
Classification - Decision Trees
No ratings yet
Classification - Decision Trees
96 pages
Lecture 04 Decession Trees 04112022 015118pm
No ratings yet
Lecture 04 Decession Trees 04112022 015118pm
43 pages
2 - Decision Tree
No ratings yet
2 - Decision Tree
23 pages
Lecture 4 - Decision Tree
No ratings yet
Lecture 4 - Decision Tree
48 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
Decision Tree
No ratings yet
Decision Tree
33 pages
Unit-II - Tree Based Methods
No ratings yet
Unit-II - Tree Based Methods
158 pages
ML - 4
No ratings yet
ML - 4
58 pages
Decision Tree
No ratings yet
Decision Tree
22 pages
Trees
No ratings yet
Trees
78 pages
Classification and Clustering
No ratings yet
Classification and Clustering
59 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
59 pages
Data Mining Unit-Iii
No ratings yet
Data Mining Unit-Iii
36 pages
Lec05 Classification DecisionTree
No ratings yet
Lec05 Classification DecisionTree
67 pages
UNIT 2 Class Basic
No ratings yet
UNIT 2 Class Basic
69 pages
ML Classification Tree
No ratings yet
ML Classification Tree
36 pages
Classification&Decision Tree
No ratings yet
Classification&Decision Tree
10 pages
Lect 8-Decision Tree-2
No ratings yet
Lect 8-Decision Tree-2
16 pages
Decision Tree
No ratings yet
Decision Tree
38 pages
5 Classification
No ratings yet
5 Classification
59 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
Python Decision Tree Classification
No ratings yet
Python Decision Tree Classification
14 pages
Les 3 DWM
No ratings yet
Les 3 DWM
21 pages
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
No ratings yet
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
34 pages
ML Unit II
No ratings yet
ML Unit II
183 pages
Data Mining: Concepts and Techniques: - Chapter 7
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 7
61 pages
Decision Tree
No ratings yet
Decision Tree
28 pages
L05 - Advance Analytical Theory and Methods - Classification
No ratings yet
L05 - Advance Analytical Theory and Methods - Classification
34 pages
2 ML Ch3 Decision Trees Final
No ratings yet
2 ML Ch3 Decision Trees Final
70 pages
CH 5
No ratings yet
CH 5
84 pages
Module - 4.1-DM-1
No ratings yet
Module - 4.1-DM-1
63 pages
Module 5: Data Mining Algorithms: Classification
No ratings yet
Module 5: Data Mining Algorithms: Classification
34 pages
Decisiontree
No ratings yet
Decisiontree
6 pages
L6 Decision Tree Classifier
No ratings yet
L6 Decision Tree Classifier
46 pages
19 - Decision Tree - ID3
No ratings yet
19 - Decision Tree - ID3
87 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
3 Module DWM
No ratings yet
3 Module DWM
16 pages
Decision Trees
No ratings yet
Decision Trees
14 pages
08 Class Basic
No ratings yet
08 Class Basic
86 pages
High School Pre-Calculus Tutor
From Everand
High School Pre-Calculus Tutor
The Editors of REA
4/5 (1)
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
EEC Aggignment-2 PDF
No ratings yet
EEC Aggignment-2 PDF
4 pages
Assenergy 1
No ratings yet
Assenergy 1
4 pages
Boys Hostel Rule Book Curve
No ratings yet
Boys Hostel Rule Book Curve
24 pages
ANN Lab Manual-2
No ratings yet
ANN Lab Manual-2
35 pages
Scheme of Study &syllabi: Be Cse
No ratings yet
Scheme of Study &syllabi: Be Cse
76 pages
Artifcial Neural Network": "A Project On
No ratings yet
Artifcial Neural Network": "A Project On
31 pages
DCPD Editorial Ebook Till Sep 29, 19
No ratings yet
DCPD Editorial Ebook Till Sep 29, 19
181 pages
Artificial Neural Network Supervised Learning
No ratings yet
Artificial Neural Network Supervised Learning
14 pages
1AMCAT - Seating Plan - BE Batch 2020 Pass Outs. 11th Aug., 2019
No ratings yet
1AMCAT - Seating Plan - BE Batch 2020 Pass Outs. 11th Aug., 2019
89 pages
Handwriting Recognition Methods Using Artificial Neural Networks
No ratings yet
Handwriting Recognition Methods Using Artificial Neural Networks
10 pages
Handwritten Character Recognition (HCR) USING NEURAL NETWORK
No ratings yet
Handwritten Character Recognition (HCR) USING NEURAL NETWORK
76 pages
Ann Assignmeent 1,2,3
No ratings yet
Ann Assignmeent 1,2,3
23 pages
61383
No ratings yet
61383
8 pages
ANN Lab Manual
100% (3)
ANN Lab Manual
35 pages
Chandigarh: University Gharuan, Mohali
No ratings yet
Chandigarh: University Gharuan, Mohali
7 pages
Handwriting Recognition
100% (1)
Handwriting Recognition
76 pages
Math 9 Q1 M2 W2 Revised Final
No ratings yet
Math 9 Q1 M2 W2 Revised Final
16 pages
ChE 310 - Mid Quiz Question - ChE 16
No ratings yet
ChE 310 - Mid Quiz Question - ChE 16
3 pages
Percent Practice Chapter Test: Principles of Mathematics 8
No ratings yet
Percent Practice Chapter Test: Principles of Mathematics 8
6 pages
Electromagnetic Fields R 22 - Hyd ECE Course Structure & Syllabus
No ratings yet
Electromagnetic Fields R 22 - Hyd ECE Course Structure & Syllabus
2 pages
Probability & Statistics BITS WILP
100% (2)
Probability & Statistics BITS WILP
174 pages
Get Them All: (I 1) T, 3) Seats
No ratings yet
Get Them All: (I 1) T, 3) Seats
2 pages
Mort Table
No ratings yet
Mort Table
5 pages
Bougheas Et Al. 2009 1-S2.0-S0378426608001830-Main
No ratings yet
Bougheas Et Al. 2009 1-S2.0-S0378426608001830-Main
8 pages
Tedy Game Theory PDF
No ratings yet
Tedy Game Theory PDF
71 pages
ProblemsChapter 05 Cables PDF
No ratings yet
ProblemsChapter 05 Cables PDF
4 pages
Flow Characteristics in Mixers Agitated by Helical Ribbon Blade Impeller
No ratings yet
Flow Characteristics in Mixers Agitated by Helical Ribbon Blade Impeller
15 pages
Question Bank Maths
No ratings yet
Question Bank Maths
3 pages
4$20 Triangles$20 Proofs
No ratings yet
4$20 Triangles$20 Proofs
23 pages
Chapter 16 Kinetics
No ratings yet
Chapter 16 Kinetics
22 pages
Dissertation Topics in Electronics and Communication
100% (1)
Dissertation Topics in Electronics and Communication
4 pages
4 - Pseudocode With WHILE
No ratings yet
4 - Pseudocode With WHILE
17 pages
Statistics Assignment
No ratings yet
Statistics Assignment
6 pages
Manual Laboratory Experiment No. 4 Bangguiyac
No ratings yet
Manual Laboratory Experiment No. 4 Bangguiyac
11 pages
08 - TD - M13 Handout
No ratings yet
08 - TD - M13 Handout
50 pages
Chapra 5e PPT Ch02 MATLAB+Fundamentals
No ratings yet
Chapra 5e PPT Ch02 MATLAB+Fundamentals
39 pages
GEC 104 - Activity 2
No ratings yet
GEC 104 - Activity 2
3 pages
Lines and Angles: Exercise 5.1
No ratings yet
Lines and Angles: Exercise 5.1
12 pages
Unit Yield Analysis of Slabs: 20 Line
No ratings yet
Unit Yield Analysis of Slabs: 20 Line
22 pages
Primary 2
100% (1)
Primary 2
11 pages
MT 226:partial Differential Equations
No ratings yet
MT 226:partial Differential Equations
75 pages
Class Vi, Maths
No ratings yet
Class Vi, Maths
4 pages
Eee 2204 Engineering Mathematics IV
No ratings yet
Eee 2204 Engineering Mathematics IV
3 pages
Fractions Topic Assessment Band 1 3
No ratings yet
Fractions Topic Assessment Band 1 3
6 pages

Decision Tree Decision Tree: R. Akerkar

Uploaded by

Decision Tree Decision Tree: R. Akerkar

Uploaded by

Decision Tree

 A classification scheme which g generates a tree and

 The sett off records

 A decision tree is a tree with the following

 An edge represents a test on the attribute of the father

 Construction of a decision tree

 The data set has five attributes.

 Based on the training data set, we want to find a set of rules to

 We have five leaf nodes.

 We have the following rules corresponding to the tree given in

 RULE 1 If it is sunny and the humidity is not above 75%

 The classification of an unknown input

 outlook= rain; temp = 70; humidity = 65; and

 In our example, outlook is the splitting attribute at root.

 Consider the following training

The attribute age is a numeric attribute, whereas pincode is a categorical

 The splitting attribute at the root is

 Major strengths are:

 At each node the splitting attribute is selected to be the most

 Entropy is used to measure how informative is a node.

The class label attribute,

 Represent the knowledge in

IF age = “<=30” AND student = “no” THEN buys_computer = “no”

IF age = “<=30” AND student = “yes” THEN buys_computer = “yes”

IF age = “31…40” THEN buys_computer = “yes”

IF age = “>40” AND credit_rating = “excellent” THEN

IF age = “<=30” AND credit_rating = “fair” THEN buys_computer =

 Basic algorithm (a greedy algorithm)

 Attributes are categorical

 Test attributes are selected on the basis of a heuristic or

 Select the attribute with the highest information gain

 information gained by branching on attribute A

Gain(A)  I(s 1, s 2 ,..., sm)  E(A)

The entropy for S is defined as

 Example: Let a set of examples consists of 9 instances for class positive,

i.e. Entropy[14,0] = - (14/14).log2(14/14) - (0/14).log2(0/14)

i.e. Entropy[7,7] = - (7/14).log2(7/14) - (7/14).log2(7/14)

 Gini index (CART,

 SPLIT: Salary <= < 65K

 More formally, let us consider the properties of the Gini index.

 Postpruning: Remove branches from a “fully grown” tree—get a

You might also like