Classification Tree 2
Classification Tree 2
Gini Index
GINI (t ) 1 [ p ( j | t )] 2
Entropy
Entropy (t ) p( j | t ) log p( j | t )
j
Misclassification error
Error (t ) 1 max P (i | t ) i
GINI (t ) 1 [ p ( j | t )]2
j
GINI (t ) 1 [ p ( j | t )]2
j
C1 0 C1 1 C1 2 C1 3
C2 6 C2 5 C2 4 C2 3
Gini=0.000 Gini=0.278 Gini=0.444 Gini=0.500
n
split i 1
Error (t ) 1 max P (i | t ) i
No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0
Gini 0.420 0.400 0.375 0.343 0.417 0.400 0.300 0.343 0.375 0.400 0.420
No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0
Gini 0.420 0.400 0.375 0.343 0.417 0.400 0.300 0.343 0.375 0.400 0.420
No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0
Gini 0.420 0.400 0.375 0.343 0.417 0.400 0.300 0.343 0.375 0.400 0.420
No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0
Gini 0.420 0.400 0.375 0.343 0.417 0.400 0.300 0.343 0.375 0.400 0.420
No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0
Gini 0.420 0.400 0.375 0.343 0.417 0.400 0.300 0.343 0.375 0.400 0.420
Gain Ratio:
GAIN n n
SplitINFO log
k
GainRATIO Split i i
SplitINFO
split
n n i 1
Gain Ratio:
GAIN n n
SplitINFO log
k
GainRATIO Split i i
SplitINFO
split
n n i 1