0% found this document useful (0 votes)

29 views71 pages

Example Classification

The document discusses decision tree induction using a training dataset called Buys_computer, illustrating the process of creating a decision tree based on attributes like age, income, student status, and credit rating. It explains the concept of information gain for attribute selection and the recursive procedure for building the tree. Additionally, it covers the calculation of Gini index for binary and categorical attributes to evaluate the effectiveness of splits in the decision tree.

Uploaded by

khlykun0209

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views71 pages

Example Classification

Uploaded by

khlykun0209

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 71

Decision Tree Induction: An Example

q Training data set: Buys_computer

age income student credit_rating buys_computer
q The data set follows an example of <=30 high no fair no
Quinlan’s ID3 (Playing Tennis) <=30 high no excellent no
31…40 high no fair yes
q Resulting tree:age? >40 medium no fair yes
>40 low yes fair yes
>40 low yes excellent no
31…40 low yes excellent yes
<=30 overcast31..40 >40 <=30 medium no fair no
<=30 low yes fair yes
>40 medium yes fair yes
student? yes credit rating? <=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
no yes excellent fair >40 medium no excellent no

no yes yes
4
Attribute Selection: Information Gain
¨ Class P: buys_computer = “yes” 9 9 5 5
¨ Class N: buys_computer = “no” Info(D) = I (9,5) = - 14 log2 (14 ) - 14 log2 (14 ) =0.940

age income student credit_rating buys_computer

<=30 high no fair no Look at “age”:
<=30 high no excellent no
31…40 high no fair yes age pi ni I(pi, ni)
>40 medium no fair yes
>40 low yes fair yes <=30 2 3 0.971
>40 low yes excellent no 31…40 4 0 0
31…40 low yes excellent yes >40 3 2 0.971
<=30 medium no fair no
<=30 low yes fair yes 5 4
>40 medium yes fair yes Info (D) = I (2,3) + I (4,0)
<=30 medium yes excellent yes age 14 14
31…40 medium no excellent yes 5

31…40 high yes fair yes

>40 medium no excellent no + 14 I (3,2) = 0.694
10
Attribute Selection: Information Gain
¨ Class P: buys_computer = “yes” 9 9 5 5
¨ Class N: buys_computer = “no” Info(D) = I (9,5) = - 14 log2 (14 ) - 14 log2 (14 )

age income student credit_rating buys_computer 5 4

<=30 high no fair no =0.940 Infoage (D) = 14 I (2,3) + 14 I (4,0)
<=30 high no excellent no
31…40 high no fair yes
>40 medium no fair yes 5
>40 low yes fair yes
+ 14 I (3,2) = 0.694
>40 low yes excellent no
31…40 low yes excellent yes
<=30 medium no fair no Gain(age) = Info(D) - Infoage (D) = 0.246
<=30 low yes fair yes
>40 medium yes fair yes
<=30 medium yes excellent yes Similarly,
31…40 medium no excellent yes Gain(income) = 0.029
31…40 high yes fair yes Gain(student) = 0.151
>40 medium no excellent no
Gain(credit _ rating) = 0.048
11
Recursive
Procedure
1.After selecting age at the
age income student credit_rating buys_computer root node, we will create three
<=30 high no fair no child nodes.
<=30 high no excellent no
31…40 high no fair yes
2.One child node is associated with
>40 medium no fair yes
>40 low yes fair yes red data tuples.
>40 low yes excellent no
31…40 low yes excellent yes 3. How to continue for this child node?
<=30 medium no fair no
<=30 low yes fair yes
>40 medium yes fair yes Now, you will make D = {red data
<=30 medium yes excellent yes tuples}
31…40 medium no excellent yes
31…40 high yes fair yes and then select the best attribute to
>40 medium no excellent no
further split
D.
A recursive procedure.

12
Binary Attributes: Computing Gini Index
! Splits into two partitions n 2
gini(D) =1- å p j j =1
! Effect of weighing partitions:
– Larger and Purer Partitions are sought for.
Parent
B?
C1 6
Yes No C2 6

Node N1 Node N2
Gini = ?
22
Binary Attributes: Computing Gini Index
! Splits into two partitions gini(D) =1- n 2
å pj
! Effect of weighing partitions: j =1
– Prefer Larger and Purer Partitions.
Parent
B? C1 6
Yes No C2 6
Gini = 0.500
Gini(N1) Node N1 Node N2
= 1 – (5/7)2 – (2/7)2
= 0.408 N1 N2 Gini(Children)
Gini(N2) C1 5 1
2 2 C2 2 4 = 7/12 * 0.408 + weighting 5/12
= 1 – (1/5) – (4/5)
= 0.320 Gini=0.371 * 0.320

= 0.371
71
Categorical Attributes: Computing Gini Index
¨ For each distinct value, gather counts for each class in the dataset
¨ Use the count matrix to make decisions
Multi-way split Two-way split
(find best partition of values)

CarType CarType CarType

Family Sports Luxury {Sports, {Family,

{Sports}
Luxury} {Family} Luxury}
C1 1 2 1 C1 3 1 C1 2 2
C2 4 1 1 C2 2 4 C2 1 5
Gini 0.393 Gini 0.400 Gini 0.419
25
Continuous Attributes: Computing Gini Index or
Information Gain
Tid Refund Marital Taxable Cheat
¨ To discretize the attribute values Status Income

¤ Use Binary Decisions based on one splitting value 1 Yes Single 125K No
2 No Married 100K No

¨ Several Choices for the splitting value 3 No Single 70K No

¤ Number of possible splitting values = Number of distinct values 4 Yes Married 120K No
-1
¤ Typically, the midpoint between each pair of adjacent values is 5 No Divorced 95K Yes
considered as a
possible split point 6 No Married 60K No
7 Yes Divorced 220K No
n (ai+ai+1)/2 is the midpoint between the values of ai and ai+1 8 No Single 85K Yes

¨ Each splitting value has a count matrix associated with it 9 No Married 75K No
10 No Single 90K Yes

¤ Class counts in each of the partitions, A < v and A ³ v

¨ Simple method to choose best v

¤ For each v, scan the database to gather count matrix and compute Taxable
its Gini index Income
> 80K?
¤ Computationally Inefficient! Repetition of work.
Yes No

26
Continuous Attributes:
Computing Gini Index or expected information
requirement
First decide the splitting value to discretize the attribute:
¨ For efficient computation: for each attribute,
Step 1: Sort the attribute on values

Cheat No No No Yes Yes Yes No No No No

Taxable Income

Step Sorted Values 60 70 75 85 90 95 100 120 125 220

1: Possible Splitting 55 65 72 80 87 92 97 110 122 172 230

Use
Values midpoint
<= > <= > <= > <= > <= > <= > <= > <= > <= > <= > <= >
Yes 0 3 0 3 0 3 0 3 1 2 2 1 3 0 3 0 3 0 3 0 3 0
No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0

Gini 0.420 0.400 0.375 0.343 0.417 0.400 0.300 0.343 0.375 0.400 0.420
27
Continuous Attributes:
Computing Gini Index or expected information
requirement
First decide the splitting value to discretize the attribute:
¨ For efficient computation: for each attribute,
Step 1: Sort the attribute on values
Step 2: Linearly scan these values, each time updating the count matrix

Cheat No No No Yes Yes Yes No No No No

Taxable Income

Step Sorted Values 60 70 75 85 90 95 100 120 125 220

1: Possible Splitting 55 65 72 80 87 92 97 110 122 172 230

Values
<= > <= > <= > <= > <= > <= > <= > <= > <= > <= > <= >

Step Yes 0 3 0 3 0 3 0 3 1 2 2 1 3 0 3 0 3 0 3 0 3 0

2: No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0

Gini 0.420 0.400 0.375 0.343 0.417 0.400 0.300 0.343 0.375 0.400 0.420
For each splitting value, get its count matrix: how many
data tuples have:
(a) Taxable income <=65 with class label “Yes” , (b)
Taxable income <=65 with class label “No”, (c) Taxable
income >65 with class label “Yes”,
28 (d) Taxable income >65 with class
label “No”.
Continuous Attributes:
Computing Gini Index or expected information
requirement
First decide the splitting value to discretize the attribute:
¨ For efficient computation: for each attribute,
Step 1: Sort the attribute on values
Step 2: Linearly scan these values, each time updating the count matrix

Cheat No No No Yes Yes Yes No No No No

Taxable Income

Step Sorted Values 60 70 75 85 90 95 100 120 125 220

1: Possible Splitting 55 65 72 80 87 92 97 110 122 172 230

Values
<= > <= > <= > <= > <= > <= > <= > <= > <= > <= > <= >

Step Yes 0 3 0 3 0 3 0 3 1 2 2 1 3 0 3 0 3 0 3 0 3 0

2: No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0

Gini 0.420 0.400 0.343 0.417 0.400 0.300 0.343 0.375 0.400 0.420

For each splitting value, get its count matrix: how many
data tuples have:
(a) Taxable income <=72 with class label “Yes” , (b)
Taxable income
<=72 with class label “No”, (c) Taxable income >72 with
class label “Yes”,
29 (d) Taxable income >72 with class
label “No”.
Continuous Attributes:
Computing Gini Index or expected information
requirement
First decide the splitting value to discretize the attribute:
¨ For efficient computation: for each attribute,
Step 1: Sort the attribute on values
Step 2: Linearly scan these values, each time updating the count matrix

Cheat No No No Yes Yes Yes No No No No

Taxable Income

Step Sorted Values 60 70 75 85 90 95 100 120 125 220

1: Possible Splitting 55 65 72 80 87 92 97 110 122 172 230

Values
<= > <= > <= > <= > <= > <= > <= > <= > <= > <= > <= >

Step Yes 0 3 0 3 0 3 0 3 1 2 2 1 3 0 3 0 3 0 3 0 3 0

2: No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0

Gini 0.420 0.400 0.375 0. 0.417 0.400 0.300 0.343 0.375 0.400 0.420
For each splitting value, get its count matrix: how many
data tuples have:
(a) Taxable income <=80 with class label “Yes” , (b)
Taxable income <=80 with class label “No”, (c) Taxable
income >80 with class label “Yes”,
30 (d) Taxable income >80 with class
label “No”.
Continuous Attributes:
Computing Gini Index or expected information
requirement
First decide the splitting value to discretize the attribute:
¨ For efficient computation: for each attribute,
Step 1: Sort the attribute on values
Step 2: Linearly scan these values, each time updating the count matrix

Cheat No No No Yes Yes Yes No No No No

Taxable Income

Step Sorted Values 60 70 75 85 90 95 100 120 125 220

1: Possible Splitting 55 65 72 80 87 92 97 110 122 172 230

Values
<= > <= > <= > <= > <= > <= > <= > <= > <= > <= > <= >

Step Yes 0 3 0 3 0 3 0 3 1 2 2 1 3 0 3 0 3 0 3 0 3 0

2: No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0

Gini 0.420 0.400 0.375 0.343 0.417 0.400 0.300 0.343 0.375 0. 0.420

For each splitting value, get its count matrix: how many data tuples have:
(a) Taxable income <=172 with class label “Yes” , (b)
Taxable income <=172 with class label “No”, (c) Taxable
income >172 with class label
31 “Yes”, (d) Taxable income >172 with class
label “No”.
Continuous Attributes:
Computing Gini Index or expected information
requirement
First decide the splitting value to discretize the attribute:
¨ For efficient computation: for each attribute,
Step 1: Sort the attribute on values
Step 2: Linearly scan these values, each time updating the count matrix
Step 3: Computing Gini index and choose the split position that has the least
Gini index

Step 3:

Step
1:

Step
2:
Sorted Values
Possible Splitting
For each splitting value v (e.g., 65), compute its Gini
index:
gini (D) = |D1| |D | Here D1 and D2 are two partitions based on
Taxable _ Income gini(D1) + 2 gini(D2) v: D1 has
|D|
32 |D| taxable income <=v and D2 has >v
Continuous Attributes:
Computing Gini Index or expected information
requirement
First decide the splitting value to discretize the attribute:
¨ For efficient computation: for each attribute,
Step 1: Sort the attribute on values
Step 2: Linearly scan these values, each time updating the count matrix
Step 3: Computing Gini index and choose the split position that has the least
Gini index

Step 3:

Step
1:

Step
2:
Sorted Values
Possible Splitting
For each splitting value v (e.g., 72), compute its Gini
index:
gini (D) = |D1| |D | Here D1 and D2 are two partitions based on
Taxable _ Income gini(D1) + 2 gini(D2) v: D1 has
|D|
33 |D| taxable income <=v and D2 has >v
Continuous Attributes:
Computing Gini Index or expected information
requirement
First decide the splitting value to discretize the attribute:
¨ For efficient computation: for each attribute,
Step 1: Sort the attribute on values
Step 2: Linearly scan these values, each time updating the count matrix
Step 3: Computing Gini index and choose the split position that has the least
Gini index

Step 3:

Step
1:

Step
2:
<= > <= > <= > <= > <= > <= > <= > <= > <= > <= > <= >
Yes 0 3 0 3 0 3 0 3 1 2 2 1 3 0 3 0 3 0 3 0 3 0
No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0
Sorted Values
Gini 0.420 0.400 0.375 0.343 0.417 0.400 0.300 0.343 0.375 0.400 0.420
Possible Splitting

Choose this splitting value (=97) with the least Gini index to discretize
Taxable Income
34
Continuous Attributes:
Computing Gini Index or expected information
requirement
First decide the splitting value to discretize the attribute:
¨ For efficient computation: for each attribute,
Step 1: Sort the attribute on values
Step 2: Linearly scan these values, each time updating the count matrix
Step 3: Computing expected information requirement and choose the split position
that has the least value

Step 3:

Step
1:

Step
2:
Sorted Values
Possible Splitting
If Information Gain is Similarly to calculating Gini index, for each splitting value, compute
used Info_{Taxable Income}:
for attribute selection, Info (D) =
2
|Dj |
å ´ Info(D j )
35
Taxable-Income
j =1 |D|
Continuous Attributes:
Computing Gini Index or expected information
requirement
First decide the splitting value to discretize the attribute:
¨ For efficient computation: for each attribute,
Step 1: Sort the attribute on values
Step 2: Linearly scan these values, each time updating the count matrix
Step 3: Computing Gini index and choose the split position that has the least
Gini index

Step 3:

Step
1:

Step
2:
Sorted Values
Possible Splitting
Choose this splitting value (=97 here) with the least Gini index or expected information requirement
to discretize Taxable Income

36
Continuous Attributes:
Computing Gini Index or expected information
requirement
First decide the splitting value to discretize the attribute:
¨ For efficient computation: for each attribute,
Step 1: Sort the attribute on values
Step 2: Linearly scan these values, each time updating the count matrix
Step 3: Computing Gini index and choose the split position that has the least Gini
index

Step 3:

Step
1:

Step
2:
No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0

Gini 0.420 0.400 0.375 0.343 0.417 0.400 0.300 0.343 0.375 0.400 0.420

Sorted Values
Possible Splitting
At each level of the decision tree, for attribute selection, (1) First, discretize a continuous attribute
by deciding the splitting value; (2) Then, compare the discretized attribute with other attributes in
terms of Gini Index reduction or Information Gain.
37
Continuous Attributes:
Computing Gini Index or expected information
requirement
First decide the splitting value to discretize the attribute:
For each
¨ For efficient computation: for each attribute,
attribute,
Step 1: Sort the attribute on values
only scan the
Step 2: Linearly scan these values, each time updating the count
matrix data tuples
once
Step 3: Computing Gini index and choose the split position that has the least Gini
index
Step 2:

Step 3:
Step
1:
No 0 7 1 6 2 5 3 4 3 4 3 4 3 4 4 3 5 2 6 1 7 0

Gini 0.420 0.400 0.375 0.343 0.417 0.400 0.300 0.343 0.375 0.400 0.420

Sorted Values
Possible Splitting
At each level of the decision tree, for attribute selection, (1) First, discretize a continuous attribute
by deciding the splitting value; (2) Then, compare the discretized attribute with other attributes in
terms of Gini Index reduction or Information Gain.
38
Naïve Bayes Classifier: Training
Dataset
Class: age income student credit_rating buys_computer
<=30 high no fair no
C1:buys_computer = ‘yes’ <=30 high no excellent no
31…40 high no fair yes
C2:buys_computer = ‘no’ >40 medium no fair yes
>40 low yes fair yes
Data to be classified: >40 low yes excellent no
31…40 low yes excellent yes
X = (age <=30, Income = medium, <=30 medium no fair no
Student = yes, Credit_rating = Fair) <=30 low yes fair yes
>40 medium yes fair yes
<=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
>40 medium no excellent no
118
Naïve Bayes Classifier: An age
<=30
income student credit_rating buys_computer
high no fair no

Example <=30
31…40
>40
high
high
no excellent
no fair
medium no fair
no
yes
yes
>40 low yes fair yes
¨ Prior probability P(Ci): >40 low yes excellent no
31…40 low yes excellent yes
P(buys_computer = “yes”) = 9/14 <=30 medium no fair no
<=30 low yes fair yes
= 0.643 P(buys_computer = “no”) >40 medium yes fair yes
<=30 medium yes excellent yes
= 5/14= 0.357 31…40 medium no excellent yes
31…40 high yes fair yes
>40 medium no excellent no

119
Naïve Bayes Classifier: An age
<=30
income student credit_rating buys_computer
high no fair no

Example <=30
31…40
>40
high
high
no excellent
no fair
medium no fair
no
yes
yes
>40 low yes fair yes
¨ P(Ci): P(buys_computer = “yes”) = 9/14 >40 low yes excellent no
31…40 low yes excellent yes
= 0.643 P(buys_computer = “no”) <=30 medium no fair no
<=30 low yes fair yes
= 5/14= 0.357 >40 medium yes fair yes
<=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
¨ Compute P(X|Ci) for each class, where, >40 medium no excellent no
X = (age <=30, Income = medium, Student = yes,
Credit_rating = Fair)

According to “the naïve assumption”, first get:

P(age = “<=30”|buys_computer = “yes”) = 2/9 =
0.222
120
P(income =
Naïve Bayes Classifier: An “medium” |
buys_computer =
Example “no”) = 2/5 = 0.4
P(student = “yes” |
¨ P(Ci): P(buys_computer = “yes”) = 9/14 buys_computer =
“yes) = 6/9 = 0.667
= 0.643 P(buys_computer = “no”) P(student = “yes” |
= 5/14= 0.357 buys_computer =
“no”) = 1/5 = 0.2
¨ Compute P(X|Ci) for each class, where, P(credit_rating =
X = (age <=30, Income = medium, Student = yes, “fair” |
Credit_rating = Fair) buys_computer =
“yes”) = 6/9 =
0.667
According to “the naïve assumption”, first get: P(credit_rating =
P(age = “<=30”|buys_computer = “yes”) = 2/9 = “fair” |
0.222 buys_computer =
P(age = “<= 30”|buys_computer = “no”) = 3/5 = 0.6 “no”) = 2/5 = 0.4
P(income = “medium” | buys_computer = “yes”) =
4/9 = 0.444
31…40 low yes excellent yes
<=30 medium no fair no
age income student credit_rating buys_computer <=30 low yes fair yes
<=30 high no fair no >40 medium yes fair yes
<=30 high no excellent no <=30 medium yes excellent yes
31…40 high no fair yes 31…40 medium no excellent yes
>40 medium no fair yes 31…40 high yes fair yes
>40 low yes fair yes >40 medium no excellent no
>40 low yes excellent no

121
P(student = “yes” |
Naïve Bayes Classifier: An buys_computer =
“no”) = 1/5 = 0.2
Example P(credit_rating =
“fair” |
¨ P(Ci): P(buys_computer = “yes”) = 9/14 buys_computer =
“yes”) = 6/9 =
= 0.643 P(buys_computer =
0.667
“no”) = 5/14= 0.357 P(credit_rating =
¨ Compute P(Xi|Ci) for each class “fair” |
P(age = “<=30”|buys_computer = “yes”) = 2/9 = buys_computer =
0.222 “no”) = 2/5 = 0.4
P(age = “<= 30”|buys_computer = “no”) = 3/5 = 0.6
P(income = “medium” | buys_computer = “yes”) =
4/9 = 0.444
P(income = “medium” | buys_computer = “no”) =
2/5 = 0.4
P(student = “yes” | buys_computer = “yes) = 6/9 =
0.667
31…40 low yes excellent yes
<=30 medium no fair no
age income student credit_rating buys_computer <=30 low yes fair yes
<=30 high no fair no >40 medium yes fair yes
<=30 high no excellent no <=30 medium yes excellent yes
31…40 high no fair yes 31…40 medium no excellent yes
>40 medium no fair yes 31…40 high yes fair yes
>40 low yes fair yes >40 medium no excellent no
>40 low yes excellent no

¨ X = (age <= 30 , income = medium, student = yes, credit_rating = fair)

122
P(credit_rating =
Naïve Bayes Classifier: An “fair” |
buys_computer =
Example “yes”) = 6/9 =
0.667
P(credit_rating =
¨ P(Ci): P(buys_computer = “yes”) = 9/14
“fair” |
= 0.643 P(buys_computer = “no”) = 5/14= buys_computer =
0.357 “no”) = 2/5 = 0.4
¨ Compute P(Xi|Ci) for each class
P(age = “<=30”|buys_computer = “yes”) = 2/9 =
0.222
P(age = “<= 30”|buys_computer = “no”) = 3/5 = 0.6
P(income = “medium” | buys_computer = “yes”) =
4/9 = 0.444
P(income = “medium” | buys_computer = “no”) =
2/5 = 0.4
P(student = “yes” | buys_computer = “yes) = 6/9 =
0.667
P(student = “yes” | buys_computer = “no”) = 1/5 =
0.2
31…40 low yes excellent yes
<=30 medium no fair no
age income student credit_rating buys_computer <=30 low yes fair yes
<=30 high no fair no >40 medium yes fair yes
<=30 high no excellent no <=30 medium yes excellent yes
31…40 high no fair yes 31…40 medium no excellent yes
>40 medium no fair yes 31…40 high yes fair yes
>40 low yes fair yes >40 medium no excellent no
>40 low yes excellent no
¨ X = (age <= 30 , income = medium, student = yes,
credit_rating = fair) P(X|Ci) : P(X|buys_computer = “yes”) =
0.222 x 0.444 x 0.667 x 0.667 = 0.044 P(X|buys_computer =
“no”) = 0.6 x 0.4 x 0.2 x 0.4 = 0.019
Take into account
P(X|Ci)*P(Ci) : P(X|buys_computer = “yes”) * P(buys_computer the prior
= “yes”) = 0.028 P(X|buys_computer = “no”) * probabilities
P(buys_computer = “no”) = 0.007
123
P(credit_rating =
Naïve Bayes Classifier: An “fair” |
buys_computer =
Example “yes”) = 6/9 =
0.667
P(credit_rating =
¨ P(Ci): P(buys_computer = “yes”) = 9/14
“fair” |
= 0.643 P(buys_computer = “no”) = 5/14= buys_computer =
0.357 “no”) = 2/5 = 0.4
¨ Compute P(Xi|Ci) for each class
P(age = “<=30”|buys_computer = “yes”) = 2/9 =
0.222
P(age = “<= 30”|buys_computer = “no”) = 3/5 = 0.6
P(income = “medium” | buys_computer = “yes”) =
4/9 = 0.444
P(income = “medium” | buys_computer = “no”) =
2/5 = 0.4
P(student = “yes” | buys_computer = “yes) = 6/9 =
0.667
P(student = “yes” | buys_computer = “no”) = 1/5 =
0.2
31…40 low yes excellent yes
<=30 medium no fair no
age income student credit_rating buys_computer <=30 low yes fair yes
<=30 high no fair no >40 medium yes fair yes
<=30 high no excellent no <=30 medium yes excellent yes
31…40 high no fair yes 31…40 medium no excellent yes
>40 medium no fair yes 31…40 high yes fair yes
>40 low yes fair yes >40 medium no excellent no
>40 low yes excellent no
X = (age <= 30 , income = medium, student = yes,
¨

credit_rating = fair) P(X|Ci) : P(X|buys_computer = “yes”) =

0.222 x 0.444 x 0.667 x 0.667 = 0.044 P(X|buys_computer =
“no”) = 0.6 x 0.4 x 0.2 x 0.4 = 0.019
P(X|Ci)*P(Ci) : P(X|buys_computer = “yes”) * P(buys_computer =
“yes”) = 0.028 P(X|buys_computer = “no”) *
P(buys_computer = “no”) = 0.007
Since Red > Blue here, X belongs to class (“buys_computer = yes”)
124
ROC Calculation
83

¨ Rank the test examples by prediction probability in

descending order
¨ Gradually decreases the classification threshold from
1.0 to 0.0 and calculate the true positive and false
positive rate along the way
0.95 Yes
Yes Input
"≥1.0→ -() = 0.0

Prebability of Actual Class '() = 0.0

Prediction

!!
0.85 Yes
!" 0.75 No
!
#
0.65 Yes
!$
0.4 No
!% 0.3 No
!
&

83
ROC Calculation
84

¨ Rank the test examples by prediction probability in

descending order
¨ Gradually decreases the classification threshold from
1.0 to 0.0 and calculate the true positive and false
positive rate along the way
Input Prebability of Actual Class
"≥0.9→ Prediction '() = 0.334
Yes ! 0.95
0.85
Yes
Yes -() = 0.0

!"
0.75 No
!
#
0.65 Yes
!$
0.4 No
!% 0.3 No
!
&

84
ROC Calculation
85

¨ Rank the test examples by prediction probability in

descending order
¨ Gradually decreases the classification threshold from
1.0 to 0.0 and calculate the true positive and false
positive rate along the way
Input Prebability of Actual Class
'() = 0.666
Prediction
0.95 Yes
"≥0.8→ !" 0.75 No

Yes !! 0.85 Yes

#
!
0.65 Yes
!$
0.4 No
!% 0.3 No
!
&

85
ROC Calculation
86

¨ Rank the test examples by prediction probability in

descending order
¨ Gradually decreases the classification threshold from
1.0 to 0.0 and calculate the true positive and false
positive rate along the way
Input Prebability of Actual Class
Prediction
0.95 Yes
!! 0.85 Yes '() = 0.666
0.65 Yes

Yes
"≥0.7→ !
# -() = 0.334

!" 0.75 No
0.4 No
!$

!% 0.3 No
!
&

86
ROC Calculation
87

¨ Rank the test examples by prediction probability in

descending order
¨ Gradually decreases the classification threshold from
1.0 to 0.0 and calculate the true positive and false
positive rate along the way
Input Prebability of Actual Class
Prediction
0.95 Yes
!!
0.85 Yes
!"
0.75 No '() = 1.0
"≥0.5→ !$ 0.4 No

Yes !
#
0.65 Yes
!%
0.3 No
!
&

87
ROC Calculation
88

¨ Rank the test examples by prediction probability in

descending order
¨ Gradually decreases the classification threshold from
1.0 to 0.0 and calculate the true positive and false
positive rate along the way
Input Prebability of Actual Class
Prediction
0.95 Yes
!!
0.85 Yes
!" 0.75 No
#
! 0.65 Yes '() = 1.0
"≥0.4→ !% 0.3 No
Yes 0.4 No
!
!$

&
88
ROC Calculation
89

¨ Rank the test examples by prediction probability in

descending order
¨ Gradually decreases the classification threshold from
1.0 to 0.0 and calculate the true positive and false
positive rate along the way
Input Prebability of Actual Class
Prediction
0.95 Yes
!!
0.85 Yes
!" 0.75 No
!
#
0.65 Yes
0.4 No
"≥0.3→ !$ '() = 1.0

Yes 0.3 No
!
& -() = 1.0

Mlfinlab Release Hudson & Thames
100% (1)
Mlfinlab Release Hudson & Thames
74 pages
Ritesh Tandon Machine Learning Project
100% (5)
Ritesh Tandon Machine Learning Project
23 pages
1 s2.0 S0925753523000802 Main
No ratings yet
1 s2.0 S0925753523000802 Main
12 pages
Classification: Basic Concepts and Decision Trees
No ratings yet
Classification: Basic Concepts and Decision Trees
71 pages
Important For Data Mining
No ratings yet
Important For Data Mining
96 pages
Concepts and Techniques: Data Mining
100% (1)
Concepts and Techniques: Data Mining
81 pages
For Classification Models
No ratings yet
For Classification Models
47 pages
Classification and Regression Trees (CART - III) : DR A. Ramesh
No ratings yet
Classification and Regression Trees (CART - III) : DR A. Ramesh
42 pages
Attribute Selection Measures: Decision Tree Based Classification
No ratings yet
Attribute Selection Measures: Decision Tree Based Classification
16 pages
Concepts and Techniques: - Chapter 8
No ratings yet
Concepts and Techniques: - Chapter 8
42 pages
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
59 pages
Ecture Ecision REE: Sajal Halder Bsmrstu
100% (1)
Ecture Ecision REE: Sajal Halder Bsmrstu
22 pages
Concepts and Techniques: - Chapter 8
No ratings yet
Concepts and Techniques: - Chapter 8
87 pages
09 - ML - Decision Tree
No ratings yet
09 - ML - Decision Tree
45 pages
Data Mining Unit 3
No ratings yet
Data Mining Unit 3
21 pages
CC1 8
0% (1)
CC1 8
4 pages
Concepts and Techniques: - Chapter 8
No ratings yet
Concepts and Techniques: - Chapter 8
81 pages
P9-10 ClassBasic
No ratings yet
P9-10 ClassBasic
82 pages
Decision Tree
No ratings yet
Decision Tree
30 pages
Decision Tree and KNN Assignment Two
No ratings yet
Decision Tree and KNN Assignment Two
13 pages
20210913115613D3708 - Session 05-08 Decision Tree Classification
No ratings yet
20210913115613D3708 - Session 05-08 Decision Tree Classification
37 pages
Scorecard Formula Guide
100% (1)
Scorecard Formula Guide
32 pages
Data Mining & Knowledge Discovery
No ratings yet
Data Mining & Knowledge Discovery
34 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
Decision Tree
No ratings yet
Decision Tree
30 pages
Construction of Decision Tree Attribute Selection Measures
No ratings yet
Construction of Decision Tree Attribute Selection Measures
5 pages
Inflammation Biomarkers in Blood As Mortality Predictors in Community-Acquired Pneumonia Admitted Patients: Importance of Comparison
No ratings yet
Inflammation Biomarkers in Blood As Mortality Predictors in Community-Acquired Pneumonia Admitted Patients: Importance of Comparison
48 pages
DM Lect8
No ratings yet
DM Lect8
56 pages
Clase12 13
No ratings yet
Clase12 13
15 pages
Attribute Selection Measure
No ratings yet
Attribute Selection Measure
3 pages
Solved Numericals
No ratings yet
Solved Numericals
7 pages
Double Circuit Transmission Line Protection Using Line Trap & Artificial Neural Network MATLAB Approach
No ratings yet
Double Circuit Transmission Line Protection Using Line Trap & Artificial Neural Network MATLAB Approach
6 pages
Child Psychology Psychiatry - 2006 - Goodman - The Strengths and Difficulties Questionnaire A Research Note
No ratings yet
Child Psychology Psychiatry - 2006 - Goodman - The Strengths and Difficulties Questionnaire A Research Note
7 pages
Screening For Perinatal Depression With The Patient Health Questionnaire Depression Scale (PHQ-9) - A Systematic Review and Meta-Analysis
No ratings yet
Screening For Perinatal Depression With The Patient Health Questionnaire Depression Scale (PHQ-9) - A Systematic Review and Meta-Analysis
9 pages
IML Unit04 - Learning Decision Trees
No ratings yet
IML Unit04 - Learning Decision Trees
28 pages
VII - CS8031 - DMDW - Module 6 - Classification - VBP
No ratings yet
VII - CS8031 - DMDW - Module 6 - Classification - VBP
99 pages
Attribute Selection Measures
No ratings yet
Attribute Selection Measures
15 pages
Lec 6
No ratings yet
Lec 6
39 pages
2023-Contextualizing The Current State of Research On The Use Ofmachine Learning For Student Performance Prediction Asystematic Literature Review
No ratings yet
2023-Contextualizing The Current State of Research On The Use Ofmachine Learning For Student Performance Prediction Asystematic Literature Review
25 pages
Unit 3-Classification
No ratings yet
Unit 3-Classification
71 pages
MIS416 Chapter6 by DrAsimAlwabel
No ratings yet
MIS416 Chapter6 by DrAsimAlwabel
73 pages
5 Classification
No ratings yet
5 Classification
59 pages
3 - Sınıflandırma 2
No ratings yet
3 - Sınıflandırma 2
62 pages
2017CS10324 Amal Prasad SIV895 Assignment
No ratings yet
2017CS10324 Amal Prasad SIV895 Assignment
6 pages
An Optimized Machine Learning Model Accurately Predicts In-Hospital Outcomes at Admission To A Cardiac Unit
No ratings yet
An Optimized Machine Learning Model Accurately Predicts In-Hospital Outcomes at Admission To A Cardiac Unit
14 pages
Decision Tree
No ratings yet
Decision Tree
47 pages
Classification - Decision Tree
No ratings yet
Classification - Decision Tree
32 pages
DM 4
No ratings yet
DM 4
68 pages
Lecture 9
No ratings yet
Lecture 9
21 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
83 pages
Solution For DWDM Problems
No ratings yet
Solution For DWDM Problems
24 pages
Decision Trees
No ratings yet
Decision Trees
31 pages
Data Minning Unit 5 PDF
No ratings yet
Data Minning Unit 5 PDF
19 pages
Fleming 2017
No ratings yet
Fleming 2017
14 pages
Vacunas: COVID-19 Vaccination Acceptance in Jambi City, Indonesia: A Single Vaccination Center Study
No ratings yet
Vacunas: COVID-19 Vaccination Acceptance in Jambi City, Indonesia: A Single Vaccination Center Study
10 pages
UNIT 2 Class Basic
No ratings yet
UNIT 2 Class Basic
69 pages
Lecture 8
No ratings yet
Lecture 8
81 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
10 pages
DM 3
No ratings yet
DM 3
37 pages
10.1515 - Cdbme 2023 1165
No ratings yet
10.1515 - Cdbme 2023 1165
4 pages
CSE445 NSU Week - 4
No ratings yet
CSE445 NSU Week - 4
48 pages
Master Thesis - Emidi and Galan
No ratings yet
Master Thesis - Emidi and Galan
39 pages
Decision Tree
No ratings yet
Decision Tree
22 pages
01 - Kaligarič - Ivajnšič - Vanishing Landscape of The Classic Karst
No ratings yet
01 - Kaligarič - Ivajnšič - Vanishing Landscape of The Classic Karst
11 pages
Transunext: Towards A More Advanced U-Shaped Framework For Automatic Vessel Segmentation in The Fundus Image
No ratings yet
Transunext: Towards A More Advanced U-Shaped Framework For Automatic Vessel Segmentation in The Fundus Image
23 pages
ViroNia LSTM Based Proteomics Model For Precis - 2025 - Computers in Biology An
No ratings yet
ViroNia LSTM Based Proteomics Model For Precis - 2025 - Computers in Biology An
12 pages
Classification by Decision Tree Induction
No ratings yet
Classification by Decision Tree Induction
25 pages
Dual View Deep Learning For Enhanced Breast Cancer Screening Using Mammography
No ratings yet
Dual View Deep Learning For Enhanced Breast Cancer Screening Using Mammography
15 pages
ML Lecture 8 9 Classification
No ratings yet
ML Lecture 8 9 Classification
35 pages
Unit 1 Classification & Prediction DM
No ratings yet
Unit 1 Classification & Prediction DM
71 pages
Violence Detection From Industrial Surveillance Videos Using Deep Learning
No ratings yet
Violence Detection From Industrial Surveillance Videos Using Deep Learning
13 pages
A Deep Hierarchical Feature Learning Architecture For Crack Segmentation
No ratings yet
A Deep Hierarchical Feature Learning Architecture For Crack Segmentation
15 pages
Statistics in Medicine - 2024 - Zhang - Weighted Expectile Regression Neural Networks For Right Censored Data
No ratings yet
Statistics in Medicine - 2024 - Zhang - Weighted Expectile Regression Neural Networks For Right Censored Data
15 pages
Neves Et Al (2019)
No ratings yet
Neves Et Al (2019)
21 pages
Data Analytics All Paper Solution
No ratings yet
Data Analytics All Paper Solution
11 pages
Decision Tree
No ratings yet
Decision Tree
74 pages
Slide 07 Chapter8 Classification Basic Concept
No ratings yet
Slide 07 Chapter8 Classification Basic Concept
55 pages
08 Class Basic
No ratings yet
08 Class Basic
81 pages
Lecture 5 DecisionTree
No ratings yet
Lecture 5 DecisionTree
21 pages
"/content/android - Permission - CSV" "/content/plots/" "/content/unsampled/" "/content/oversampled"
No ratings yet
"/content/android - Permission - CSV" "/content/plots/" "/content/unsampled/" "/content/oversampled"
58 pages
Innovate or Die
No ratings yet
Innovate or Die
21 pages
DM Unit-4
No ratings yet
DM Unit-4
75 pages
Asian - Journal - of - Dietetics - 6 - 3-4 - 2024 - 127 - 136 4
No ratings yet
Asian - Journal - of - Dietetics - 6 - 3-4 - 2024 - 127 - 136 4
10 pages
8 Classification
No ratings yet
8 Classification
82 pages
Forecast Verification A Practitioner S Guide in Atmospheric Science Second Edition Ian T. Jolliffe Instant Download
No ratings yet
Forecast Verification A Practitioner S Guide in Atmospheric Science Second Edition Ian T. Jolliffe Instant Download
52 pages
The Smart Math Tricks Secrets to Solving Math Fast and Easy
From Everand
The Smart Math Tricks Secrets to Solving Math Fast and Easy
Leonardo Cruz
No ratings yet

Example Classification

Uploaded by

Example Classification

Uploaded by

Decision Tree Induction: An Example

q Training data set: Buys_computer

age income student credit_rating buys_computer

31…40 high yes fair yes

age income student credit_rating buys_computer 5 4

CarType CarType CarType

Family Sports Luxury {Sports, {Family,

¨ Several Choices for the splitting value 3 No Single 70K No

¤ Class counts in each of the partitions, A < v and A ³ v

¨ Simple method to choose best v

Cheat No No No Yes Yes Yes No No No No

Step Sorted Values 60 70 75 85 90 95 100 120 125 220

1: Possible Splitting 55 65 72 80 87 92 97 110 122 172 230

Cheat No No No Yes Yes Yes No No No No

Step Sorted Values 60 70 75 85 90 95 100 120 125 220

1: Possible Splitting 55 65 72 80 87 92 97 110 122 172 230

Cheat No No No Yes Yes Yes No No No No

Step Sorted Values 60 70 75 85 90 95 100 120 125 220

1: Possible Splitting 55 65 72 80 87 92 97 110 122 172 230

Cheat No No No Yes Yes Yes No No No No

Step Sorted Values 60 70 75 85 90 95 100 120 125 220

1: Possible Splitting 55 65 72 80 87 92 97 110 122 172 230

Cheat No No No Yes Yes Yes No No No No

Step Sorted Values 60 70 75 85 90 95 100 120 125 220

1: Possible Splitting 55 65 72 80 87 92 97 110 122 172 230

According to “the naïve assumption”, first get:

¨ X = (age <= 30 , income = medium, student = yes, credit_rating = fair)

credit_rating = fair) P(X|Ci) : P(X|buys_computer = “yes”) =

¨ Rank the test examples by prediction probability in

Prebability of Actual Class '() = 0.0

¨ Rank the test examples by prediction probability in

¨ Rank the test examples by prediction probability in

Yes !! 0.85 Yes

¨ Rank the test examples by prediction probability in

¨ Rank the test examples by prediction probability in

¨ Rank the test examples by prediction probability in

¨ Rank the test examples by prediction probability in

You might also like