0% found this document useful (0 votes)

9 views22 pages

08 Tree Classification

The document discusses tree-based methods for classification, focusing on the differences between regression and classification trees, where the latter predicts qualitative responses. It explains the process of growing classification trees using measures like classification error rate, Gini index, and cross-entropy to evaluate splits. Additionally, it provides practical tasks and solutions using the Carseats dataset to illustrate the application of classification trees.

Uploaded by

ikr7950

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views22 pages

08 Tree Classification

Uploaded by

ikr7950

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Tree Based Methods: Classification Trees

Rebecca C. Steorts, Duke University

STA 325, Chapter 8 ISL

1 / 22
Agenda

I What are tree based methods?

I Regression trees
I Classification trees

2 / 22
Regression versus Classification trees

Classification trees are similar to regression trees, except that it is

used to predict a qualitative response rather than a quantitative
one.
For a regression tree, the prediction response for an observation is
given by the mean response of the training observations that belong
to the same terminal node.
In contract, for a classification tree, we predict that each
observation belongs to the most commonly occurring class of
training observations in the region to which it belongs.

3 / 22
Interpretation of classification trees

We are usually interterested in

I Interpreting the results of the classification tree.

I The class prediction corresponding to a particular terminal
node region and
I the class proportions among the training observations that fall
into that region.

4 / 22
How to grow a classification tree?

I Growing a classification tree is very similar to that of a

regression tree.
I We use a recursive binary split to grow the tree, however in the
classification setting, the RSS cannot be used as a criterion for
making splits.
I A natural alternative to the RSS is the classification error rate.

5 / 22
Classification error rate

The classification error rate is simply the fraction of the training

observations in that region that do not belong to the most common
class:

E = 1 − max p̂mk (1)

where p̂mk represents the proportion of training observations in the

region m that are from class k.
It turns out that the classification error is not sufficiently sensitive
for tree-growing and two other measures are preferable (Gini-index
and cross-entropy).

6 / 22
The Gini index

Recall that p̂mk represents the proportion of training observations in

the region m that are from class k.
The Gini index is defined by

K
X
G= p̂mk (1 − p̂mk ) (2)
k=1

which is a measure of the total variance across the K classes.

7 / 22
The Gini index

The Gini index takes on a small value if all of the p̂mk are close to 0
or 1.
Because of this, the Gini index is called a measure of node purity.
For example, a small value of p̂mk indicates indicates that a node
contains predominantly observations from a single class.

8 / 22
The cross entropy

An alternative to the Gini index is the cross entropy

K
X
D=− p̂mk log(p̂mk ) (3)
k=1

Since
0 ≤ p̂mk ≤ 1
this implies that
0 ≤ −p̂mk log(p̂mk ).

Exercise: The cross entropy will take on a value near 0 if the p̂mk ’s
are all near 0 or 1.

9 / 22
Gini index, cross entropy, and classification error rate

Like the Gini index, the cross-entropy will take on a small value if
node m is pure.
In fact, it turns out that the Gini index and the cross-entropy are
quite similar numerically.
When building a classification tree, either the Gini index or the
cross- entropy are typically used to evaluate the quality of a
particular split, since these two approaches are more sensitive to
node purity than is the classification error rate.
Any of these three approaches might be used when pruning the tree,
but the classification error rate is preferable if prediction accuracy of
the final pruned tree is the goal.

10 / 22
Deviance

For classification trees, the deviance is given by the summary()

function and can be calculated via

XX
−2 nmk log(pmk
ˆ ) (4)
m k

where nmk is the number of observations in the mth terminal node

that belongs to class k.
A small deviance indicates a tree that provides a good fit to the
(training) data.
The residual mean deviance reported is simply the deviance divided
by n − |To |.

11 / 22
Application to Carseats Dataset

We use regression and classification trees to analyze the Carseats

data set.
In the Carseats data, Sales is a continuous variable, and so we begin
by recording it as a binary variable.

12 / 22
Task 1

1. First use the ifelse() function to create a variable, called High,

which takes on a value of Yes if the Sales variable exceeds 8,
and takes on a value of No otherwise. This creates a binary
variable.
2. Next, use a data.frame() to merge High with the rest of the
Carseats data.
3. Finally, use the tree() function to fit a classification tree in
order to predict high using all variables but Sales.

13 / 22
Solution Task 1 (a,b,c)

library(ISLR)
library(tree)
attach(Carseats)
# creating a binary variable
High <- ifelse(Sales <= 8, "No", "Yes")
# merge High with the rest of the Carseats
Carseats <- data.frame(Carseats, High)
# fit a regression tree
tree.carseats <- tree(High ~. - Sales, Carseats)

14 / 22
Task 2

How does the tree fit? Plot your tree and explain your explaination.

15 / 22
Solution Task 2

In order to answer Task 2, we will look at the summary of our tree

and also plot the tree. In general, we have just built the tree, so
remember that we will need to prune it back!

16 / 22
Solution Task 2 (continued)

summary(tree.carseats)

##
## Classification tree:
## tree(formula = High ~ . - Sales, data = Carseats)
## Variables actually used in tree construction:
## [1] "ShelveLoc" "Price" "Income" "CompPrice
## [6] "Advertising" "Age" "US"
## Number of terminal nodes: 27
## Residual mean deviance: 0.4575 = 170.7 / 373
## Misclassification error rate: 0.09 = 36 / 400

17 / 22
Solution to Task 2 (Continued) ShelveLoc: Bad,Medium

Price < 92.5 Price < 135

US: No Income < 46

Price < 109

Income < 57 Advertising < 13.5 No Yes

Yes

CompPrice < 110.5 Population < 207.5

Yes No

No Yes Yes Yes CompPrice < 124.5 Age < 54.5

CompPrice < 130.5 CompPrice < 122.5

Price < 106.5 Price < 122.5
Income < 100
Price < 125

Population < 177 Yes

No
No Yes
Income < 60.5 No
ShelveLoc: Bad CompPrice < 147.5
Yes No
No

Price < 109.5

Price < 147
No Yes
No
No CompPrice < 152.5
Age < 49.5

No
Yes
Yes No

Yes No

The most important indicator of Sales appears to be shelving

location, since the first branch differentiates Good locations from
Bad and Medium locations.
18 / 22
Task 3

1. What is the training error?

2. What is the residual mean error or deviance?
3. What must we do to properly evaluate the performance of a
classification tree on the data? (Hint: use the predict function
and be sure to explain any results that you obtain). Note: in
this step break the data into training and testing data. Don’t
prune the data yet!

19 / 22
Solution to Task 3

20 / 22
Task 4

Given Task 3, now prune the tree to see if you get improved results.
Note: use the argument FUN=prune.misclass in order to indicate
that we want the classification error rate to guide the
cross-validation and pruning process, rather than the default for the
cv.tree() function, which is the deviance.

21 / 22
Solution to Task 4

22 / 22

Chapter 8 - 1 Machine Learning
No ratings yet
Chapter 8 - 1 Machine Learning
167 pages
Multiple Choice Questions: Psychological Testing and Assessment
100% (1)
Multiple Choice Questions: Psychological Testing and Assessment
30 pages
Unit-II
No ratings yet
Unit-II
34 pages
Maarifa Kona Handbook - Final Small Size-17 June 2019
No ratings yet
Maarifa Kona Handbook - Final Small Size-17 June 2019
92 pages
3 Dtrees-Lect6
No ratings yet
3 Dtrees-Lect6
63 pages
Idr 2022 - Ebook
No ratings yet
Idr 2022 - Ebook
220 pages
Chapter 09 CART-3
No ratings yet
Chapter 09 CART-3
42 pages
BSC ML Ch3.pptx
No ratings yet
BSC ML Ch3.pptx
106 pages
Unit-II - Tree Based Methods
No ratings yet
Unit-II - Tree Based Methods
158 pages
Project-Sample-Synopsis-UOM - MBA and MCOM
No ratings yet
Project-Sample-Synopsis-UOM - MBA and MCOM
3 pages
Account Planner Cover Letter
100% (2)
Account Planner Cover Letter
8 pages
Unit 3 Classification - Dr. Vidyut D
No ratings yet
Unit 3 Classification - Dr. Vidyut D
72 pages
Chap9 Cart 574 1
No ratings yet
Chap9 Cart 574 1
42 pages
NFG Notes and MLA Chapters
No ratings yet
NFG Notes and MLA Chapters
20 pages
First Course in Statistics 11th Edition McClave Solutions Manual 1
100% (62)
First Course in Statistics 11th Edition McClave Solutions Manual 1
36 pages
(Chapman & Hall - CRC Data Science Series) Brandon M. Greenwell - Tree-Based Methods For Statistical Learning in R - A Practical Introduction With Applications in R-CRC Press (2022)
No ratings yet
(Chapman & Hall - CRC Data Science Series) Brandon M. Greenwell - Tree-Based Methods For Statistical Learning in R - A Practical Introduction With Applications in R-CRC Press (2022)
405 pages
Classification Tree - Utkarsh Kulshrestha: Earn in G Is in Learnin G - Utkarsh Kulshrestha
No ratings yet
Classification Tree - Utkarsh Kulshrestha: Earn in G Is in Learnin G - Utkarsh Kulshrestha
33 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Tree-Based-Methods
No ratings yet
Tree-Based-Methods
21 pages
S&ML Unit 6- Q & A
No ratings yet
S&ML Unit 6- Q & A
12 pages
ML-unit-3
No ratings yet
ML-unit-3
22 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
BUSINESS COMMUNICATION
No ratings yet
BUSINESS COMMUNICATION
10 pages
Decision Trees (I) : ISOM3360 Data Mining For Business Analytics, Session 4
No ratings yet
Decision Trees (I) : ISOM3360 Data Mining For Business Analytics, Session 4
32 pages
Extreme Learning Machine Thesis
100% (1)
Extreme Learning Machine Thesis
8 pages
Estimation Theory
No ratings yet
Estimation Theory
4 pages
BAS-1 T123 Session 5
No ratings yet
BAS-1 T123 Session 5
53 pages
CP 4
No ratings yet
CP 4
2 pages
Unit-4 DM
No ratings yet
Unit-4 DM
19 pages
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
No ratings yet
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
18 pages
Module10 TreeBasedMethods
No ratings yet
Module10 TreeBasedMethods
33 pages
Computing and Using The Deviance With Classificati
No ratings yet
Computing and Using The Deviance With Classificati
13 pages
Do Dissertations Go Through Turnitin
100% (2)
Do Dissertations Go Through Turnitin
4 pages
TTDS Lecture 4
No ratings yet
TTDS Lecture 4
31 pages
Lec.7.intro.D.S. Fall 2023
No ratings yet
Lec.7.intro.D.S. Fall 2023
26 pages
Note 6
No ratings yet
Note 6
33 pages
Module 19 Selection of Product
No ratings yet
Module 19 Selection of Product
7 pages
Module09 TreeBasedMethods
No ratings yet
Module09 TreeBasedMethods
36 pages
Data Mining Unit-1 Notes
No ratings yet
Data Mining Unit-1 Notes
18 pages
The Effect of ESG-Corporate, Company Size, and Size of Board Director On Financial Performance With Audit Quality As A Moderating Variable in Public Companies in Indonesia
No ratings yet
The Effect of ESG-Corporate, Company Size, and Size of Board Director On Financial Performance With Audit Quality As A Moderating Variable in Public Companies in Indonesia
8 pages
Self Assessment in The Health Professions A.15
No ratings yet
Self Assessment in The Health Professions A.15
9 pages
Introduction To Decision Tree: Gini Index
No ratings yet
Introduction To Decision Tree: Gini Index
15 pages
Decision_tree
No ratings yet
Decision_tree
15 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Ch5 Data Science
No ratings yet
Ch5 Data Science
60 pages
Random Forest
No ratings yet
Random Forest
83 pages
Classification and Regression Trees
No ratings yet
Classification and Regression Trees
37 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Tree Based Classifiers: Dinesh R
No ratings yet
Tree Based Classifiers: Dinesh R
54 pages
Classification and Regression Trees
No ratings yet
Classification and Regression Trees
36 pages
DS535 Note 6 (Page1-14)
No ratings yet
DS535 Note 6 (Page1-14)
13 pages
Digital Transformation Leadership Competencies: A Contingency Approach
No ratings yet
Digital Transformation Leadership Competencies: A Contingency Approach
12 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Chapter 9 - Classification and Regression Trees: Data Mining For Business Intelligence
No ratings yet
Chapter 9 - Classification and Regression Trees: Data Mining For Business Intelligence
36 pages
Tree-Based Methods
No ratings yet
Tree-Based Methods
32 pages
Factors That Make For Project Success or Failure
No ratings yet
Factors That Make For Project Success or Failure
6 pages
Applied Economics: Quarter 3 - Module 1 Economics As Social and Applied Science in Terms of Nature and Scope
100% (1)
Applied Economics: Quarter 3 - Module 1 Economics As Social and Applied Science in Terms of Nature and Scope
25 pages
HW 4 Sol
No ratings yet
HW 4 Sol
21 pages
Chapter 7 - Trees
No ratings yet
Chapter 7 - Trees
80 pages
Actionable Knowledge
No ratings yet
Actionable Knowledge
14 pages
Random Forest
No ratings yet
Random Forest
5 pages
CV April Iba
No ratings yet
CV April Iba
94 pages
Scheme of Work Marketing
No ratings yet
Scheme of Work Marketing
2 pages
Decision Tree R
No ratings yet
Decision Tree R
5 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
4.5. Interpretation of The Results: L. Tașçi. Dam Deformation Measurements With GPS
No ratings yet
4.5. Interpretation of The Results: L. Tașçi. Dam Deformation Measurements With GPS
1 page
Decision Trees
67% (3)
Decision Trees
14 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Classification Using Decision Trees
No ratings yet
Classification Using Decision Trees
43 pages
Machine Learning in Ecology
No ratings yet
Machine Learning in Ecology
15 pages
Machine Learning: Classification & Decision Trees
No ratings yet
Machine Learning: Classification & Decision Trees
24 pages
Decision Tree & Regression
No ratings yet
Decision Tree & Regression
33 pages
An Introduction TO Decision Trees
No ratings yet
An Introduction TO Decision Trees
30 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
UHV Question Bank
No ratings yet
UHV Question Bank
19 pages
Classification and Regression Trees
100% (1)
Classification and Regression Trees
60 pages
A Project Report On Quality Management at CAPARO - 03
100% (1)
A Project Report On Quality Management at CAPARO - 03
76 pages
HR Role Play
No ratings yet
HR Role Play
8 pages
Chapter 1 - MAS Introduction
No ratings yet
Chapter 1 - MAS Introduction
9 pages
Predict 422 - Module 8
100% (1)
Predict 422 - Module 8
138 pages
Impact of The Student Activities On The Holistic Development of The Students
50% (2)
Impact of The Student Activities On The Holistic Development of The Students
26 pages
Designing, Implementing and Updating Performance Measurement Systems
No ratings yet
Designing, Implementing and Updating Performance Measurement Systems
18 pages
Classification and Regression Tree Construction
No ratings yet
Classification and Regression Tree Construction
18 pages
Random Forest
No ratings yet
Random Forest
8 pages
Classification and Regression Trees
No ratings yet
Classification and Regression Trees
48 pages
Classification and Regression Tree Methods
No ratings yet
Classification and Regression Tree Methods
13 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet