0% found this document useful (0 votes)

19 views11 pages

RandomForest2324 CR - 4p

The document discusses decision trees, which are a machine learning method for classification and regression. Decision trees work by recursively splitting the predictor variable space into mutually exclusive regions called nodes. Each region or node is assigned a prediction, such as a class for classification trees or a continuous value for regression trees. The splits are determined by choosing a predictor variable and cut point that minimize the residual sum of squares at each step. This allows the tree to be built efficiently in a top-down, greedy manner. While interpretable, decision trees often have lower predictive accuracy than other methods, so ensemble methods like bagging and random forests are discussed which combine many decision trees to improve performance. An example decision tree for a heart disease dataset is also presented.

Uploaded by

souidiamel7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views11 pages

RandomForest2324 CR - 4p

Uploaded by

souidiamel7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Sources

Decision Trees

Sumanta Basu

September 28, 2017

Decision(trees(
Machine Learning 10601
Lecture(11( Recitation 8
Oct 21, 2009
David&Sontag& Oznur Tastan

New&York&University&

Slides adapted from Luke Zettlemoyer, Carlos Guestrin, and

Andrew Moore
Acknowledgement Regression trees, more formally
Regression: Response y 2 R, predictors X = (X1 , . . . , Xp ) 2 Rp
At a high-level, two-step process:
I Divide predictor space X1 , X2 , ..Xp into J non-overlapping
regions R1 , R2 , ..RJ
I Every observation in a Rj has same prediction i.e.
Some of the figures in this presentation are taken from ”An ŷRj = n1 ⌃j2Rj yj where n is the number of training observations
Introduction to Statistical Learning, with applications in R” in Rj .
(Springer, 2013) with permission from the authors: G. James, D.
A closer look at Step 1:
Witten, T. Hastie and R. Tibshirani
I We want to find boxes R1 , R2 , ..RJ that minimize the RSS
given by
X J X
RSS = (yi ŷRj )2
j=1 i2Rj

I Unfortunately, minimization problem computation intensive

I Method: Recursive binary splitting

Recursive binary splitting Prediction

I Top-down recursive greedy approach

I Top-down recursive: Start with all observations and split into
two branches at each level of the tree
Recall the two-step process:
I Greedy: Best split is made at each step without looking ahead I Divide predictor space X1 , X2 , ..Xp into J non-overlapping
regions R1 , R2 , ..RJ
I Choose a predictor Xj and a cutpoint s that minimizes the I Every observation in a Rj has same prediction i.e.
RSS for the resulting tree ŷRj = n1 ⌃j2Rj yj where n is the number of training observations
in Rj .
R1 (j, s) = {X |Xj < s}, R2 (j, s) = {X |Xj s}
X X
RSS = (yi ŷR1 )2 + (yi ŷR2 )2
xi 2R1 (j,s) xi 2R2 (j,s)

I This minimization problem can be solved efficiently!!

Prediction with a Regression Tree
Pros and Cons
R5

R2 t4

X2
t2
R3

R4 • Tree-based methods are simple and useful for

R1
interpretation.
X1
t1

X1
t3
• However they typically are not competitive with the best
supervised learning approaches in terms of prediction
X1  t1
|
accuracy.
X2  t2 X1  t3
• Hence we also discuss bagging, random forests, and
boosting. These methods grow multiple trees which are
R1 R2 R3
X2  t4
then combined to yield a single consensus prediction.
X2
X1

R4 R5
• Combining a large number of trees can often result in
dramatic improvements in prediction accuracy, at the
[Top Left]: A partition that could not result from recursive binary splits; expense of some loss interpretation.
[Top Right]: output of a recursive binary splits, [Bottom Left]:
associated regression tree, [Bottom Right]: perspective plot of the
response surface of the regression tree. 2 / 51

Example: heart data Thal:a

• These data contain a binary outcome HD for 303 patients Ca < 0.5 Ca < 0.5

Slope < 1.5 Oldpeak < 1.1

who presented with chest pain. MaxHR < 161.5 ChestPain:bc Age < 52 Thal:b
ChestPain:a
RestECG < 1
Yes
RestBP < 157 No Yes Yes
Yes No

• An outcome value of Yes indicates the presence of heart Chol < 244
MaxHR < 156
MaxHR < 145.5
No
Yes
No Chol < 244 Sex < 0.5
No Yes

No No No No Yes

disease based on an angiographic test, while No means no

No Yes

heart disease.
• There are 13 predictors including Age, Sex, Chol (a Thal:a
|

0.6
Training
Cross−Validation
cholesterol measurement), and other heart and lung Test

0.5
function measurements.

0.4
• Cross-validation yields a tree with six terminal nodes. See

Error

0.3
Ca < 0.5 Ca < 0.5

next figure.
0.2
Yes Yes

0.1
MaxHR < 161.5 ChestPain:bc

0.0

No No
No Yes
5 10 15

Tree Size

27 / 51 28 / 51
Trees Versus Linear Models Advantages and Disadvantages of Trees
s Trees are very easy to explain to people. In fact, they are

2
even easier to explain than linear regression!

1
s Some people believe that decision trees more closely mirror

X2
0

0
human decision-making than do the regression and

−1

−1
classification approaches seen in previous chapters.

−2

−2
−2 −1 0 1 2 −2 −1 0 1 2
s Trees can be displayed graphically, and are easily
X1 X1
2
interpreted even by a non-expert (especially if they are

2
small).
1

1
s Trees can easily handle qualitative predictors without the
X2

X2
0

0
need to create dummy variables.
−1

−1
t Unfortunately, trees generally do not have the same level of
−2

−2
−2 −1 0 1 2 −2 −1 0 1 2 predictive accuracy as some of the other regression and
X1 X1
classification approaches seen in this book.
Top Row: True linear boundary; Bottom row: true non-linear
boundary. However, by aggregating many decision trees, the predictive
Left column: linear model; Right column: tree-based model performance of trees can be substantially improved. We
introduce these concepts next.
29 / 51 30 / 51

Bias/Variance&Tradeoﬀ& Reduce&Variance&Without&Increasing&Bias&

• Averaging&reduces&variance:&
(when predictions
are independent)

Average models to reduce model variance

One problem:
only one training set
where do multiple models come from?

Hastie, Tibshirani, Friedman “Elements of Statistical Learning” 2001

Bagging The 0.632 Bootstrap

Les méthodes précédentes n'autorisent pas le replacement/remise d'exemples

• Bagging or bootstrap aggregation a technique for (un exemple sélectionné ne peut l'être une autre fois)
reducing the variance of an estimated prediction
• Former un « lot » d'apprentissage par échantillonage (avec replacement) n
function.
fois à partir de la base d'exemples de n instances

• Conséquences : certain exemples sont tirés plusieurs fois et d'autres pas du

• For classification, a committee of trees each
tout
cast a vote for the predicted class.
• Les exemples non tirés sont mis dans la base de test

D’après A. OSMANI Cours Apprentissage symbolique

Bootstrap Un pas de côté : Leave-one-out

The basic idea: Cas particulier de la k-validation croisée où k = nombre d'exemples

• Avantage :
randomly draw datasets with replacement from the
training data, each sample the same size as the original training set • chaque itération utilise un nombre élevé de données pour la
phase d'apprentissage
• déterministe : pas de variance
• Inconvénients :
• algorithme d'apprentissage exécuté n fois
• pb de garantie de la stratification des exemples
=> utilisé quand n < 100

D’après A. OSMANI Cours Apprentissage symbolique

8
The 0.632 Bootstrap Apprentissage et test pour 0.632 Bootstrap

Pour les grandes bases de données, 36,8% des instances de la base doivent

apparaitre dans la base de tests. • A partir des données E

Preuve • Répéter avec différents échantillonnages aléatoires

• un exemple a une probabilité de 1/n d’être tiré dans la base • Générer un lot de données par boostrap : Etrain, Etest = E - Etrain

d'apprentissage à chaque tirage, et donc (1-1/n) de ne pas être tiré • Générer un classeur à partir des données d'App
• La probabilité qu'une instance particulière ne soit pas tirée = (1-1/n)n ~ e-1 • Estimer le taux d'erreur : 0.632 * etest +0.368 * etrain
=0,368
• Moyenner les résultats

1
D’après A. OSMANI Cours Apprentissage symbolique
9 09/17/07 A. OSMANI Cours Apprentissage symbolique 0

Bagging Bagging— continued

• Bootstrap aggregation, or bagging, is a general-purpose • Instead, we can bootstrap, by taking repeated samples
procedure for reducing the variance of a statistical learning from the (single) training data set.
method; we introduce it here because it is particularly • In this approach we generate B di↵erent bootstrapped
useful and frequently used in the context of decision trees.
training data sets. We then train our method on the bth
• Recall that given a set of n independent observations bootstrapped training set in order to get fˆ⇤b (x), the
Z1 , . . . , Zn , each with variance 2 , the variance of the mean prediction at a point x. We then average all the predictions
Z̄ of the observations is given by 2 /n. to obtain
B
1 X ˆ⇤b
fˆbag (x) = f (x).
• In other words, averaging a set of observations reduces B
b=1
variance. Of course, this is not practical because we
generally do not have access to multiple training sets. This is called bagging.

31 / 51 32 / 51
Bagging classification trees Out-of-Bag Error Estimation
• It turns out that there is a very straightforward way to
estimate the test error of a bagged model.
• Recall that the key to bagging is that trees are repeatedly
fit to bootstrapped subsets of the observations. One can
• The above prescription applied to regression trees show that on average, each bagged tree makes use of
• For classification trees: for each test observation, we record around two-thirds of the observations.
the class predicted by each of the B trees, and take a • The remaining one-third of the observations not used to fit
majority vote: the overall prediction is the most commonly a given bagged tree are referred to as the out-of-bag (OOB)
occurring class among the B predictions. observations.
• We can predict the response for the ith observation using
each of the trees in which that observation was OOB. This
will yield around B/3 predictions for the ith observation,
which we average.
• This estimate is essentially the LOO cross-validation error
for bagging, if B is large.
33 / 51 36 / 51

Bagging the heart data Details of previous figure

Bagging and random forest results for the Heart data.

0.30

• The test error (black and orange) is shown as a function of

B, the number of bootstrapped training sets used.
p
• Random forests were applied with m = p.
0.25

• The dashed line indicates the test error resulting from a

single classification tree.
Error

0.20

• The green and blue traces show the OOB error, which in
this case is considerably lower
0.15

Test: Bagging
Test: RandomForest
OOB: Bagging
0.10

OOB: RandomForest

0 50 100 150 200 250 300

Number of Trees

34 / 51 35 / 51
Bagging Bagging : an simulated example
Generated a sample of size N = 30, with two
M features classes and p = 5 features, each having a
standard Gaussian distribution with pairwise
N examples

Take the
majority
Correlation 0.95.
vote

....…
....…

The response Y was generated according to

Pr(Y = 1|x1 ≤ 0.5) = 0.2,
Pr(Y = 0|x1 > 0.5) = 0.8.

Bagging
Bagging
Notice the bootstrap trees are different than the original tree

Treat the voting

Proportions as
probabilities

Hastie

http://www-stat.stanford.edu/~hastie/Papers/ESLII.pdf Example 8.7.1

Random Forests
• Random forests provide an improvement over bagged trees
Random&Forests&Algorithm&
by way of a small tweak that decorrelates the trees. This
reduces the variance when we average the trees.
• As in bagging, we build a number of decision trees on
bootstrapped training samples.
• But when building these decision trees, each time a split in
a tree is considered, a random selection of m predictors is
chosen as split candidates from the full set of p predictors.
The split is allowed to use only one of those m predictors.
• A fresh selection of m predictors is taken at each split, and
p
typically we choose m ⇡ p — that is, the number of
predictors considered at each split is approximately equal
to the square root of the total number of predictors (4 out
of the 13 for the Heart data).

37 / 51

Random Forest Classifier Random Forest Classifier

Create decision tree
At each node in choosing the split feature from each bootstrap sample
choose only among m<M features

M features M features
N examples

N examples

....…
....…

....…
Random Forest Classifier Example: gene expression data
• We applied random forests to a high-dimensional biological
data set consisting of expression measurements of 4,718
genes measured on tissue samples from 349 patients.
M features • There are around 20,000 genes in humans, and individual
genes have di↵erent levels of activity, or expression, in
N examples

particular cells, tissues, and biological conditions.

Take he
• Each of the patient samples has a qualitative label with 15
majority
di↵erent levels: either normal or one of 14 di↵erent types of
vote cancer.

....…
....…

• We use random forests to predict cancer type based on the

500 genes that have the largest variance in the training set.
• We randomly divided the observations into a training and a
test set, and applied random forests to the training set for
three di↵erent values of the number of splitting variables m.

38 / 51

Results: gene expression data Details of previous figure

m=p • Results from random forests for the fifteen-class gene

m=p/2
expression data set with p = 500 predictors.
0.5

m= p
Test Classification Error

• The test error is displayed as a function of the number of

0.4

trees. Each colored line corresponds to a di↵erent value of

m, the number of predictors available for splitting at each
0.3

interior tree node.

• Random forests (m < p) lead to a slight improvement over
0.2

bagging (m = p). A single classification tree has an error

rate of 45.7%.
0 100 200 300 400 500

Number of Trees

39 / 51 40 / 51
Variable importance measure
• For bagged/RF regression trees, we record the total
Random forest
amount that the RSS is decreased due to splits over a given
predictor, averaged over all B trees. A large value indicates To read more:
an important predictor.
http://www-stat.stanford.edu/~hastie/Papers/ESLII.pdf
• Similarly, for bagged/RF classification trees, we add up the
total amount that the Gini index is decreased by splits over
a given predictor, averaged over all B trees.
Fbs

RestECG

ExAng

Sex

Slope

Chol

Age

RestBP
Variable importance plot
MaxHR for the Heart data
Oldpeak

ChestPain

Thal

0 20 40 60 80 100
Variable Importance
50 / 51

Summary

• Decision trees are simple and interpretable models for

regression and classification
• However they are often not competitive with other
methods in terms of prediction accuracy
• Bagging, random forests and boosting are good methods
for improving the prediction accuracy of trees. They work
by growing many trees on the training data and then
combining the predictions of the resulting ensemble of trees.
• The latter two methods— random forests and boosting—
are among the state-of-the-art methods for supervised
learning. However their results can be difficult to interpret.

51 / 51

Worksheet Class Xii Ip 2023-24
No ratings yet
Worksheet Class Xii Ip 2023-24
143 pages
Basic Parts of Motherboard
No ratings yet
Basic Parts of Motherboard
73 pages
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
No ratings yet
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
27 pages
How To Create Security Roles For SAP FIORI Tiles Via PFCG in Gateway - Frontend System
100% (1)
How To Create Security Roles For SAP FIORI Tiles Via PFCG in Gateway - Frontend System
9 pages
Random Forest
No ratings yet
Random Forest
83 pages
Lecture #15: Regression Trees & Random Forests
No ratings yet
Lecture #15: Regression Trees & Random Forests
34 pages
Insurance Analytics: Prof. Julien Trufin
No ratings yet
Insurance Analytics: Prof. Julien Trufin
50 pages
Decision Tree & Regression
No ratings yet
Decision Tree & Regression
33 pages
Trees Handout
No ratings yet
Trees Handout
51 pages
CS109a Lecture16 Bagging RF Boosting
No ratings yet
CS109a Lecture16 Bagging RF Boosting
48 pages
Week 12
No ratings yet
Week 12
34 pages
Random Forests: N 1 N J X A I X A I
No ratings yet
Random Forests: N 1 N J X A I X A I
12 pages
Lecture 20: Bagging, Random Forests, Boosting: Reading: Chapter 8
No ratings yet
Lecture 20: Bagging, Random Forests, Boosting: Reading: Chapter 8
53 pages
Random Forest: Proprietary Content. ©great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
No ratings yet
Random Forest: Proprietary Content. ©great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
16 pages
Machine Learning: Classification & Decision Trees
No ratings yet
Machine Learning: Classification & Decision Trees
24 pages
09 Decision Trees Nearest Neighbor
No ratings yet
09 Decision Trees Nearest Neighbor
8 pages
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
100% (1)
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
14 pages
Random Forest PDF
No ratings yet
Random Forest PDF
14 pages
Lecture+Notes+-+Random Forests
No ratings yet
Lecture+Notes+-+Random Forests
10 pages
cs229 Notes Ensemble
No ratings yet
cs229 Notes Ensemble
7 pages
Lecture 05 Random Forest 07112022 124639pm
No ratings yet
Lecture 05 Random Forest 07112022 124639pm
25 pages
Decision Tree R
No ratings yet
Decision Tree R
5 pages
Random Forest
No ratings yet
Random Forest
8 pages
Bagging and Random Forests
No ratings yet
Bagging and Random Forests
24 pages
CP 4
No ratings yet
CP 4
2 pages
08 Tree Advanced
No ratings yet
08 Tree Advanced
68 pages
Random Forests 2
No ratings yet
Random Forests 2
43 pages
Random Forest
No ratings yet
Random Forest
25 pages
C# Array PDF
No ratings yet
C# Array PDF
13 pages
ML Mid Question Solve
No ratings yet
ML Mid Question Solve
19 pages
Physical Design Automation
No ratings yet
Physical Design Automation
116 pages
M01 Tree-Based Methods
No ratings yet
M01 Tree-Based Methods
38 pages
Random Forest Summary
No ratings yet
Random Forest Summary
6 pages
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
No ratings yet
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
12 pages
Random Forests
No ratings yet
Random Forests
43 pages
Random Forest
No ratings yet
Random Forest
5 pages
Random Forest Class Lecture Notes
No ratings yet
Random Forest Class Lecture Notes
2 pages
ML Unit 4
No ratings yet
ML Unit 4
47 pages
Algorithm Exam
No ratings yet
Algorithm Exam
14 pages
MODULE 4 - Flip Flop & Registers
No ratings yet
MODULE 4 - Flip Flop & Registers
27 pages
English Question Exercise Format
No ratings yet
English Question Exercise Format
2 pages
05 - Ensemble Learning
No ratings yet
05 - Ensemble Learning
39 pages
Random Forest
No ratings yet
Random Forest
16 pages
Project Conventions Coding Standards Java/Android
No ratings yet
Project Conventions Coding Standards Java/Android
12 pages
Random Forest
No ratings yet
Random Forest
25 pages
DSA5102 Lecture3
No ratings yet
DSA5102 Lecture3
34 pages
Ict 11-3RD Quarter Exam
No ratings yet
Ict 11-3RD Quarter Exam
3 pages
Ensemble Methods
No ratings yet
Ensemble Methods
32 pages
Unit IV
No ratings yet
Unit IV
36 pages
PDS LVC 2 Post-Session Summary
No ratings yet
PDS LVC 2 Post-Session Summary
11 pages
Dbms Aptitute Q and A
No ratings yet
Dbms Aptitute Q and A
63 pages
Data Mining Notes
No ratings yet
Data Mining Notes
5 pages
CE880 Lecture7 Slides
No ratings yet
CE880 Lecture7 Slides
78 pages
Shenzhen Weizhongyun Technology Co.,Ltd
No ratings yet
Shenzhen Weizhongyun Technology Co.,Ltd
1 page
Harsh It
No ratings yet
Harsh It
9 pages
Siasun Company Intro
No ratings yet
Siasun Company Intro
34 pages
Lecture 5
No ratings yet
Lecture 5
53 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
32 pages
Classification Algorithms
No ratings yet
Classification Algorithms
68 pages
Enterprise Information Security-15
No ratings yet
Enterprise Information Security-15
10 pages
MIS410 Chapter6
No ratings yet
MIS410 Chapter6
47 pages
Literature Survey
No ratings yet
Literature Survey
8 pages
ML Lec6
No ratings yet
ML Lec6
4 pages
DA Lab Week-3
No ratings yet
DA Lab Week-3
15 pages
08 09 10 Cross Validation and Decision Trees
No ratings yet
08 09 10 Cross Validation and Decision Trees
15 pages
Enabling LDAP For IBM FlashSystem A9000
No ratings yet
Enabling LDAP For IBM FlashSystem A9000
28 pages
CHP 8.2 Intro To Statistical Learning
No ratings yet
CHP 8.2 Intro To Statistical Learning
13 pages
1.10. Decision Trees - Scikit-Learn 0.24.1 Documentation
No ratings yet
1.10. Decision Trees - Scikit-Learn 0.24.1 Documentation
10 pages
Operating System 1 Notes
No ratings yet
Operating System 1 Notes
102 pages
Phys361 S24 Lecture 17 Random Forests
No ratings yet
Phys361 S24 Lecture 17 Random Forests
24 pages
Bus 4407 Written Assignment Unit 5
No ratings yet
Bus 4407 Written Assignment Unit 5
3 pages
DT2485 - DT-BUS Data Logger
No ratings yet
DT2485 - DT-BUS Data Logger
2 pages
Csi ZG520 Ec-3r First Sem 2023-2024
No ratings yet
Csi ZG520 Ec-3r First Sem 2023-2024
4 pages
Lecture 11 Slides - After
No ratings yet
Lecture 11 Slides - After
55 pages
Ch04 The MSP430 Slides
No ratings yet
Ch04 The MSP430 Slides
42 pages
Random Forest Summary
No ratings yet
Random Forest Summary
6 pages
2022 Seeing Traffic Paths Encrypted Traffic Classification With Path Signature Features
No ratings yet
2022 Seeing Traffic Paths Encrypted Traffic Classification With Path Signature Features
16 pages
Icom Rev (Com)
No ratings yet
Icom Rev (Com)
17 pages
CSP Microproject-Numbered
No ratings yet
CSP Microproject-Numbered
23 pages
Getting-Started - WCFM Documentation
No ratings yet
Getting-Started - WCFM Documentation
15 pages
Ch8 Tree Based Methods
No ratings yet
Ch8 Tree Based Methods
81 pages
D3 IT Random Forest Apr 2023
No ratings yet
D3 IT Random Forest Apr 2023
32 pages
Citra Log
No ratings yet
Citra Log
7 pages
05 - IaaS, PaaS, SaaS Case Study
No ratings yet
05 - IaaS, PaaS, SaaS Case Study
20 pages
FBC Lab Manual - 4361603-1
No ratings yet
FBC Lab Manual - 4361603-1
48 pages
CSE Final Exam Routine Spring - 2025
100% (1)
CSE Final Exam Routine Spring - 2025
3 pages
Java Strings - Introduction & String Methods
No ratings yet
Java Strings - Introduction & String Methods
29 pages
Eda - M4
No ratings yet
Eda - M4
7 pages

RandomForest2324 CR - 4p

Uploaded by

RandomForest2324 CR - 4p

Uploaded by

Sources

September 28, 2017

Slides adapted from Luke Zettlemoyer, Carlos Guestrin, and

I Unfortunately, minimization problem computation intensive

Recursive binary splitting Prediction

I Top-down recursive greedy approach

I This minimization problem can be solved efficiently!!

R4 • Tree-based methods are simple and useful for

Example: heart data Thal:a

Slope < 1.5 Oldpeak < 1.1

disease based on an angiographic test, while No means no

Average models to reduce model variance

Hastie, Tibshirani, Friedman “Elements of Statistical Learning” 2001

Les méthodes précédentes n'autorisent pas le replacement/remise d'exemples

• Conséquences : certain exemples sont tirés plusieurs fois et d'autres pas du

D’après A. OSMANI Cours Apprentissage symbolique

Bootstrap Un pas de côté : Leave-one-out

The basic idea: Cas particulier de la k-validation croisée où k = nombre d'exemples

D’après A. OSMANI Cours Apprentissage symbolique

apparaitre dans la base de tests. • A partir des données E

Preuve • Répéter avec différents échantillonnages aléatoires

Bagging Bagging— continued

Bagging the heart data Details of previous figure

Bagging and random forest results for the Heart data.

• The test error (black and orange) is shown as a function of

• The dashed line indicates the test error resulting from a

0 50 100 150 200 250 300

The response Y was generated according to

Treat the voting

http://www-stat.stanford.edu/~hastie/Papers/ESLII.pdf Example 8.7.1

Random Forest Classifier Random Forest Classifier

particular cells, tissues, and biological conditions.

• We use random forests to predict cancer type based on the

Results: gene expression data Details of previous figure

m=p • Results from random forests for the fifteen-class gene

• The test error is displayed as a function of the number of

trees. Each colored line corresponds to a di↵erent value of

interior tree node.

bagging (m = p). A single classification tree has an error

• Decision trees are simple and interpretable models for

You might also like