0% found this document useful (0 votes)

4 views19 pages

Trees

Uploaded by

sabamadadi9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views19 pages

Trees

Uploaded by

sabamadadi9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

From tree to forest

1 / 19
Decision tree
● Decision trees are very popular supervised classification
algorithms:
– They perform quite well on classification problems
– The decision path is relatively easy to interpret
– The algorithm to build (train) them is fast and simple

● A decision tree is a flowchart-like structure made of nodes and

branches:
– At each node, a split on the data is performed based on one of the
input features, generating two or more branches.
– More and more splits are made in the upcoming nodes to partition
the original data.
– This continues until a node is generated where all or almost all of
the data belong to the same class.
2 / 19
Example: Sailing plan

3 / 19
Building a decision tree
● There are several automatic procedures (like C4.5, ID3 or the
CART algorithm) to extract the rules from the data to build a
decision tree.

● These algorithms partition the training set into subsets until

each partition is either “pure” in terms of target class or
sufficiently small:
– A pure subset is a subset that contains only samples of
one class.
– Each partitioning operation is implemented by a rule that
splits the incoming data based on the values of one of the
input features.

4 / 19
Split rules

● How does an algorithm decide which feature to use at

each point to split the input subset?
– At each step, the algorithm uses the feature that
leads to the purest output subsets.
– Therefore, we need a metric to measure the purity of
a split:
● Information gain

● Gini index

● Gain ration

5 / 19
Entropy
● Entropy is used to measure purity,
information, or disorder:

where p is the whole dataset, N is the number of classes,

and pi is the frequency of class i in the same dataset.

6 / 19
Entropy based splits

The goal of each split in

a decision tree is to move
from a confused dataset to
two (or more) purer subsets
with lesser entropys.

Ideally, the split should lead

to subsets with an entropy of
0.0.

7 / 19
Information Gain (ID3)

● In order to evaluate how good a feature is for splitting, the

difference in entropy before and after the split is calculated:

where “before” is the dataset before the split,

K is the number of subsets generated by the split,
(j, after) is subset j after the split.

● We choose to split the data on the feature with the highest

value in information gain.

8 / 19
Gain Ratio (C4.5)
The information gained by a balanced split is higher than the
information gained by an unbalanced split.

9 / 19
Gini index
● Gini impurity is a measure of how often a randomly
chosen element from the set would be incorrectly
labeled:

Proof:

● The Gini index:

where K is the number of subsets generated by the split 10 / 19

and (j, after) is subset j after the split.
Split candidates
●
Nominal features:
– We can create a child node for each possible value ( a wider
tree)
– we can make a binary split (a higher tree)

11 / 19
Split candidates
●
Numerical features:
– All numerical values could actually be split candidates
(computationally expensive).
– The candidate split points are taken in between every two
consecutive values of the selected numerical feature. the
binary split producing the best quality measure is adopted.

12 / 19
Size and Overfitting

● Trees that are too deep can lead to models that are too
detailed and don’t generalize on new data.
● On the other hand, trees that are too shallow might lead to
overly simple models that can’t fit the data.

13 / 19
Pruning

● Pruning is a way to avoid overfitting.

● Pruning is applied to a decision tree after the training
phase.

● Basically, we let the tree be free to grow as much as

allowed by its settings, without applying any explicit
restrictions. At the end, we proceed to cut those
branches that are not populated sufficiently

14 / 19
reduced error pruning

●
At each iteration,
– a low populated branch is pruned
– The tree is applied again to the training
data.
– If the pruning of the branch doesn’t
decrease the accuracy on the training set,
the branch is removed.

15 / 19
Early Stopping

● Another option to avoid overfitting is early stopping,

based on a stopping criterion.
● One common stopping criterion is the minimum number
of samples per node.
– A higher value of this minimum number leads to
shallower trees
– While a smaller value leads to deeper trees
● What other criteria?

17 / 19
Random forest
● Many is better than one.
– Several decision trees together can produce more accurate
predictions than just one single decision tree by itself.
● Random forest algorithm builds N slightly differently trained
decision trees and merges them together to get more accurate
and stable predictions.

18 / 19
Bootstrapping of Training Sets
In a random forest, N decision trees are trained each on a subset
of the original training set obtained via bootstrapping of the
original dataset (random sampling with replacement.)

19 / 19
The Majority Rule

● The N slightly differently trained trees will produce N

slightly different predictions for the same input vector.
● Usually, the majority rule is applied to make the final
decision.
● The prediction offered by the majority of the N trees is
adopted as the final one.
● While the predictions from a single tree are highly
sensitive to noise in the training set, predictions from
the majority of many trees are not (if trees are diverse).

20 / 19

Decision Tree
0% (1)
Decision Tree
24 pages
Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
Decision Tree
No ratings yet
Decision Tree
66 pages
Decision Tree & Techniques
71% (7)
Decision Tree & Techniques
41 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
Question Bank-DSA Using Python (Unit-I & Unit-II)
No ratings yet
Question Bank-DSA Using Python (Unit-I & Unit-II)
4 pages
Decision Trees: Make A Decision (Represent An Outcome
No ratings yet
Decision Trees: Make A Decision (Represent An Outcome
4 pages
MLA NLP Lecture2
No ratings yet
MLA NLP Lecture2
76 pages
Session 9 10 Decision Tree
No ratings yet
Session 9 10 Decision Tree
41 pages
Chapter 3
No ratings yet
Chapter 3
88 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Decision Tree
No ratings yet
Decision Tree
74 pages
EDA Cat2
No ratings yet
EDA Cat2
54 pages
PR GTU IMP Questions by Jay
No ratings yet
PR GTU IMP Questions by Jay
35 pages
Sheet7 - Trees - S2018 - Final - Solution
No ratings yet
Sheet7 - Trees - S2018 - Final - Solution
18 pages
ML CLASS 6 Decision Tree Algorithm
No ratings yet
ML CLASS 6 Decision Tree Algorithm
21 pages
Adobe Scan 16 May 2023
No ratings yet
Adobe Scan 16 May 2023
12 pages
CSE445 NSU Week - 4
No ratings yet
CSE445 NSU Week - 4
48 pages
TEAA - Tree Ensembles-1
No ratings yet
TEAA - Tree Ensembles-1
43 pages
ESGB - 2025 - Classification and Regression Tress (Enregistré Automatiquement)
No ratings yet
ESGB - 2025 - Classification and Regression Tress (Enregistré Automatiquement)
43 pages
My Decision Tree Algorithm
No ratings yet
My Decision Tree Algorithm
21 pages
CS467-M4-Machine Learning-Ktustudents - in
No ratings yet
CS467-M4-Machine Learning-Ktustudents - in
9 pages
Classification
No ratings yet
Classification
45 pages
Decision Trees
No ratings yet
Decision Trees
45 pages
Unit 3,4,5 ML (CS - AI)
No ratings yet
Unit 3,4,5 ML (CS - AI)
37 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Lecture 5a
No ratings yet
Lecture 5a
24 pages
Machine Learning Chapter 4
No ratings yet
Machine Learning Chapter 4
9 pages
Decision Trees: A Recent Overview: S. B. Kotsiantis
No ratings yet
Decision Trees: A Recent Overview: S. B. Kotsiantis
23 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Decision Trees: Decision Tree Is One of The Most Widely Used and
No ratings yet
Decision Trees: Decision Tree Is One of The Most Widely Used and
53 pages
An Introduction TO Decision Trees
No ratings yet
An Introduction TO Decision Trees
30 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Decision Tree
No ratings yet
Decision Tree
45 pages
Decision Trees - 2022
No ratings yet
Decision Trees - 2022
49 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
Classification: Decision Trees: Business Analytics Lecture 7/8
No ratings yet
Classification: Decision Trees: Business Analytics Lecture 7/8
35 pages
Lecture 3 - Decision Trees and Random Forest
No ratings yet
Lecture 3 - Decision Trees and Random Forest
20 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
DSP (18EC52) Course File Planning
No ratings yet
DSP (18EC52) Course File Planning
7 pages
Decision Trees
No ratings yet
Decision Trees
26 pages
Dmi Unit 4
No ratings yet
Dmi Unit 4
34 pages
Decision Trees and Decision Modeling
No ratings yet
Decision Trees and Decision Modeling
58 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
Integer Programming
No ratings yet
Integer Programming
29 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Ch02 DecisionTree
No ratings yet
Ch02 DecisionTree
41 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
17 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
13 pages
Graph Theory Paper
No ratings yet
Graph Theory Paper
5 pages
2 - Decision Tree
No ratings yet
2 - Decision Tree
23 pages
Relational Calculus: Database Management Systems, R. Ramakrishnan 1
No ratings yet
Relational Calculus: Database Management Systems, R. Ramakrishnan 1
17 pages
Multi-Layer Perceptron Tutorial
No ratings yet
Multi-Layer Perceptron Tutorial
87 pages
EG 7016 Design of Discrete Systems: Rodrigo Quian Quiroga
No ratings yet
EG 7016 Design of Discrete Systems: Rodrigo Quian Quiroga
15 pages
AE306 Digital Signal Processing
No ratings yet
AE306 Digital Signal Processing
2 pages
11 - Fuzzy Systems
No ratings yet
11 - Fuzzy Systems
33 pages
R (3,3,3,3)
No ratings yet
R (3,3,3,3)
12 pages
Python From Scratch
No ratings yet
Python From Scratch
3 pages
Ackermann Function
No ratings yet
Ackermann Function
8 pages
MTL107 Set 7
No ratings yet
MTL107 Set 7
7 pages
1 Introduction
No ratings yet
1 Introduction
5 pages
Ford Fulkerson
No ratings yet
Ford Fulkerson
31 pages
Supervised Neural Networks For The Classification of Structures
No ratings yet
Supervised Neural Networks For The Classification of Structures
22 pages
Factoring Polynomials: Be Sure Your Answers Will Not Factor Further!
No ratings yet
Factoring Polynomials: Be Sure Your Answers Will Not Factor Further!
5 pages
Accelerating Benders Decomposition Algorithmic Enh PDF
No ratings yet
Accelerating Benders Decomposition Algorithmic Enh PDF
41 pages
Pre Release Solution MAY JUNE 2021
No ratings yet
Pre Release Solution MAY JUNE 2021
11 pages
Security Lec4
No ratings yet
Security Lec4
37 pages
Subject: C' Programming For Problem Solving Subject Code: 18CPS23
No ratings yet
Subject: C' Programming For Problem Solving Subject Code: 18CPS23
16 pages
54-Article Text-200-2-10-20230107
No ratings yet
54-Article Text-200-2-10-20230107
9 pages
MAT6007 - Session7 - Sigmoid Neurons - Gradient Descent
No ratings yet
MAT6007 - Session7 - Sigmoid Neurons - Gradient Descent
19 pages
Cse1001 - Assignment No 1
No ratings yet
Cse1001 - Assignment No 1
4 pages
Scalable Neural Network
No ratings yet
Scalable Neural Network
31 pages
CDTR 9464
No ratings yet
CDTR 9464
8 pages
Week 5 - Kleenes Theorem
No ratings yet
Week 5 - Kleenes Theorem
23 pages
CS 701 Viva Qa
No ratings yet
CS 701 Viva Qa
4 pages
32 Scheme Examples PDF
No ratings yet
32 Scheme Examples PDF
8 pages