Open navigation menu

Scribd

0% found this document useful (0 votes)

17 views20 pages

Unit 4-2

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views20 pages

Unit 4-2

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 20

Tree Building – Regression,

Classification, Overfitting, Pruning

And Complexity, Multiple Decision
Trees.
Introduction to Tree Building

Decision trees are a versatile model

used for both classification and
regression tasks.

They work by splitting the data into

subsets based on feature values.

This presentation will explore key

concepts in tree building, including
overfitting and pruning.
What is Decision Tree?

A decision tree is a flowchart-like

structure used for decision-making.

It consists of nodes (decisions) and

leaves (outcomes).

Decision trees can be visualized

easily, making them intuitive for
interpretation.
Types of Decision Trees

Decision trees can be categorized into

classification trees and regression
trees.

Classification trees are used for

predicting categorical outcomes.

Regression trees predict continuous

numeric values based on input
features.
Decision Tree for Classification

In classification, the goal is to assign

labels to instances based on features.

Each internal node represents a test

on an attribute, leading to branches
for outcomes.

The final leaves represent the

predicted classes for the input data.
Decision Tree for Regression

Regression trees predict a continuous

value rather than a class label.

The splits in the tree are based on

minimizing the variance of the target
variable.

Each leaf node contains the average

of the target values for the instances
in that node.
How Trees Split Data

Trees use algorithms like CART

(Classification and Regression Trees)
to split data.

The splitting criterion can be Gini

impurity for classification and Mean
Squared Error for regression.

The objective is to maximize

information gain and minimize
impurity at each split.
Overfitting in Decision Trees

Overfitting occurs when a tree model

learns noise in the training data.

A tree that is too deep can capture

outliers, leading to poor
generalization.

It is crucial to find a balance between

model complexity and predictive
performance.
Signs of Overfitting

High accuracy on training data but

poor performance on validation data
indicates overfitting.

A very complex tree with numerous

splits is often a sign of overfitting.

Visual inspections of tree depth and

structure can help identify overfitting.
Pruning Techniques

Pruning reduces the size of the tree to

improve generalization.

There are two main types: pre-pruning

(stopping growth early) and post-
pruning (removing nodes after
construction).

Pruning helps to eliminate branches

that have little importance, thus
reducing complexity.
Pre-Pruning Strategies

Pre-pruning involves setting

conditions to halt tree growth during
the creation process.

Common conditions include limiting

the maximum depth or minimum
samples per leaf.

This approach helps to create a

simpler model before overfitting
occurs.
Post-Pruning Strategies

Post-pruning involves trimming the

tree after it has been fully grown.

Techniques such as cost complexity

pruning evaluate the contribution of
each subtree.

This can significantly enhance the

model's ability to generalize to
unseen data.
Complexity of Decision Trees

The complexity of decision trees is

determined by their depth and
number of leaves.

More complex trees can capture more

patterns but may lead to overfitting.

Understanding the trade-off between

complexity and performance is
essential for effective modeling.
Evaluating Tree Performance

Performance metrics for decision

trees include accuracy, precision,
recall, and F1 score.

For regression trees, metrics like

Mean Absolute Error (MAE) and R-
squared are used.

Evaluating performance on both

training and validation datasets is
crucial.
Ensemble Methods

Ensemble methods combine multiple

decision trees to improve
performance.

Techniques like Random Forests and

Gradient Boosting Trees are popular
examples.

These methods reduce the variance of

predictions and enhance
generalization.
Random Forests Overview

Random forests build multiple

decision trees using random subsets
of data and features.

Each tree votes on the output, leading

to a more robust prediction.

This approach helps mitigate

overfitting that might occur with
individual trees.
Gradient Boosting Trees

Gradient boosting builds trees

sequentially, with each tree correcting
errors from the previous one.

This method focuses on optimizing the

loss function, leading to high
accuracy.

Boosted trees are more susceptible to

overfitting, requiring careful tuning.
Handling Missing Values

Decision trees can effectively handle

missing values during splitting.

They can create surrogate splits or

utilize imputation techniques.

This feature makes decision trees a

robust choice for datasets with
incomplete data.
Practical Applications

Decision trees are widely used in

various domains, including finance,
healthcare, and marketing.

They are effective for customer

segmentation, credit scoring, and
predictive analytics.

Their interpretability makes them a

favored choice for applications
requiring clear decision paths.
Summary of Key Concepts

Decision trees are powerful tools for

both classification and regression
tasks.

Managing complexity through pruning

and using ensemble methods can
enhance performance.

Understanding the balance between

bias and variance is crucial in tree
building.

You might also like

Pe s4hc PR Dd2 Wa
No ratings yet
Pe s4hc PR Dd2 Wa
8 pages
1 Wire Command Set
No ratings yet
1 Wire Command Set
46 pages
Unit 4
No ratings yet
Unit 4
33 pages
Tree
No ratings yet
Tree
7 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
14 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Chapter 03
No ratings yet
Chapter 03
30 pages
Decision Tree by Masud
No ratings yet
Decision Tree by Masud
12 pages
Decision Tree
No ratings yet
Decision Tree
68 pages
HSMC
No ratings yet
HSMC
5 pages
AIML Ak
No ratings yet
AIML Ak
21 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
Decision Trees Set-1
No ratings yet
Decision Trees Set-1
7 pages
Introduction To Decision Trees
No ratings yet
Introduction To Decision Trees
10 pages
Classification and Regression Trees
100% (1)
Classification and Regression Trees
60 pages
ML Unit3
No ratings yet
ML Unit3
8 pages
ML Mod2
No ratings yet
ML Mod2
5 pages
AIML Removed
No ratings yet
AIML Removed
25 pages
AIML Removed Merged
No ratings yet
AIML Removed Merged
31 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Decision Tree Comprehesive
No ratings yet
Decision Tree Comprehesive
7 pages
EST Cheatsheet
No ratings yet
EST Cheatsheet
5 pages
Decision Tree
100% (1)
Decision Tree
57 pages
Dmi Unit 4
No ratings yet
Dmi Unit 4
34 pages
An Introduction TO Decision Trees
No ratings yet
An Introduction TO Decision Trees
30 pages
ML Ch-3 Decision Trees and Ensemble Methods
No ratings yet
ML Ch-3 Decision Trees and Ensemble Methods
14 pages
TEAA - Tree Ensembles-1
No ratings yet
TEAA - Tree Ensembles-1
43 pages
10 - Cart
No ratings yet
10 - Cart
39 pages
Decision Trees
No ratings yet
Decision Trees
37 pages
Classification Using Decision Trees
No ratings yet
Classification Using Decision Trees
43 pages
Classification and Regression Trees
No ratings yet
Classification and Regression Trees
36 pages
Presentation On Decision Trees
No ratings yet
Presentation On Decision Trees
12 pages
Chap9 Cart 574 1
No ratings yet
Chap9 Cart 574 1
42 pages
Decision Trees A Comprehensive Guide
No ratings yet
Decision Trees A Comprehensive Guide
10 pages
Decision Tree
No ratings yet
Decision Tree
57 pages
DS Tech M 3 1
No ratings yet
DS Tech M 3 1
13 pages
Unit 4 Da
No ratings yet
Unit 4 Da
23 pages
PR GTU IMP Questions by Jay
No ratings yet
PR GTU IMP Questions by Jay
35 pages
Decision Tree Induction Algorithm
No ratings yet
Decision Tree Induction Algorithm
6 pages
Machine Learning Note 2
No ratings yet
Machine Learning Note 2
2 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Random Forest Summary
No ratings yet
Random Forest Summary
6 pages
Ch5 Data Science
No ratings yet
Ch5 Data Science
60 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Machine Learning With Python - Machine Learning Algorithms - Decision Tree
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - Decision Tree
17 pages
Decision Tree & Regression
No ratings yet
Decision Tree & Regression
33 pages
Decisiontree
No ratings yet
Decisiontree
6 pages
Chapter 04
No ratings yet
Chapter 04
48 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
MI - Unit 4
No ratings yet
MI - Unit 4
79 pages
Classification: Decision Trees: Business Analytics Lecture 7/8
No ratings yet
Classification: Decision Trees: Business Analytics Lecture 7/8
35 pages
6 Decision Trees in Data Mining
No ratings yet
6 Decision Trees in Data Mining
10 pages
Random Forest Summary
No ratings yet
Random Forest Summary
6 pages
Evaluating Model Accuracy and Bias-Variance Tradeoff
No ratings yet
Evaluating Model Accuracy and Bias-Variance Tradeoff
40 pages
Introduction To Decision Tree: Gini Index
No ratings yet
Introduction To Decision Tree: Gini Index
15 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
30 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
14 pages
U4 ML Updated
No ratings yet
U4 ML Updated
32 pages
Decision Tree
No ratings yet
Decision Tree
82 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Code Is Political
No ratings yet
Code Is Political
11 pages
Structural Engineering PG Lab Manual
No ratings yet
Structural Engineering PG Lab Manual
47 pages
IOT Embedded Projects List 2021 - 2022
No ratings yet
IOT Embedded Projects List 2021 - 2022
10 pages
Week2-Fuzzy Logic and Reasoning
No ratings yet
Week2-Fuzzy Logic and Reasoning
48 pages
Solution CC Assign8
0% (1)
Solution CC Assign8
4 pages
Anonymous Class (Extending Class)
No ratings yet
Anonymous Class (Extending Class)
6 pages
SKP CHROMA ID Service Manual
No ratings yet
SKP CHROMA ID Service Manual
13 pages
Tsedey Bank
No ratings yet
Tsedey Bank
11 pages
Resource Scheduling in Cloud Computing
No ratings yet
Resource Scheduling in Cloud Computing
6 pages
TTP-245p 247 User Manual E
No ratings yet
TTP-245p 247 User Manual E
50 pages
Compilation Techniques
No ratings yet
Compilation Techniques
15 pages
INVENTORY SHEET Final
No ratings yet
INVENTORY SHEET Final
1 page
List of Drawing Instruments Equipments and Materials
No ratings yet
List of Drawing Instruments Equipments and Materials
16 pages
UTtoKB A Model For Semantic Relation Extraction For Unstructured Text
No ratings yet
UTtoKB A Model For Semantic Relation Extraction For Unstructured Text
7 pages
Flutter User Interface Using Scaffolds
No ratings yet
Flutter User Interface Using Scaffolds
36 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
A Survey On Large Language Model Acceleration Based On KV Cache Management
No ratings yet
A Survey On Large Language Model Acceleration Based On KV Cache Management
43 pages
Quarashi Network Whitepaper
No ratings yet
Quarashi Network Whitepaper
16 pages
Machine Learning Algorithms From Scratch
No ratings yet
Machine Learning Algorithms From Scratch
9 pages
FYBCA Sem 2 C Lang Unit 4 - Graphics
No ratings yet
FYBCA Sem 2 C Lang Unit 4 - Graphics
8 pages
CV Porto Vickyab - Compressed
No ratings yet
CV Porto Vickyab - Compressed
8 pages
SAP MM Bootcamp Exercises-3.0 Vendor Master
No ratings yet
SAP MM Bootcamp Exercises-3.0 Vendor Master
21 pages
An Online Road Transport Booking System: Asian Journal of Computer Science and Technology October 2021
No ratings yet
An Online Road Transport Booking System: Asian Journal of Computer Science and Technology October 2021
6 pages
A Workbook in Lexical Semantics
No ratings yet
A Workbook in Lexical Semantics
35 pages
Lab 04 - Composition
No ratings yet
Lab 04 - Composition
3 pages
HCMS Documentation
No ratings yet
HCMS Documentation
81 pages
Applied Python Programming (Cycle-1) - 1
No ratings yet
Applied Python Programming (Cycle-1) - 1
26 pages
Pa - Unit - Iv
No ratings yet
Pa - Unit - Iv
45 pages