Overview of Decision Tree Algorithms

The document provides an overview of decision trees, a supervised learning algorithm used for classification and regression, highlighting key concepts such as decision nodes, leaf nodes, entropy, and information gain. It discusses various algorithms including CART, ID3, C4.5, and CHAID, along with their use cases and criteria for splitting data. Additionally, it covers the evaluation process for decision trees, including data splitting, making predictions, and calculating performance metrics.

Uploaded by

www.rithika2001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views21 pages

Overview of Decision Tree Algorithms

Uploaded by

www.rithika2001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

DECISION TREE

ALGORITHM
DECISION TREE

Decision tree are the type of supervised

learning where the data is continuously split
according to a certain parameters.

The tree has two entities namely decision

node and leaf node.

The decision node are where the data is split

and Leaves are the decision or final
outcomes.
Decision tree is used for both classification and
regression.

1. Entropy
2. Information gain

Entropy is a measure of impurity or uncertainty

in a dataset, and it's a key factor in decision
trees. It helps the algorithm determine how to
split data at each node to create more
structured subsets.
Formula:
Information gain is a measure used to
determine which feature should be used to
split the data at each internal node of the
decision tree. It is calculated using entropy.

Formula:
EXAMPLE
Gini index :

The Gini Index is a statistical measure used to

determine inequality or impurity in a dataset.
Purpose: Measures how "pure" or "impure" a
dataset is. Pure data means all elements belong
to one category, while impure data means they
are distributed among multiple categories.
Range: Values range from 0 to 1:
0: Perfectly pure (all elements belong to one
class).
1: Maximum impurity (elements are evenly
distributed across all classes).
• Formula:

• Higher Gini Index indicates more impurity. Lower Gini

Index indicates more purity.
Example:
ALGORITHMS:

1. CART (Classification and Regression

Trees)
Use Case: Both classification and regression.
Split Criterion:
 Gini Index for classification.
 Mean Squared Error (MSE) for regression.
Output: Binary tree (each node splits into
two branches).
2. ID3 (Iterative Dichotomiser 3)
Use Case: Classification tasks.
Split Criterion: Information Gain,
Entropy
Limitations:
 Does not handle continuous data directly.
 Prone to overfitting.
ID3 is one of the earliest decision tree
algorithms, developed by Ross Quinlan, and
is used for classification tasks.
3. C4.5
C4.5 is an advanced decision tree algorithm
developed by Ross Quinlan as an
improvement over the ID3 algorithm.
Use Case: Classification tasks.
Split Criterion: Gain Ratio
Features:
 Handles continuous and missing data.
 Prunes trees to avoid overfitting.
5. CHAID (Chi-squared Automatic
Interaction Detection)
Use Case: Primarily for categorical data.
Split Criterion: Chi-square test for
independence.
Features:
 Produces multi-way splits (not restricted to binary
splits).
 Often used in marketing and survey data analysis.
Evaluating the decision tree:

 Split Data
Divide the dataset into training and test sets to evaluate
performance on unseen data.
 Make Predictions
Use the decision tree to predict outcomes for the test set.
 Compare Predictions
Compare the tree’s predictions with the actual outcomes
from the test set.
 Calculate Metrics
Measure performance using metrics like accuracy, precision,
recall, or F1 score.
 Analyze and Improve
Check for overfitting, adjust parameters and improve if
needed.
THANK YOU

Unit 3 Notes
No ratings yet
Unit 3 Notes
25 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Unit 3
No ratings yet
Unit 3
21 pages
Understanding Decision Trees in Data Science
No ratings yet
Understanding Decision Trees in Data Science
13 pages
ML CLASS 6 Decision Tree Algorithm
No ratings yet
ML CLASS 6 Decision Tree Algorithm
21 pages
Lecture 7.1 - Decision Tree Classification
No ratings yet
Lecture 7.1 - Decision Tree Classification
15 pages
ML-PPT Unit Iii-1
No ratings yet
ML-PPT Unit Iii-1
38 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
Decision Tree Algorithms Guide
No ratings yet
Decision Tree Algorithms Guide
49 pages
Unit 2
No ratings yet
Unit 2
29 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Decision Trees
No ratings yet
Decision Trees
26 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
ML Unit 3 Qa
No ratings yet
ML Unit 3 Qa
26 pages
Data Science Lectures 3
No ratings yet
Data Science Lectures 3
46 pages
Day48 Decision Trees
No ratings yet
Day48 Decision Trees
5 pages
Entropy and Information Gain For Decision Tree Algorithm
No ratings yet
Entropy and Information Gain For Decision Tree Algorithm
12 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Session 5b Classification by Decision Tree Induction
No ratings yet
Session 5b Classification by Decision Tree Induction
42 pages
Decision Trees: Make A Decision (Represent An Outcome
No ratings yet
Decision Trees: Make A Decision (Represent An Outcome
4 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
DS Tech M 3 1
No ratings yet
DS Tech M 3 1
13 pages
Decision Trees and Probabilistic Models
No ratings yet
Decision Trees and Probabilistic Models
32 pages
Decision Trees in Python: A Guide
No ratings yet
Decision Trees in Python: A Guide
12 pages
Decsion Tree
No ratings yet
Decsion Tree
6 pages
Evaluating Model Accuracy and Bias-Variance Tradeoff
No ratings yet
Evaluating Model Accuracy and Bias-Variance Tradeoff
40 pages
Experiment No-2
No ratings yet
Experiment No-2
4 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Decision Trees
No ratings yet
Decision Trees
61 pages
Unit 3.2 Decision Tree Algorithm Wit Examples
No ratings yet
Unit 3.2 Decision Tree Algorithm Wit Examples
85 pages
Decision Tree Learning in Machine Learning
No ratings yet
Decision Tree Learning in Machine Learning
68 pages
Text Classification Techniques Overview
No ratings yet
Text Classification Techniques Overview
65 pages
07.2.decision Trees
No ratings yet
07.2.decision Trees
33 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
17 pages
Unit IV Da Online - PPTX 2 82
No ratings yet
Unit IV Da Online - PPTX 2 82
81 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
5 pages
AI - Mod 5. Part 2
No ratings yet
AI - Mod 5. Part 2
40 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
Decision Trees and Random Forests
No ratings yet
Decision Trees and Random Forests
19 pages
Decision Tree & Techniques
71% (7)
Decision Tree & Techniques
41 pages
6 DecisionTrees ID3 CART
No ratings yet
6 DecisionTrees ID3 CART
24 pages
Supervised Decision TreeRandom Forest
No ratings yet
Supervised Decision TreeRandom Forest
39 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Classification Algorithms
No ratings yet
Classification Algorithms
31 pages
Supervised Learning Algorithm DT
No ratings yet
Supervised Learning Algorithm DT
15 pages
Machine Learning Chapter 4
No ratings yet
Machine Learning Chapter 4
9 pages
Decision Tree - ML Class
No ratings yet
Decision Tree - ML Class
58 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
11 pages
CART
No ratings yet
CART
26 pages
Data Warehousing and Data Mining: Classification, Trees
No ratings yet
Data Warehousing and Data Mining: Classification, Trees
26 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
DM Unit 4
No ratings yet
DM Unit 4
24 pages
ML Unit 3
No ratings yet
ML Unit 3
22 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
Fortigate 1100E Series: Data Sheet
No ratings yet
Fortigate 1100E Series: Data Sheet
7 pages
Computer Repair With Diagnostic Flowcharts Third Edition
No ratings yet
Computer Repair With Diagnostic Flowcharts Third Edition
69 pages
TEM045V User Manual - MTEM045V-R02-0720
No ratings yet
TEM045V User Manual - MTEM045V-R02-0720
66 pages
Ransomware Attack Seminar Report
No ratings yet
Ransomware Attack Seminar Report
20 pages
Proteus CT1628 Electrical Simulation
No ratings yet
Proteus CT1628 Electrical Simulation
4 pages
Lesson 5 Ethics in Conducting Online Research 2025
No ratings yet
Lesson 5 Ethics in Conducting Online Research 2025
32 pages
UMTS Networks Architecture Mobility and Services Second Edition Heikki Kaaranen - PDF Download (2025)
No ratings yet
UMTS Networks Architecture Mobility and Services Second Edition Heikki Kaaranen - PDF Download (2025)
57 pages
Siemens PLM Licensing User Guide: April 2018
No ratings yet
Siemens PLM Licensing User Guide: April 2018
34 pages
CS603 Manual
No ratings yet
CS603 Manual
21 pages
"DGTP": Panel Digital Weight Transmitter/Indicator: Industrial Weighing and Dosage Systems Making
No ratings yet
"DGTP": Panel Digital Weight Transmitter/Indicator: Industrial Weighing and Dosage Systems Making
4 pages
Experiment No-3
No ratings yet
Experiment No-3
4 pages
Pol111h5s Lec0101
No ratings yet
Pol111h5s Lec0101
17 pages
Java Server Pages Guide
No ratings yet
Java Server Pages Guide
27 pages
Constructors and Destructors Presentation
No ratings yet
Constructors and Destructors Presentation
33 pages
Leniear Algebra Operation For Machine Learning
No ratings yet
Leniear Algebra Operation For Machine Learning
10 pages
CompTIA SecAI+ CY0-001 Exam Objectives (1.1)
No ratings yet
CompTIA SecAI+ CY0-001 Exam Objectives (1.1)
12 pages
Digit MP 2023 en
No ratings yet
Digit MP 2023 en
38 pages
Compiler Design-Notes
100% (2)
Compiler Design-Notes
212 pages
Optimize Sage ERP X3 Read Performance
No ratings yet
Optimize Sage ERP X3 Read Performance
2 pages
macOS Sonoma For Dummies Guy Hart-Davis Instant Download
100% (1)
macOS Sonoma For Dummies Guy Hart-Davis Instant Download
40 pages
Acrel Catalog 2023.7
No ratings yet
Acrel Catalog 2023.7
40 pages
SEO Audit: Lorena Sousa Health Services
No ratings yet
SEO Audit: Lorena Sousa Health Services
6 pages
Live Tracker - Pak Sim Data
No ratings yet
Live Tracker - Pak Sim Data
3 pages
Pava Specs
No ratings yet
Pava Specs
19 pages
AI Playbook Template
No ratings yet
AI Playbook Template
15 pages
Shaik Shavil
No ratings yet
Shaik Shavil
2 pages
Beneview T6 (Standard Parameter)
No ratings yet
Beneview T6 (Standard Parameter)
3 pages
Assignment On Business Plan
86% (63)
Assignment On Business Plan
28 pages
OS2 Introduction
No ratings yet
OS2 Introduction
7 pages
renesas.R19AN0014ED0102 - ASSP - Industrial Ethernet PHY Layout Recommendations and Design R
No ratings yet
renesas.R19AN0014ED0102 - ASSP - Industrial Ethernet PHY Layout Recommendations and Design R
42 pages