0% found this document useful (0 votes)

9 views5 pages

Decision Tree

This document provides a beginner-friendly guide to the Decision Tree Classifier using the ID3 algorithm, explaining its structure, advantages, and disadvantages. It includes a step-by-step process for building a decision tree, calculating entropy and information gain, and implementing the algorithm in Python with a practical example using the Iris dataset. The guide also discusses when to use ID3 and offers tips for effective implementation.

Uploaded by

Arkojyoti Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views5 pages

Decision Tree

Uploaded by

Arkojyoti Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Absolutely!

Let’s now dive into the Decision Tree Classifier using the ID3 algorithm,
explained from the ground up — step-by-step, simple yet deep, with intuition, visuals,
and Python code (with output). 🌳

🌳 Decision Tree (ID3 Algorithm) —

Beginner Friendly Guide

📘 What is a Decision Tree?

A decision tree is a flowchart-like tree structure where:

Each internal node tests a feature.

Each branch represents the outcome of that test.

Each leaf node gives a final class label.

It's like playing "20 Questions" to arrive at an answer!

🧠 What is ID3?
ID3 (Iterative Dichotomiser 3) is one of the earliest and most well-known decision
tree algorithms.

It uses:
Entropy: Measures impurity (randomness) in data.

Information Gain: Measures how much "uncertainty" is removed by a feature.

✅ Why Use ID3 Decision Tree?

Easy to interpret (white box)

Handles both categorical and numerical data

No need for feature scaling

Great for small-to-medium datasets

🧮 Step-by-Step: Building a Tree with ID3
Example Dataset:

Outlook Temperature Humidity Wind Play

Sunny Hot High Weak No

Sunny Hot High Strong No

Overcast Hot High Weak Yes

Rain Mild High Weak Yes

Rain Cool Normal Weak Yes

Rain Cool Normal Strong No

Overcast Cool Normal Strong Yes

Sunny Mild High Weak No

We want to predict Play based on other features.

🔢 Step 1: Calculate Entropy

Entropy is a measure of uncertainty:

Entropy(S) = −p+ log2 (p+ ) − p− log2 (p− )

For example, if 4 "Yes" and 4 "No":

Entropy = −0.5 log2 (0.5) − 0.5 log2 (0.5) = 1

Lower entropy means more purity.

🔍 Step 2: Compute Information Gain

∣Sv ∣
Gain(S, A) = Entropy(S) − ∑ Entropy(Sv )

∣S∣

We choose the feature that maximizes information gain to split the node.
🔧 Python Example: Using sklearn for Decision Tree

Let’s use a real dataset: the Iris dataset.

python Copy Edit

from sklearn.datasets import load_iris from sklearn.tree import

DecisionTreeClassifier, plot_tree from sklearn.model_selection import
train_test_split from sklearn.metrics import classification_report,
confusion_matrix # Load dataset iris = load_iris() X = iris.data y =
iris.target # Train-test split X_train, X_test, y_train, y_test =
train_test_split(X, y, test_size=0.3, random_state=42) # Train a Decision
Tree using ID3 (criterion="entropy") clf =
DecisionTreeClassifier(criterion="entropy", random_state=42)
clf.fit(X_train, y_train) # Predict y_pred = clf.predict(X_test) #
Evaluate print("Classification Report:\n", classification_report(y_test,
y_pred)) print("Confusion Matrix:\n", confusion_matrix(y_test, y_pred))

🔍 Output:
lua Copy Edit

Classification Report:
precision recall f1-score support

0 1.00 1.00 1.00 16

1 0.89 0.89 0.89 9
2 0.91 0.91 0.91 11

accuracy 0.94 36
macro avg 0.93 0.93 0.93 36
weighted avg 0.94 0.94 0.94 36

Confusion Matrix:
[[16 0 0]
[ 0 8 1]
[ 0 1 10]]

🌲 Visualizing the Tree

python Copy Edit

import matplotlib.pyplot as plt plt.figure(figsize=(12,8)) plot_tree(clf,

feature_names=iris.feature_names, class_names=iris.target_names,
filled=True) plt.title("Decision Tree (ID3)") plt.show()
✅ Advantages of ID3
Advantage Description

🧠 Easy to Understand Like a flowchart, interpretable

🛠 No Feature Scaling Needed Works with raw data

📊 Handles Categorical Features Easily Good for decision-making tasks

🚀 Fast Training Especially on small data

⚠️ Disadvantages
Disadvantage Description

🌲 Overfitting Can create deep, complex trees

❌ Sensitive to Noise Small changes can change the tree

💡 Greedy Strategy Chooses best split now, not globally optimal

📊 Biased Towards Features with More Favors categorical variables with many
Levels categories

🧠 When to Use / Not Use

Use When... Avoid When...

Need Interpretability (white-box models) High-dimensional and sparse data

Data is small/medium and well-cleaned You expect high variance or noise

Features are categorical You need robust generalization (use ensemble)

🔄 How it Handles High-Dimensional Data

Struggles with many irrelevant features

Can overfit on high-dimensional or noisy data

Works better with feature selection or pruning

📈 Complexity
Aspect Complexity

Time O(n ⋅ m ⋅ log n) — n: samples, m: features

Space O(n ⋅ m)

⚙️ Tips for Using ID3 in Practice

Prune the tree to prevent overfitting ( max_depth , min_samples_split )

Use cross-validation for better generalization

Combine with bagging/boosting (e.g., RandomForest, XGBoost)

Would you like me to explain pruning, CART (Gini), or how decision trees work in
ensembles like Random Forest or Gradient Boosting next?

WRD 2024-JH
No ratings yet
WRD 2024-JH
165 pages
Credit Appraisal in SIDBI
No ratings yet
Credit Appraisal in SIDBI
131 pages
Unit IV Da Online - PPTX 2 82
No ratings yet
Unit IV Da Online - PPTX 2 82
81 pages
Sugar and Other Sweeteners - Handbook of Industrial Chemistry and Biotechnology-Springer (2017)
100% (1)
Sugar and Other Sweeteners - Handbook of Industrial Chemistry and Biotechnology-Springer (2017)
46 pages
Bashir-UCP Art1, Trade Payment
No ratings yet
Bashir-UCP Art1, Trade Payment
89 pages
UNIT II 2.1 ML Decision Tree Learning
No ratings yet
UNIT II 2.1 ML Decision Tree Learning
55 pages
The Basics of "Criminal Trial" Q and A-Part-III)
No ratings yet
The Basics of "Criminal Trial" Q and A-Part-III)
5 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Job Order Costing
100% (3)
Job Order Costing
45 pages
305 Final Exam Cram Question Package
No ratings yet
305 Final Exam Cram Question Package
14 pages
Unit3 ML
No ratings yet
Unit3 ML
23 pages
Kaukopartiojoukot1942 1944
No ratings yet
Kaukopartiojoukot1942 1944
8 pages
Deep Learning Fundamentals in Python
From Everand
Deep Learning Fundamentals in Python
LazyProgrammer
4/5 (9)
Agenda - Rail Show - 4th Edition - 2025 (12.6.2025)
No ratings yet
Agenda - Rail Show - 4th Edition - 2025 (12.6.2025)
6 pages
Anko Remote Control
No ratings yet
Anko Remote Control
3 pages
Classification
No ratings yet
Classification
148 pages
Unit-3 MLT
No ratings yet
Unit-3 MLT
74 pages
Aiml M4 C1
No ratings yet
Aiml M4 C1
101 pages
Decession Tree
No ratings yet
Decession Tree
72 pages
Decision Trees Lectures
No ratings yet
Decision Trees Lectures
55 pages
ML UNIT 2 Decision Tree
No ratings yet
ML UNIT 2 Decision Tree
109 pages
Module 3
No ratings yet
Module 3
103 pages
Decision Tree Classifier-Introduction, ID3
No ratings yet
Decision Tree Classifier-Introduction, ID3
34 pages
Bayes and Decision Tree
No ratings yet
Bayes and Decision Tree
36 pages
Unit-2 Material
No ratings yet
Unit-2 Material
52 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Production and Operations Management 5th Edition S. N. Chary Ebook All Chapters PDF
100% (5)
Production and Operations Management 5th Edition S. N. Chary Ebook All Chapters PDF
55 pages
Springer - Linguistic Decision Trees For Classification-2014
No ratings yet
Springer - Linguistic Decision Trees For Classification-2014
43 pages
Unit 3
No ratings yet
Unit 3
46 pages
ML Unit 2-2-40
No ratings yet
ML Unit 2-2-40
39 pages
Lec4 Tree v2.4 1
No ratings yet
Lec4 Tree v2.4 1
54 pages
DOST Puts Up Free Online Reviewer For PSHS Exams
No ratings yet
DOST Puts Up Free Online Reviewer For PSHS Exams
2 pages
Machine Learning Approaches: Decision Trees
No ratings yet
Machine Learning Approaches: Decision Trees
44 pages
DWM - Module 3
No ratings yet
DWM - Module 3
22 pages
DM Lect 9 - Classification - Decision Trees
No ratings yet
DM Lect 9 - Classification - Decision Trees
39 pages
LINFO2262: Decision Trees + Random Forests: Pierre Dupont
No ratings yet
LINFO2262: Decision Trees + Random Forests: Pierre Dupont
43 pages
Incinerator For RHC
No ratings yet
Incinerator For RHC
30 pages
Lecture 6 - Decision Trees
No ratings yet
Lecture 6 - Decision Trees
43 pages
Decision Trees - 2022
No ratings yet
Decision Trees - 2022
49 pages
Decision Tree Using ID3 Algorithm
No ratings yet
Decision Tree Using ID3 Algorithm
40 pages
Macroeconomics 2Nd Edition Krugman Solutions Manual Full Chapter PDF
100% (11)
Macroeconomics 2Nd Edition Krugman Solutions Manual Full Chapter PDF
34 pages
Decizsion Tree
No ratings yet
Decizsion Tree
16 pages
Storey DecisionTrees
No ratings yet
Storey DecisionTrees
38 pages
Decision Trees Parth Gupta
No ratings yet
Decision Trees Parth Gupta
22 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
3 - Decision Trees
No ratings yet
3 - Decision Trees
16 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
Decision Tree Learning Lecture
No ratings yet
Decision Tree Learning Lecture
13 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Complete ID3 Decision Tree
No ratings yet
Complete ID3 Decision Tree
15 pages
Unit en Decision Trees Algorithms
No ratings yet
Unit en Decision Trees Algorithms
26 pages
Numerical Similarity Measures Versus Jaccard For Collaborative Filtering
No ratings yet
Numerical Similarity Measures Versus Jaccard For Collaborative Filtering
14 pages
Decision Trees - Id3 Algorithms
No ratings yet
Decision Trees - Id3 Algorithms
12 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
ML Introduction - CLASSIFICATION DECISION TREE
No ratings yet
ML Introduction - CLASSIFICATION DECISION TREE
18 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
ID3 Decision Tree Classifier From Scratch in Python - by Bernardo Garcia Del Rio - Towards Data Science
No ratings yet
ID3 Decision Tree Classifier From Scratch in Python - by Bernardo Garcia Del Rio - Towards Data Science
15 pages
Lesson 7 Supervised Method (Decision Trees) Algorithms
No ratings yet
Lesson 7 Supervised Method (Decision Trees) Algorithms
12 pages
ID3 Algorithm For Decision Trees
No ratings yet
ID3 Algorithm For Decision Trees
16 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
12 pages
Chapter - 6 Issue and Redemption of Debentures
No ratings yet
Chapter - 6 Issue and Redemption of Debentures
8 pages
Unit 1 Rates of Change Assessment of Learning 1 PDF
No ratings yet
Unit 1 Rates of Change Assessment of Learning 1 PDF
11 pages
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
11 pages
2167TC1 Lab
No ratings yet
2167TC1 Lab
8 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Thomson One Wealth Solutions Brochure
No ratings yet
Thomson One Wealth Solutions Brochure
10 pages
Decision Tree Classification Fully Explained by Example
No ratings yet
Decision Tree Classification Fully Explained by Example
4 pages
Decision Tree and Related Techniques For Classification in Scalation
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
12 pages
Prepared Food Photos, Inc V New Kianis Pizza & Subs, Inc: Judgment Entered $51,461.50
No ratings yet
Prepared Food Photos, Inc V New Kianis Pizza & Subs, Inc: Judgment Entered $51,461.50
5 pages
Consent Form
No ratings yet
Consent Form
5 pages
Darda Hidráulica
No ratings yet
Darda Hidráulica
9 pages
Operational Competiveness of Retail Industries: A Case Study in Odisha
No ratings yet
Operational Competiveness of Retail Industries: A Case Study in Odisha
6 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
Practical 1ritesh
No ratings yet
Practical 1ritesh
3 pages
MLT Experiment 3
No ratings yet
MLT Experiment 3
3 pages
Research Scholars Evaluation Based On Guides View Using Id3
No ratings yet
Research Scholars Evaluation Based On Guides View Using Id3
4 pages
Roles and Responsibilities in The Planning Stage
No ratings yet
Roles and Responsibilities in The Planning Stage
6 pages
Decision Tree Final
No ratings yet
Decision Tree Final
2 pages
Ijret - Research Scholars Evaluation Based On Guides View Using Id3
No ratings yet
Ijret - Research Scholars Evaluation Based On Guides View Using Id3
4 pages
Labrador v. CA
No ratings yet
Labrador v. CA
5 pages
Salcedo II Vs Comelec
No ratings yet
Salcedo II Vs Comelec
5 pages
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
No ratings yet
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
4 pages
GPS Antenna Cable
No ratings yet
GPS Antenna Cable
5 pages
Salience Model For Classifying Stakeholders
No ratings yet
Salience Model For Classifying Stakeholders
2 pages
Hazard Pay
No ratings yet
Hazard Pay
2 pages
Sunpal Power Co.,Ltd.: Quotation of 5.5KW Hybrid Solar Power System (Battery Backup 4.8kwh)
No ratings yet
Sunpal Power Co.,Ltd.: Quotation of 5.5KW Hybrid Solar Power System (Battery Backup 4.8kwh)
1 page
Heather Jennings Resume
No ratings yet
Heather Jennings Resume
1 page