Decision Tree Using ID3 Algorithm

The document discusses the ID3 decision tree algorithm. It describes how ID3 builds classification trees from training data using the concept of information gain to choose attributes that best split the data. The inductive bias and hypothesis space of decision tree learning are also explained.

Uploaded by

Srujana Shetty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

83 views40 pages

Decision Tree Using ID3 Algorithm

Uploaded by

Srujana Shetty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Decision Tree Using ID3

Algorithm
Compiled by,
Dr. Shashank Shetty
DECISION TREE REPRESENTATION

• Decision trees classify instances by sorting them down the tree from
the root to some leaf node, which provides the classification of the
instance.
• Each node in the tree specifies a test of some attribute of the
instance, and each branch descending from that node corresponds to
one of the possible values for this attribute.
• An instance is classified by starting at the root node of the tree,
testing the attribute specified by this node, then moving down the
tree branch corresponding to the value of the attribute in the given
example. This process is then repeated for the subtree rooted at the
new node.
• Decision trees represent a disjunction of conjunctions of constraints
on the attribute values of instances.
• Each path from the tree root to a leaf corresponds to a conjunction of
attribute tests, and the tree itself to a disjunction of these
conjunctions For example, the decision tree shown in above figure
corresponds to the expression
(Outlook = Sunny ∧ Humidity = Normal) ∨
(Outlook = Overcast) ∨
(Outlook = Rain ∧ Wind = Weak)
Appropriate Problems for Decision Tree Learning:
• Decision tree learning is generally best suited to problems with the
following characteristics:
1. Instances are represented by attribute-value pairs – Instances are
described by a fixed set of attributes and their values.
2. The target function has discrete output values – The decision tree assigns
a Boolean classification (e.g., yes or no) to each example. Decision tree
methods easily extend to learning functions with more than two possible
output values.
3. Disjunctive descriptions may be required.
4. The training data may contain errors – Decision tree learning methods are
robust to errors, both errors in classifications of the training examples and
errors in the attribute values that describe these examples.
5. The training data may contain missing attribute values – Decision tree
methods can be used even when some training examples have unknown
values.
What is ID3?
• A mathematical algorithm for building the decision tree.
• Invented by J. Ross Quinlan in 1979.
• Uses Information Theory invented by Shannon in 1948.
• Builds the tree from the top down, with no backtracking.
• Information Gain is used to select the most useful attribute for
classification.
Entropy
• A formula to calculate the homogeneity of a sample.
• A completely homogeneous sample has entropy of 0.
• An equally divided sample has entropy of 1.
• Entropy(s) = - p+log2 (p+) -p-log2 (p-) for a sample of negative and
positive elements.
• The formula for entropy is:
Entropy Example
Entropy(S) =
- (9/14) Log2 (9/14) - (5/14) Log2 (5/14)
= 0.940
Information Gain (IG)
• The information gain is based on the decrease in entropy after a dataset is split on an
attribute.
• Which attribute creates the most homogeneous branches?
• First the entropy of the total dataset is calculated.
• The dataset is then split on the different attributes.
• The entropy for each branch is calculated. Then it is added proportionally, to get total
entropy for the split.
• The resulting entropy is subtracted from the entropy before the split.
• The result is the Information Gain, or decrease in entropy.
• The attribute that yields the largest IG is chosen for the decision node.
Information Gain (cont’d)
• A branch set with entropy of 0 is a leaf node.
• Otherwise, the branch needs further splitting to classify its dataset.
• The ID3 algorithm is run recursively on the non-leaf branches, until all data
is classified.
Input Parameters:

•Examples: The training examples with known attribute values and corresponding class labels.

•Target attribute: The attribute whose value we want to predict.

•Attributes: A list of attributes that may be used to make decisions.

Algorithm Flow:

a. Create a Root node for the tree.

b. If all Examples have the same class label:
• If all Examples are positive, return a single-node tree Root with label =
"+"
• If all Examples are negative, return a single-node tree Root with label
= "-“
• If Attributes is empty:
• Return a single-node tree Root with the label as the most common
value of the Target attribute in Examples.
d. Otherwise, start the decision-making process:
i. Calculate the entropy of the current dataset (Examples) using the formula
ii. For each attribute in Attributes, calculate the information gain (IG) using
the formula
iii. Select the attribute with the highest information gain as the decision
attribute for the Root node.
iv. For each possible value of the selected attribute:
- Create a new branch below the Root node corresponding to the test
"Attribute = vi".
- Divide the Examples into subsets based on the value of the selected
attribute.
- If a subset is empty:
- Add a leaf node with the label as the most common value of the Target
attribute in the Examples.
- Otherwise: - Recursively create a subtree using the ID3 algorithm with the
subset of Examples, excluding the selected attribute.
• e. Return the Root node of the decision tree.
Hypothesis Space Search In Decision Tree
Learning
• In the process of decision tree learning, like with the ID3 algorithm,
we're essentially trying to find the best tree structure that accurately
classifies our training data. This involves exploring various hypotheses
or potential decision trees to find the one that fits our data the best.
• The hypothesis space searched by ID3 is the set of possible decision
trees. ID3 performs a simple-to complex, hill-climbing search through
this hypothesis space, beginning with the empty tree, then
considering progressively more elaborate hypotheses in search of a
decision tree that correctly classifies the training data.
Hypothesis Space Search In Decision Tree
Learning
• The goal is to find the tree that maximizes the information gain, which
essentially means it helps to classify the training data better.
• One key advantage of ID3 is that it considers all possible decision
trees that can be constructed from the available attributes, ensuring
it won't miss the target function. However, it only maintains a single
hypothesis at any given time, unlike some other methods that keep
track of multiple consistent hypotheses. This limitation means it can't
explore alternative trees or ask new questions to improve its
understanding.
Hypothesis Space Search In Decision Tree
Learning
• Another thing to note is that ID3 doesn't backtrack once it selects an
attribute to split the data at a certain level of the tree. This means it
might get stuck at locally optimal solutions, missing out on potentially
better trees along different paths. To address this, there are
extensions like post-pruning, which involves refining the tree
structure after it's been constructed.
• ID3 also differs from methods that make decisions based on individual
training examples; instead, it uses statistical properties of the entire
dataset to guide its decisions. This makes it less sensitive to errors in
individual examples and allows it to handle noisy data by accepting
hypotheses that might not perfectly fit the training data.
Inductive Bias in Decision tree Learning
• The inductive bias in decision tree learning, specifically in the ID3
algorithm, refers to the set of assumptions and preferences guiding
how the algorithm generalizes from observed training examples to
classify unseen instances.
• In simpler terms, it's like the inherent tendencies or rules that ID3
follows when it's making decisions about how to classify things based
on the data it has seen.
Inductive Bias in Decision tree Learning
• Preference for Shorter Trees: ID3 prefers simpler decision trees over
complex ones. This means it likes to keep the rules as concise as
possible. It does this by selecting the first acceptable tree it
encounters during its search, favoring shorter paths through the tree.
• Placing High Information Gain Attributes Close to the Root: ID3 also
tends to prioritize attributes that provide the most useful information
for classification. It tries to put these attributes closer to the top of
the decision tree, as they can quickly split the data into meaningful
subsets.
Hypothesis Space:
•Definition: The hypothesis space refers to the set of all possible
hypotheses (models or rules) that a learning algorithm can
consider to explain the data.
•Characteristics: It encompasses the range of potential solutions
the algorithm can explore during the learning process.
•Example: In decision tree learning, the hypothesis space includes
all possible decision trees that can be formed using different
combinations of attributes and decision rules.
Inductive Bias:
•Definition: Inductive bias refers to the set of assumptions, preferences, or
constraints that a learning algorithm incorporates into its decision-making
process when generalizing from observed data to classify unseen instances.
•Characteristics: It guides the algorithm's learning process by favoring
certain hypotheses over others based on predefined criteria or principles.
•Example: In decision tree learning, the inductive bias might include
preferences for simpler trees (those with fewer branches or nodes), favoring
attributes with higher information gain, or placing important attributes closer to
the root of the tree.
• In summary, the hypothesis space defines the range of possible
solutions that a learning algorithm considers, while the inductive bias
influences the algorithm's decision-making process within that space
by favoring certain types of hypotheses over others based on
predefined principles or preferences.

Decision Trees & ID3 for Beginners
No ratings yet
Decision Trees & ID3 for Beginners
109 pages
Module 3
No ratings yet
Module 3
103 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
42 pages
Decision Tree Learning Lecture
No ratings yet
Decision Tree Learning Lecture
13 pages
Decision Trees & Kernel Machines
No ratings yet
Decision Trees & Kernel Machines
39 pages
Unit-3 MLT
No ratings yet
Unit-3 MLT
74 pages
Unit 3
No ratings yet
Unit 3
46 pages
Mod 3 AIML QB With Answers
No ratings yet
Mod 3 AIML QB With Answers
26 pages
ID3 Algorithm in Decision Trees Explained
No ratings yet
ID3 Algorithm in Decision Trees Explained
12 pages
Screenshot 2024-02-06 at 1.43.15 PM
No ratings yet
Screenshot 2024-02-06 at 1.43.15 PM
66 pages
Decision Tree Learning Notes On 23rd July
No ratings yet
Decision Tree Learning Notes On 23rd July
23 pages
ML - Unit 2 - Part I
No ratings yet
ML - Unit 2 - Part I
15 pages
UNIT II 2.1 ML Decision Tree Learning
No ratings yet
UNIT II 2.1 ML Decision Tree Learning
55 pages
Data Mining Practical 8
No ratings yet
Data Mining Practical 8
7 pages
Evaluating Research Scholars with ID3
No ratings yet
Evaluating Research Scholars with ID3
4 pages
Evaluating Scholars with ID3
No ratings yet
Evaluating Scholars with ID3
4 pages
ID3 Decision Tree Algorithm Implementation
No ratings yet
ID3 Decision Tree Algorithm Implementation
20 pages
Decision Tree 2
No ratings yet
Decision Tree 2
20 pages
Decision Tree Learning Overview
No ratings yet
Decision Tree Learning Overview
38 pages
03 02 Decision Trees
No ratings yet
03 02 Decision Trees
61 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
Storey DecisionTrees
No ratings yet
Storey DecisionTrees
38 pages
Decision Trees
No ratings yet
Decision Trees
20 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
12 pages
Decision Trees
No ratings yet
Decision Trees
34 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
48 pages
Classification
No ratings yet
Classification
148 pages
Decision Tree Learning Guide
No ratings yet
Decision Tree Learning Guide
43 pages
Decision Trees Parth Gupta
No ratings yet
Decision Trees Parth Gupta
22 pages
Ai Mod3@Azdocuments - in
No ratings yet
Ai Mod3@Azdocuments - in
42 pages
Decision Tree Learning Guide
No ratings yet
Decision Tree Learning Guide
21 pages
4th Sem MA Module 3 Notes
No ratings yet
4th Sem MA Module 3 Notes
27 pages
Decision Tree Learning Overview
No ratings yet
Decision Tree Learning Overview
29 pages
4.3-DecisionTreesLearningAlgorithms Part 2
No ratings yet
4.3-DecisionTreesLearningAlgorithms Part 2
15 pages
Decision Trees & The Iterative Dichotomiser 3 (ID3) Algorithm
100% (1)
Decision Trees & The Iterative Dichotomiser 3 (ID3) Algorithm
8 pages
L8 1 Decisiontrees Random Forest
No ratings yet
L8 1 Decisiontrees Random Forest
118 pages
Complete ID3 Decision Tree
No ratings yet
Complete ID3 Decision Tree
15 pages
Unit 2
100% (1)
Unit 2
42 pages
Module 2 Notes
No ratings yet
Module 2 Notes
20 pages
LINFO2262: Decision Trees + Random Forests: Pierre Dupont
No ratings yet
LINFO2262: Decision Trees + Random Forests: Pierre Dupont
43 pages
Springer - Linguistic Decision Trees For Classification-2014
No ratings yet
Springer - Linguistic Decision Trees For Classification-2014
43 pages
ID3 Decision Tree Algorithm Overview
No ratings yet
ID3 Decision Tree Algorithm Overview
5 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
DM Lect 9 - Classification - Decision Trees
No ratings yet
DM Lect 9 - Classification - Decision Trees
39 pages
Session 17-Decision Tree
No ratings yet
Session 17-Decision Tree
16 pages
Aiml M4 C1
No ratings yet
Aiml M4 C1
101 pages
3 - Decision Trees
No ratings yet
3 - Decision Trees
16 pages
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
No ratings yet
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
19 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
ID3 vs C4.5 Decision Tree Algorithms
No ratings yet
ID3 vs C4.5 Decision Tree Algorithms
7 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
Decision Trees & Neural Networks
No ratings yet
Decision Trees & Neural Networks
19 pages
Module 3 DecisionTree Notes
100% (1)
Module 3 DecisionTree Notes
14 pages
A Predictive Model For The Early Identification of Student Dropout Using Data Classification Clustering and Association Methods
No ratings yet
A Predictive Model For The Early Identification of Student Dropout Using Data Classification Clustering and Association Methods
10 pages
Decision Trees Class
No ratings yet
Decision Trees Class
22 pages
7th Sem Report File
No ratings yet
7th Sem Report File
41 pages
Breast Cancer Prediction with ML
No ratings yet
Breast Cancer Prediction with ML
80 pages
M.SC (Computer Science) 2013 Pattern
No ratings yet
M.SC (Computer Science) 2013 Pattern
49 pages
Pega Interview Questions for Freshers
No ratings yet
Pega Interview Questions for Freshers
10 pages
Machine Learning Internship Report
No ratings yet
Machine Learning Internship Report
20 pages
Data Mining Experiments with WEKA
No ratings yet
Data Mining Experiments with WEKA
33 pages
Module - 03 Machine Learning (BCS602) Search Creators
No ratings yet
Module - 03 Machine Learning (BCS602) Search Creators
29 pages
Data Mining and Data Warehousing Principles and Practical Techniques 1108727743 9781108727747 Compress
No ratings yet
Data Mining and Data Warehousing Principles and Practical Techniques 1108727743 9781108727747 Compress
513 pages
Prediction of Dubai Apartment Prices Using Machine Learning
No ratings yet
Prediction of Dubai Apartment Prices Using Machine Learning
61 pages
Sensor Fusion For IoT-based Intelligent Agriculture System
No ratings yet
Sensor Fusion For IoT-based Intelligent Agriculture System
5 pages
Data Mining Techniques & Concepts
No ratings yet
Data Mining Techniques & Concepts
7 pages
Stevenson CH05S Accessible
No ratings yet
Stevenson CH05S Accessible
33 pages
Data Science Basics for Beginners
No ratings yet
Data Science Basics for Beginners
16 pages
Enhancing Decision Tree Performance Using NSUM For Diabetic Patients
No ratings yet
Enhancing Decision Tree Performance Using NSUM For Diabetic Patients
2 pages
Heart Failure CETM24
No ratings yet
Heart Failure CETM24
28 pages
Data Mining: Techniques and Processes
No ratings yet
Data Mining: Techniques and Processes
11 pages
Electrical Fault Tracing Essentials
No ratings yet
Electrical Fault Tracing Essentials
4 pages
Decision Science
No ratings yet
Decision Science
137 pages
SAS Decision Tree Analysis for Organics
No ratings yet
SAS Decision Tree Analysis for Organics
12 pages
2 ML
No ratings yet
2 ML
93 pages
Data Science Vijay1
No ratings yet
Data Science Vijay1
88 pages
Exploratory Data Analysis of Titanic Survival Prediction Using Machine Learning Techniques
No ratings yet
Exploratory Data Analysis of Titanic Survival Prediction Using Machine Learning Techniques
5 pages
BCS602 ML Extra Important Questions
No ratings yet
BCS602 ML Extra Important Questions
2 pages
Structured Data Classification MCQ's
No ratings yet
Structured Data Classification MCQ's
6 pages
Guidelines Datamining II
No ratings yet
Guidelines Datamining II
2 pages
IoT Botnet Detection via LDA Optimization
No ratings yet
IoT Botnet Detection via LDA Optimization
12 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
Airbnb (Air Bed and Breakfast) Listing Analysis TH
No ratings yet
Airbnb (Air Bed and Breakfast) Listing Analysis TH
24 pages

Decision Tree Using ID3 Algorithm

Uploaded by

Decision Tree Using ID3 Algorithm

Uploaded by

Decision Tree Using ID3

•Target attribute: The attribute whose value we want to predict.

•Attributes: A list of attributes that may be used to make decisions.

a. Create a Root node for the tree.

You might also like