0% found this document useful (0 votes)

38 views10 pages

ML - Module-3-Chapter-6 RNSIT

Decision tree learning is a popular supervised predictive model used for classification and regression tasks, characterized by its tree structure comprising root, branches, and leaf nodes. It offers advantages such as ease of interpretation and quick training, but also has disadvantages like susceptibility to overfitting and challenges with continuous attributes. Various algorithms like ID3, C4.5, and CART are employed to construct decision trees, each with different criteria for splitting attributes.

Uploaded by

emanikanta535

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views10 pages

ML - Module-3-Chapter-6 RNSIT

Uploaded by

emanikanta535

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

MACHINE LEARNING(BCS602)

MODULE 3
CHAPTER 6
DECISION TREE LEARNING
6.1 Introduction
Decision tree learning model, one of the most popular supervised predictive learning models,
classifies data instances with high accuracy and consistency. The model performs an
inductive inference that reaches a general conclusion from observed examples. This model
is variably used for solving complex classification applications.
Decision tree is a concept tree which summarizes the information contained in the training
dataset in the form of a tree structure. Once the concept model is built, test data can be easily
classified.

• Why called as decision tree ?

• As starts from root node and finds number of solutions .
• The benefits of having a decision tree are as follows :
• It does not require any domain knowledge.
• It is easy to comprehend.
• The learning and classification steps of a decision tree are simple and fast.
• Example : Toll free number

6.1.1 Structure of a Decision Tree A decision tree is a structure that includes a root node,
branches, and leaf nodes. Each internal node denotes a test on an attribute, each branch
denotes the outcome of a test, and each leaf node holds a class label. The topmost node in
the tree is the root node.

Applies to classification and regression model.

Ms. Deepa S, Dept. Of CSE,RNSIT 1

MACHINE LEARNING(BCS602)

The decision tree consists of 2 major procedures:

1) Building a tree and

2) Knowledge inference or classification.

Building the Tree

Goal Construct a decision tree with the given training dataset. The tree is constructed in a
top-down fashion. It starts from the root node. At every level of tree construction, we need
to find the best split attribute or best decision node among all attributes. This process is
recursive and continued until we end up in the last level of the tree or finding a leaf node
which cannot be split further. The tree construction is complete when all the test conditions
lead to a leaf node. The leaf node contains the target class or output of classification.

Output Decision tree representing the complete hypothesis space.

Knowledge Inference or Classification

Goal Given a test instance, infer to the target class it belongs to.
Classification Inferring the target class for the test instance or object is based on inductive
inference on the constructed decision tree. In order to classify an object, we need to start
traversing the tree from the root. We traverse as we evaluate the test condition on every
decision node with the test object attribute value and walk to the branch corresponding to the
test's outcome. This process is repeated until we end up in a leaf node which contains the
target class of the test object.
Output Target label of the test instance.

Advantages of Decision Trees

1. Easy to model and interpret

2. Simple to understand
3. The input and output attributes can be discrete or continuous predictor variables.

Ms. Deepa S, Dept. Of CSE,RNSIT 2

MACHINE LEARNING(BCS602)

4. Can model a high degree of nonlinearity in the relationship between the target variables
and the predictor variables
5. Quick to train

Disadvantages of Decision Trees

Some of the issues that generally arise with a decision tree learning are that:
1. It is difficult to determine how deeply a decision tree can be grown or when to stop growing
it.
2. If training data has errors or missing attribute values, then the decision tree constructed
may become unstable or biased.
3. If the training data has continuous valued attributes, handling it is computationally
complex and has to be discretized.
4. A complex decision tree may also be over-fitting with the training data.
5. Decision tree learning is not well suited for classifying multiple output classes.
6. Learning an optimal decision tree is also known to be NP-complete.

6.1.2 Fundamentals of Entropy

• How to draw a decision tree ?

Entropy
Information gain

Entropy is the amount of uncertainty or randomness in the outcome of a random

variable or an event. Moreover, entropy describes about the homogeneity of the data
instances. The best feature is selected based on the entropy value. For example, when a coin
is flipped, head or tail are the two outcomes, hence its entropy is lower when compared to
rolling a dice which has got six outcomes.

Let P be the probability distribution of data instances from 1 to n as shown in Eq. (6.2).
So, P=P1....... Pn (6.2)
Entropy of P is the information measure of this probability distribution given in Eq. (6.3),
Entropy_Info(P) = Entropy_Info( P1....... Pn )
=-(P1 ̧ log2(P1 ̧) + P2 log2(P2)+.......+Pn log(Pn)) (6.3)

Ms. Deepa S, Dept. Of CSE,RNSIT 3

MACHINE LEARNING(BCS602)

where, P1, is the probability of data instances classified as class 1 and P2, is the probability of data
instances classified as class 2 and so on.
P1= |No of data instances belonging to class 1| / |Total no of data instances in the training
dataset|

Algorithm 6.1: General Algorithm for Decision Trees

6.2 DECISION TREE INDUCTION ALGORITHMS

There are many decision tree algorithms, such as ID3, C4.5, CART, CHAID, QUEST,
GUIDE, CRUISE, and CTREE, that are used for classification in real-time environment. The
most commonly used decision tree algorithms are ID3 (Iterative Dichotomizer 3), developed
by J.R Quinlan in 1986, and C4.5 is an advancement of ID3 presented by the same author in
1993. CART, that stands for Classification and Regression Trees, is another algorithm which
was developed by Breiman et al. in 1984.

The accuracy of the tree constructed depends upon the selection of the best split attribute.
Different algorithms are used for building decision trees which use different measures to
decide on the splitting criterion. Algorithms such as ID3, C4.5 and CART are popular
algorithms used in the construction of decision trees. The algorithm ID3 uses 'Information
Gain' as the splitting criterion whereas the algorithm C4.5 uses 'Gain Ratio' as the splitting

Ms. Deepa S, Dept. Of CSE,RNSIT 4

MACHINE LEARNING(BCS602)

criterion. The CART algorithm is popularly used for classifying both categorical and
continuous-valued target variables. CART uses GINI Index to construct a decision tree.

6.2.1 ID3 Tree Construction(ID3 stands for Iterative Dichotomiser 3 )

A decision tree is one of the most powerful tools of supervised learning algorithms used
for both classification and regression tasks.
It builds a flowchart-like tree structure where each internal node denotes a test on an
attribute, each branch represents an outcome of the test, and each leaf node (terminal node)
holds a class label. It is constructed by recursively splitting the training data into subsets
based on the values of the attributes until a stopping criterion is met, such as the maximum
depth of the tree or the minimum number of samples required to split a node .

Ms. Deepa S, Dept. Of CSE,RNSIT 5

MACHINE LEARNING(BCS602)

6.2.2 C4.5 Construction

C4.5 is a widely used algorithm for constructing decision trees from a dataset.
Disadvantages of ID3 are: Attributes must be nominal values, dataset must not include
missing data, and finally the algorithm tend to fall into overfitting.
To overcome this disadvantage Ross Quinlan, inventor of ID3, made some
improvements for these bottlenecks and created a new algorithm named C4.5. Now, the
algorithm can create a more generalized models including continuous data and could handle
missing data. And also works with discrete data, supports post-prunning.

Ms. Deepa S, Dept. Of CSE,RNSIT 6

MACHINE LEARNING(BCS602)

Dealing with Continuous Attributes in C4.5

Ms. Deepa S, Dept. Of CSE,RNSIT 7

MACHINE LEARNING(BCS602)

6.2.3 Classification and Regression Trees Construction

Classification and Regression Trees (CART) is a widely used algorithm for constructing
decision trees that can be applied to both classification and regression tasks. CART is similar
to C4.5 but has some differences in its construction and splitting criteria.
The classification method CART is required to construct a decision tree based on Gini's
impurity index. It serves as an example of how the values of other variables can be used to

Ms. Deepa S, Dept. Of CSE,RNSIT 8

MACHINE LEARNING(BCS602)

predict the values of a target variable. It functions as a fundamental machine-learning method

and provides a wide range of use cases

Ms. Deepa S, Dept. Of CSE,RNSIT 9

MACHINE LEARNING(BCS602)

6.2.4 Regression Trees

Regression trees are a variant of decision trees where the target feature is a continuous valued
variable. These trees can be constructed using an algorithm called reduction in variance
which uses standard deviation to choose the best splitting attribute.

Ms. Deepa S, Dept. Of CSE,RNSIT 10

Decision Tree & Techniques
71% (7)
Decision Tree & Techniques
41 pages
(Advances in Intelligent Systems and Computing 577) Wojciech Mitkowski, Janusz Kacprzyk, Krzysztof Oprzędkiewicz, Paweł Skruch (Eds.) - Trends in Advanced Intelligent Control, Optimization and Automat
No ratings yet
(Advances in Intelligent Systems and Computing 577) Wojciech Mitkowski, Janusz Kacprzyk, Krzysztof Oprzędkiewicz, Paweł Skruch (Eds.) - Trends in Advanced Intelligent Control, Optimization and Automat
886 pages
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
18 pages
Image Compression Fundamentals
85% (13)
Image Compression Fundamentals
84 pages
Lecture Notes 5
No ratings yet
Lecture Notes 5
3 pages
Unit IV Da Online - PPTX 2 82
No ratings yet
Unit IV Da Online - PPTX 2 82
81 pages
Assignment of Decision Tree in Machine Learning
No ratings yet
Assignment of Decision Tree in Machine Learning
15 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
Decision Trees Edited
No ratings yet
Decision Trees Edited
56 pages
Konsep Ensemble
No ratings yet
Konsep Ensemble
52 pages
Decision Trees and Regression Techniques
No ratings yet
Decision Trees and Regression Techniques
27 pages
Bisection Method
100% (1)
Bisection Method
4 pages
Combined S
No ratings yet
Combined S
223 pages
IEM 4103 Quality Control & Reliability Analysis IEM 5103 Breakthrough Quality & Reliability
No ratings yet
IEM 4103 Quality Control & Reliability Analysis IEM 5103 Breakthrough Quality & Reliability
42 pages
2 - Decision Tree
No ratings yet
2 - Decision Tree
23 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Design and Analysis of Algorithms 1
No ratings yet
Design and Analysis of Algorithms 1
29 pages
3 (Energy & Power Signal)
100% (1)
3 (Energy & Power Signal)
10 pages
Hierarchical Data Structures
No ratings yet
Hierarchical Data Structures
21 pages
AIML Module-04
No ratings yet
AIML Module-04
46 pages
Free Algo Swing Trade
No ratings yet
Free Algo Swing Trade
25 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
Decisiontree 2
No ratings yet
Decisiontree 2
16 pages
Posts Theorem PDF
No ratings yet
Posts Theorem PDF
10 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Machine Learning-Lecture 05
No ratings yet
Machine Learning-Lecture 05
21 pages
IR - Lecture 2
No ratings yet
IR - Lecture 2
35 pages
Naval Research Laboratory Washington, DC 20375-5320 Nrl/Mr/6410!93!7192
No ratings yet
Naval Research Laboratory Washington, DC 20375-5320 Nrl/Mr/6410!93!7192
134 pages
Location of Critical Failure Surface and Some Further Studies On Slope Stability Analysis
No ratings yet
Location of Critical Failure Surface and Some Further Studies On Slope Stability Analysis
13 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
ML Unit 3 Notes
No ratings yet
ML Unit 3 Notes
117 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
Aiml M4 C1
No ratings yet
Aiml M4 C1
101 pages
MST 2
No ratings yet
MST 2
4 pages
Tezis Eng
No ratings yet
Tezis Eng
12 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
7 pages
Rida Farouki Course
No ratings yet
Rida Farouki Course
7 pages
Maths Class 9 WS 5
No ratings yet
Maths Class 9 WS 5
8 pages
EC004 OutputDynamics - Microfoundation 2022 Lecture3
No ratings yet
EC004 OutputDynamics - Microfoundation 2022 Lecture3
15 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
Interpolation Lagrange
No ratings yet
Interpolation Lagrange
10 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
B - Entangled World - B - The Fascination of Quantum Information and Computation. Edited by Jürgen Audretsch. (ChemPhysChem, Vol. 7, Issue 12) (2006)
No ratings yet
B - Entangled World - B - The Fascination of Quantum Information and Computation. Edited by Jürgen Audretsch. (ChemPhysChem, Vol. 7, Issue 12) (2006)
1 page
Wk. 5.2. Decision Trees (27.10.2020)
No ratings yet
Wk. 5.2. Decision Trees (27.10.2020)
57 pages
L 11 Circle Drawing Algorithims 2
No ratings yet
L 11 Circle Drawing Algorithims 2
6 pages
Bivariate Regression - Part I: Indep Var / Dep Var Continuous Discrete
No ratings yet
Bivariate Regression - Part I: Indep Var / Dep Var Continuous Discrete
4 pages
U4 ML Updated
No ratings yet
U4 ML Updated
32 pages
ML Unit 3 Notes-1
No ratings yet
ML Unit 3 Notes-1
118 pages
Discrete Probability Distribution
No ratings yet
Discrete Probability Distribution
21 pages
2179 Unit 3
No ratings yet
2179 Unit 3
29 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Entropy and Information Gain For Decision Tree Algorithm
No ratings yet
Entropy and Information Gain For Decision Tree Algorithm
12 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
No ratings yet
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
22 pages
1615888543RME - Detail Syllabus PhD-2020
No ratings yet
1615888543RME - Detail Syllabus PhD-2020
28 pages
Module 4 Lecture - 2
No ratings yet
Module 4 Lecture - 2
65 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
Lecture 4 - Spectral Theorem For Symmetric Matrix
No ratings yet
Lecture 4 - Spectral Theorem For Symmetric Matrix
5 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
22 pages
Unit-3 Alt
No ratings yet
Unit-3 Alt
24 pages
Decision Tree
No ratings yet
Decision Tree
74 pages
ECO113 Practice Problems
No ratings yet
ECO113 Practice Problems
6 pages
S&ML Unit 6 - Q & A
No ratings yet
S&ML Unit 6 - Q & A
12 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
Decision Tree New
No ratings yet
Decision Tree New
52 pages
02 Nodeemb
No ratings yet
02 Nodeemb
71 pages
Decision Trees: Make A Decision (Represent An Outcome
No ratings yet
Decision Trees: Make A Decision (Represent An Outcome
4 pages
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
11 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Decision Tree
No ratings yet
Decision Tree
41 pages
Assignment of Decision Tree
No ratings yet
Assignment of Decision Tree
15 pages
Privasea Whitepaper
No ratings yet
Privasea Whitepaper
44 pages
Cse 445 Lecture 8 Mma
No ratings yet
Cse 445 Lecture 8 Mma
107 pages
Decision Tree
No ratings yet
Decision Tree
15 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
11 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
ML Unit 3
No ratings yet
ML Unit 3
15 pages
ML Unit 3
No ratings yet
ML Unit 3
22 pages
Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
Decision Trees Lectures
No ratings yet
Decision Trees Lectures
55 pages
ML Unit 3 Qa
No ratings yet
ML Unit 3 Qa
26 pages
DS Tech M 3 1
No ratings yet
DS Tech M 3 1
13 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet