Lecture 7.1 - Decision Tree Classification

Uploaded by

suryapratp369

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views15 pages

Lecture 7.1 - Decision Tree Classification

Uploaded by

suryapratp369

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

• Decision Tree Classification

Algorithm

Dr. Jagendra Singh

What is Decision Tree Classification
Algorithm?

o Decision Tree is a Supervised learning technique that can

be used for both classification and Regression problems.
o It is a tree-structured classifier, where internal nodes
represent the features of a dataset
o branches represent the decision rules and
o each leaf node represents the outcome.
o In a Decision tree, there are two nodes, which are
the Decision Node and Leaf Node.
o Decision nodes are used to make any decision and have
multiple branches.
What is Decision Tree
Classification Algorithm?
o The decisions or the test are performed on the basis
of features of the given dataset.
o It is called a decision tree because, similar to a tree, it
starts with the root node, which expands on further
branches and constructs a tree-like structure.
o In order to build a tree, we use the CART
algorithm, which stands for Classification and
Regression Tree algorithm.
o A decision tree simply asks a question, and based on
the answer (Yes/No), it further split the tree into
subtrees.
Decision Tree Classification
Algorithm
o This diagram explains the
general structure of a decision
tree:
• Note: A decision tree can contain
categorical data (YES/NO) as
well as numeric data.
Why use Decision Trees?
Below are the two reasons for using the
Decision tree:
o Decision Trees usually mimic human thinking
ability while making a decision, so it is easy to
understand.
o The logic behind the decision tree can be
easily understood because it shows a tree-like
structure.
The complete process can be better understood using the
below algorithm:
o Step1: the tree with the root node, says S, which
contains the complete dataset
o Step-2: Find the best attribute in the dataset
How does the using Attribute Selection Measure (ASM).
Step-3: Divide the S into subsets that contains possible
Decision Tree
o

values for the best attributes.

Step-4: Generate the decision tree node, which contains
algorithm Work? o

the best attribute.

o Step-5: Recursively make new decision trees using the
subsets of the dataset created in step -3.
Continue this process until a stage is reached where you
cannot further classify the nodes and called the final node
as a leaf node.
Example:
Suppose there is a candidate who has a job offer
and wants to decide whether he should accept the
offer or Not. So, to solve this problem, the decision
tree starts with the root node (Salary attribute by
ASM). The root node splits further into the next
decision node (distance from the office) and one
leaf node based on the corresponding labels. The
next decision node further gets split into one
decision node (Cab facility) and one leaf node.
Finally, the decision node splits into two leaf nodes
(Accepted offers and Declined offer).

Consider this diagram:

• While implementing a Decision tree, the main
issue arises that how to select the best attribute
for the root node and for sub-nodes.

Attribute • So, to solve such problems there is a technique

which is called as Attribute selection measure
or ASM.

Selection • By this measurement, we can easily select the

best attribute for the nodes of the tree. There are

Measures o
two popular techniques for ASM, which are:
Information Gain
o Gini Index
o According to the value of information gain,
we split the node and build the decision tree.
o A decision tree algorithm always tries to

Information maximize the value of information gain, and

a node/attribute having the highest
information gain is split first.

Gain o It can be calculated using the below formula:

Information Gain= Entropy(S)-
[(Weighted Avg) *Entropy(each feature)
• Entropy: Entropy is a metric to measure
the impurity in a given attribute. It
specifies randomness in data. Entropy can
be calculated as:

Information
• Entropy(s)= -P(yes)log2 P(yes)-
P(no) log2 P(no)

Gain • Where,
o S= Total number of samples
o P(yes)= probability of yes
o P(no)= probability of no
Gini Index
o Gini index is a measure of impurity or purity used while creating a
decision tree in the CART(Classification and Regression Tree) algorithm.
o An attribute with the low Gini index should be preferred as compared to
the high Gini index.
o It only creates binary splits, and the CART algorithm uses the Gini index to
create binary splits.
o Gini index can be calculated using the below formula:
• Gini Index= 1- ∑jPj2
Advantages of the Decision Tree
o It is simple to understand as it follows the same process which a
human follow while making any decision in real-life.
o It can be very useful for solving decision-related problems.
o It helps to think about all the possible outcomes for a problem.
o There is less requirement of data cleaning compared to other
algorithms.
Disadvantages of the Decision Tree
o The decision tree contains lots of layers, which makes it
complex.
o It may have an overfitting issue, which can be resolved using
the Random Forest algorithm.
o For more class labels, the computational complexity of the
decision tree may increase.
Python Implementation of Decision Tree

• Now we will implement the Decision tree using Python. For this, we will use the dataset
"user_data.csv,"

Steps will also remain the same, which are given below:
o Data Pre-processing step
o Fitting a Decision-Tree algorithm to the Training set
o Predicting the test result
o Test accuracy of the result(Creation of Confusion matrix)
o Visualizing the test set result.
Thank you

Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
17 pages
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
16 pages
Decision Trees
No ratings yet
Decision Trees
61 pages
Tafj Cache
50% (2)
Tafj Cache
37 pages
Primary Mathematics Quiz 2022
100% (1)
Primary Mathematics Quiz 2022
4 pages
Supervised Decision TreeRandom Forest
No ratings yet
Supervised Decision TreeRandom Forest
39 pages
My Decision Tree Algorithm
No ratings yet
My Decision Tree Algorithm
21 pages
unit-4[1].docx ML
No ratings yet
unit-4[1].docx ML
42 pages
DMDW 04
No ratings yet
DMDW 04
10 pages
chapter 04
No ratings yet
chapter 04
48 pages
Python Decision Tree Classification
No ratings yet
Python Decision Tree Classification
14 pages
Stagnation Properties
67% (3)
Stagnation Properties
25 pages
Decision Tree (Autosaved)
No ratings yet
Decision Tree (Autosaved)
14 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
10 pages
Deciosn_tree_(1)
No ratings yet
Deciosn_tree_(1)
5 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
30 pages
Naive Bayes and Decision Tree Classification
No ratings yet
Naive Bayes and Decision Tree Classification
21 pages
Day48 Decision Trees
No ratings yet
Day48 Decision Trees
5 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Decision Trees
No ratings yet
Decision Trees
3 pages
Session 5b Classification by Decision Tree Induction (1)
No ratings yet
Session 5b Classification by Decision Tree Induction (1)
42 pages
Decision tree
No ratings yet
Decision tree
16 pages
Ml Unit 2 Final_iii Yr
No ratings yet
Ml Unit 2 Final_iii Yr
72 pages
S&ML Unit 6- Q & A
No ratings yet
S&ML Unit 6- Q & A
12 pages
Supervised Learning Algorithm DT
No ratings yet
Supervised Learning Algorithm DT
15 pages
CSL0777 L25
No ratings yet
CSL0777 L25
39 pages
ML for ME S17 Decision Trees
No ratings yet
ML for ME S17 Decision Trees
12 pages
Machine_Learning_Lecture_08_Decision Tree Learning (1)
No ratings yet
Machine_Learning_Lecture_08_Decision Tree Learning (1)
67 pages
NOTES
No ratings yet
NOTES
18 pages
DS Unit - 4
No ratings yet
DS Unit - 4
76 pages
Decision Tree
No ratings yet
Decision Tree
11 pages
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
UNIT 2 - Groups (Decision Tree) (1)
No ratings yet
UNIT 2 - Groups (Decision Tree) (1)
20 pages
ML CLASS 6 Decision Tree Algorithm
No ratings yet
ML CLASS 6 Decision Tree Algorithm
21 pages
AI - Mod 5. Part 2
No ratings yet
AI - Mod 5. Part 2
40 pages
Lab 2
No ratings yet
Lab 2
3 pages
Module 4 Lecture -2
No ratings yet
Module 4 Lecture -2
65 pages
Unit IV Da Online - PPTX 2 82
No ratings yet
Unit IV Da Online - PPTX 2 82
81 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
Decision Tree (1)
No ratings yet
Decision Tree (1)
7 pages
DECSION TREE
No ratings yet
DECSION TREE
6 pages
Uvobjs
No ratings yet
Uvobjs
289 pages
2179-Unit-3
No ratings yet
2179-Unit-3
29 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Decision Tree Classification Algorithm (2)
No ratings yet
Decision Tree Classification Algorithm (2)
11 pages
Water Resources
No ratings yet
Water Resources
7 pages
Unit 4
No ratings yet
Unit 4
33 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
5 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
U4 ML Updated
No ratings yet
U4 ML Updated
32 pages
Cours #4—Decision Tree
No ratings yet
Cours #4—Decision Tree
18 pages
Tree
No ratings yet
Tree
7 pages
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
11 pages
Lecture Note #5_PEC-CS701E
No ratings yet
Lecture Note #5_PEC-CS701E
16 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
14 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
11 pages
Unit-3 Introduction To Machine Learning Algorithms
No ratings yet
Unit-3 Introduction To Machine Learning Algorithms
18 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
Chemical Coordination in Plants
No ratings yet
Chemical Coordination in Plants
10 pages
Chapter 6 Price Determination
No ratings yet
Chapter 6 Price Determination
9 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Lecture 6.2 - Polynomial Regression
No ratings yet
Lecture 6.2 - Polynomial Regression
56 pages
MCA 2023 Syllabus - 27-10-2023
No ratings yet
MCA 2023 Syllabus - 27-10-2023
107 pages
decision tree
No ratings yet
decision tree
13 pages
CHAPTER 12 - Kinetics of Particles Newton's Second Law
No ratings yet
CHAPTER 12 - Kinetics of Particles Newton's Second Law
43 pages
Decisiontree
No ratings yet
Decisiontree
6 pages
Dtic Ada128624
No ratings yet
Dtic Ada128624
212 pages
Genetics Revision Notes
No ratings yet
Genetics Revision Notes
2 pages
Applied Physics M3 M4 2021RVN
No ratings yet
Applied Physics M3 M4 2021RVN
35 pages
Introduction To Telephoney: R.K.Gupta de (Training)
No ratings yet
Introduction To Telephoney: R.K.Gupta de (Training)
43 pages
DD00005302 Avoiding EMI and ESD in Camera Installationss
No ratings yet
DD00005302 Avoiding EMI and ESD in Camera Installationss
30 pages
11TH Math Parabola Assignment-3 With Key
No ratings yet
11TH Math Parabola Assignment-3 With Key
2 pages
GE 103 Lecture 3
No ratings yet
GE 103 Lecture 3
80 pages
M3 Receiver Installation Manual
No ratings yet
M3 Receiver Installation Manual
21 pages
4th Assignment BCA-202 DS
No ratings yet
4th Assignment BCA-202 DS
2 pages
D49-Calculation of Road Traffic Noise
No ratings yet
D49-Calculation of Road Traffic Noise
100 pages
GMN BallBearingCatalog 4000.0911
No ratings yet
GMN BallBearingCatalog 4000.0911
100 pages
Automatic Radar Plotting Aid (ARPA) : Vasile Radu Adrian ET32
100% (1)
Automatic Radar Plotting Aid (ARPA) : Vasile Radu Adrian ET32
4 pages
Signal Generation and Plotting: Performance Objectives
No ratings yet
Signal Generation and Plotting: Performance Objectives
2 pages
FSL Ebook Designing
No ratings yet
FSL Ebook Designing
24 pages
Silo Guide
100% (2)
Silo Guide
14 pages
Osmosis and Diffusion Lab
No ratings yet
Osmosis and Diffusion Lab
6 pages
Separator Sizing Spreadsheet Main Menu: File Separp1
100% (1)
Separator Sizing Spreadsheet Main Menu: File Separp1
23 pages
AGM-Handbook, Part 2, Edition 8, June 2013 PDF
No ratings yet
AGM-Handbook, Part 2, Edition 8, June 2013 PDF
49 pages
Lecture Notes 1: Brief Review of Basic Probability (Casella and Berger Chapters 1-4)
100% (1)
Lecture Notes 1: Brief Review of Basic Probability (Casella and Berger Chapters 1-4)
14 pages
Drafting Conventions
No ratings yet
Drafting Conventions
20 pages
Calculation Manufacturing Process
No ratings yet
Calculation Manufacturing Process
18 pages
FACP Methodology
No ratings yet
FACP Methodology
6 pages
Class 11th Physics Notes Motion by ATS
No ratings yet
Class 11th Physics Notes Motion by ATS
2 pages
Off-Highway Tyre Solutions
No ratings yet
Off-Highway Tyre Solutions
9 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet