0% found this document useful (0 votes)

1 views26 pages

Decision Trees

The document discusses Decision Trees as a supervised machine learning method used for classification and regression tasks, highlighting their structure and key algorithms like CART and C4.5. It outlines the advantages of Decision Trees, such as ease of understanding and less data cleaning, as well as disadvantages like overfitting and challenges with continuous variables. Additionally, it covers important concepts like entropy, Gini index, and various performance metrics used to evaluate the effectiveness of Decision Trees.

Uploaded by

ridss9002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1 views26 pages

Decision Trees

Uploaded by

ridss9002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

Decision Trees

Supervised Machine Learning with Decision Trees

Does this
human earn

Decision Trees more than

$128.12?

● Banks can't blindly trust the machine answer.

● What if there's a system failure, hacker attack or a quick fix from a senior?
● To deal with it, we have Decision Trees.
● All the data automatically divided to yes/no questions.
● They could sound a bit weird from a human perspective, e.g., whether the creditor earns more than $128.12?
● Though, the machine comes up with such questions to split the data best at each step.
Who is the mother of
Decision Trees father of the niece of the
brother of the nephew of
the son of the insurer?

● That's how a tree is made.

● The higher the branch — the broader the question.
● Any analyst can take it and explain afterwards. He may not understand it, but explain easily! (typical analysts)
● Decision trees are widely used in high responsibility spheres:
○ Diagnostics

○ Medicine

○ Finances
Decision Trees ● The two most popular algorithms for forming the trees
are CART and C4.5.
● Pure decision trees are rarely used today.
● However, they often set the basis for large systems, and
their ensembles even work better than neural networks.
● When you google something, that's precisely the bunch
of dumb trees which are looking for a range of answers
for you.
● Search engines love them because they're fast.
Decision Trees
The CART Algorithm Data

Test Train

Test Train Test Train

The CART Algorithm
● Scikit-Learn uses the Classification And Regression Tree (CART) algorithm to train Decision Trees (also
called “growing” trees).
● The idea is really quite simple: the algorithm first splits the training set in two subsets using a single
feature k and a threshold tk

● Once it has successfully split the training set in two, it splits the subsets using the same logic, then the
sub-subsets and so on, recursively.
● CART algorithm is a greedy algorithm: it greedily searches for an optimum split at the top level, then
repeats the process at each level.
● It does not check whether or not the split will lead to the lowest possible impurity several levels down.
Important Terminology related to Decision
Trees
● Root Node: It represents entire population or sample and this further gets divided into two or more
homogeneous sets.
● Splitting: It is a process of dividing a node into two or more sub-nodes.
● Decision Node: When a sub-node splits into further sub-nodes, then it is called decision node.
● Leaf/ Terminal Node: Nodes do not split is called Leaf or Terminal node.
● Pruning: When we remove sub-nodes of a decision node, this process is called pruning. You can say
opposite process of splitting.
● Branch / Sub-Tree: A sub section of entire tree is called branch or sub-tree.
● Parent and Child Node: A node, which is divided into sub-nodes is called parent node of sub-nodes
where as sub-nodes are the child of parent node.
Decision Trees
Pruning

● The technique of setting constraint is a greedy-approach.

● The algorithm will check for the best split instantaneously and move forward until one of the specified
stopping condition is reached.
Pruning

● Consider the following case when you’re driving:

Pruning

● There are 2 lanes:

○ A lane with cars moving at 80km/h

○ A lane with trucks moving at 30km/h

● At this instant, you are the yellow car and you have 2 choices:
○ Take a left and overtake the other 2 cars quickly

○ Keep moving in the present lane

Pruning

● Lets analyze these choices.

● In the former choice, you’ll immediately overtake the car ahead and reach behind the truck and start
moving at 30 km/h, looking for an opportunity to move back right.
● All cars originally behind you move ahead in the meanwhile.
Pruning

● This would be the optimum choice if your objective is to maximize the distance covered in next say 10
seconds.
● In the later choice, you sale through at same speed, cross trucks and then overtake maybe depending
on situation ahead. Greedy you!
● This is exactly the difference between normal decision tree & pruning.
● A decision tree with constraints won’t see the truck ahead and adopt a greedy approach by taking a
left.
● On the other hand if we use pruning, we in effect look at a few steps ahead and make a choice.
Advantages

Easy to Understand

● Decision tree output is very easy to understand even for people from non-analytical background.
● It does not require any statistical knowledge to read and interpret them.
● Its graphical representation is very intuitive and users can easily relate their hypothesis.
Advantages

Useful in Data exploration

● Decision tree is one of the fastest way to identify most significant variables and relation between two
or more variables.
● With the help of decision trees, we can create new variables / features that has better power to
predict target variable.
● It can also be used in data exploration stage. For example, we are working on a problem where we
have information available in hundreds of variables, there decision tree will help to identify most
significant variable.
Advantages

Less data cleaning required

● It requires less data cleaning compared to some other modeling techniques.

● It is not influenced by outliers and missing values to a fair degree.
Advantages

Data type is not a constraint

● It can handle both numerical and categorical variables.

Non Parametric Method

● Decision tree is considered to be a non-parametric method.

● This means that decision trees have no assumptions about the space distribution and the classifier
structure.
Disadvantages

Over fitting

● Over fitting is one of the most practical difficulty for decision tree models.
● This problem gets solved by setting constraints on model parameters and pruning
Disadvantages

Not fit for continuous variables

● While working with continuous numerical variables, decision tree loses information when it
categorizes variables in different categories.
Entropy
● The concept of entropy originated in thermodynamics as a measure
of molecular disorder.

● Entropy approaches zero when molecules are still and well ordered.

● In Machine Learning, it is frequently used as an impurity measure.

● A set’s entropy is zero when it contains instances of only one class.

● The more it is closer to zero, the better your algorithm.

Entropy Entropy =
~0 dQ/T
Gini Index

● We use the Gini Index as our cost function used to evaluate splits in the dataset.
● A Gini score gives an idea of how good a split is by how mixed the classes are in the two groups
created by the split.
● A perfect separation results in a Gini score of 0,
● The worst case split that results in 1.
● CART (Classification and Regression Tree) uses Gini method to create binary splits.
Metric What does it say? What it should be?

Accuracy Proportion of the correctly Closer to 1, the better

classified samples and all the
samples.

Precision & Recall Actual probability and Predicted Closer to 1, the better
probability

F1 Score Harmonic Mean of Precision and Closer to 1, the better

Recall

Confusion Matrix Identify a class that’s constantly If the classifier is perfect, you’ll
mistaken for some other class. obtain non-zero values only on
the main diagonal.

Performance Metrics
Metric What does it say? What it should be?

Receiver Operating Probability curve Closer to 1, the better

Characteristic (ROC) Curve

Area Under ROC Curve (AOC) How much the model is capable Closer to 1, the better
of distinguishing between classes

Performance Metrics
AUC and ROC Curve
AUC and ROC Curve

ADMS 3330 Final Exam
No ratings yet
ADMS 3330 Final Exam
14 pages
Decision Tree & Techniques
71% (7)
Decision Tree & Techniques
41 pages
ML CLASS 6 Decision Tree Algorithm
No ratings yet
ML CLASS 6 Decision Tree Algorithm
21 pages
U4 ML Updated
No ratings yet
U4 ML Updated
32 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
PR GTU IMP Questions by Jay
No ratings yet
PR GTU IMP Questions by Jay
35 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
DS Tech M 3 1
No ratings yet
DS Tech M 3 1
13 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
11 pages
Decision Tree
No ratings yet
Decision Tree
68 pages
Unit IV Da Online - PPTX 2 82
No ratings yet
Unit IV Da Online - PPTX 2 82
81 pages
Module 4 Lecture - 2
No ratings yet
Module 4 Lecture - 2
65 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
Supervised Learning Algorithm DT
No ratings yet
Supervised Learning Algorithm DT
15 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Ml-Unit Iii-1
No ratings yet
Ml-Unit Iii-1
46 pages
Lecture 7.1 - Decision Tree Classification
No ratings yet
Lecture 7.1 - Decision Tree Classification
15 pages
CSL0777 L25
No ratings yet
CSL0777 L25
39 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
17 pages
Unit 4
No ratings yet
Unit 4
33 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
15 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
Decision Trees: Make A Decision (Represent An Outcome
No ratings yet
Decision Trees: Make A Decision (Represent An Outcome
4 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
13 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
14 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
5 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
Decision Tree
100% (1)
Decision Tree
57 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
No ratings yet
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
22 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
17 pages
Decision Tree
No ratings yet
Decision Tree
57 pages
Business Analytics: Foundation: Material Handouts
No ratings yet
Business Analytics: Foundation: Material Handouts
7 pages
Decsion Tree
No ratings yet
Decsion Tree
6 pages
Decision Tree
No ratings yet
Decision Tree
74 pages
Decision Trees and Decision Modeling
No ratings yet
Decision Trees and Decision Modeling
58 pages
Chapter 04
No ratings yet
Chapter 04
48 pages
Decision Trees - 2022
No ratings yet
Decision Trees - 2022
49 pages
My Decision Tree Algorithm
No ratings yet
My Decision Tree Algorithm
21 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
2 - Decision Tree
No ratings yet
2 - Decision Tree
23 pages
Aiml M4 C1
No ratings yet
Aiml M4 C1
101 pages
Unit 3,4,5 ML (CS - AI)
No ratings yet
Unit 3,4,5 ML (CS - AI)
37 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
30 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
TEAA - Tree Ensembles-1
No ratings yet
TEAA - Tree Ensembles-1
43 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Dmi Unit 4
No ratings yet
Dmi Unit 4
34 pages
Unit-2 Material
No ratings yet
Unit-2 Material
52 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Beam Search: Fundamentals and Applications
From Everand
Beam Search: Fundamentals and Applications
Fouad Sabry
No ratings yet
Best First Search: Fundamentals and Applications
From Everand
Best First Search: Fundamentals and Applications
Fouad Sabry
No ratings yet
RAG - Genai
No ratings yet
RAG - Genai
11 pages
Genai
No ratings yet
Genai
22 pages
Intro To Cyber World
No ratings yet
Intro To Cyber World
16 pages
LLM - Evaluation Metrics
No ratings yet
LLM - Evaluation Metrics
7 pages
Mca-Probability & Statistics - I
No ratings yet
Mca-Probability & Statistics - I
7 pages
FAI IMP Questions (Unit Wise)
No ratings yet
FAI IMP Questions (Unit Wise)
6 pages
Fibinet: Combining Feature Importance and Bilinear Feature Interaction For Click-Through Rate Prediction
No ratings yet
Fibinet: Combining Feature Importance and Bilinear Feature Interaction For Click-Through Rate Prediction
8 pages
PDE Methods For Pricing Barrier Optionsq
No ratings yet
PDE Methods For Pricing Barrier Optionsq
28 pages
Lecture3 Tolerant Retrieval
100% (1)
Lecture3 Tolerant Retrieval
48 pages
dg1-3 Surfaces in E 3
No ratings yet
dg1-3 Surfaces in E 3
5 pages
Peng 2021
No ratings yet
Peng 2021
7 pages
Gtu Computer 3170720 Summer 2022
No ratings yet
Gtu Computer 3170720 Summer 2022
2 pages
Models - Opt.mbb Beam Optimization
No ratings yet
Models - Opt.mbb Beam Optimization
14 pages
Previous Play Next Rewind 10 Seconds Move Forward 10 Seconds Unmute
No ratings yet
Previous Play Next Rewind 10 Seconds Move Forward 10 Seconds Unmute
14 pages
Chapter 3 (Seborg Et Al.)
No ratings yet
Chapter 3 (Seborg Et Al.)
20 pages
Gr4j Machine Learning
No ratings yet
Gr4j Machine Learning
21 pages
Attention and Memory in Deep Learning and NLP
No ratings yet
Attention and Memory in Deep Learning and NLP
8 pages
Topic 8 Tutorial Am025
No ratings yet
Topic 8 Tutorial Am025
4 pages
1.10 Simple Linear Regression - Answers
No ratings yet
1.10 Simple Linear Regression - Answers
22 pages
An Example of A Simulation 2d Ising Model With C++ PDF
No ratings yet
An Example of A Simulation 2d Ising Model With C++ PDF
10 pages
Complete Spatiotemporal Random Fields, Second Edition: Theory and Applications George Christakos - Ebook PDF PDF For All Chapters
100% (1)
Complete Spatiotemporal Random Fields, Second Edition: Theory and Applications George Christakos - Ebook PDF PDF For All Chapters
62 pages
Assignment#3 OS
No ratings yet
Assignment#3 OS
2 pages
IEEE Template
No ratings yet
IEEE Template
4 pages
Approximation Algorithms
No ratings yet
Approximation Algorithms
23 pages
Alternating Direction Implicit Method
No ratings yet
Alternating Direction Implicit Method
6 pages
Paper 1 Vol. 16, No - 4, 2023
No ratings yet
Paper 1 Vol. 16, No - 4, 2023
11 pages
CDMP Mock Test 2
No ratings yet
CDMP Mock Test 2
19 pages
Supplier Selection
No ratings yet
Supplier Selection
15 pages
FFT Tutorial PDF
No ratings yet
FFT Tutorial PDF
2 pages
Lec 01 Introduction
No ratings yet
Lec 01 Introduction
116 pages
PPB ML Notes
No ratings yet
PPB ML Notes
54 pages
Data Mining: Concepts and Techniques: - Chapter 3
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 3
52 pages

Decision Trees

Uploaded by

Decision Trees

Uploaded by

Decision Trees

Supervised Machine Learning with Decision Trees

Decision Trees more than

● Banks can't blindly trust the machine answer.

● That's how a tree is made.

Test Train Test Train

Test Train Test Train

Test Train Test Train

● The technique of setting constraint is a greedy-approach.

● Consider the following case when you’re driving:

● There are 2 lanes:

○ A lane with trucks moving at 30km/h

○ Keep moving in the present lane

● Lets analyze these choices.

Useful in Data exploration

Less data cleaning required

● It requires less data cleaning compared to some other modeling techniques.

Data type is not a constraint

● It can handle both numerical and categorical variables.

Non Parametric Method

● Decision tree is considered to be a non-parametric method.

Not fit for continuous variables

● In Machine Learning, it is frequently used as an impurity measure.

● A set’s entropy is zero when it contains instances of only one class.

● The more it is closer to zero, the better your algorithm.

Accuracy Proportion of the correctly Closer to 1, the better

F1 Score Harmonic Mean of Precision and Closer to 1, the better

Receiver Operating Probability curve Closer to 1, the better

You might also like