0% found this document useful (0 votes)

20 views29 pages

1694600905-Unit2.4 Decision Tree CU 2.0

(B) regression tree The target variable (price of a house) is continuous in this case, so a regression tree would be used to predict it. Classification trees are used when the target variable is categorical.

Uploaded by

woxiko1688

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views29 pages

1694600905-Unit2.4 Decision Tree CU 2.0

Uploaded by

woxiko1688

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Decision Tree

Unit 2.4
Decision Tree

Reference
Decision Tree

Disclaimer
The content is curated from online/offline resources and used for educational purpose only

Disclaimer: The content is curated for educational purposes only.

Decision Tree

Reference
Decision Tree

Learning Objectives

You will learn in this lesson:

• Concept of Decision Tree
• Use of Decision Tree to classify data
• Basic algorithm to build Decision Tree
• Some illustrations
• Concept of Entropy
• Basic concept of entropy in information theory
• Mathematical formulation of entropy
• Calculation of entropy of a training set
• Decision Tree induction algorithms
• ID3
Decision Tree

Introduction
• Basic idea behind building a decision tree is
to map all the possible decision paths in the
form of a tree.
• Efficient machine learning algorithm.
• Need to create new tree once seen whole
new data
• Data driven programing the conditions.

Weather Prediction
Decision Tree

Decision Tree Important Terminology

Root Node: Root node is from where the decision tree starts. It represents the entire dataset, which further
gets divided into two or more homogeneous sets.
Leaf Node: Leaf nodes are the final output node, and the tree cannot be segregated further after getting a leaf
node.
Splitting: Splitting is the process of dividing the decision node/root node into sub-nodes according to the
given conditions.
Branch/Sub Tree: A tree formed by splitting the tree.
Pruning: Pruning is the process of removing the unwanted branches from the tree.
Parent/Child node: The root node of the tree is called the parent node, and other nodes are called the child
nodes.
Decision Tree

Case Study: Drug Prediction

• A medical researcher compiling data for a study. Features Target
• During course of treatment, each patient
responded to one of two medications; we’ll call
them Drug A and Drug B.
• Job is to build a model to find out which drug might
be appropriate for a patient with same illness.
• Feature sets: Age, Gender, Blood Pressure, and
Cholesterol
• Target: Drug that each patient responded.

Patient Drug Data

Decision Tree

How does the Decision Tree algorithm Work?

The complete process can be better understood using the below algorithm:

Step 1: Begin the tree with the root node, says S, which contains the complete dataset.

Step 2: Find the best attribute in the dataset using Attribute Selection Measure (ASM).

Step 3: Divide the S into subsets that contains possible values for the best attributes.

Step 4: Generate the decision tree node, which contains the best attribute.
Step 5: Recursively make new decision trees using the subsets of the dataset created in step 3.
Continue this process until a stage is reached where you cannot further classify the nodes and
called the final node as a leaf node
Decision Tree

How to Select the Effective Attribute ?

• The algorithm chooses the most predictive
feature to split the data on.
• Root node can be any valid feature.
• Root node divides to create branches.

Patient Drug Data

Decision Tree

Effective Attribute Quest!

• Let’s begin our quest by picking “Cholesterol” as
the first attribute to split data.
• It is a sample of bad attribute selection for splitting
data.
• Bad in terms of impurity cleanliness.
• Improper class balance attained.

How Decision Tree Works

Decision Tree

Effective Attribute Quest!

• Let’s try with “sex” attribute of patients.
• However, it is still a better choice in comparison
with the “Cholesterol” attribute.
• Reason: as the result in the nodes are more pure,
i.e. nodes which are either mostly Drug A or Drug B.
• Thus, it’s more predictive than the other attributes.

How Decision Tree Works

Decision Tree

Effective Attribute Quest!

• Predictiveness is based on decrease in “impurity”
of nodes.
• So, the Gender based feature is a good candidate
in the following case, because it almost found the
pure patients.
• We test “Cholesterol” again here.
• As you can see, it results in even more pure
leaves.
• So, we can easily make a decision here.

Pure Node

How Decision Tree Works

Decision Tree

Intuition of Node Impurity

• Method uses recursive partitioning to minimize the “impurity” at each step.
• ”Impurity” of nodes is calculated by “Entropy” of data in the node.
• So, what is “Entropy”?
• Entropy is the amount of information disorder, or the amount of randomness in the data.
• In decision trees, we're looking for trees that have the smallest entropy in their nodes.
• Lower the Entropy, distribution is less uniform and nodes are highly pure!
Decision Tree

Entropy
• To calculate entropy, formulae is:
𝐸𝑛𝑡𝑟𝑜𝑝𝑦=−𝑝(𝐴) log(𝑝(𝐴))−𝑝(𝐵) log(𝑝(𝐵)) **base 2
• p is for the proportion or ratio of a category, such as Drug A or B.

Let’s calculate the entropy of the dataset in our case, before splitting it.
• We have 9 occurrences of Drug B and 5 of Drug A.
• Entropy = 0.530 + (0.410) = 0.940 (approx.).

Entropy in Decision Tree

Decision Tree

Entropy at each Node (for each Attribute)

• Consider all the attributes and calculate the
“Entropy” after the split, and then chose the best
attribute.
• Calculate node entropy for Cholesterol feature.
• Separate test is conducted for each valid feature.

How Decision Tree Works

Decision Tree

Entropy at Each Node (for each Attribute)

• Calculate node entropy for Sex feature
-(9/14*log(9/14) + 5/14*log(5/14))
=0.940

• Entropy of branch F
-(3/7*log(3/7) + (4/7*log(4/7))
=0.985

How Decision Tree Works

Decision Tree

Information Gain (ID3)

• Before taking our splitting decision, lets understand Information Gain!
• Information gain is the information that can increase the level of certainty after splitting.
• As entropy, or the amount of randomness, decreases, the information gain, or amount of certainty,
increases, and vice-versa.
• So, constructing a decision tree is all about finding attributes that return the highest information gain.
• Information Gain = Total Entropy – Sum of Conditional Entropies
Decision Tree

Comparison of Attributes
Information Gain (Sex)
0.940-(7/14*0.985)-(7/14*0.592)
= 0.151

Information Gain (Cholestrol)

0.940-(8/14*0.811)-(6/14*1)
= 0.048

How Decision Tree Works

Decision Tree

Question ?
• Between the Cholesterol and Sex
attributes, which one is a better choice?
• Which one is better as the first attribute to
divide the dataset into 2 branches?
• Which attribute results in more pure nodes
for our drugs?
• Answer: “Sex” attribute

How Decision Tree Works

Decision Tree

Repeat!
• So, we select the “Sex” attribute as the first
splitter.
• Now, what is the next attribute after branching
by the “Sex” attribute?
• We should repeat the process for each
branch, and test each of the other attributes
to continue to reach the most pure leaves.
• This is the way that you build a decision tree!

Patient Drug Data

Decision Tree

Lab 1 – Implement Decision Tree Machine Learning Algorithm

Decision Tree

Summary
• Decision Tree is a Supervised learning technique that can be used for both classification and
Regression problems, but mostly it is preferred for solving Classification problems.
• It is a tree-structured classifier, where internal nodes represent the features of a dataset, branches
represent the decision rules, and each leaf node represents the outcome.
• Entropy is the amount of information disorder or the amount of randomness in the data. The entropy
in the node depends on how much random data is in that node and is calculated for each node.
• Information gain is the information that can increase the level of certainty after splitting.
• As entropy, or the amount of randomness, decreases, the information gain, or amount of certainty,
increases, and vice-versa.
Decision Tree

Quiz
1) Decision trees are also known as CART. What is CART?
(A) Classification and Regression Trees
(B) Customer Analysis and Research Tool
(C) Communication Access Real-time Translation
(D) Computerized Automatic Rating Technique

(A) Classification and Regression Trees

Decision Tree

Quiz

2) Decision tree can be used for ______.

(A) classification
(B) regression
(C) Both
(D) None of these

C). Both
Decision Tree

Quiz

3) Decision tree is a ______ algorithm.

(A) supervised learning
(B) unsupervised learning
(C) Both
(D) None of these

A). supervised learning

Decision Tree

Quiz
4) Suppose, your target variable is whether a passenger will survive or not using Decision Tree.
What type of tree do you need to predict the target variable?
(A) classification tree
(B) regression tree
(C) clustering tree
(D) dimensionality reduction tree

(A) Classification tree

Decision Tree

Quiz
5) Suppose, your target variable is the price of a house using Decision Tree. What type of tree do
you need to predict the target variable?
(A) classification tree
(B) regression tree
(C) clustering tree
(D) dimensionality reduction tree

(B) regression tree

Decision Tree

Reference

https://kawsar34.medium.com/machine-learning-quiz-05-decision-tree-part-1-3ea71fa312e5
https://www.javatpoint.com
https://www.tutorialspoint.com
www.towardsdatascience.com
How Decision Tree Works !. In this Blog, I’ll be covering the… | by Mehmet Toprak | Medium
Decision Tree

Thank you...!

Deep Video Dehazing: Major Project Part I (18B19CI791) - AY 2023-24
No ratings yet
Deep Video Dehazing: Major Project Part I (18B19CI791) - AY 2023-24
26 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
17 pages
Unit IV Da Online - PPTX 2 82
No ratings yet
Unit IV Da Online - PPTX 2 82
81 pages
Eye Clinic Project
No ratings yet
Eye Clinic Project
36 pages
Cse 445 Lecture 8 Mma
No ratings yet
Cse 445 Lecture 8 Mma
107 pages
6cs5 Ds Unit 4 Unit 4
No ratings yet
6cs5 Ds Unit 4 Unit 4
65 pages
Decision Trees
No ratings yet
Decision Trees
14 pages
Decision Tree Algorithm, Explained
No ratings yet
Decision Tree Algorithm, Explained
20 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
CHTKT - DataScience - Chapter03 - Machine Learning With Python - 02
No ratings yet
CHTKT - DataScience - Chapter03 - Machine Learning With Python - 02
34 pages
ML For ME S17 Decision Trees
No ratings yet
ML For ME S17 Decision Trees
12 pages
Unit-4 Linked-List
No ratings yet
Unit-4 Linked-List
57 pages
TSP Iasc 32301
No ratings yet
TSP Iasc 32301
14 pages
Twos Nines Challenges
No ratings yet
Twos Nines Challenges
16 pages
PWC Unit-1 Part-2 (Basics of Programming)
No ratings yet
PWC Unit-1 Part-2 (Basics of Programming)
44 pages
2 ML Ch3 Decision Trees Final
No ratings yet
2 ML Ch3 Decision Trees Final
70 pages
Decisiontrees
No ratings yet
Decisiontrees
28 pages
Ashwin P-Faults in PLA
No ratings yet
Ashwin P-Faults in PLA
22 pages
CLP 02.2 Course Title: Microprocessors & Microcontrollers Lab
No ratings yet
CLP 02.2 Course Title: Microprocessors & Microcontrollers Lab
6 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
NOTES
No ratings yet
NOTES
18 pages
Decision Tree Algorithm Tutorial With Example in R
No ratings yet
Decision Tree Algorithm Tutorial With Example in R
23 pages
WMR Fleet Management Interfaces Mínimas Manual v2.1
No ratings yet
WMR Fleet Management Interfaces Mínimas Manual v2.1
13 pages
What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
Cse Daa
No ratings yet
Cse Daa
5 pages
Decision Trees Lectures
No ratings yet
Decision Trees Lectures
55 pages
Decision Tree
100% (1)
Decision Tree
57 pages
UNIT 2 - Groups (Decision Tree)
No ratings yet
UNIT 2 - Groups (Decision Tree)
20 pages
Naincy Mod 3 Python
No ratings yet
Naincy Mod 3 Python
110 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
30 pages
Python RTL
No ratings yet
Python RTL
92 pages
Unit-II - Tree Based Methods
No ratings yet
Unit-II - Tree Based Methods
158 pages
Excel To Desktop
No ratings yet
Excel To Desktop
6 pages
Decision Trees and Regression Techniques
No ratings yet
Decision Trees and Regression Techniques
27 pages
19 - Decision Tree - ID3
No ratings yet
19 - Decision Tree - ID3
87 pages
Codigo Macro para Desbloquear Senhas Excel
No ratings yet
Codigo Macro para Desbloquear Senhas Excel
4 pages
Final Exam Review 2 - 2
No ratings yet
Final Exam Review 2 - 2
2 pages
Decision Tree and Random Forest
No ratings yet
Decision Tree and Random Forest
41 pages
ML Unit 3 Notes-1
No ratings yet
ML Unit 3 Notes-1
118 pages
FoM 10, Unit 3, Part 2, Factor Trees and Prime Factorization
No ratings yet
FoM 10, Unit 3, Part 2, Factor Trees and Prime Factorization
7 pages
ML Unit 3 Qa
No ratings yet
ML Unit 3 Qa
26 pages
DecisionTree Numerical ID3Prob
No ratings yet
DecisionTree Numerical ID3Prob
114 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
Machine Learning: Prepared by
No ratings yet
Machine Learning: Prepared by
44 pages
ML Unit 3 Notes
No ratings yet
ML Unit 3 Notes
117 pages
Decision Tree
No ratings yet
Decision Tree
45 pages
Cours #4-Decision Tree
No ratings yet
Cours #4-Decision Tree
18 pages
02-1 Structures
No ratings yet
02-1 Structures
94 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
No ratings yet
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
10 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
COAL Lab 11
No ratings yet
COAL Lab 11
14 pages
Tree
No ratings yet
Tree
7 pages
Machine Learning and Deep Learning
No ratings yet
Machine Learning and Deep Learning
6 pages
Postfix Evaluation
No ratings yet
Postfix Evaluation
1 page
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
14 pages
CS311 Exam
No ratings yet
CS311 Exam
16 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
No ratings yet
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
54 pages
U4 ML Updated
No ratings yet
U4 ML Updated
32 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Unit 4
No ratings yet
Unit 4
33 pages
Igcse Compsci 2ed TR Paper 1
100% (1)
Igcse Compsci 2ed TR Paper 1
8 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
Decision Trees Edited
No ratings yet
Decision Trees Edited
56 pages
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
No ratings yet
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
22 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Lecture 04 Decession Trees 04112022 015118pm
No ratings yet
Lecture 04 Decession Trees 04112022 015118pm
43 pages
ALV Grid Display With Checkbox To Process Selected Records at Runtime
No ratings yet
ALV Grid Display With Checkbox To Process Selected Records at Runtime
19 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
Ec3352 - Digital Systems Design Set I - Iat2
No ratings yet
Ec3352 - Digital Systems Design Set I - Iat2
2 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
11 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
Decision Tree
No ratings yet
Decision Tree
57 pages
Decisiontree 2
No ratings yet
Decisiontree 2
16 pages
Atharva Tambat Resume PDF
No ratings yet
Atharva Tambat Resume PDF
2 pages
Unit 2 - Os - PPT (1)
No ratings yet
Unit 2 - Os - PPT (1)
121 pages
Dsa Sheet - Dsa Series Sheet
No ratings yet
Dsa Sheet - Dsa Series Sheet
3 pages
Module 2 Basic Competencies
No ratings yet
Module 2 Basic Competencies
23 pages
Learn The Basics Of Decision Trees A Popular And Powerful Machine Learning Algorithm
From Everand
Learn The Basics Of Decision Trees A Popular And Powerful Machine Learning Algorithm
UBER AUTHOR
No ratings yet

1694600905-Unit2.4 Decision Tree CU 2.0

Uploaded by

1694600905-Unit2.4 Decision Tree CU 2.0

Uploaded by

Decision Tree

Disclaimer: The content is curated for educational purposes only.

You will learn in this lesson:

Decision Tree Important Terminology

Case Study: Drug Prediction

Patient Drug Data

How does the Decision Tree algorithm Work?

How to Select the Effective Attribute ?

Patient Drug Data

Effective Attribute Quest!

How Decision Tree Works

Effective Attribute Quest!

How Decision Tree Works

Effective Attribute Quest!

How Decision Tree Works

Intuition of Node Impurity

Entropy in Decision Tree

Entropy at each Node (for each Attribute)

How Decision Tree Works

Entropy at Each Node (for each Attribute)

How Decision Tree Works

Information Gain (ID3)

Information Gain (Cholestrol)

How Decision Tree Works

How Decision Tree Works

Patient Drug Data

Lab 1 – Implement Decision Tree Machine Learning Algorithm

(A) Classification and Regression Trees

2) Decision tree can be used for ______.

3) Decision tree is a ______ algorithm.

A). supervised learning

(A) Classification tree

(B) regression tree

You might also like