ML Lecture 3

Uploaded by

bhargavialluri30

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views13 pages

ML Lecture 3

Uploaded by

bhargavialluri30

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 13

Machine Learning: Lecture 3

Decision Tree Learning

(Based on Chapter 3 of Mitchell T..,
Machine Learning, 1997)

1
Decision Tree Representation

Outlook
Sunny Rain
Overcast
Humidity Wind
High Normal Strong Weak

A Decision Tree for the concept PlayTennis

2
Appropriate Problems for
Decision Tree Learning
 Instances are represented by discrete attribute-value pairs
(though the basic algorithm was extended to real-valued
attributes as well)
 The target function has discrete output values (can have
more than two possible output values --> classes)
 Disjunctive hypothesis descriptions may be required
 The training data may contain errors
 The training data may contain missing attribute values

3
ID3: The Basic Decision Tree
Learning Algorithm
Database, See [Mitchell, p. 59]

D12 D11 What is the “best”

D1
D2 D5
attribute?
D10 D4
D6
D14
D3 Answer: Outlook
D8 D9
D7 D13 [“best” = with highest
information gain]

4
ID3 (Cont’d)
Outlook
Sunny Rain
Overcast

D1 D8 D10 D6
D3
D14
D11 D12 D4
D9 D2 D7 D5
D13

What are the

“best” attributes? Humidity and Wind
5
What Attribute to choose to
“best” split a node?
 Choose the attribute that minimize the Disorder (or Entropy)
in the subtree rooted at a given node.
 Disorder and Information are related as follows: the more
disorderly a set, the more information is required to correctly
guess an element of that set.
 Information: What is the best strategy for guessing a number
from a finite set of possible numbers? i.e., how many questions
do you need to ask in order to know the answer (we are
looking for the minimal number of questions). Answer
Log_2(S), where S is the set of numbers and |S|, its cardinality.

Q1: is it smaller than 5?

Q2: is it smaller than 2?
E.g.: 0 1 2 3 4 5 6 7 8 9 10 6
Q2 Q1
What Attribute to choose to
“best” split a node? (Cont’d)
 Log_2 |S| can also be thought of as the information value of
being told x (the number to be guessed) instead of having to
guess it.
 Let U be a subset of S. What is the informatin value of being
told x after finding out whether or not x U? Ans: Log_2|S|-
[P(x  U) Log_2|U|+ P(s U) Log_2|S-U|
 Let S = P N (positive and negative data). The information
value of being told x after finding out whether x  U or x 
N is I({P,N})=Log_2(|S|)-|P|/|S| Log_2|P| -|
N|/|S| Log_2|N|

7
What Attribute to choose to
“best” split a node? (Cont’d)
We want to use this measure to choose an attribute
that minimizes the disorder in the partitions it
creates. Let {S_i | 1i n} be a partition of S
resulting from a particular attribute. The disorder
associated with this partition is:
V({S_i | 1i n})=|S_i|/|
S|.I({P(S_i),N(S_i)})
Set of positive Set of negative
examples in S_i examples in S_i
8
Hypothesis Space Search in
Decision Tree Learning
 Hypothesis Space: Set of possible decision trees (i.e., complete
space of finie discrete-valued functions).
 Search Method: Simple-to-Complex Hill-Climbing Search (only
a single current hypothesis is maintained ( from candidate-
elimination method)). No Backtracking!!!
 Evaluation Function: Information Gain Measure
 Batch Learning: ID3 uses all training examples at each step to
make statistically-based decisions ( from candidate-elimination
method which makes decisions incrementally). ==> the search is
less sensitive to errors in individual training examples.

9
Inductive Bias in Decision Tree
Learning
 ID3’s Inductive Bias: Shorter trees are preferred over
longer trees. Trees that place high information gain
attributes close to the root are preferred over those that
do not.
 Note: this type of bias is different from the type of bias
used by Candidate-Elimination: the inductive bias of ID3
follows from its search strategy (preference or search
bias) whereas the inductive bias of the Candidate-
Elimination algorithm follows from the definition of its
hypothesis space (restriction or language bias).

10
Why Prefer Short Hypotheses?
 Occam’s razor: Prefer the
simplest hypothesis that fits the data [William of Occam
(Philosopher), circa 1320]
 Scientists seem to do that: E.g., Physicist seem to prefer simple explanations
for the motion of planets, over more complex ones
 Argument: Since there are fewer short hypotheses than long ones, it is less
likely that one will find a short hypothesis that coincidentally fits the training
data.
 Problem with this argument: it can be made about many other constraints.
Why is the “short description” constraint more relevant than others?
 Nevertheless: Occam’s razor was shown experimentally to be a successful
strategy!

11
Issues in Decision Tree Learning:
I. Avoiding Overfitting the Data
 Definition: Given a hypothesis space H, a hypothesis hH is
said to overfit the training data if there exists some alternative
hypothesis h’H, such that h has smaller error than h’ over the
training examples, but h’ has a smaller error than h over the
entire distribution of instances. (See curves in [Mitchell, p.67])
 There are two approaches for overfitting avoidance in Decision
Trees:

Stop growing the tree before it perfectly fits the data

Allow the tree to overfit the data, and then post-prune it.

12
Issues in Decision Tree Learning:
II. Other Issues
Incorporating Continuous-Valued Attributes
Alternative Measures for Selecting
Attributes
Handling Training Examples with Missing
Attribute Values
Handling Attributes with Differing Costs

Verizon Sample Bill PDF
No ratings yet
Verizon Sample Bill PDF
55 pages
Decision Tree
No ratings yet
Decision Tree
42 pages
Class 16 Decision Tree
No ratings yet
Class 16 Decision Tree
45 pages
Decision Tree Learning Lecture
No ratings yet
Decision Tree Learning Lecture
13 pages
Decession Tree
No ratings yet
Decession Tree
72 pages
Mod 3 AIML QB With Answers
No ratings yet
Mod 3 AIML QB With Answers
26 pages
ML - Unit 2 - Part I
No ratings yet
ML - Unit 2 - Part I
15 pages
Unit 3
No ratings yet
Unit 3
81 pages
Screenshot 2024-02-06 at 1.43.15 PM
No ratings yet
Screenshot 2024-02-06 at 1.43.15 PM
66 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
42 pages
MCA3 (DS) Unit 4 ML
No ratings yet
MCA3 (DS) Unit 4 ML
29 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
Week 11 - Decision Tree Learning
No ratings yet
Week 11 - Decision Tree Learning
43 pages
Mod 4-1
No ratings yet
Mod 4-1
42 pages
Unit 3
No ratings yet
Unit 3
46 pages
Module 3
No ratings yet
Module 3
103 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
Decision Trees
No ratings yet
Decision Trees
7 pages
ML UNIT 2 Decision Tree
No ratings yet
ML UNIT 2 Decision Tree
109 pages
ML Unit 2-2-40
No ratings yet
ML Unit 2-2-40
39 pages
Machine Learning: MVJ21CS62
No ratings yet
Machine Learning: MVJ21CS62
12 pages
3 Decision Tree Learning
No ratings yet
3 Decision Tree Learning
38 pages
M01 Tree-Based Methods
No ratings yet
M01 Tree-Based Methods
38 pages
Decision Tree 2
No ratings yet
Decision Tree 2
20 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
Decision - Tree
No ratings yet
Decision - Tree
75 pages
ML-3-Decision Tree
No ratings yet
ML-3-Decision Tree
17 pages
L3 - Decision Trees
No ratings yet
L3 - Decision Trees
28 pages
Unit2 ML
No ratings yet
Unit2 ML
19 pages
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
Cse 445 Lecture 8 Mma
No ratings yet
Cse 445 Lecture 8 Mma
107 pages
Lecture 04 Decession Trees 04112022 015118pm
No ratings yet
Lecture 04 Decession Trees 04112022 015118pm
43 pages
Unit 5
No ratings yet
Unit 5
21 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
Unit 3 MLT
No ratings yet
Unit 3 MLT
18 pages
Unit-3 MLT
No ratings yet
Unit-3 MLT
74 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
19 - Decision Tree - ID3
No ratings yet
19 - Decision Tree - ID3
87 pages
Decision Trees-Lecture 9&10
No ratings yet
Decision Trees-Lecture 9&10
60 pages
Decision Trees CLS
No ratings yet
Decision Trees CLS
43 pages
7-Decision Trees Learning
No ratings yet
7-Decision Trees Learning
51 pages
Module 2 Notes
No ratings yet
Module 2 Notes
20 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
Learning From Observations: Section 1 - 3
No ratings yet
Learning From Observations: Section 1 - 3
26 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
Issues in Decision Tree Learning
No ratings yet
Issues in Decision Tree Learning
6 pages
Lecture No. 3: AU-KBC Research Centre, MIT Campus, Anna University
No ratings yet
Lecture No. 3: AU-KBC Research Centre, MIT Campus, Anna University
92 pages
Machine Learning Learning
No ratings yet
Machine Learning Learning
35 pages
Data Mining Practical 8
No ratings yet
Data Mining Practical 8
7 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
80 pages
Module 3 DecisionTree Notes
100% (1)
Module 3 DecisionTree Notes
14 pages
Unit 2 1
No ratings yet
Unit 2 1
15 pages
Lec4 - Decision Trees
No ratings yet
Lec4 - Decision Trees
43 pages
Tycs Ai Unit 2
No ratings yet
Tycs Ai Unit 2
84 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Data Science with R: Beginner to Expert
From Everand
Data Science with R: Beginner to Expert
Narayana Nemani
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
UNIT-4 OOP Lecture Notes
No ratings yet
UNIT-4 OOP Lecture Notes
105 pages
CA Unit-3 Part2
No ratings yet
CA Unit-3 Part2
8 pages
DBMS Unit-4
No ratings yet
DBMS Unit-4
58 pages
Node JS Presenetation
No ratings yet
Node JS Presenetation
59 pages
Catalist-Listed ES Group Announces Revised Chartering Agreement and New Vessel Sale To Sea Hub Tankers For S$29.4 Million
No ratings yet
Catalist-Listed ES Group Announces Revised Chartering Agreement and New Vessel Sale To Sea Hub Tankers For S$29.4 Million
3 pages
Unit Plan: Paises Hispano-Hablantes
100% (1)
Unit Plan: Paises Hispano-Hablantes
30 pages
DR Engp I 1.15 R6 - Ing
No ratings yet
DR Engp I 1.15 R6 - Ing
19 pages
PCC 0410 Twincat3 e
No ratings yet
PCC 0410 Twincat3 e
4 pages
Cisco Script
No ratings yet
Cisco Script
2 pages
Phase 0
No ratings yet
Phase 0
15 pages
Scamper Technique
No ratings yet
Scamper Technique
19 pages
Motor, Filter, Kühlsystem Und Auspuff
No ratings yet
Motor, Filter, Kühlsystem Und Auspuff
18 pages
Four Dimension of Cloud Cube Model
No ratings yet
Four Dimension of Cloud Cube Model
2 pages
DevOps Part I
No ratings yet
DevOps Part I
16 pages
How To Be Secure From Social Engineering Attack
No ratings yet
How To Be Secure From Social Engineering Attack
3 pages
Keberhasilan Media Promosi Judi Online Dalam Menarik Minat Masyarakat-1
No ratings yet
Keberhasilan Media Promosi Judi Online Dalam Menarik Minat Masyarakat-1
10 pages
LCD TV: Service Manual
No ratings yet
LCD TV: Service Manual
51 pages
Palindromes: Digitalcommons@University of Nebraska - Lincoln
No ratings yet
Palindromes: Digitalcommons@University of Nebraska - Lincoln
19 pages
Gravity Light Project
No ratings yet
Gravity Light Project
16 pages
Introduction To TikTok Shop Affiliate Program
No ratings yet
Introduction To TikTok Shop Affiliate Program
10 pages
Littering Stats Essay
No ratings yet
Littering Stats Essay
3 pages
Logica Portfolio-1
No ratings yet
Logica Portfolio-1
10 pages
Cardiosync Corporate Business Plan
No ratings yet
Cardiosync Corporate Business Plan
7 pages
The Psychology of Academic Achievement
No ratings yet
The Psychology of Academic Achievement
31 pages
Max 30
No ratings yet
Max 30
6 pages
Anglais
No ratings yet
Anglais
19 pages
The Art of Growing Irish Potatoes in Sacks
No ratings yet
The Art of Growing Irish Potatoes in Sacks
6 pages
C7000 - C7000P - C6000 - C70hc - C6000L - Manualzz PDF
No ratings yet
C7000 - C7000P - C6000 - C70hc - C6000L - Manualzz PDF
154 pages
Edible Oil Industry 1 PDF
No ratings yet
Edible Oil Industry 1 PDF
45 pages
Hyperlipidemia 1
No ratings yet
Hyperlipidemia 1
54 pages
5 - Part 2 - Memory Principles
No ratings yet
5 - Part 2 - Memory Principles
10 pages
FSBC01 The Use of Repair and Maintenance Budget For Buildings
No ratings yet
FSBC01 The Use of Repair and Maintenance Budget For Buildings
5 pages
A Review On Lifting Beams: July 2017
No ratings yet
A Review On Lifting Beams: July 2017
14 pages

ML Lecture 3

Uploaded by

ML Lecture 3

Uploaded by

Machine Learning: Lecture 3

Decision Tree Learning

A Decision Tree for the concept PlayTennis

D12 D11 What is the “best”

What are the

Q1: is it smaller than 5?

You might also like