Lecture 07

The document discusses machine learning algorithms, distinguishing between parametric and non-parametric methods. Parametric algorithms simplify learning by assuming a specific functional form, while non-parametric algorithms are more flexible and can learn any functional form from data. It also covers decision trees for classification, including concepts like entropy, Gini index, and information gain for evaluating splits in the tree.

Uploaded by

ranamzeeshan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views31 pages

Lecture 07

Uploaded by

ranamzeeshan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

MACHINE LEARNING

Lecture 07
Dr. Samana Batool
DECISION TREES
PARAMETRIC ML ALGORITHMS
Assumptions can greatly simplify the learning process, but can also limit what can be learned.
Algorithms that simplify the function to a known form are called parametric machine learning algorithms.
The algorithms involve two steps:
1.Select a form for the function.
2.Learn the coefficients for the function from the training data.
Examples: Logistic Regression, Linear Regression, Linear Discriminant Analysis, Perceptron, Naive
Bayes, Simple Neural Networks
Benefits of Parametric Machine Learning Algorithms:
•Simpler: Easier to understand and interpret results.
•Speed: Very fast to learn from data.
•Less Data: Do not require as much training data and can work well even if the fit is not perfect.
Limitations of Parametric Machine Learning Algorithms:
•Constrained: By choosing a functional form these methods are highly constrained to the specified
form.
•Limited Complexity: The methods are more suited to simpler problems.
•Poor Fit: In practice the methods are unlikely to match the underlying mapping function.
NON-PARAMETRIC ML ALGORITHMS
Algorithms that do not make strong assumptions about the form of the mapping function are called
nonparametric machine learning algorithms. By not making assumptions, they are free to learn any
functional form from the training data.
Nonparametric methods are good when you have a lot of data and no prior knowledge, and when you
don’t want to worry too much about choosing just the right features.
Examples: k-Nearest Neighbors, Decision Trees, Support Vector Machines
Benefits of Nonparametric Machine Learning Algorithms:
 Flexibility: Capable of fitting a large number of functional forms.
 Power: No assumptions (or weak assumptions) about the underlying function.
 Performance: Can result in higher performance models for prediction.
Limitations of Nonparametric Machine Learning Algorithms:
 More data: Require a lot more training data to estimate the mapping function.
 Slower: A lot slower to train as they often have far more parameters to train.
 Overfitting: More of a risk to overfit the training data and it is harder to explain why specific
predictions are made.
CLASSIFICATION
 The classification of an unknown input vector is done by traversing the tree from
the root node to a leaf node.
 A record enters the tree at the root node.
 At the root node, a test is applied to determine which child node the record will
encounter next.
 This process is repeated until the record arrives at a leaf node.
 All the records that end up at given leaf of the tree are classified in the same way.
 There is a unique path from the root to each leaf.
 The path is a rule which is used to classify the records.
40 60 40 60

28 42 12 18 40 60

No Improvement Perfect Split

𝑘

𝐸𝑛𝑡𝑟𝑜𝑝𝑦 𝑇 = − ෍ 𝑝𝑙 log 2 𝑝𝑙
𝑙=1
Min Entropy= 0 (No impurity)
Max Entropy = 1 (Max impurity for
binary classes)
𝑘

Min Gini index = 0 (No impurity) 𝐺𝑖𝑛𝑖 𝑇 = 1 − ෍ 𝑝𝑙2

Max Gini index = 0.5 (Max impurity 𝑙=1
for binary classes)
INFORMATION GAIN
INFORMATION GAIN
INFORMATION GAIN

𝑁𝑙𝑒𝑓𝑡 𝑁𝑟𝑖𝑔ℎ𝑡
𝐼𝐺 = 𝐼 − 𝐼𝑙𝑒𝑓𝑡 − 𝐼𝑟𝑖𝑔ℎ𝑡
𝑁 𝑁

IG – Information Gain
I – Impurity calculated on parent node (Gini or Entropy)
Ileft – Impurity calculated on left child node
Iright – Impurity calculated on right child node
N – Total no. of samples
Nleft – No. of samples at left child node
Nright – No. of samples at left child node
INFORMATION GAIN FOR A1
2 2
29 35
𝐴𝑡 𝑟𝑜𝑜𝑡 𝑛𝑜𝑑𝑒; 𝐼 = 1 − − = 0.496
64 64
2 2
21 5
𝐴𝑡 𝑙𝑒𝑓𝑡 𝑛𝑜𝑑𝑒; 𝐼𝑙𝑒𝑓𝑡 =1 − − = 0.310
26 26
2 2
8 30
𝐴𝑡 𝑟𝑖𝑔ℎ𝑡 𝑛𝑜𝑑𝑒; 𝐼𝑟𝑖𝑔ℎ𝑡 = 1 − − = 0.332
38 38
𝑁𝑙𝑒𝑓𝑡 𝑁𝑟𝑖𝑔ℎ𝑡
𝐼𝐺 = 𝐼 − 𝐼𝑙𝑒𝑓𝑡 − 𝐼𝑟𝑖𝑔ℎ𝑡
𝑁 𝑁
26 38
𝐼𝐺 = 0.496 − 0.310 − 0.332
64 64
𝐼𝐺 = 0.496 − 0.33
𝐼𝐺 = 0.166
INFORMATION GAIN FOR A2
INFORMATION GAIN
2 2
29 35
𝐴𝑡 𝑟𝑜𝑜𝑡 𝑛𝑜𝑑𝑒; 𝐼 = 1 − − = 0.496
64 64
2 2
18 33
𝐴𝑡 𝑙𝑒𝑓𝑡 𝑛𝑜𝑑𝑒; 𝐼𝑙𝑒𝑓𝑡 =1 − − = 0.457
51 51
2 2
11 2
𝐴𝑡 𝑟𝑖𝑔ℎ𝑡 𝑛𝑜𝑑𝑒; 𝐼𝑟𝑖𝑔ℎ𝑡 = 1 − − = 0.260
13 13
𝑁𝑙𝑒𝑓𝑡 𝑁𝑟𝑖𝑔ℎ𝑡
𝐼𝐺 = 𝐼 − 𝐼𝑙𝑒𝑓𝑡 − 𝐼𝑟𝑖𝑔ℎ𝑡
𝑁 𝑁
51 13
𝐼𝐺 = 0.496 − 0.457 − 0.260
64 64
𝐼𝐺 = 0.496 − 0.417
𝐼𝐺 = 0.079
Evaluation data
Error rate

Training data

Number of split nodes

Ex
Ex
Ex
100 %
𝑹𝟐
80%

Performance
𝑹𝟑
𝑹𝟏

0%
0 year 5 years 20 years
Experience
TRAINING DATA EXAMPLE: GOAL IS TO PREDICT WHEN THIS
PLAYER WILL PLAY TENNIS?

Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Barron's Educational Series
3/5 (2)
Introductiontomachinelearning 230723174746 1a0e5edc
No ratings yet
Introductiontomachinelearning 230723174746 1a0e5edc
27 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Lecture 7 Overview of ML Models
No ratings yet
Lecture 7 Overview of ML Models
77 pages
WEEK 5 Machine Learning
No ratings yet
WEEK 5 Machine Learning
8 pages
Parametric and Nonparametric Machine Learning Algorithms
No ratings yet
Parametric and Nonparametric Machine Learning Algorithms
16 pages
Machine Learning Report
No ratings yet
Machine Learning Report
58 pages
Refer For KNNDecison Tree SVM
No ratings yet
Refer For KNNDecison Tree SVM
90 pages
Unit 1
No ratings yet
Unit 1
77 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
21 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
Machine Learning - Iii
No ratings yet
Machine Learning - Iii
53 pages
ML Important
No ratings yet
ML Important
11 pages
2-Parametric and Non Parametric ML
No ratings yet
2-Parametric and Non Parametric ML
30 pages
Intro ML 1 Day
No ratings yet
Intro ML 1 Day
43 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
27 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
Machine Learning Algorithms - A Review - ART20203995
No ratings yet
Machine Learning Algorithms - A Review - ART20203995
6 pages
Short Review On Machine Learning and Its Application
No ratings yet
Short Review On Machine Learning and Its Application
12 pages
Chapter - 4
No ratings yet
Chapter - 4
14 pages
Unit 1
No ratings yet
Unit 1
66 pages
Data Analyst Interview Questionaries
No ratings yet
Data Analyst Interview Questionaries
16 pages
A Preliminary Idea On Machine Learning
No ratings yet
A Preliminary Idea On Machine Learning
40 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Unit 5
No ratings yet
Unit 5
77 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
ML - Machine Learning PDF
No ratings yet
ML - Machine Learning PDF
13 pages
Lecture 5 - Feature Extraction, Model Building & Evaluation
No ratings yet
Lecture 5 - Feature Extraction, Model Building & Evaluation
35 pages
Tutorial 3
No ratings yet
Tutorial 3
30 pages
Decision Trees
No ratings yet
Decision Trees
45 pages
Classifying in Machine Learning
No ratings yet
Classifying in Machine Learning
26 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
21 pages
AIch 5
No ratings yet
AIch 5
50 pages
CH Ai App
No ratings yet
CH Ai App
3 pages
ML ModuleUntitled 2
No ratings yet
ML ModuleUntitled 2
8 pages
CP Presentation Affan, Hammad, Arman, Shayan
No ratings yet
CP Presentation Affan, Hammad, Arman, Shayan
18 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
INT354 - Unit 1
No ratings yet
INT354 - Unit 1
72 pages
PR & ML: CS5691: Machine Learning
No ratings yet
PR & ML: CS5691: Machine Learning
42 pages
DAIOT UNIT 5 (1) Own
No ratings yet
DAIOT UNIT 5 (1) Own
13 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Machine Learning Algorithms Laiki
No ratings yet
Machine Learning Algorithms Laiki
123 pages
Which ML Algo Should I Use SAS
No ratings yet
Which ML Algo Should I Use SAS
20 pages
Machine Learning1
100% (1)
Machine Learning1
11 pages
Unit 1
100% (1)
Unit 1
13 pages
Ml-Unit 1
No ratings yet
Ml-Unit 1
53 pages
Machine Learning
No ratings yet
Machine Learning
40 pages
R20 ML Notes
No ratings yet
R20 ML Notes
118 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
Support Machine Learning
No ratings yet
Support Machine Learning
161 pages
Unit 1
No ratings yet
Unit 1
68 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
16 pages
Artificial Intelligence Lec 3
No ratings yet
Artificial Intelligence Lec 3
17 pages
Comaprison of Machine Learning Algorithms
No ratings yet
Comaprison of Machine Learning Algorithms
10 pages
ML Unit 1
No ratings yet
ML Unit 1
74 pages
Machine 2023 Part 1
No ratings yet
Machine 2023 Part 1
4 pages
AI-900 - Fundamental Principles of ML
No ratings yet
AI-900 - Fundamental Principles of ML
55 pages
Machine Learning
No ratings yet
Machine Learning
48 pages
The Smart Math Tricks Secrets to Solving Math Fast and Easy
From Everand
The Smart Math Tricks Secrets to Solving Math Fast and Easy
Leonardo Cruz
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Cognitive Domain Keywords
No ratings yet
Cognitive Domain Keywords
1 page
Ain Dumps 2023-Aug-31 by Martin 0q Vce
100% (1)
Ain Dumps 2023-Aug-31 by Martin 0q Vce
30 pages
Module 2 AI For Threat Detection and Prevention
No ratings yet
Module 2 AI For Threat Detection and Prevention
15 pages
Verbs With Urdu Meanings Set 1
No ratings yet
Verbs With Urdu Meanings Set 1
4 pages
Wa0009.
No ratings yet
Wa0009.
4 pages
Lecture 6 Actual
No ratings yet
Lecture 6 Actual
52 pages
Supervised Vs Unsupervised Learning
No ratings yet
Supervised Vs Unsupervised Learning
9 pages
Bloom Taxonomy Reading Material Urdu
86% (7)
Bloom Taxonomy Reading Material Urdu
3 pages
Advance AI Lec 2
No ratings yet
Advance AI Lec 2
75 pages
Ba Engineering Fs-Oi-Baeng en
No ratings yet
Ba Engineering Fs-Oi-Baeng en
415 pages
AU Quote (C & F)
No ratings yet
AU Quote (C & F)
2 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
12 pages
Introduction To For It & Non-It Professionals: Artificial Intelligence
No ratings yet
Introduction To For It & Non-It Professionals: Artificial Intelligence
10 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
7 pages
Personality Vocabulary For Friends
0% (1)
Personality Vocabulary For Friends
2 pages
What Is AI? What Is ML? What Is Deep Learning? Machine Learning Process
No ratings yet
What Is AI? What Is ML? What Is Deep Learning? Machine Learning Process
8 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
9 pages
EURO 36790 AU LC DRAFT-intermark
100% (1)
EURO 36790 AU LC DRAFT-intermark
3 pages
USD 31,297.17 AU LC DRAFT-Time & Tune
No ratings yet
USD 31,297.17 AU LC DRAFT-Time & Tune
3 pages
TEC Khi Kamra Tender
No ratings yet
TEC Khi Kamra Tender
2 pages
Quotation
No ratings yet
Quotation
1 page
Revised Proforma Invoice
No ratings yet
Revised Proforma Invoice
3 pages
Detailed Mind Map For Internet Infrastructure Assets
No ratings yet
Detailed Mind Map For Internet Infrastructure Assets
1 page
Pi-Au-2020 5 19 PDF
No ratings yet
Pi-Au-2020 5 19 PDF
1 page
Induction Melting Furnace Quotation 010720
No ratings yet
Induction Melting Furnace Quotation 010720
4 pages
Physic-Incubator-Dr Samia
No ratings yet
Physic-Incubator-Dr Samia
8 pages
Offer Surveillance Opt 2
No ratings yet
Offer Surveillance Opt 2
2 pages
Invoice - 2nd Shipment
No ratings yet
Invoice - 2nd Shipment
2 pages
Start of Message Message Identifier
No ratings yet
Start of Message Message Identifier
3 pages
Swift Acknowledged Copy - Acknowledgement Date: 04-Aug-2020
No ratings yet
Swift Acknowledged Copy - Acknowledgement Date: 04-Aug-2020
4 pages
MATH232 Quiz6 Solutions PDF
No ratings yet
MATH232 Quiz6 Solutions PDF
8 pages
GMATH Correlation Analysis
No ratings yet
GMATH Correlation Analysis
3 pages
Strip Plot Leaflet
No ratings yet
Strip Plot Leaflet
2 pages
ANOVA Test How To
No ratings yet
ANOVA Test How To
4 pages
Introductory Econometrics A Modern Approach 6th Edition Wooldridge Test Bank Download
100% (1)
Introductory Econometrics A Modern Approach 6th Edition Wooldridge Test Bank Download
47 pages
Arch Garch Model
No ratings yet
Arch Garch Model
3 pages
Autocorrelation (Lec 12) : 1 Nguyen Thu Hang, BMNV, FTU CS2
No ratings yet
Autocorrelation (Lec 12) : 1 Nguyen Thu Hang, BMNV, FTU CS2
52 pages
Introduction To Biostatistics A Guide To Design, Analysis, and Discovery (FULL VERSION DOWNLOAD)
100% (12)
Introduction To Biostatistics A Guide To Design, Analysis, and Discovery (FULL VERSION DOWNLOAD)
15 pages
Addendum To Ordered Logit and Probit Models by Afees Salisu
No ratings yet
Addendum To Ordered Logit and Probit Models by Afees Salisu
16 pages
To Demonstrate A Correlated Uniqueness Model, We Use The Following Summary Statistics Data
No ratings yet
To Demonstrate A Correlated Uniqueness Model, We Use The Following Summary Statistics Data
7 pages
Correlation Pearsons R
No ratings yet
Correlation Pearsons R
25 pages
Module 6
No ratings yet
Module 6
82 pages
STAT1070 Mock Exam Questions
No ratings yet
STAT1070 Mock Exam Questions
14 pages
Linear Regression. Com
No ratings yet
Linear Regression. Com
13 pages
Hasil Uji Asumsi Klasik 2
No ratings yet
Hasil Uji Asumsi Klasik 2
4 pages
An Introduction To Supervised Machine Learning and Pattern Classification - The Big Picture
No ratings yet
An Introduction To Supervised Machine Learning and Pattern Classification - The Big Picture
55 pages
PS Unit - Iv
No ratings yet
PS Unit - Iv
19 pages
Stata Excel
No ratings yet
Stata Excel
25 pages
Bec 4102 Econometrics I
No ratings yet
Bec 4102 Econometrics I
2 pages
SLA - Class Test - 4 - AnswerKey
No ratings yet
SLA - Class Test - 4 - AnswerKey
2 pages
Exploratory Factor Analysis
No ratings yet
Exploratory Factor Analysis
54 pages
Econometrics For ECO 2022 Tutorial 03 PDF
No ratings yet
Econometrics For ECO 2022 Tutorial 03 PDF
2 pages
Problem 7.5 A)
No ratings yet
Problem 7.5 A)
11 pages
More Predictive Analytics. Microsoft Excel (PDFDrive)
No ratings yet
More Predictive Analytics. Microsoft Excel (PDFDrive)
465 pages
Tesis DR Imam 2018 Siap Sidang Hasil Revisi
100% (2)
Tesis DR Imam 2018 Siap Sidang Hasil Revisi
207 pages
AI34
No ratings yet
AI34
3 pages
Time Series Forecasting Chapter 16
No ratings yet
Time Series Forecasting Chapter 16
43 pages
IENG 314 - Advanced Analysis of Engineering Data: Ecampus - Wvu.edu
No ratings yet
IENG 314 - Advanced Analysis of Engineering Data: Ecampus - Wvu.edu
3 pages
PS - Module 3 - ViRa
No ratings yet
PS - Module 3 - ViRa
104 pages
Case Study New
No ratings yet
Case Study New
16 pages