0% found this document useful (0 votes)

36 views

Module 2

Uploaded by

8497kfgt8w

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views

Module 2

Uploaded by

8497kfgt8w

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 19

Module 2

Supervised
Learning
Copyright © 2018 McGraw Hill Education, All Rights Reserved.

PROPRIETARY MATERIAL © 2018 The McGraw Hill Education, Inc. All rights reserved. No part of this PowerPoint slide may be displayed, reproduced or distributed in any form or by any
means, without the prior written permission of the publisher, or used beyond the limited distribution to teachers and educators permitted by McGraw Hill for their individual course preparation.
If you are a student using this PowerPoint slide, you are using it without permission.
Inductive learning
Task of inductive learning:
• Given a collection of examples of a function , returns
a function that approximates .
• The approximating function is called hypothesis
function.
• The unknown true function correctly maps the input
space (of the entire data) to the output space Y.
• Central aim of designing is to suggest decisions for
unseen patterns.
• Better approximation of leads to better generalization.
• Generalization performance is the fundamental problem
in inductive learning.
• Off-training set error—the error on points not in the
training set, is used as a measure of generalization
performance.
• Inductive learning assumes that the best hypothesis
regarding unseen patterns is the one induced by the
observed training set
Occam’s Razor Principle
• A simpler algorithm can be expected to perform better
on a test set.
• “simpler” – may stand for fewer parameters, lesser
training time, fewer features and so forth.
• Generally searching is stopped for a design when the
solution is “good enough” and not the optimal one.
• Occam’s razor principle recommends hypothesis
functions that avoid overfitting of the training data.
Overfitting
• With increase in complexity, the training set’s
performance increases but performance of the test set
decreases, then overfitting has happened.

The accuracy of the

classifier
over training examples
increases monotonically
as the classifier grows in
complexity. However, the
accuracy over the
independent
test examples first
increases,
Heuristic Search in Inductive
Learning
Goal of machine learning:
• Not to learn exact representation of training data.
• To build a statistical model of process which generates
the data.
Success of learning: Depends on hypothesis space
complexity and sample complexity.
Search problem: finding a hypothesis function of
complexity consistent with the given training data
Machine learning community depend on tools that
appear to be heuristic, trail-and-error tools.
Estimating Generalization
Errors
Holdout method and random subsampling
• Certain amount of data reserved for testing and rest is
used for training.
• To partition dataset , randomly sample a set of training
examples from and use the rest for testing.
• For time-series data, use the earlier part for training
and the later for testing.
• Usually, one-third of the data is used for testing.
• This procedure of partitioning time-series data is
suitable because the learning machine is used in the
real world. Unseen data are from the future.
• Samples used for training and testing should have
same distribution.
• It can not be identified whether a sample is
representative or not since the distribution is unknown.
• Check: In classification problems, each class should be
represented in about the right proportion in the training
and test sets.
K-Fold Cross-Validation
• Data randomly partitioned into K mutually exclusive
subsets or “folds”, each of approximately equal size.
• In iteration k, partition is test set and remaining
partitions are collectively used to train the model.
• If stratification is adopted it is called stratified K-fold

• Error estimates obtained from K iterations are averaged

cross- validation for classification.

K=10 folds is the standard number used for predicting

to yield an overall error estimate.

the error rate of a learning technique.

Assessing Regression
Accuracy
Mean Square Error
• Most commonly used metric

Root Mean Square Error

• Same dimensions as the predicted value itself
Sum-of-Errors Squares
• Mathematical manipulation of MSE
Sum-of-Error-Squares
Assessing Classification
Accuracy
Misclassification Error
• Metric for assessing the accuracy of classification
algorithms is: number of samples misclassified by the
model
• For binary classification problems,

• For 0% error, for all data points

Confusion Matrix
• Decisions made on classifications based on
misclassification error rate lead to poor performance
when data is unbalanced.
• For example, in case of financial fraud detection, the
proportion of fraud cases is extremely small.
• In such classification problems, the interest is mainly
in minority cases.
• The class that the user is interested in is commonly
called positive class and the rest negative class.
• A single prediction on the test set has four possible
outcomes.
1. The true positive (TP) and true negative (TN) are
correct classifications.
2. A false positive (FP) occurs when the outcome is
incorrectly predicted as positive when it is actually
negative.
3. A false negative (FN) occurs when the outcome is
Hypothesized class (prediction)
incorrectly predicted as negative when it is actually
Classified +ve Classified –ve
positive.
Actual Class Actual +ve TP FN
(observation)
Actual -ve FP TN
Confusion Matrix
Misclassification Rate

True Positive Rate (tp rate)

• Determines sensitivity in detection of abnormal events

• Classification method with high sensitivity would

• FP = FN = 0 is desired.
rarely miss abnormal event.
True Negative Rate

• Determines the specificity in detection of the abnormal

event
• High specificity results in low rate of false alarms caused
by classification of a normal event as an abnormal one.

• Simultaneously high sensitivity and high specificity is

desired.
ROC Curves
• When a classifier algorithm is applied to test set, it
yields a confusion matrix, which corresponds to one
ROC point.
• An ROC curve is created by thresholding the classifier
with respect to its complexity.
• Each level of complexity in the space of the hypothesis
class produces a different point in the ROC space.
• Comparison of two learning schemes is done by
analyzing ROC curves in the same ROC space for the
learning schemes.
A sample ROC curve
An Overview of the Design
Cycle

An overview of the design

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Easy Pre-Calculus Step-by-Step, Second Edition
From Everand
Easy Pre-Calculus Step-by-Step, Second Edition
Carolyn Wheater
No ratings yet
Additional Exercises For Vectors, Matrices, and Least Squares
No ratings yet
Additional Exercises For Vectors, Matrices, and Least Squares
41 pages
ERROR and Confusion Matrix
No ratings yet
ERROR and Confusion Matrix
29 pages
ML-2-PPT-UNIT-2
No ratings yet
ML-2-PPT-UNIT-2
214 pages
Unit 2
No ratings yet
Unit 2
76 pages
Notes
No ratings yet
Notes
125 pages
ML Sit1305
No ratings yet
ML Sit1305
127 pages
Sec 1630
No ratings yet
Sec 1630
145 pages
Evaluation of Predictive Models Final
No ratings yet
Evaluation of Predictive Models Final
6 pages
Module 2 - Syllabus: CS 476 Introduction To Machine Learning, Module 2
No ratings yet
Module 2 - Syllabus: CS 476 Introduction To Machine Learning, Module 2
20 pages
Module 6
No ratings yet
Module 6
24 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
KSMF
No ratings yet
KSMF
35 pages
CH-5_ML
No ratings yet
CH-5_ML
36 pages
Outline: - Learning Agents - Inductive Learning - Decision Tree Learning
No ratings yet
Outline: - Learning Agents - Inductive Learning - Decision Tree Learning
30 pages
DL_IT324a_4
No ratings yet
DL_IT324a_4
52 pages
Unit 6-Feature Engineering and Sensitivity Analysis
No ratings yet
Unit 6-Feature Engineering and Sensitivity Analysis
63 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
AI351 Lecture 2 - Common Evaluation Metrics
No ratings yet
AI351 Lecture 2 - Common Evaluation Metrics
50 pages
Session01 DataScience
No ratings yet
Session01 DataScience
79 pages
All Cards
No ratings yet
All Cards
104 pages
Unit 1-1
No ratings yet
Unit 1-1
75 pages
Machine Learning Cheatsheet
No ratings yet
Machine Learning Cheatsheet
12 pages
Fall 2022 Midterm Notes PDF
No ratings yet
Fall 2022 Midterm Notes PDF
15 pages
Unit6 -7 Issues_23bc7150-918a-4ebe-9af6-01db96af986a
No ratings yet
Unit6 -7 Issues_23bc7150-918a-4ebe-9af6-01db96af986a
53 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
Week 3
No ratings yet
Week 3
56 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
NLP Chapter 2
No ratings yet
NLP Chapter 2
79 pages
5.2
No ratings yet
5.2
62 pages
KNN Evaluation
No ratings yet
KNN Evaluation
51 pages
UNIT I-Part 2
No ratings yet
UNIT I-Part 2
35 pages
Lecture 10
No ratings yet
Lecture 10
16 pages
slide07-bayes
No ratings yet
slide07-bayes
51 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
Machine Learning-2
No ratings yet
Machine Learning-2
16 pages
Data MIning Chapter 8
No ratings yet
Data MIning Chapter 8
11 pages
Data Mining - Credibility: Evaluating What's Been Learned
No ratings yet
Data Mining - Credibility: Evaluating What's Been Learned
36 pages
Lec 3
No ratings yet
Lec 3
21 pages
05-1 Supervised Learning
No ratings yet
05-1 Supervised Learning
65 pages
ML Document-1 - Merged
No ratings yet
ML Document-1 - Merged
19 pages
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
No ratings yet
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
50 pages
l9
No ratings yet
l9
110 pages
Cs 171 18 IntroLearning Old
No ratings yet
Cs 171 18 IntroLearning Old
47 pages
6 Evaluarea performantei
No ratings yet
6 Evaluarea performantei
43 pages
EvaluationMatrix
No ratings yet
EvaluationMatrix
29 pages
3 LogisticRegression
No ratings yet
3 LogisticRegression
30 pages
Lecture 5 Evaluation_Classifer
No ratings yet
Lecture 5 Evaluation_Classifer
61 pages
Lecturenotes Cse176
No ratings yet
Lecturenotes Cse176
80 pages
Lec 8
No ratings yet
Lec 8
35 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
116 pages
Ai Unit5 Learning
No ratings yet
Ai Unit5 Learning
62 pages
L2_Problems in ML & Performance Evaluation - Copy
No ratings yet
L2_Problems in ML & Performance Evaluation - Copy
30 pages
Lecturenotes PDF
No ratings yet
Lecturenotes PDF
80 pages
19_ML_intro
No ratings yet
19_ML_intro
33 pages
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet
ISTQB Certified Tester Advanced Level Test Manager (CTAL-TM): Practice Questions Syllabus 2012
From Everand
ISTQB Certified Tester Advanced Level Test Manager (CTAL-TM): Practice Questions Syllabus 2012
Gabriel Awoyemi
No ratings yet
Measurement - Task Sheets Gr. 3-5
From Everand
Measurement - Task Sheets Gr. 3-5
Chris Forest
No ratings yet
Factor Ization
No ratings yet
Factor Ization
6 pages
FARMAN CONTACTS PDF
No ratings yet
FARMAN CONTACTS PDF
59 pages
Rockwell FactoryTalk Studio Basics
No ratings yet
Rockwell FactoryTalk Studio Basics
7 pages
How To Do Subcontract Process Cycle in SAP With Planning Run
100% (1)
How To Do Subcontract Process Cycle in SAP With Planning Run
10 pages
Registration for Pine Labs Internship Cum PPO Recruitment Drive- 2026 Graduating Batch
No ratings yet
Registration for Pine Labs Internship Cum PPO Recruitment Drive- 2026 Graduating Batch
2 pages
Exotec Solutions Internship - Mobile Robot Control (Concatenated) PDF
No ratings yet
Exotec Solutions Internship - Mobile Robot Control (Concatenated) PDF
3 pages
Rounding in Excel - Round, Roundup, Rounddown, Floor, Ceiling Functions
No ratings yet
Rounding in Excel - Round, Roundup, Rounddown, Floor, Ceiling Functions
38 pages
Bottom-Up Parsing in Compiler Design
No ratings yet
Bottom-Up Parsing in Compiler Design
20 pages
Diophantine Equations
No ratings yet
Diophantine Equations
4 pages
Core, Cavity and Side-Core Design For A Multi-Cavity Die-Casting Die
No ratings yet
Core, Cavity and Side-Core Design For A Multi-Cavity Die-Casting Die
24 pages
Tauhid-Uz - Zaman: Cell No: 050 311 0458
No ratings yet
Tauhid-Uz - Zaman: Cell No: 050 311 0458
3 pages
Final Year Seminar On HDMI Technology
No ratings yet
Final Year Seminar On HDMI Technology
20 pages
Labview Notebook
No ratings yet
Labview Notebook
40 pages
Objective: Use The Order of Operations To Evaluate Expressions
No ratings yet
Objective: Use The Order of Operations To Evaluate Expressions
18 pages
Mitsubishi Melsec FX PDF
No ratings yet
Mitsubishi Melsec FX PDF
85 pages
Bcs054 Object Oriented System Design With c
No ratings yet
Bcs054 Object Oriented System Design With c
2 pages
Oracle Apps Finance - Search Results For How To Rates
No ratings yet
Oracle Apps Finance - Search Results For How To Rates
7 pages
Full download Design patterns explained a new perspective on object oriented design 2. ed Edition Shalloway pdf docx
100% (4)
Full download Design patterns explained a new perspective on object oriented design 2. ed Edition Shalloway pdf docx
82 pages
AHAHAKomikĞmessage
No ratings yet
AHAHAKomikĞmessage
3 pages
April - May 2011 Time Table
No ratings yet
April - May 2011 Time Table
101 pages
Kansai Nerolac Paints - Digital Marketing
0% (1)
Kansai Nerolac Paints - Digital Marketing
14 pages
Gis File Types
No ratings yet
Gis File Types
16 pages
CPI ProFlow Software
No ratings yet
CPI ProFlow Software
2 pages
JAVA Language
No ratings yet
JAVA Language
71 pages
Quality Tools - ASQ
100% (1)
Quality Tools - ASQ
46 pages
BU8 Block Diagram: Sata - HDD
No ratings yet
BU8 Block Diagram: Sata - HDD
43 pages
Quiz 1 Solutions
No ratings yet
Quiz 1 Solutions
3 pages
SpyGlass_AdvancedLintRules_Reference
No ratings yet
SpyGlass_AdvancedLintRules_Reference
348 pages
AutoCad 2004 Keyboard Shortcuts
100% (1)
AutoCad 2004 Keyboard Shortcuts
9 pages

Module 2

Uploaded by

Module 2

Uploaded by

Module 2

The accuracy of the

• Error estimates obtained from K iterations are averaged

K=10 folds is the standard number used for predicting

the error rate of a learning technique.

Root Mean Square Error

• For 0% error, for all data points

True Positive Rate (tp rate)

• Determines sensitivity in detection of abnormal events

• Determines the specificity in detection of the abnormal

• Simultaneously high sensitivity and high specificity is

An overview of the design

You might also like