0% found this document useful (0 votes)

7 views18 pages

Lecture 4

Uploaded by

bhavesh agrawal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views18 pages

Lecture 4

Uploaded by

bhavesh agrawal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

MACHINE LEARNING (CS 403/603)

Bias-variance; Linear regression and

Ridge regression

By: Dr. Puneet Gupta

Overfitting
Overfitting means fitting the training set “too well” on the performance
on the test set degrades.
Underfitting refers to a model that can neither model the training data
nor generalize to new data.
Model will keep on learning and
thus the error on training and
testing data decreases.
If learning goes too long,
overfitting starts due to noise
and less relevant attributes.
Hence the performance of the
model on test set decreases.
For good model, we will stop at
a point just before where the
error starts increasing, i.e., the
point where the model performs
well on training and unseen
testing dataset.
K-Nearest Neighbor
1-NN,
i.e., K=1

5-NN, ● In one-nearest-neighbor (1-NN), label

i.e., K=5 of x is given by the label of its nearest
neighbor in training data.
● Distance measures are used to find the
nearest neighbor.

● Better result can be expected when

more (K > 1) neighbors are utilized.
● Classification: Use majority voting.
Mitigating Overfitting by Holdout
Model Selection
Techniques
Error1

Test Set
Error or
Training Set

ML Algo. 1 Model 1

Select model with minimum error,

average error

Error or Error2
Validation Set

ML Algo. 2 Model 2
average error

Model s
Selected Final
model Error
: : :
: : :
: : :
: : :
Error or Errorq
ML Algo. q Model q
average error

Different ML Algorithms are designed by varying hyperparameters

Overfitting vs underfitting
Bias and variance

● Variance is an error from sensitivity to Low Variance High Variance

small fluctuations in the training set
The bias describes how much the

ng
●

tti
model fit over datasets, i.e., prediction

Low Bias
rfi
deviates from the value of the

ve
underlying target function.

High Bias
g
tin
fit
er
nd
U
Expected Prediction Error and Bias-
variance Tradeoff

What is expected prediction error?

Example
A person with high bias is someone who starts to answer before you
can even finish asking. A person with high variance is someone who
can think of all sorts of crazy answers. Combining these provide:
● High bias/low variance: this is someone who usually gives you
the same answer, no matter what you ask, and is always wrong
about it.
● High bias/high variance: someone who takes wild guesses, all of
which are sort of wrong; he might be right sometimes due to
chance.
● Low bias/high variance: a knowledgeable person who listens to
you and tries to answer the best they can, but that daydreams a
lot and may say something totally crazy.
● Low bias/low variance: a person who listens to you very carefully
and gives you good answers pretty much all the time.
Basic Maths refresher
Previous example of Supervised
Learning

-1 +1
Linear Regression
● Linear Models are used in SVM, Deep learning, and etc...
● Defining a rule is not always feasible.
● How to learn their weights (or unknowns) from data?
● Formulate learning as optimization problem w.r.t. weights

Parameteric ML
Equation of line:
y = mx +c

c can be considered as
bias then, m is the
weight.

For simplicity, we
consider the following:
●
w = [m c]T
●
x = [x 1]T
1-D pictorial example of linear regression
Linear Regression
● Squared loss chosen
for simplicity.
● The best w minimizes
training error w.r.t. w
Linear Regression

In w, wd denotes the importance of dth input feature for predicting y

Problems with the closed form solution:

● Outliers or noise

●
(XTX) may not be invertible
● Overfitting: Based solely on minimizing the training error

● Expensive inversion for large D

Ridge Regression or Regularized
Linear Regression
Why l2 regularization?
Linear and Ridge Regression
Linear and Ridge Regression
Reference

A Course in Machine Learning by Hal Daumé III.

Link: “http://ciml.info/dl/v0_99/ciml-v0_99-all.pdf”.

ML Decode
No ratings yet
ML Decode
130 pages
Unit 1-Week2: Linear Regression, Bias, Variance, Under and Over Fitting, Curse of Dimensionality and ROC
No ratings yet
Unit 1-Week2: Linear Regression, Bias, Variance, Under and Over Fitting, Curse of Dimensionality and ROC
53 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
How To Get Free $300 BTC Per Day
100% (3)
How To Get Free $300 BTC Per Day
4 pages
Lecture 4 and 5
No ratings yet
Lecture 4 and 5
17 pages
Machine Learning-2
No ratings yet
Machine Learning-2
87 pages
Theory in Machine Learning
No ratings yet
Theory in Machine Learning
60 pages
Week11 - Regularization and Optimization
No ratings yet
Week11 - Regularization and Optimization
75 pages
DL Unit1
100% (1)
DL Unit1
79 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
ML Decode
No ratings yet
ML Decode
130 pages
Regularization Linear Models
No ratings yet
Regularization Linear Models
23 pages
Chapter2 1 22
No ratings yet
Chapter2 1 22
9 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
Linear Regression Summary
No ratings yet
Linear Regression Summary
57 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
Lecture 2 Ai
No ratings yet
Lecture 2 Ai
24 pages
Lecture 8
No ratings yet
Lecture 8
15 pages
(Technical) Machine Learning U3-6 (2019 Pattern)
No ratings yet
(Technical) Machine Learning U3-6 (2019 Pattern)
101 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Linear Regression, Polynomical, Gradiant Descent
No ratings yet
Linear Regression, Polynomical, Gradiant Descent
42 pages
Unit 1.2 Perceptron 2024
No ratings yet
Unit 1.2 Perceptron 2024
107 pages
Biasvariancetradeoff 210313075413
No ratings yet
Biasvariancetradeoff 210313075413
13 pages
Machine Learning Interview Question
No ratings yet
Machine Learning Interview Question
72 pages
Machine Learning Models
No ratings yet
Machine Learning Models
52 pages
Vsat2k - ML - Ch1a Evaluation of Learning Algorithms - Jan 2025
No ratings yet
Vsat2k - ML - Ch1a Evaluation of Learning Algorithms - Jan 2025
19 pages
Top 100 ML Interview Q&A
100% (1)
Top 100 ML Interview Q&A
39 pages
Machine Learning Volume I 280820241047
No ratings yet
Machine Learning Volume I 280820241047
4 pages
12 Bias-Variance - Underfit - Overfit
No ratings yet
12 Bias-Variance - Underfit - Overfit
4 pages
Machine Learning Models
No ratings yet
Machine Learning Models
54 pages
Diagnosing Bias Vs Variance
No ratings yet
Diagnosing Bias Vs Variance
11 pages
ML Models Concepts
No ratings yet
ML Models Concepts
32 pages
ML Models, Model Evaluation Methods, Overfitting, Underfitting Bias Variance Loss Function Hyperparameter and Gradient Descent
No ratings yet
ML Models, Model Evaluation Methods, Overfitting, Underfitting Bias Variance Loss Function Hyperparameter and Gradient Descent
74 pages
ML 01
No ratings yet
ML 01
24 pages
GML Slides 2024 04 29
No ratings yet
GML Slides 2024 04 29
206 pages
4 - Bias-Variance Tradeoff
No ratings yet
4 - Bias-Variance Tradeoff
28 pages
Machine Learning Notes Anna University
No ratings yet
Machine Learning Notes Anna University
9 pages
Unit 4
No ratings yet
Unit 4
50 pages
ML 04 Validation Regularization
No ratings yet
ML 04 Validation Regularization
57 pages
Lecture 02
No ratings yet
Lecture 02
43 pages
MLS 1 - Regression
No ratings yet
MLS 1 - Regression
20 pages
DL-Lec 2 - Bias-Variance-Tradeoff
No ratings yet
DL-Lec 2 - Bias-Variance-Tradeoff
33 pages
ML Unit 3
No ratings yet
ML Unit 3
23 pages
Bias Variance
No ratings yet
Bias Variance
8 pages
Emsemble Methods-Pages-Deleted
No ratings yet
Emsemble Methods-Pages-Deleted
2 pages
3 LogisticRegression
No ratings yet
3 LogisticRegression
30 pages
Complete ML Concepts
No ratings yet
Complete ML Concepts
30 pages
Unit 2
No ratings yet
Unit 2
97 pages
Merge +1
No ratings yet
Merge +1
107 pages
ML 2
No ratings yet
ML 2
155 pages
Neural Networks Cheat Sheet - 2020 PDF
No ratings yet
Neural Networks Cheat Sheet - 2020 PDF
14 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
Introduction To ML
No ratings yet
Introduction To ML
55 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
ML Hand Written Notes
No ratings yet
ML Hand Written Notes
19 pages
3.3 Bias Variance
No ratings yet
3.3 Bias Variance
14 pages
Bias and Variance
No ratings yet
Bias and Variance
15 pages
Mlfa Autumn 22 Lec 02
No ratings yet
Mlfa Autumn 22 Lec 02
24 pages
Lec 8
No ratings yet
Lec 8
19 pages
4 MachineLearningForCV
No ratings yet
4 MachineLearningForCV
73 pages
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet
Microprocessor Technology: Presented by Anshika Porwal Scholar No-222116609
No ratings yet
Microprocessor Technology: Presented by Anshika Porwal Scholar No-222116609
27 pages
Foot2hip: A Deep Neural Network Model For Predicting Lower Limb Kinematics From Foot Measurements
No ratings yet
Foot2hip: A Deep Neural Network Model For Predicting Lower Limb Kinematics From Foot Measurements
11 pages
SpectralSpatial Morphological Attention Transformer For Hyperspectral Image Classification
No ratings yet
SpectralSpatial Morphological Attention Transformer For Hyperspectral Image Classification
15 pages
Basicsof Probability
No ratings yet
Basicsof Probability
8 pages
Integration of Artificial Intelligence, Blockchain, and Wearable Technology For Chronic Disease Management: A New Paradigm in Smart Healthcare
No ratings yet
Integration of Artificial Intelligence, Blockchain, and Wearable Technology For Chronic Disease Management: A New Paradigm in Smart Healthcare
11 pages
Towards A Blockchain Based Fall Prediction Model For Aged Care
No ratings yet
Towards A Blockchain Based Fall Prediction Model For Aged Care
10 pages
Transactions and Concurrency Control
No ratings yet
Transactions and Concurrency Control
7 pages
Database Management System
No ratings yet
Database Management System
5 pages
Database Management System: Refer Below To Answer The Questions (Q.1 To Q4)
No ratings yet
Database Management System: Refer Below To Answer The Questions (Q.1 To Q4)
6 pages
Database Management System: Questions (1 - 20) Are Based On The Following 3 Tables
0% (1)
Database Management System: Questions (1 - 20) Are Based On The Following 3 Tables
6 pages
Transactions and Concurrency Control
100% (1)
Transactions and Concurrency Control
7 pages
Transactions and Concurrency Control
No ratings yet
Transactions and Concurrency Control
6 pages
CD Q4 PDF
No ratings yet
CD Q4 PDF
2 pages
CD Q5 PDF
No ratings yet
CD Q5 PDF
3 pages
File Structures & Indexing
No ratings yet
File Structures & Indexing
4 pages
Database Management System 1. For A Database Relation R (A, B, C, D) Where The Domains of A, B, C and D Only Include Atomic
No ratings yet
Database Management System 1. For A Database Relation R (A, B, C, D) Where The Domains of A, B, C and D Only Include Atomic
4 pages
CD Q5 PDF
No ratings yet
CD Q5 PDF
3 pages
CD Q4 PDF
No ratings yet
CD Q4 PDF
2 pages
Compiler Design: 1. The Advantage of Panic Mode of Error Recovery Is That
No ratings yet
Compiler Design: 1. The Advantage of Panic Mode of Error Recovery Is That
4 pages
Interview Vocab
No ratings yet
Interview Vocab
4 pages
Lab Maual For Experiments 6 To 10
No ratings yet
Lab Maual For Experiments 6 To 10
19 pages
What Is Interactive Media1
No ratings yet
What Is Interactive Media1
4 pages
Firewall Forward Info
No ratings yet
Firewall Forward Info
2 pages
2-Digit Addition & Subtraction: With and Without Regrouping Worksheets
No ratings yet
2-Digit Addition & Subtraction: With and Without Regrouping Worksheets
21 pages
Pleiades Panharpening and Orthorectification
No ratings yet
Pleiades Panharpening and Orthorectification
10 pages
Staad Aashto LRFD Parameters
No ratings yet
Staad Aashto LRFD Parameters
2 pages
Canon Ir2016 Ir2020 Brochure
No ratings yet
Canon Ir2016 Ir2020 Brochure
4 pages
SIMTECH
No ratings yet
SIMTECH
9 pages
Operations Management: Linear Programming Module B
No ratings yet
Operations Management: Linear Programming Module B
29 pages
Interfacing Stepper Motor Using MicroController
100% (1)
Interfacing Stepper Motor Using MicroController
4 pages
Payroll Hijack
No ratings yet
Payroll Hijack
23 pages
User Manual For e-MRO
No ratings yet
User Manual For e-MRO
2 pages
SHS LCS Q1 Las Le2
No ratings yet
SHS LCS Q1 Las Le2
6 pages
CS 8520: Artificial Intelligence: Knowledge Representation
No ratings yet
CS 8520: Artificial Intelligence: Knowledge Representation
30 pages
Intro To Information Processing
No ratings yet
Intro To Information Processing
27 pages
G9 Dietexpert Report
No ratings yet
G9 Dietexpert Report
56 pages
Depth-First Search: 11.1 Topological Sort
No ratings yet
Depth-First Search: 11.1 Topological Sort
20 pages
Infineon-Presentation 2kW ZVS Demoboard description-AP-v01 00-EN
No ratings yet
Infineon-Presentation 2kW ZVS Demoboard description-AP-v01 00-EN
16 pages
PCIE Protocol
No ratings yet
PCIE Protocol
29 pages
Capstone: Stem-Based Research
No ratings yet
Capstone: Stem-Based Research
29 pages
Data Sheet MP 420 Murrsystems en
No ratings yet
Data Sheet MP 420 Murrsystems en
20 pages
SwissgasSonimix 2106 Gas Dilution Calibrator
No ratings yet
SwissgasSonimix 2106 Gas Dilution Calibrator
2 pages
Dahua
No ratings yet
Dahua
2 pages
Coding Resources Coding Clinic, Encoders, Automated Coding
No ratings yet
Coding Resources Coding Clinic, Encoders, Automated Coding
11 pages
Upload A Document - Scribd
No ratings yet
Upload A Document - Scribd
4 pages
Applied Linear Regression Models 4th Edi
No ratings yet
Applied Linear Regression Models 4th Edi
4 pages
HF Security Smart-Pass - Installation Instructions - 1.5.9 - 20220304
No ratings yet
HF Security Smart-Pass - Installation Instructions - 1.5.9 - 20220304
28 pages
Volunteer Resume Example
100% (2)
Volunteer Resume Example
7 pages
Chapter 2.6 - Thevenin's Theorem-2
No ratings yet
Chapter 2.6 - Thevenin's Theorem-2
21 pages
Workshop Brochure
No ratings yet
Workshop Brochure
2 pages

Lecture 4

Uploaded by

Lecture 4

Uploaded by

MACHINE LEARNING (CS 403/603)

Bias-variance; Linear regression and

By: Dr. Puneet Gupta

5-NN, ● In one-nearest-neighbor (1-NN), label

● Better result can be expected when

Select model with minimum error,

Different ML Algorithms are designed by varying hyperparameters

● Variance is an error from sensitivity to Low Variance High Variance

What is expected prediction error?

In w, wd denotes the importance of dth input feature for predicting y

Problems with the closed form solution:

● Expensive inversion for large D

A Course in Machine Learning by Hal Daumé III.

You might also like