Lecture 4 and 5

Uploaded by

bhavesh agrawal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views17 pages

Lecture 4 and 5

Uploaded by

bhavesh agrawal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

MACHINE LEARNING (CS 403/603)

Bias-variance; Linear regression and

Ridge regression

By: Dr. Puneet Gupta

Overfitting
Overfitting means fitting the training set “too well” on the performance
on the test set degrades.
Underfitting refers to a model that can neither model the training data nor
generalize to new data.
Model will keep on learning and
thus the error on training and
testing data decreases.
If learning goes too long,
overfitting starts due to noise
and less relevant attributes.
Hence the performance of the
model on test set decreases.
For good model, we will stop at
a point just before where the
error starts increasing, i.e., the
point where the model performs
well on training and unseen
testing dataset.
K-Nearest Neighbor
1-NN,
i.e., K=1

5-NN, ● In one-nearest-neighbor (1-NN), label

i.e., K=5 of x is given by the label of its nearest
neighbor in training data.
● Distance measures are used to find the
nearest neighbor.

● Better result can be expected when

more (K > 1) neighbors are utilized.
● Classification: Use majority voting.
Mitigating Overfitting by Holdout
Model Selection
Techniques

Test Set
ML Algo. Error or Error1
Training Set

Select model with minimum error,

Model 1 average error
1

Error2
Validation Set

ML Algo. Error or
Model 2 average error
2

Model s
Selected Final
Error
: : : model
: : :
: : :
: : :
ML Algo. Error or Errorq
Model q average error
q

Different ML Algorithms are designed by varying hyperparameters

Overfitting vs underfitting
Bias and variance

● Variance is an error from sensitivity to Low Variance High Variance

small fluctuations in the training set
● The bias describes how much the

Low Bias
model fit over datasets, i.e., prediction
deviates from the value of the
underlying target function.

High Bias
Expected Prediction Error and Bias-
variance Tradeoff

What is expected prediction error?

Example
A person with high bias is someone who starts to answer before you can
even finish asking. A person with high variance is someone who can
think of all sorts of crazy answers. Combining these provide:
● High bias/low variance: this is someone who usually gives you the
same answer, no matter what you ask, and is always wrong about it.
● High bias/high variance: someone who takes wild guesses, all of
which are sort of wrong; he might be right sometimes due to chance.
● Low bias/high variance: a knowledgeable person who listens to you
and tries to answer the best they can, but that daydreams a lot and
may say something totally crazy.
● Low bias/low variance: a person who listens to you very carefully and
gives you good answers pretty much all the time.
Basic Maths refresher
Previous example of Supervised
Learning

-1 +1
Linear Regression
● Linear Models are used in SVM, Deep learning, and etc...
● Defining a rule is not always feasible.
● How to learn their weights (or unknowns) from data?
● Formulate learning as optimization problem w.r.t. weights

Parameteric ML
Equation of line:
y = mx +c

c can be considered as
bias then, m is the
weight.

For simplicity, we
consider the following:
● w = [m c]
T

● x = [x 1]
T

1-D pictorial example of linear regression

Linear Regression
● Squared loss chosen
for simplicity.
● The best w minimizes
training error w.r.t. w

Closed form solution for w

Linear Regression

In w, wd denotes the importance of dth input feature for predicting y

Problems with the closed form solution:

● Outliers or noise
● (XTX) may not be invertible
● Overfitting: Based solely on minimizing the training error
● Expensive inversion for large D
Ridge Regression or Regularized
Linear Regression
Why l2 regularization?
Linear and Ridge Regression
Linear and Ridge Regression

DL Unit1
100% (2)
DL Unit1
79 pages
MLS 1 - Regression
No ratings yet
MLS 1 - Regression
20 pages
4 - Bias-Variance Tradeoff
No ratings yet
4 - Bias-Variance Tradeoff
28 pages
Unit 2
No ratings yet
Unit 2
97 pages
(Technical) Machine Learning U3-6 (2019 Pattern)
No ratings yet
(Technical) Machine Learning U3-6 (2019 Pattern)
101 pages
Machine Learning Math Essentials - 12.02.2025
No ratings yet
Machine Learning Math Essentials - 12.02.2025
88 pages
Unit 4
No ratings yet
Unit 4
50 pages
Machine Learning Models
No ratings yet
Machine Learning Models
52 pages
Linear Regression, Polynomical, Gradiant Descent
No ratings yet
Linear Regression, Polynomical, Gradiant Descent
42 pages
ML Models, Model Evaluation Methods, Overfitting, Underfitting Bias Variance Loss Function Hyperparameter and Gradient Descent
No ratings yet
ML Models, Model Evaluation Methods, Overfitting, Underfitting Bias Variance Loss Function Hyperparameter and Gradient Descent
74 pages
Module 3 Modified
No ratings yet
Module 3 Modified
48 pages
Linear Regression Summary
No ratings yet
Linear Regression Summary
57 pages
Unit 1.2 Perceptron 2024
No ratings yet
Unit 1.2 Perceptron 2024
107 pages
ML UNIT 4 Notes
No ratings yet
ML UNIT 4 Notes
30 pages
Bias and Variance
No ratings yet
Bias and Variance
36 pages
Regularization Linear Models
No ratings yet
Regularization Linear Models
23 pages
Bias and Variance in Machine Learning
100% (1)
Bias and Variance in Machine Learning
7 pages
Lec 3 Regression.
No ratings yet
Lec 3 Regression.
20 pages
Model Parameters
No ratings yet
Model Parameters
26 pages
Machine Learning Interview Question
No ratings yet
Machine Learning Interview Question
72 pages
Lec 8
No ratings yet
Lec 8
19 pages
Lecture 8
No ratings yet
Lecture 8
15 pages
ML Unit 3
No ratings yet
ML Unit 3
23 pages
Vsat2k - ML - Ch1a Evaluation of Learning Algorithms - Jan 2025
No ratings yet
Vsat2k - ML - Ch1a Evaluation of Learning Algorithms - Jan 2025
19 pages
ML Lec-7
No ratings yet
ML Lec-7
12 pages
Bias and Variance
No ratings yet
Bias and Variance
15 pages
Lecture 4
No ratings yet
Lecture 4
18 pages
12 Bias-Variance - Underfit - Overfit
No ratings yet
12 Bias-Variance - Underfit - Overfit
4 pages
Theory in Machine Learning
No ratings yet
Theory in Machine Learning
60 pages
11 July Unit 1
No ratings yet
11 July Unit 1
47 pages
ML Decode
No ratings yet
ML Decode
130 pages
Week11 - Regularization and Optimization
No ratings yet
Week11 - Regularization and Optimization
75 pages
Regression
No ratings yet
Regression
45 pages
Unit 1-Week2: Linear Regression, Bias, Variance, Under and Over Fitting, Curse of Dimensionality and ROC
No ratings yet
Unit 1-Week2: Linear Regression, Bias, Variance, Under and Over Fitting, Curse of Dimensionality and ROC
53 pages
Ensemble Method
No ratings yet
Ensemble Method
12 pages
Machine Learning-2
No ratings yet
Machine Learning-2
87 pages
3.3 Bias Variance
No ratings yet
3.3 Bias Variance
14 pages
Bias Variance
No ratings yet
Bias Variance
8 pages
Biasvariancetradeoff 210313075413
No ratings yet
Biasvariancetradeoff 210313075413
13 pages
Top 100 ML Interview Q&A
100% (1)
Top 100 ML Interview Q&A
39 pages
Lec 24
No ratings yet
Lec 24
8 pages
Bias and Variance
No ratings yet
Bias and Variance
7 pages
Diagnosing Bias Vs Variance
No ratings yet
Diagnosing Bias Vs Variance
11 pages
Jkkklphftbbhuii
No ratings yet
Jkkklphftbbhuii
17 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
ML Models Concepts
No ratings yet
ML Models Concepts
32 pages
Machine Learning Models
No ratings yet
Machine Learning Models
54 pages
ML Decode
No ratings yet
ML Decode
130 pages
Machine Learning Notes Anna University
No ratings yet
Machine Learning Notes Anna University
9 pages
Lecture 2 Ai
No ratings yet
Lecture 2 Ai
24 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
Bias, Variance, and Tradeoff
No ratings yet
Bias, Variance, and Tradeoff
8 pages
Machine Learning Volume I 280820241047
No ratings yet
Machine Learning Volume I 280820241047
4 pages
Chapter2 1 22
No ratings yet
Chapter2 1 22
9 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
Merge +1
No ratings yet
Merge +1
107 pages
Q1-What's The Trade-Off Between Bias and Variance?
100% (1)
Q1-What's The Trade-Off Between Bias and Variance?
5 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
Bais and Variance
No ratings yet
Bais and Variance
4 pages
Soil Spectroscopy: Training Material
100% (1)
Soil Spectroscopy: Training Material
28 pages
2024.application of Back Propagation Neural Network in Complex Diagnostics and Forecasting Loss of Life of Cellulose Paper Insulation in Oil-Immersed Transformers
No ratings yet
2024.application of Back Propagation Neural Network in Complex Diagnostics and Forecasting Loss of Life of Cellulose Paper Insulation in Oil-Immersed Transformers
28 pages
Transactions and Concurrency Control
100% (1)
Transactions and Concurrency Control
7 pages
An Explainable Transformer-Based Model For Phishing Email Detection: A Large Language Model Approach
No ratings yet
An Explainable Transformer-Based Model For Phishing Email Detection: A Large Language Model Approach
15 pages
IntroClassificationDA 2024
No ratings yet
IntroClassificationDA 2024
129 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Data Science Ethics - Lecture 3
No ratings yet
Data Science Ethics - Lecture 3
79 pages
SpectralSpatial Morphological Attention Transformer For Hyperspectral Image Classification
No ratings yet
SpectralSpatial Morphological Attention Transformer For Hyperspectral Image Classification
15 pages
Database Management System 1. For A Database Relation R (A, B, C, D) Where The Domains of A, B, C and D Only Include Atomic
No ratings yet
Database Management System 1. For A Database Relation R (A, B, C, D) Where The Domains of A, B, C and D Only Include Atomic
4 pages
Exam Professional Data Engineer Topic - Discussion and Explanations
No ratings yet
Exam Professional Data Engineer Topic - Discussion and Explanations
9 pages
Customer Churn Prediction System: A Machine Learning Approach
No ratings yet
Customer Churn Prediction System: A Machine Learning Approach
24 pages
Lecture 2 - CNN and Overfitting
No ratings yet
Lecture 2 - CNN and Overfitting
42 pages
BTP Final Repo 2024 Conv
No ratings yet
BTP Final Repo 2024 Conv
56 pages
CHP1 Introduction To Machine Learning
No ratings yet
CHP1 Introduction To Machine Learning
52 pages
Bias and Variance in Machine Learning - Javatpoint
100% (2)
Bias and Variance in Machine Learning - Javatpoint
6 pages
Lab Maual For Experiments 6 To 10
No ratings yet
Lab Maual For Experiments 6 To 10
19 pages
Data Science For Civil Engineering Unit 3 Notes-1
No ratings yet
Data Science For Civil Engineering Unit 3 Notes-1
29 pages
Leveraging Artificial Intelligence To Enhance Port Operation Efficiency
No ratings yet
Leveraging Artificial Intelligence To Enhance Port Operation Efficiency
16 pages
Caltech AI Updated 121223
No ratings yet
Caltech AI Updated 121223
32 pages
Berrar EBCB 2nd Edition Cross-Validation Preprint
No ratings yet
Berrar EBCB 2nd Edition Cross-Validation Preprint
13 pages
Database Management System: Refer Below To Answer The Questions (Q.1 To Q4)
No ratings yet
Database Management System: Refer Below To Answer The Questions (Q.1 To Q4)
6 pages
Aurora Guard: Reliable Face Anti-Spoofing Via Mobile Lighting System
No ratings yet
Aurora Guard: Reliable Face Anti-Spoofing Via Mobile Lighting System
8 pages
Interview Vocab
No ratings yet
Interview Vocab
4 pages
File Structures & Indexing
No ratings yet
File Structures & Indexing
4 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
19 pages
Microprocessor Technology: Presented by Anshika Porwal Scholar No-222116609
No ratings yet
Microprocessor Technology: Presented by Anshika Porwal Scholar No-222116609
27 pages
Opportunities and Challenges in Data-Centric AI
No ratings yet
Opportunities and Challenges in Data-Centric AI
17 pages
DMDW 4th Module
No ratings yet
DMDW 4th Module
50 pages
SSRN Id4778909
No ratings yet
SSRN Id4778909
27 pages
Unit 2 1
No ratings yet
Unit 2 1
15 pages
Basicsof Probability
No ratings yet
Basicsof Probability
8 pages
2018 - Kouw - An Introduction To Domain Adaptation and Transfer Learning
No ratings yet
2018 - Kouw - An Introduction To Domain Adaptation and Transfer Learning
41 pages
Data Science & Machine Learning 2024
No ratings yet
Data Science & Machine Learning 2024
2 pages
Data Mining Assignment No 2
No ratings yet
Data Mining Assignment No 2
4 pages
Transactions in GIS - 2021 - Zhao - Terraces Mapping by Using Deep Learning Approach From Remote Sensing Images and Digital
No ratings yet
Transactions in GIS - 2021 - Zhao - Terraces Mapping by Using Deep Learning Approach From Remote Sensing Images and Digital
17 pages
Foot2hip: A Deep Neural Network Model For Predicting Lower Limb Kinematics From Foot Measurements
No ratings yet
Foot2hip: A Deep Neural Network Model For Predicting Lower Limb Kinematics From Foot Measurements
11 pages
Database Management System
No ratings yet
Database Management System
5 pages
Database Management System: Questions (1 - 20) Are Based On The Following 3 Tables
0% (1)
Database Management System: Questions (1 - 20) Are Based On The Following 3 Tables
6 pages
Integration of Artificial Intelligence, Blockchain, and Wearable Technology For Chronic Disease Management: A New Paradigm in Smart Healthcare
No ratings yet
Integration of Artificial Intelligence, Blockchain, and Wearable Technology For Chronic Disease Management: A New Paradigm in Smart Healthcare
11 pages
Towards A Blockchain Based Fall Prediction Model For Aged Care
No ratings yet
Towards A Blockchain Based Fall Prediction Model For Aged Care
10 pages
Compiler Design: 1. The Advantage of Panic Mode of Error Recovery Is That
No ratings yet
Compiler Design: 1. The Advantage of Panic Mode of Error Recovery Is That
4 pages
Transactions and Concurrency Control
No ratings yet
Transactions and Concurrency Control
7 pages
Transactions and Concurrency Control
No ratings yet
Transactions and Concurrency Control
6 pages
Machine Learning MID-2 Question Bank
No ratings yet
Machine Learning MID-2 Question Bank
2 pages
Journal of Magnetic Resonance: Michael Prange, Yi-Qiao Song
No ratings yet
Journal of Magnetic Resonance: Michael Prange, Yi-Qiao Song
6 pages
CD Q5 PDF
No ratings yet
CD Q5 PDF
3 pages
CD Q5 PDF
No ratings yet
CD Q5 PDF
3 pages
Partial Least Square
No ratings yet
Partial Least Square
6 pages
CD Q4 PDF
No ratings yet
CD Q4 PDF
2 pages
CD Q4 PDF
No ratings yet
CD Q4 PDF
2 pages
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet