0% found this document useful (0 votes)

13 views7 pages

ML Lec8

This lecture focuses on linear models for regression, discussing both batch methods like Ordinary Least Squares (OLS) and Maximum Likelihood Estimates, as well as sequential methods such as Least Mean Squares (LMS) and Recursive Least Squares (RLS). It emphasizes the importance of modeling the relationship between input variables and target outputs, using parametric regression techniques. Additionally, the lecture covers the concepts of basis functions and the Mean Squared Error (MSE) in the context of regression analysis.

Uploaded by

luosuochao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views7 pages

ML Lec8

Uploaded by

luosuochao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Lecture 8 Linear Model for Regression

► Regression and linear models

► Batch methods
► Ordinary least squares (OLS)
Lecture 8 Linear Model for Regression ► Maximum likelihood estimates

► Sequential methods
► Least mean squares (LMS)
► Recursive (sequential) least squares (RLS)

2/27

Problem Setup

Given a set of N labeled examples, and

, the goal is to learn a mapping

What is regression analysis? which associates x with y, such that we can make prediction about
when a new input is provided.

► Parametric regression: Assume a functional form for f(x) (e.g.

linear models).
► Nonparametric regression: Do not assume functional form for
f(x).

In this lecture we focus on parametric regression.

3/27 4/27
Regression Regression Function: Conditional Mean
► Regression aims at modeling the dependence of a response Y on a We consider the mean squared error and find the MMSE estimate:
covariate X. In other words, the goal of regression is to predict the
value of one or more continuous target variables y given the value of
input vector x.
► The regression model is described by

► Terminology:
► x: input, independent variable, predictor, regressor, covariate
► y: output, dependent variable, response
► The dependence of a response on a covariate is captured via a
conditional probability distribution, .
► Depending on f(x),
► Linear regression with basis functions: .
► Linear regression with kernels:

5/27 6/27

Why Linear Models?

Linear Regression

► Built on well-developed linear transformation.

► Can be solved analytically.
7/27
► Yield some interpretability (in contrast to deep learning). 8/27
Linear Regression Polynomial Regression:

Linear regression refers to a model in which the conditional mean of y

given the value of x is an affine function of

where are known as basis functions and

By using nonlinear basis functions, we allow the function f(x) to be a

nonlinear function of the input vector x (but a linear function of ).

[Figure source: Bishop's PRML]

9/27 10/27

Basis Functions

► Polynomial regression:
► Gaussian basis functions:
Ordinary Least Squares
► Spline basis functions: Piecewise polynomials (divide the input
space up into regions and fit a different polynomial in each Loss function view
region).
► Many other possible basis functions: sigmoidal basis functions,
hyperbolic tangent basis functions, Fourier basis, wavelet basis,
and so on.

11/27 12/27
Least Squares Method
Given a set of training data , we determine the weight
vector which minimizes Find the estimate such that

where both y and Φ are given.

where and is known as the
design matrix How do you find the minimizer ?

Solve for w.

Note that

13/27 14/27

Note that
Therefore, leads to the normal equation that
is of the form

Thus, LS estimate of w is given by

Then, we have

where is known as the Moore-Penrose pseudo-inverse.

15/27 16/27
Maximum Likelihood
We consider a linear model where the target variable yn is assumed to
be generated by a deterministic function with
additive Gaussian noise:

Least Squares
for and
Probabilistic model view with MLE
In a compact form, we have

In other words, we model as

17/27 18/27

The log-likelihood is given by

Sequential Methods
MLE is given by
LMS and RLS

leading to

which we arrived at under Gaussian noise assumption.

19/27 20/27
Online Learning Mean Squared Error (MSE)

A method of machine learning in which data Interested in MMSE estimate:

becomes available in a sequential order and is used to
update our best predictor for future data at each step,
as opposed to batch learning techniques which Sample average:
generate the best predictor by learning on the entire
training data set at once. Instantaneous squared error:

[Source: Wikipedia]

21/27 22/27

Least Mean Squares (LMS) Recursive (Sequential) LS

Approximate .

LMS is a gradient-descent method which minimizes the instantaneous We introduce the forgetting factor λ to de-emphasize
squared error old samples, leading to the following error function

The gradient descent method leads to the updating rule for w that is of where
the form Solving for wn leads to

where η > 0 is learning rate.

[Widrow and Hoff, 1960]

23/27 24/27
We define

The recursion for Pn is given by

With these definitions, we have

The core idea of RLS is to apply the matrix inversion lemma

to develop
the sequential algorithm without matrix inversion.

25/27 26/27

Thus, the updating rule for w is given by

27/27

Chapter2 Annotated Part2
No ratings yet
Chapter2 Annotated Part2
30 pages
LLM ML Interview Q
No ratings yet
LLM ML Interview Q
43 pages
Linear Regression
No ratings yet
Linear Regression
104 pages
ML Lecture Linear Regression 1
No ratings yet
ML Lecture Linear Regression 1
33 pages
Week 4 Linear Regression
No ratings yet
Week 4 Linear Regression
38 pages
ML - Lec 4-Introduction To Regression
No ratings yet
ML - Lec 4-Introduction To Regression
65 pages
02 01 Regression
No ratings yet
02 01 Regression
14 pages
Lecture3 Supervised Learning I
No ratings yet
Lecture3 Supervised Learning I
84 pages
Lecture Notes On High Dimensional Linear Regression
No ratings yet
Lecture Notes On High Dimensional Linear Regression
73 pages
Lecture 3 Multi-Regresion 2022.
No ratings yet
Lecture 3 Multi-Regresion 2022.
16 pages
Lecture 2 - Linear Regression
No ratings yet
Lecture 2 - Linear Regression
54 pages
Examples For LSE, RLS, and RBFN
No ratings yet
Examples For LSE, RLS, and RBFN
16 pages
Lecture 5 - Linear Regression
No ratings yet
Lecture 5 - Linear Regression
51 pages
2a Linear Regression 18may
No ratings yet
2a Linear Regression 18may
28 pages
03 Linear Regression
No ratings yet
03 Linear Regression
54 pages
Topic 7.6 Regression Analysis and Learning Regression Analysis
No ratings yet
Topic 7.6 Regression Analysis and Learning Regression Analysis
6 pages
Lecture 13 - Least Squares
No ratings yet
Lecture 13 - Least Squares
28 pages
2-Linear Regression
No ratings yet
2-Linear Regression
31 pages
Linear - Regression
100% (1)
Linear - Regression
39 pages
M6 RegressionLinearModels v2
No ratings yet
M6 RegressionLinearModels v2
97 pages
Machine Learning Unit2
No ratings yet
Machine Learning Unit2
31 pages
Chapter+3+ ++Regression+Algorithms
No ratings yet
Chapter+3+ ++Regression+Algorithms
22 pages
G.C. Calafiore (Politecnico Di Torino)
No ratings yet
G.C. Calafiore (Politecnico Di Torino)
23 pages
Linear Regression Model Presentation
No ratings yet
Linear Regression Model Presentation
7 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
ML Unit3
No ratings yet
ML Unit3
9 pages
Machine Learning (CSO851) - Lecture 02
No ratings yet
Machine Learning (CSO851) - Lecture 02
74 pages
Berkeley Machine Learning
No ratings yet
Berkeley Machine Learning
185 pages
Imp Notes For Final Term by Daniyal Subhani Cs502 Important Question With Answer Prepared
No ratings yet
Imp Notes For Final Term by Daniyal Subhani Cs502 Important Question With Answer Prepared
9 pages
Lecture 2
No ratings yet
Lecture 2
23 pages
Wk05 Machine Learning
No ratings yet
Wk05 Machine Learning
6 pages
FML Unit2
No ratings yet
FML Unit2
13 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
Lec9 - Linear Models
No ratings yet
Lec9 - Linear Models
44 pages
Regression Using LS Handout
No ratings yet
Regression Using LS Handout
21 pages
Linear Regression
No ratings yet
Linear Regression
60 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Revisiting Revisiting Logistic Regression & Naïve Logistic Regression & Naïve Bayes Bayes
No ratings yet
Revisiting Revisiting Logistic Regression & Naïve Logistic Regression & Naïve Bayes Bayes
46 pages
Everything You Need To Know About Linear Regression
No ratings yet
Everything You Need To Know About Linear Regression
19 pages
Linear Regression With Assumpt
No ratings yet
Linear Regression With Assumpt
3 pages
Machine Learning: Linear Models For Regression
No ratings yet
Machine Learning: Linear Models For Regression
54 pages
CS550 Lec2
No ratings yet
CS550 Lec2
24 pages
Pattern Recognition Machine Learning: Chapter 3: Linear Models For Regression
100% (1)
Pattern Recognition Machine Learning: Chapter 3: Linear Models For Regression
48 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
ML Models and When To Choose One Over Others
No ratings yet
ML Models and When To Choose One Over Others
7 pages
Today: - Calculus
No ratings yet
Today: - Calculus
61 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
Linear Regression
No ratings yet
Linear Regression
26 pages
Linear Regression 18may
No ratings yet
Linear Regression 18may
28 pages
05 Regression Least Squares
No ratings yet
05 Regression Least Squares
5 pages
A) The Least-Squares Method
No ratings yet
A) The Least-Squares Method
19 pages
Isn't Linear Regression From Statistics?
No ratings yet
Isn't Linear Regression From Statistics?
4 pages
(Investigations in Geophysics - 20) Gerard T. Schuster - Seismic Inversion (2017, Society of Exploration Geophysicists) - Libgen - Li
No ratings yet
(Investigations in Geophysics - 20) Gerard T. Schuster - Seismic Inversion (2017, Society of Exploration Geophysicists) - Libgen - Li
377 pages
ML Summary PDF
No ratings yet
ML Summary PDF
5 pages
Q-Basic Numerical Analysis Programs
84% (19)
Q-Basic Numerical Analysis Programs
54 pages
Daa Sem4
No ratings yet
Daa Sem4
4 pages
Question Bank Conm-Final
No ratings yet
Question Bank Conm-Final
5 pages
Machine Learning: Chapter 4. Artificial Neural Networks
No ratings yet
Machine Learning: Chapter 4. Artificial Neural Networks
34 pages
Difference Between Transportation and Assignment Problem
No ratings yet
Difference Between Transportation and Assignment Problem
4 pages
Answer The Following Questions: Q1: Choose The Correct Answer (20 Points)
No ratings yet
Answer The Following Questions: Q1: Choose The Correct Answer (20 Points)
13 pages
NN DL
No ratings yet
NN DL
54 pages
1 Introduction and Background To The FEM: 1.1 Weighted Residual Methods
No ratings yet
1 Introduction and Background To The FEM: 1.1 Weighted Residual Methods
30 pages
ML Lec11
No ratings yet
ML Lec11
14 pages
Math. System of Linear Equations Solved With Matrix Inversion Method, in Excel and in VBA
No ratings yet
Math. System of Linear Equations Solved With Matrix Inversion Method, in Excel and in VBA
7 pages
Job Sequencing With The Deadline
No ratings yet
Job Sequencing With The Deadline
5 pages
Lecture 7 Finite Difference Method
No ratings yet
Lecture 7 Finite Difference Method
15 pages
ML Lec12
No ratings yet
ML Lec12
10 pages
The Steps of The Simplex Algorithm
No ratings yet
The Steps of The Simplex Algorithm
8 pages
Introduction To Support Vector Machines: Hsuan-Tien Lin
No ratings yet
Introduction To Support Vector Machines: Hsuan-Tien Lin
20 pages
CST201 M1 Ktunotes - in
No ratings yet
CST201 M1 Ktunotes - in
72 pages
Excel Perhitungan Laporan Keuangan
No ratings yet
Excel Perhitungan Laporan Keuangan
5 pages
CSCE 310 Data Structures & Algorithms: Dr. Ying Lu
No ratings yet
CSCE 310 Data Structures & Algorithms: Dr. Ying Lu
59 pages
Linear Algebra
No ratings yet
Linear Algebra
48 pages
Prob On Unit 1 and 2 New
No ratings yet
Prob On Unit 1 and 2 New
11 pages
Introduction To Machinelearning
No ratings yet
Introduction To Machinelearning
75 pages
ML Lec3
No ratings yet
ML Lec3
10 pages
U1 - Introduction To Linear Programming and Applications: U1-T1-S1 - A1
No ratings yet
U1 - Introduction To Linear Programming and Applications: U1-T1-S1 - A1
31 pages
ML Lec5
No ratings yet
ML Lec5
7 pages
An Exact Solution To The Colebrook Equation
No ratings yet
An Exact Solution To The Colebrook Equation
1 page
Warm Up Lesson Presentation Lesson Quiz: Holt Mcdougal Algebra 2 Holt Algebra 2 Holt Mcdougal Algebra 2
No ratings yet
Warm Up Lesson Presentation Lesson Quiz: Holt Mcdougal Algebra 2 Holt Algebra 2 Holt Mcdougal Algebra 2
32 pages
Cs 503 - Design & Analysis of Algorithm: Multiple Choice Questions
No ratings yet
Cs 503 - Design & Analysis of Algorithm: Multiple Choice Questions
3 pages
Solution To Credit Assignment Problem in MLP. Rumelhart, Hinton and Relating To Economics)
No ratings yet
Solution To Credit Assignment Problem in MLP. Rumelhart, Hinton and Relating To Economics)
14 pages
FEM-BEM Coupling PDF
No ratings yet
FEM-BEM Coupling PDF
18 pages
2.2 Graphical Method Minimization
No ratings yet
2.2 Graphical Method Minimization
11 pages
Tugas - Metnum - Archyuda Farchan
No ratings yet
Tugas - Metnum - Archyuda Farchan
8 pages
Solved Examples 2 MTE
No ratings yet
Solved Examples 2 MTE
2 pages
Modern Multidimensional Calculus
From Everand
Modern Multidimensional Calculus
Marshall Evans Munroe
No ratings yet
Introduction to Minimax
From Everand
Introduction to Minimax
V. F. Dem’yanov
No ratings yet
Exercises of Partial Differential Equations
From Everand
Exercises of Partial Differential Equations
Simone Malacrida
No ratings yet
Introduction to Advanced Mathematical Analysis
From Everand
Introduction to Advanced Mathematical Analysis
Simone Malacrida
No ratings yet
Exercises of Multi-Variable Functions
From Everand
Exercises of Multi-Variable Functions
Simone Malacrida
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet

ML Lec8

Uploaded by

ML Lec8

Uploaded by

Lecture 8 Linear Model for Regression

► Regression and linear models

Given a set of N labeled examples, and

, the goal is to learn a mapping

► Parametric regression: Assume a functional form for f(x) (e.g.

In this lecture we focus on parametric regression.

Why Linear Models?

► Built on well-developed linear transformation.

Linear regression refers to a model in which the conditional mean of y

where are known as basis functions and

By using nonlinear basis functions, we allow the function f(x) to be a

[Figure source: Bishop's PRML]

where both y and Φ are given.

Thus, LS estimate of w is given by

where is known as the Moore-Penrose pseudo-inverse.

In other words, we model as

The log-likelihood is given by

which we arrived at under Gaussian noise assumption.

A method of machine learning in which data Interested in MMSE estimate:

Least Mean Squares (LMS) Recursive (Sequential) LS

where η > 0 is learning rate.

The recursion for Pn is given by

With these definitions, we have

The core idea of RLS is to apply the matrix inversion lemma

Thus, the updating rule for w is given by

You might also like