0% found this document useful (0 votes)

12 views

Lecture 02

This document summarizes a lecture on linear models for machine learning. It discusses polynomial curve fitting to sample data points using linear functions of varying degrees. Higher-degree polynomials can overfit the data, while regularization can help reduce overfitting by penalizing large coefficients. The document also introduces linear basis function models using different basis functions like polynomials, Gaussians, and sigmoids. Maximum likelihood estimation is discussed as a way to fit the linear model parameters to data.

Uploaded by

carlo.768.ri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Lecture 02

Uploaded by

carlo.768.ri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Advanced Machine Learning

Lecture 2: Linear models

Sandjai Bhulai
Vrije Universiteit Amsterdam

[email protected]
8 September 2023
Linear models

Advanced Machine Learning

Polynomial curve tting
▪ 10 points sampled from sin(2πx) + disturbance

x3 x5 x7 x9
sin x = x − + − + +⋯
3! 5! 7! 9!
3 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023
fi
Polynomial curve tting
▪ Polynomial curve
M
y(x, w) = w0 + w1x + w2 x 2 + ⋯ + wM x M = wj x j
∑
j=0

▪ Performance is measured by

1 N
{y(xn, w) − tn}2
2∑
E(w) =
n=1

4 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

fi
Polynomial curve tting: order 0

M
y(x, w) = w0 + w1x + w2 x 2 + ⋯ + wM x M = wj x j
∑
j=0
5 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023
fi
Polynomial curve tting: order 1

M
y(x, w) = w0 + w1x + w2 x 2 + ⋯ + wM x M = wj x j
∑
j=0
6 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023
fi
Polynomial curve tting: order 3

M
y(x, w) = w0 + w1x + w2 x 2 + ⋯ + wM x M = wj x j
∑
j=0
7 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023
fi
Polynomial curve tting: order 9

M
y(x, w) = w0 + w1x + w2 x 2 + ⋯ + wM x M = wj x j
∑
j=0
8 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023
fi
Over tting
▪ Root mean square (RMS) error: ERMS = 2E(w*)/N

9 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

fi
Over tting

10 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

fi
E ect of dataset size
▪ Polynomial of order 9 and N = 15

11 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

ff
E ect of dataset size
▪ Polynomial of order 9 and N = 100

12 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

ff
Regularization
▪ Penalize large coe cients values

1 N λ
{y(xn, w) − tn} + ∥w∥2
2
2∑
Ẽ(w) =
n=1
2

▪ λ becomes a model parameter

13 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

ffi
Regularization
▪ Regularization with ln λ = − 18

14 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

Regularization
▪ Regularization with ln λ = 0

15 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

Regularization
▪ ERMS versus ln λ

16 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

Regularization

17 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

A deeper analysis

Advanced Machine Learning

What is the issue?

x3 x5 x7 x9
sin x = x − + − + +⋯
3! 5! 7! 9!
19 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023
Linear basis function models
▪ General model is

M−1
wjφj(x) = w⊤φ(x)
∑
y(x, w) =
j=0

▪ φj are known are basis functions

▪ Typically, φ0(x) = 1 so that w0 acts as bias

20 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

Linear basis function models
▪ General model is

M−1
wjφj(x) = w⊤φ(x)
∑
y(x, w) =
j=0

▪ Polynomial basis functions:

φj(x) = x j

▪ These are global functions

21 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

Linear basis function models
▪ General model is

M−1
wjφj(x) = w⊤φ(x)
∑
y(x, w) =
j=0

▪ Gaussian basis functions:

(x − μj)2
{ 2s 2 }
φj(x) = exp −

▪ These are local functions

> μj controls location

> s controls scale
22 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023
Linear basis function models
▪ General model is

M−1
wjφj(x) = w⊤φ(x)
∑
y(x, w) =
j=0

▪ Sigmoidal basis functions:

( )
x − μj
φj(x) = σ
s
where
1
σ(a) =
1 + exp(−a)

23 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

Maximum likelihood
▪ Assume observations from a deterministic function with
added Gaussian noise:
t = y(x, w) + ϵ where p(ϵ | β) = (ϵ | 0, β −1)

{ 2σ 2 }
2 1 1 2
▪ Note that (x | μ, σ ) = exp − (x − μ)
(2πσ 2)1/2

β = 1/σ 2

(x | μ, σ 2) > 0
∞

∫−∞
(x | μ, σ 2) dx = 1
𝒩
𝒩
24 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023
𝒩
𝒩
Maximum likelihood
▪ Assume observations from a deterministic function with
added Gaussian noise:
t = y(x, w) + ϵ where p(ϵ | β) = (ϵ | 0, β −1)

{ 2σ 2 }
2 1 1 2
▪ Note that (x | μ, σ ) = exp − (x − μ)
(2πσ 2)1/2
∞

∫−∞
[x] = x (x | μ, σ 2) dx = μ
∞

∫−∞
[x 2] = x 2 (x | μ, σ 2) dx = μ 2 + σ 2

var[x] = [x 2] − [x]2 = σ 2
𝒩
𝔼
𝔼
𝒩
𝒩
25 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023
𝔼
𝔼
𝒩
Maximum likelihood
▪ Assume observations from a deterministic function with
added Gaussian noise:
t = y(x, w) + ϵ where p(ϵ | β) = (ϵ | 0, β −1)

▪ This is the same as saying

p(t | x, w, β) = (t | y(x, w), β −1)

M−1
wjφj(x) = w⊤φ(x)
▪ Recall: y(x, w) =
∑
j=0

26 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

𝒩
𝒩
Maximum likelihood
▪ This is the same as saying

p(t | x, w, β) = (t | y(x, w), β −1)

▪ Given observed inputs X = {x1, …, xN} and targets

t = [t1, …, tN ]⊤, we obtain the likelihood function:
N
(tn | w⊤φ(xn), β −1)
∏
p(t | X, w, β) =
n=1
𝒩
27 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023
𝒩
Maximum likelihood
▪ Taking the logarithm, we get
N
(tn | w⊤φ(xn), β −1)
∑
ln p(t | w, β) = ln
n=1
N N
= ln β − ln(2π) − βED(w)
2 2

1 N
{tn − w⊤φ(xn)}2
2∑
where ED(w) =
n=1

{ 2σ 2 }
2 1 1 2
▪ Recall: (x | μ, σ ) = exp − (x − μ)
(2πσ 2)1/2
𝒩
𝒩
28 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023
Maximum likelihood
▪ Computing the gradient and setting it to zero yields
N
{tn − w⊤φ(xn)}φ(xn)⊤ = 0
∑
∇w ln p(t | w, β) = β
n=1
The Moore-Penrose
pseudo-inverse
▪ Solve for w, we get
wML = (Φ Φ) ⊤ −1
Φ⊤t

with φ0(x1) φ1(x1) ⋯ φM−1(x1)

φ0(x2) φ1(x2) ⋯ φM−1(x2)
Φ=
⋮ ⋮ ⋱ ⋮
φ0(xN ) φ1(xN ) ⋯ φM−1(xN )

29 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

Interpretation
▪ Consider y = ΦwML = [φ1, …, φM]wML

y∈ ⊆

N-dimensional

M-dimensional

▪ is spanned by φ1, …, φM
▪ wML minimizes the distance between t and its orthogonal
projection on , i.e., y

30
𝒮
𝒮
Sandjai Bhulai / Advanced Machine Learning / 8 September 2023
𝒮
𝒯
Regularization
▪ Consider the error function

ED(w) + λEW (w)

data term + regularization term

▪ With the sum-of-squares error function and a quadratic

regularizer, we get
1 N ⊤ 2 λ ⊤
∑
{tn − w φ(xn)} + w w
2 n=1 2

▪ This is minimized by w = (λI + Φ⊤Φ)

−1
Φ⊤t

31 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

Regularization
▪ With a more general regularizer, we have

1 N λ M
{tn − w⊤φ(xn)}2 + | wj |q
2∑
n=1
2 ∑
j=1

lasso quadratic

32 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

Regularization
▪ Lasso tends to generate sparser solutions than a quadratic
regularizer

33 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

LESSON PLAN IN GRADE Trends
86% (7)
LESSON PLAN IN GRADE Trends
7 pages
Lecture 03
No ratings yet
Lecture 03
47 pages
Neural Network Lectures RBF 1
No ratings yet
Neural Network Lectures RBF 1
44 pages
Pattern Recognition Machine Learning: Chapter 3: Linear Models For Regression
100% (1)
Pattern Recognition Machine Learning: Chapter 3: Linear Models For Regression
48 pages
Lecture 04
No ratings yet
Lecture 04
28 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
ml-3
No ratings yet
ml-3
66 pages
Lecture3 2015
No ratings yet
Lecture3 2015
38 pages
Lect5 Reg
No ratings yet
Lect5 Reg
16 pages
SkriptOptMach
No ratings yet
SkriptOptMach
49 pages
Linear - Regression
100% (1)
Linear - Regression
39 pages
Advanced Machine Learning
No ratings yet
Advanced Machine Learning
74 pages
Lec 2
No ratings yet
Lec 2
106 pages
Week 4 Linear Regression
No ratings yet
Week 4 Linear Regression
38 pages
2022 Linear Regression
No ratings yet
2022 Linear Regression
34 pages
slides_foundations
No ratings yet
slides_foundations
81 pages
Foundations of Machine Learning: Part A: Logistic Regression
No ratings yet
Foundations of Machine Learning: Part A: Logistic Regression
63 pages
Today: - Calculus
No ratings yet
Today: - Calculus
61 pages
PRML Slides 3
No ratings yet
PRML Slides 3
57 pages
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
No ratings yet
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
43 pages
ML Lecture Linear Regression 1
No ratings yet
ML Lecture Linear Regression 1
33 pages
Lec19 Introduction2LinearRegression
No ratings yet
Lec19 Introduction2LinearRegression
53 pages
Notes5_Regression
No ratings yet
Notes5_Regression
14 pages
Lecture16 Crossvalidation
No ratings yet
Lecture16 Crossvalidation
32 pages
Theory of Deep Learning 1652786371
No ratings yet
Theory of Deep Learning 1652786371
118 pages
2019-20-I MS Key
No ratings yet
2019-20-I MS Key
6 pages
Regression Using LS Handout
No ratings yet
Regression Using LS Handout
21 pages
Introduction To Machine Learning Lecture 2: Linear Regression
No ratings yet
Introduction To Machine Learning Lecture 2: Linear Regression
38 pages
Lecture 3_Regression (1)
No ratings yet
Lecture 3_Regression (1)
47 pages
03 1 Linear Basis Function Models Draft SEP24
No ratings yet
03 1 Linear Basis Function Models Draft SEP24
52 pages
Chapter-3-Linear Models For Regression
100% (1)
Chapter-3-Linear Models For Regression
61 pages
MLF Combined
No ratings yet
MLF Combined
84 pages
PRML RefSheet
No ratings yet
PRML RefSheet
6 pages
Bishop Solutions PDF
No ratings yet
Bishop Solutions PDF
87 pages
MFMLHandout
No ratings yet
MFMLHandout
7 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
Curs 1 SSL - Introduction
No ratings yet
Curs 1 SSL - Introduction
57 pages
Ch-2 Linear Models For Regression
No ratings yet
Ch-2 Linear Models For Regression
40 pages
F-Bach
No ratings yet
F-Bach
36 pages
Advance Mathematical Methods
No ratings yet
Advance Mathematical Methods
3 pages
5. Ai_foundations of Machine Learning II
No ratings yet
5. Ai_foundations of Machine Learning II
54 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
Regression
No ratings yet
Regression
39 pages
PR M4 Notes
No ratings yet
PR M4 Notes
38 pages
Basic Concepts for Understanding ML & DL
No ratings yet
Basic Concepts for Understanding ML & DL
8 pages
DSA5105 Lecture1
No ratings yet
DSA5105 Lecture1
51 pages
2021 EE769 Tutorial Sheet 1
No ratings yet
2021 EE769 Tutorial Sheet 1
4 pages
2019-20-I ES Key
No ratings yet
2019-20-I ES Key
4 pages
Day 1
No ratings yet
Day 1
41 pages
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
No ratings yet
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
11 pages
MLF Mock 1 Solution
No ratings yet
MLF Mock 1 Solution
5 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
03 Linear Models
No ratings yet
03 Linear Models
46 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
Mathematics for Machine Learning V5
No ratings yet
Mathematics for Machine Learning V5
10 pages
ML 01
No ratings yet
ML 01
24 pages
SS ZC416 Revised Course Handout
No ratings yet
SS ZC416 Revised Course Handout
6 pages
Mathematics For Machine Learning
No ratings yet
Mathematics For Machine Learning
134 pages
MG305 - FF - Course Outline - FF - 2023 - A
No ratings yet
MG305 - FF - Course Outline - FF - 2023 - A
15 pages
BLUEPRINT-CLASS IX - SES 20 - 21 - 80 Marks
No ratings yet
BLUEPRINT-CLASS IX - SES 20 - 21 - 80 Marks
1 page
Science Action Plan
No ratings yet
Science Action Plan
3 pages
Nursing Care Plan For Pregnant Client (Fatigue)
50% (2)
Nursing Care Plan For Pregnant Client (Fatigue)
3 pages
CHN NCP Group 1 Docx Final Promise
No ratings yet
CHN NCP Group 1 Docx Final Promise
6 pages
The Non Native Teacher (Third Edition), Swan Communication 2017
No ratings yet
The Non Native Teacher (Third Edition), Swan Communication 2017
2 pages
We Play and Recycle
No ratings yet
We Play and Recycle
4 pages
Report On Education System in Pakistan
No ratings yet
Report On Education System in Pakistan
34 pages
DLP Format
0% (1)
DLP Format
2 pages
Branch - Name - Cbse Pre Mid Term Exam Progress Report 2023-24 - HR Central Techno - 9 - 1
No ratings yet
Branch - Name - Cbse Pre Mid Term Exam Progress Report 2023-24 - HR Central Techno - 9 - 1
1 page
650onetimechance - Ug - Yws - Examinations Fee Notification - Apr - 2024
No ratings yet
650onetimechance - Ug - Yws - Examinations Fee Notification - Apr - 2024
2 pages
Jocelyn Zhou: High School Student Seeking Part Time Job
No ratings yet
Jocelyn Zhou: High School Student Seeking Part Time Job
4 pages
Board of Medicine Vs Yasuki
No ratings yet
Board of Medicine Vs Yasuki
3 pages
Virginia Henderson'S Nursing Need Theory Biography of Virginia Henderson
100% (1)
Virginia Henderson'S Nursing Need Theory Biography of Virginia Henderson
14 pages
Sem 1 Report Card 7 F
No ratings yet
Sem 1 Report Card 7 F
36 pages
Applied Linguistics in Language Education
No ratings yet
Applied Linguistics in Language Education
3 pages
QQC Guide and Examples
No ratings yet
QQC Guide and Examples
2 pages
MH Survey NUJS
No ratings yet
MH Survey NUJS
7 pages
Chapter 1
No ratings yet
Chapter 1
55 pages
J463 Social Media Journalism Winter 2024 v4
No ratings yet
J463 Social Media Journalism Winter 2024 v4
29 pages
Malaysian IMO 2007-2008 Booklet
No ratings yet
Malaysian IMO 2007-2008 Booklet
8 pages
Factors To Consider in Writing Ims
50% (2)
Factors To Consider in Writing Ims
6 pages
Lguide 12
No ratings yet
Lguide 12
5 pages
Summative Test in PR2 22-23
No ratings yet
Summative Test in PR2 22-23
1 page
02 VxWorks653
100% (2)
02 VxWorks653
52 pages
Defence Studies MDC Syllabi 21.08.2023
No ratings yet
Defence Studies MDC Syllabi 21.08.2023
24 pages
Was Were
No ratings yet
Was Were
2 pages
Stimulus Model
No ratings yet
Stimulus Model
25 pages
Brian Mayer, Christopher Harris - Libraries Got Game - Aligned Learning Through Modern Board Games (2009) PDF
100% (2)
Brian Mayer, Christopher Harris - Libraries Got Game - Aligned Learning Through Modern Board Games (2009) PDF
145 pages

Lecture 02

Uploaded by

Lecture 02

Uploaded by

Advanced Machine Learning

Lecture 2: Linear models

Advanced Machine Learning

4 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

9 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

10 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

11 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

12 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

▪ λ becomes a model parameter

13 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

14 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

15 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

16 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

17 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

Advanced Machine Learning

▪ φj are known are basis functions

20 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

▪ Polynomial basis functions:

▪ These are global functions

21 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

▪ Gaussian basis functions:

▪ These are local functions

> μj controls location

▪ Sigmoidal basis functions:

23 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

▪ This is the same as saying

p(t | x, w, β) = (t | y(x, w), β −1)

26 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

p(t | x, w, β) = (t | y(x, w), β −1)

▪ Given observed inputs X = {x1, …, xN} and targets

with φ0(x1) φ1(x1) ⋯ φM−1(x1)

29 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

ED(w) + λEW (w)

▪ With the sum-of-squares error function and a quadratic

▪ This is minimized by w = (λI + Φ⊤Φ)

31 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

32 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

33 Sandjai Bhulai / Advanced Machine Learning / 8 September 2023

You might also like