0% found this document useful (0 votes)

28 views37 pages

Lecture 2

This document discusses linear discrete inverse problems and parameter estimation using least squares methods. It provides background on least squares problems, how they relate to maximum likelihood estimates, and how to assess goodness of fit. It explains that for over-determined problems with independent Gaussian errors, minimizing the weighted prediction error gives the least squares solution. It also discusses how the chi-square statistic can be used to test if a model fit is consistent with the estimated data uncertainties and describes how data errors propagate into uncertainties on the estimated model parameters.

Uploaded by

Sulaiman Nurhidayat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views37 pages

Lecture 2

Uploaded by

Sulaiman Nurhidayat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Linear discrete inverse Problems

(parameter estimation)

Least squares and all that....

Least squares problems

Least squares is the basis of many parameter estimation and data

fitting procedures.

A concise tutorial can be found in Chapter 15 of

the book Numerical Recipes Press et al. (1992)
Cambridge Univ. Press.

Available for free online at http://www.nrbook.com

Good explanation of essentials in Aster et al.

(2005).

2
Linear discrete inverse problems

Can a and b be resolved ?

Under-determined

Over-determined

Even-determined

3
Over-determined: Linear discrete inverse problem

To find the best fit model we can

minimize the prediction error of
the solution

But the data contain errors. Let’s assume these are independent and
normally distributed, then we weight each residual inversely by the
standard deviation of the corresponding (known) error distribution.

We can obtain a least squares solution by minimizing the weighted

prediction error of the solution.

4
Over-determined: Linear discrete inverse problem

Compare with
We seek the model vector m which minimizes maximum likelihood

Note that this is a quadratic function of the model vector.

Solution: Differentiate with respect to m and solve for the
model vector which gives a zero gradient in
This gives…

This is the least-squares solution.

A solution to the normal equations:

3 5
Over-determined: Linear discrete inverse problem

How does the Least-squares solution compare to the standard

equations of linear regression ?

Given N data yi with independent normally distributed errors

and standard deviations σi what are the expressions for the
model parameters m = [a,b]T ?

6
Linear discrete inverse problem: Least squares

What happens in the under and even-determined cases ?

Under-determined, N=1:

Matrix has a zero determinant

What
and aiszero
m ?eigenvalue
an infinite number of solutions exist

Even-determined, N=2:

r is the−
What= d G = m
prediction 0
error ?

Prediction error is zero !

7
Example: Over-determined, Linear discrete inverse
problem

The Ballistics example

Given data and noise Calculate G

Is the data fit good enough ?

And how to errors in data propagate

into the solution ?
8
The two questions in parameter
estimation

We have our fitted model parameters

…but we are far from finished !

We need to:
Assess the quality of the data fit.
Goodness of fit: Does the model fit the data to within
the statistical uncertainty of the noise ?

Estimate how errors in the data propagate

into the model
What are the errors on the model parameters ?

9
Goodness of fit

Once we have our least squares solution mLS how do we know

whether the fit is good enough given the errors in the data ?

Use the prediction error at the least squares solution !

If data errors are Gaussian this as a chi-square statistic

10
Why do we always assume errors are Gaussian ?
Probability density functions of 5 random variables

X 100000
Xi deviates
i The Central
Limit Theorem

0 0.5 1.0 0 0.5 1.0

0 2.5 5.0 X1 X2

0 0.5 1.0 0 0.5 1.0 0 0.5 1.0

X3 X4 X5

11
Mathematical Background:
Probability distributions

A random variable x follows a Normal or Gaussian probability

density function if

Multivariate case

If data covariance matrix is diagonal

then data errors are independent.

12
Mathematical Background:
Probability distributions

Expectation operator

Expectation of a Gaussian random variable

Variance

Covariance

If X and Y are independent then

13
Mathematical Background:
Probability distributions

Multi-dimensional Gaussian

Expectation of a Gaussian random vector

Covariance matrix

⎡ ⎤
σ1,1 σ1,2 σ1,3 σ1,4
Correlation matrix ⎢ ⎥
⎢ σ2,1 σ2,2 σ2,3 σ2,4 ⎥
⎢ ⎥
⎣ σ3,1 σ3,2 σ3,3 σ3,4 ⎦
σ4,1 σ4,2 σ4,3 σ4,4

Independence between X and Y

Positive or negative correlation

14
Background: Chi-square distribution
If x follows a Normal distribution

What distribution does the square of x follow ?

Answer: A chi-square distribution with

1 degree of freedom

If x1 and x2 are Normal, what distribution does y=x21 + x22 follow ?

Answer: A chi-square distribution with

2 degrees of freedom 15
Goodness of fit
For Gaussian data errors the data prediction error is the square of
a Gaussian random variable hence it has a chi-square probability
density function with N-M degrees of freedom.

p = 0.95 p = 0.05

ndf χ2(5%) χ2(50%) χ2(95%)

5 1.15 4.35 11.07
10 3.94 9.34 18.31
20 10.85 19.34 31.41
50 34.76 49.33 67.50
100 77.93 99.33 124.34

The test provides a means to testing the assumptions that

went into producing the least squares solution. It gives the
likelihood that the fit actually achieved is reasonable.
16
Example: Goodness of fit

The Ballistics problem

Given data and noise

How many degrees of freedom ? ν =N-M = 10 -3=7

In practice values between 0.1 and 0.9 are plausible

4 17
Variance ratio

Another way of looking at the chi-square statistic is as the variance

ratio of two distributions.

Given two sets of random samples

Question: Did they come from the same distribution or not ?

The ratio of the variances of the two distributions follows a chi-square distribution.

Chi-square tables tell us the likelihood that the two sets of observables
come from the same distribution.

18
Goodness of fit

For Gaussian data errors the chi-square statistic has a chi-

square distribution with υ = N-M degrees of freedom.

ndf χ2(5%) χ2(50%) χ2(95%)

5 1.15 4.35 11.07
10 3.94 9.34 18.31
20 10.85 19.34 31.41
50 34.76 49.33 67.50
100 77.93 99.33 124.34

Exercise:
If I fit 7 data points with a straight line and get
what would you conclude ?

If I fit 102 data points with a straight line and get

what would you conclude ?

If I fit 52 data points with a straight line and get

what would you conclude ?

19
Goodness of fit

For Gaussian data errors the chi-square statistic has a chi-

square distribution with υ = N-M degrees of freedom.

ndf χ2(5%) χ2(50%) χ2(95%)

5 1.15 4.35 11.07
10 3.94 9.34 18.31
20 10.85 19.34 31.41
50 34.76 49.33 67.50
100 77.93 99.33 124.34

What could be the cause if:

the prediction error is much too large ? (poor data fit)

Truly unlikely data errors Errors in forward theory

Under-estimated data errors

the prediction error is too small ? (too good data fit)

Truly unlikely data errors Over-estimated the data errors

Fraud ! 20
Solution error
Once we have our least squares solution mLS and we know
that the data fit is acceptable, how do we find the likely errors
in the model parameters arising from errors in the data ?

mLS = G−g d

The data set we actually observed is only one realization of

the many that could have been observed

d0 → d + ²
m0LS → mLS + ²m

m0LS = G−g d0
mLS + ²m = G−g (d + ²)
The effect of adding noise to the data is to add noise to the solution

²m = G−g ²
The model noise is a linear combination of the data noise !
21
Solution error: Model Covariance

Multivariate Gaussian data error distribution

How to turn this into a probability distribution for the model errors ?

We know that the solution error is a linear combination of

the data error
²m = G−g ²
The covariance of any linear
combination Ad of Gaussian
distributed random variables d is

So we have the covariance of the model parameters

22
Solution error: Model Covariance

The model covariance for a least squares problem depends

on data errors and not the data itself ! G is controlled by
the design of the experiment.

is the least squares solution

The data error distribution gives a

model error distribution !

23
Solution error: Model Covariance

For the special case of independent data errors

Independent data errors Correlated model errors

For linear regression problem

24
Confidence intervals by projection (1-D)
The model covariance is a symmetric M x M matrix

In the multi-dimensional model space

the value of Δ2 follows a
distribution with M degrees of freedom.

Projecting onto the mi axis the 1-D confidence interval becomes

Where Δ2 follows a χ21 distribution

e.g. for 90% interval on m1

Note this is 90% the confidence interval on m1 alone.

The joint (m1, m2) 90% confidence ellipse is wider than this.
25
Example: Model Covariance and confidence intervals

For Ballistics problem

95% confidence interval for parameter i

26
Confidence intervals by projection
The M-dimensional confidence ellipsoid can be projected onto any subset (or
combination) of Δ parameters to obtain the corresponding confidence ellipsoid.

Full M-dimensional ellipsoid

Projected ν dimension ellipsoid

Projected model vector

Projected covariance matrix

Chosen percentage point of

the χ2ν distribution

To find the 90% confidence ellipse for

(x,y) from a 3-D (x,y,z) ellipsoid

Can you see that this procedure gives the same formula for the 1-D
case obtained previously ? 27
Recap: Goodness of fit and model covariance

Once a best fit solution has been obtained we test goodness of fit with a
chi-square test (assuming Gaussian statistics)

If the model passes the goodness of fit test we may proceed to evaluating
model covariance (if not then your data errors are probably too small)

Evaluate model covariance matrix

Plot model or projections of it onto chosen subsets of parameters

Calculate confidence intervals using projected equation

Where Δ2 follows a χ21 distribution

28
Robust data fitting with the L1 norm

Least squares solutions are not robust

Minimize

We can calculate an L1 solution with the IRLS algorithm

is a diagonal weighting matrix that depends on the model

See section 2.4 of Aster (2005)

29
Monte Carlo error propagation

Its possible to define an approximate p statistic for the L1 solution

and hence test goodness of fit of the solution. However there is no
analytical solution to error propagation.

…but Monte Carlo error propagation can be used.

1. Calculate data prediction from solution

2. Add random realization of noise to data

and repeat IRLS algorithm

3. Repeat Q times and generate difference vectors

30
Monte Carlo error propagation

For the ballistics problem we get

Compare to LS solution without outlier

31
What if we do not know the errors on the data ?

Both Chi-square goodness of fit tests and model covariance

Calculations require knowledge of the variance of the data.

What can we do if we do not know σ ?

Consider the case of

CD = σ 2I
Independent data errors

Calculated from least

squares solution

So we can still estimate model errors using the calculated data errors
but we can no long claim anything about goodness of fit.

32
Example: Over-determined, Linear discrete inverse
problem

MATLAB exercise
Generate data with Gaussian noise for a linear regression
problem and invert for the best fitting gradient and intercept

1. Generate xi points randomly between 0 and 10

2. Calculate data yi

3. Add N[0,σ] noise to data yi

4. Calculate G matrix

5. Use MATLAB matrix routines to solve the normal equations

6. Plot the data, plot the data errors and plot the least squares solution
33
Model Resolution matrix

If we obtain a solution to an inverse problem we can

ask what its relationship is to the true solution

mest = G−g d
But we know

d = Gmtrue
and hence

mest = G−g Gmtrue = Rmtrue

The matrix R measures how `good an inverse’ G-g is.

The matrix R shows how the elements of mest are built from
linear combination of the true model, mtrue. Hence matrix R
measures the amount of blurring produced by the inverse
operator.
For the least squares solution we have

−1 −1
G−g = (GT CD G)−1GT CD ⇒R=I
5 34
Example: Model resolution in a tomographic
experiment

m = Rmtrue
If the calculated model resolution matrix looked like this
⎡ ⎤
0.75 −0.25 0.25 0.25
⎢ −0.25 0.75 0.25 0.25 ⎥
⎢
R=⎢
⎥
⎥ What units do the
⎣ 0.25 0.25 0.75 −0.25 ⎦ elements of R have ?
0.25 0.25 −0.25 0.75

Spike test
True model Recovered model

35
Data Resolution matrix

If we obtain a solution to an inverse problem we can

ask what how it compares to the data

dpre = Gmest
But we know

mest = G−g dobs

and hence

dpre = GG−g dobs = Ddobs

The matrix D is analogous to the model resolution matrix R

but measures how independently the model produced by G-g
can reproduce the data. If D = I then the data is fit exactly
and the prediction error d-Gm is zero.

36
Recap: Linear discrete inverse problems

The Least squares solution minimizes the prediction error.

Goodness of fit criteria tells us whether the least squares

model adequately fits the data, given the level of noise.

Chi-square with N-M degrees of freedom

The covariance matrix describes how noise propagates from

the data to the estimated model

Chi-square with M degrees of freedom

Gives confidence intervals

The resolution matrix describes how the estimated model
relates to the true model
37

Peak Fit
No ratings yet
Peak Fit
295 pages
Advanced Econometrics PDF
No ratings yet
Advanced Econometrics PDF
58 pages
RigNotes15 PDF
No ratings yet
RigNotes15 PDF
130 pages
10 Linear Prediction: X (1), - . - , X (N) X X X
No ratings yet
10 Linear Prediction: X (1), - . - , X (N) X X X
6 pages
Prob RV Opt Basics
No ratings yet
Prob RV Opt Basics
35 pages
S M S T C Lecture Notes Lecture5
No ratings yet
S M S T C Lecture Notes Lecture5
14 pages
PRML Slides 3
No ratings yet
PRML Slides 3
57 pages
Lecture6 2015
No ratings yet
Lecture6 2015
36 pages
High-Dimensional Statistics: Lecture Notes
No ratings yet
High-Dimensional Statistics: Lecture Notes
168 pages
Slides 1 Handout
No ratings yet
Slides 1 Handout
23 pages
斯坦福大学机器学习数学基础 33-40
No ratings yet
斯坦福大学机器学习数学基础 33-40
8 pages
6034 - Classical Linear Regression Model
No ratings yet
6034 - Classical Linear Regression Model
30 pages
Lecture4 Mech SU
No ratings yet
Lecture4 Mech SU
17 pages
��
No ratings yet
��
3 pages
ML 3
No ratings yet
ML 3
66 pages
Statistical Learning
No ratings yet
Statistical Learning
31 pages
9 Mle
No ratings yet
9 Mle
39 pages
Rig Notes 17
No ratings yet
Rig Notes 17
168 pages
Pattern Recognition Machine Learning: Chapter 3: Linear Models For Regression
100% (1)
Pattern Recognition Machine Learning: Chapter 3: Linear Models For Regression
48 pages
Series 1, Oct 1st, 2013 Probability and Related) : Machine Learning
No ratings yet
Series 1, Oct 1st, 2013 Probability and Related) : Machine Learning
4 pages
12.540 Principles of Global Positioning Systems: Mit Opencourseware
100% (2)
12.540 Principles of Global Positioning Systems: Mit Opencourseware
23 pages
Applied Statistics - Lecture 1: Mario Beraha
No ratings yet
Applied Statistics - Lecture 1: Mario Beraha
52 pages
Desingn of Experiments ch10
No ratings yet
Desingn of Experiments ch10
5 pages
Unit 2 (2) - 1
No ratings yet
Unit 2 (2) - 1
37 pages
Simple Regression
No ratings yet
Simple Regression
46 pages
Wainwrightslides 1
No ratings yet
Wainwrightslides 1
67 pages
Class
No ratings yet
Class
35 pages
226 Lecture5 Prediction
No ratings yet
226 Lecture5 Prediction
45 pages
Tutorial
No ratings yet
Tutorial
11 pages
ECON 1630 Problem Set #2 Fall 2021: Bias Variance
No ratings yet
ECON 1630 Problem Set #2 Fall 2021: Bias Variance
9 pages
Module 4: Point Estimation: Statistics (OA3102)
No ratings yet
Module 4: Point Estimation: Statistics (OA3102)
41 pages
Chapter 12
No ratings yet
Chapter 12
48 pages
STAT 714 Linear Statistical Models: Lecture Notes
No ratings yet
STAT 714 Linear Statistical Models: Lecture Notes
150 pages
Probability Theory For Machine Learning: Chris Cremer September 2015
No ratings yet
Probability Theory For Machine Learning: Chris Cremer September 2015
40 pages
IV AI-DS AD3491 FDSA Unit5
No ratings yet
IV AI-DS AD3491 FDSA Unit5
35 pages
HKNECE313 Cramming Carnival FA24
No ratings yet
HKNECE313 Cramming Carnival FA24
45 pages
Lecture Notes For Mathematical Statistics
No ratings yet
Lecture Notes For Mathematical Statistics
184 pages
Analysis of Data - Curve Fitting and Spectral Analysis
No ratings yet
Analysis of Data - Curve Fitting and Spectral Analysis
23 pages
DS303: Introduction To Machine Learning: Manjesh K. Hanawal
No ratings yet
DS303: Introduction To Machine Learning: Manjesh K. Hanawal
17 pages
Error Propagation
No ratings yet
Error Propagation
22 pages
Lecture 1
No ratings yet
Lecture 1
8 pages
Probability and Statistics
No ratings yet
Probability and Statistics
28 pages
BA501 Week5 Linear Regression
No ratings yet
BA501 Week5 Linear Regression
45 pages
MIT18 05S14 Prac Fnal Exm
No ratings yet
MIT18 05S14 Prac Fnal Exm
8 pages
Regression
No ratings yet
Regression
46 pages
Introduction To Curve Fitting
No ratings yet
Introduction To Curve Fitting
10 pages
Linear - Regression
100% (1)
Linear - Regression
39 pages
Durrande 2020
No ratings yet
Durrande 2020
90 pages
App.A - Detection and Estimation in Additive Gaussian Noise PDF
No ratings yet
App.A - Detection and Estimation in Additive Gaussian Noise PDF
55 pages
Lecture 9: Predictive Inference
No ratings yet
Lecture 9: Predictive Inference
10 pages
Lecture Notes On High Dimensional Linear Regression
No ratings yet
Lecture Notes On High Dimensional Linear Regression
73 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
215 Final Exam Formula Sheet
No ratings yet
215 Final Exam Formula Sheet
2 pages
Lec 6
No ratings yet
Lec 6
20 pages
Fundamentals of Statistics (18.6501x)
No ratings yet
Fundamentals of Statistics (18.6501x)
20 pages
31 Least Squares
No ratings yet
31 Least Squares
39 pages
PBM Notes
No ratings yet
PBM Notes
130 pages
Mathematical Model
No ratings yet
Mathematical Model
34 pages
Statlearn PDF
No ratings yet
Statlearn PDF
123 pages
SAT Math: Master the Skills in 40 Pages
From Everand
SAT Math: Master the Skills in 40 Pages
Jennifer L Johnson
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
1
No ratings yet
1
3 pages
Classification
No ratings yet
Classification
2 pages
Rab FC Sekretaris 2022
No ratings yet
Rab FC Sekretaris 2022
12 pages
Bookof Abstracts 2016
No ratings yet
Bookof Abstracts 2016
106 pages
Wu 2020 BSM 4 DP
No ratings yet
Wu 2020 BSM 4 DP
14 pages
Paper Protect 2023 - Prevailer Team
No ratings yet
Paper Protect 2023 - Prevailer Team
27 pages
Annual Calendar of SEG UGM SC 2022
No ratings yet
Annual Calendar of SEG UGM SC 2022
36 pages
Case Stud - Minnovation
No ratings yet
Case Stud - Minnovation
9 pages
Water 15 00473 v3
No ratings yet
Water 15 00473 v3
21 pages
Textbook
No ratings yet
Textbook
3 pages
Parameter Seismisitas Gempa
No ratings yet
Parameter Seismisitas Gempa
24 pages
Nabila Fibionisa Gafintri - 20-462149-PA-20121 - Ref
No ratings yet
Nabila Fibionisa Gafintri - 20-462149-PA-20121 - Ref
64 pages
Nonseismik
No ratings yet
Nonseismik
11 pages
Perhitungan Cadangan
No ratings yet
Perhitungan Cadangan
20 pages
Lecture 4
No ratings yet
Lecture 4
27 pages
CatNab MetInv UAS
No ratings yet
CatNab MetInv UAS
14 pages
I MESC Leaflet 2023 1703994803
No ratings yet
I MESC Leaflet 2023 1703994803
6 pages
Lecture 7
No ratings yet
Lecture 7
60 pages
Caption - Sulaiman Nurhidayat
No ratings yet
Caption - Sulaiman Nurhidayat
1 page
Lecture 5
No ratings yet
Lecture 5
21 pages
Box, Wetz Technical Report PDF
No ratings yet
Box, Wetz Technical Report PDF
95 pages
SSMD MID Question Paper April 2024
No ratings yet
SSMD MID Question Paper April 2024
1 page
Evaluation of Stability Data
100% (2)
Evaluation of Stability Data
21 pages
Power System State Estimation Based On Nonlinear Programming
No ratings yet
Power System State Estimation Based On Nonlinear Programming
6 pages
Fundamentals of Kalman Filtering
No ratings yet
Fundamentals of Kalman Filtering
83 pages
Cross Model by DR - Zafar
No ratings yet
Cross Model by DR - Zafar
4 pages
Levenberg-Marquardt Algorithm Handout
No ratings yet
Levenberg-Marquardt Algorithm Handout
10 pages
STAT378 Syllabus
No ratings yet
STAT378 Syllabus
7 pages
Population Pharmacokinetics II: Estimation Methods
No ratings yet
Population Pharmacokinetics II: Estimation Methods
9 pages
Econometrics University of Ottawa
No ratings yet
Econometrics University of Ottawa
6 pages
Determination of Rate Equations From The Experimental Data
No ratings yet
Determination of Rate Equations From The Experimental Data
36 pages
UNIT-4 Material 02
No ratings yet
UNIT-4 Material 02
19 pages
Probabilistic Engineering Mechanics: Swarup Ghosh, Subrata Chakraborty
No ratings yet
Probabilistic Engineering Mechanics: Swarup Ghosh, Subrata Chakraborty
12 pages
Statistics - Wikipedia
No ratings yet
Statistics - Wikipedia
23 pages
Wavelength Dispersion of Ti Induced Refractive Index Change in LiNbO3as A Function of Diffusion Parameters
No ratings yet
Wavelength Dispersion of Ti Induced Refractive Index Change in LiNbO3as A Function of Diffusion Parameters
9 pages
Determination of A Rate Law Part 1 - 2 PDF
No ratings yet
Determination of A Rate Law Part 1 - 2 PDF
6 pages
CH-6 Traversing
No ratings yet
CH-6 Traversing
40 pages
CM20315 02 Supervised
No ratings yet
CM20315 02 Supervised
53 pages
Standardizationandcalibration DR 211214104709
No ratings yet
Standardizationandcalibration DR 211214104709
19 pages
Rondcom Nex Accretech B 84 1100 en 2107 Eu
No ratings yet
Rondcom Nex Accretech B 84 1100 en 2107 Eu
20 pages
40 L.P (M.s.i)
No ratings yet
40 L.P (M.s.i)
75 pages
02 Conceptual Foundation
No ratings yet
02 Conceptual Foundation
30 pages
Ansys Hyperelastic Curve Fitting
100% (1)
Ansys Hyperelastic Curve Fitting
32 pages
Module 3 Basic-Econometrics
No ratings yet
Module 3 Basic-Econometrics
38 pages
Baye 9e Chapter 03
No ratings yet
Baye 9e Chapter 03
28 pages
Chapter-6-Simple Linear Regression & Correlation
No ratings yet
Chapter-6-Simple Linear Regression & Correlation
12 pages
Least Median of Squares Regression. Peter J. Rousseeuw, 1984
No ratings yet
Least Median of Squares Regression. Peter J. Rousseeuw, 1984
10 pages
Singer 1970 Estimating
No ratings yet
Singer 1970 Estimating
11 pages
Chapter 11: Simple Linear Regression
No ratings yet
Chapter 11: Simple Linear Regression
57 pages