0% found this document useful (0 votes)

9 views45 pages

Mstat Note12 Parametric Inference FSP

Uploaded by

junmokim123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views45 pages

Mstat Note12 Parametric Inference FSP

Uploaded by

junmokim123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

Parametric Inference

Math & Stat for Data Science

Graduate School of Data Science

Seoul National University
Parametric Inference
• We consider a following model

• Θ ⊂ 𝑅! : parameter space
• 𝜃 = (𝜃", … , 𝜃#) : parameter

• Now the inference is estimating the parameter 𝜃

• If the parametric assumption is wrong, parametric
inference can be inaccurate
Parametric Inference
• But there are many strength in parametric
inference
• Computationally trackable
• Can provide analytical solution

• Can provide more efficient estimator (ex. Lower

standard error)

• Parameters can provide a direct interpretation

• Ex. How much the disease risk increases by genetic variants
Parameter of Interest
• Usually we are interested in the subset of
parameters, or some function of parameters
• Parameter of interest: 𝑇(𝜃)

• Other parameters are called nuisance parameter

• Ex. Normal(𝜇,𝜎)
• Usually interested in mean
• 𝜇 is the parameter of interest
• 𝜎 is the nuisance parameter
Estimation
• Suppose we want to estimate (𝜇,𝜎)
• Assume X1, …, Xn ~ N(𝜇,𝜎)

• There can be numerous ways to estimate

parameters….

• Likelihood based approach is most commonly used.

Maximum Likelihood
• Likelihood function

• Joint density of data

• But it is a function of the parameter, not a function of data!!!
• Represents how likely the parameters are given data.
Maximum Likelihood Estimation (MLE)

• Find the parameters that most likely generated the

observed data
• Very intuitive idea
• Note that maximizing the likelihood is the same as
maximizing the log likelihood

• Any constants in likelihood do not affect MLE

MLE
• Example: X1,…, Xn ~ Bernoulli(𝑝). MLE of p?
Likelihood function
p=0.3
n=40
6e-11

-40
Log-Likelihood
Likelihood

0e+00 3e-11

-120 -80
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

p p

MLE=0.285
MLE
• Example: X1,…, Xn ~ Poisson(λ). MLE of λ?
Likelihood function
λ =3
n=40

MLE=2.84
MLE
• Example: X1,…, Xn ~ Normal(𝜇,𝜎). MLE of (𝜇,𝜎)?
Log-likelihood function
Mu=1, Sigma=2
n=300 3.0
2.5
sigma

2.0
1.5

0.5 1.0 1.5 2.0

MLE: mu=1.047, sigma=1.94

Properties of Estimation
• Properties of estimator?
• Bias

• Consistency

• Variance

• Distribution
Properties of MLE
Equivalence of MLE

• Convenient to find MLE of transformed parameter:

• Ex. MLE of exp(𝜃)?
Score and Fisher information
Asymptotic Distribution
Score function and Fisher Information

• Score function: first derivative of log likelihood

function
• Fisher Information: variance of score function
Fisher Information

• Fisher information (FI) is the (expected) second

derivative of log likelihood
• Beware the notation: I(𝜃) represents a FI of single
observation and In(𝜃) of n observation
Score and Fisher Information
• Example: X1,…, Xn ~ Poisson(λ). Find score and
fisher information of λ
Score and Fisher Information
• Example: X1,…, Xn ~ Normal(𝜇,𝜎). Suppose 𝜎 is
known. Find score and fisher information of 𝜇
Why is the second derivative
information?
Asymptotic Distribution

1. MLE asymptotically follows Normal Distribution

2. Variance is the inverse of fisher information!!
Asymptotic Confidence Interval

• P-value can also be calculated using asymptotic

normality.
Example
• Example: X1,…, Xn ~ Poission(λ). Distribution of MLE
of λ?
Example
• Example: X1,…, Xn ~ Normal(𝜇,𝜎). Suppose 𝜎 is
known. Distribution of MLE of 𝜇?
Computing MLE
• Usually MLE is estimated finding the solution that
makes score function 0
• Find 𝜃 which makes
𝜕 ln 𝜃
=0
𝜕𝜃
• This works when the log likelihood is a convex function

• It is possible that we can not obtain analytic

solution
• Need to use numerical method
• Gradient Descent, Newton Raphson, etc
Optimality and Delta method
Optimality
• Suppose X1,…, Xn ~ Normal(𝜇,𝜎)
• Two different estimator of 𝜇
• MLE (same as the mean)
• Median

• MLE satisfies

Which one is optimal?

• Median satisfies
Optimality
• More generally, consider two different estimator:
𝑇# and 𝑈#

• Asymptotic relative efficiency:

+!
• ARE(𝑈* , 𝑇* ) =,!
• In normal case, ARE(Median, MLE) = 0.63
• Median effectively using 63% of data compares to MLE
Optimality

• MLE is the optimal estimator! (under some regularity condition)

• Reason why MLE is dominating in parametric inference.
Delta method
• Suppose 𝜏 = 𝑔(𝜃), and 𝑔(𝜃) is a smooth function
(ex. exp(𝜃) )
/
• Equivalence shows that MLE of 𝜏, 𝜏,̂ is 𝑔(𝜃),
• Distribution of 𝜏̂ ?
Delta method
Delta method
• Example: X1,…, Xn ~ Bernoulli(𝑝). MLE distribution
$
of log( )?
%&$
Multiparameter Models
Parametric Bootstrap
Multiparameter Models
• Now we consider multiple parameters:
• 𝜃 = 𝜃", … , 𝜃!
• MLE
• 𝜃. = (𝜃.", … , 𝜃.! )
• Fisher Information
Multiparameter Models
• 𝜃/ = (𝜃/% , … , 𝜃/' ) follows multivariate normal
distribution

"#
Note: 𝐽! = 𝐼! 𝜃
Multiparameter Models
• Example: Let X1,…, Xn ~ Normal(𝜇,𝜎) with unknown
𝜎. MLE distribution of (𝜇,
4 𝜎)?
4
Multiparameter Models
• Let r = g 𝜃% , … , 𝜃'
• Gradient
Parametric Bootstrap
• Resampling approach can be effective when
• Sample size is small, so asymptotic does not work..
• Difficult to calculate distribution..

• In previous
∗ ∗
(nonparametric) bootstrap, we simulated
𝑋! , … , 𝑋# from empirical CDF
• Does not use any distributional assumption
• Often called nonparametric bootstrap

• If we know the parametric form of the distribution, it

can be used in bootstrap
Parametric Bootstrap
• Suppose
𝑋", … , 𝑋* ~ 𝑓(𝑥; 𝜃)

• Estimate 𝜃 (using MLE)

. is a true distribution function.

• Now consider 𝑓(𝑥; 𝜃)
Generate B bootstrap sample from
𝑋"∗ , … , 𝑋*∗ 0
.
~ 𝑓(𝑥; 𝜃)
• For each sample, calculate statistic T(𝑋"∗ , … , 𝑋*∗ )
Parametric Bootstrap
• Example: Let 𝑥% , … , 𝑥# are observed and assumed
to follow exp(𝛽=1). Find a MLE distribution of
log(𝛽)?
Parametric Bootstrap, when n=100
Parametric Bootstrap
• Parametric Bootstrap can work better than
asymptotic approach when the sample size is small

• In the previous example, if we reduce the sample

size to 3
• Note: True SD is around 0.625
Summary
• Parametric inference
• Estimate the parameter 𝜃
• Estimating methods
• Method of Moments
• Maximum Likelihood Estimator (MLE)
• MLE
• Properties
• Score function and Fisher Information
• Asymptotic distribution
• Delta method, Parametric Bootstrap

Consistent Problem Unit 2
No ratings yet
Consistent Problem Unit 2
9 pages
C Programs
No ratings yet
C Programs
6 pages
Chapter10 Solutions
No ratings yet
Chapter10 Solutions
62 pages
Introductory Econometrics For Finance Chris Brooks Solutions To Review Questions - Chapter 5
No ratings yet
Introductory Econometrics For Finance Chris Brooks Solutions To Review Questions - Chapter 5
9 pages
TMT Siciliano
No ratings yet
TMT Siciliano
9 pages
Unit 04 - Maximum Likelihood Estimation - 1 Per Page
No ratings yet
Unit 04 - Maximum Likelihood Estimation - 1 Per Page
62 pages
Ps 2,3
No ratings yet
Ps 2,3
48 pages
PSLecture18 2022
No ratings yet
PSLecture18 2022
100 pages
Lecture1 ML MLE
No ratings yet
Lecture1 ML MLE
103 pages
Module 2.1 Slides PDF
100% (1)
Module 2.1 Slides PDF
47 pages
8.estimation I - 530
100% (1)
8.estimation I - 530
22 pages
AllNotes 4
No ratings yet
AllNotes 4
56 pages
3.exponential Family & Point Estimation - 552
0% (1)
3.exponential Family & Point Estimation - 552
33 pages
Sta255 Week 11-1 Pre
No ratings yet
Sta255 Week 11-1 Pre
37 pages
15MA301 U4v1
No ratings yet
15MA301 U4v1
28 pages
Business Statistics: Module 4. Z-Test Page 1 of 7
No ratings yet
Business Statistics: Module 4. Z-Test Page 1 of 7
7 pages
Math Research
No ratings yet
Math Research
11 pages
Chapter 2: Statistical Inference, Point Estimation, and Confidence Intervals
No ratings yet
Chapter 2: Statistical Inference, Point Estimation, and Confidence Intervals
16 pages
Model Inference and Averaging: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
No ratings yet
Model Inference and Averaging: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
51 pages
Maximum Likelihood Estimation by K.Kashin
No ratings yet
Maximum Likelihood Estimation by K.Kashin
34 pages
7.estimation Clustering
No ratings yet
7.estimation Clustering
56 pages
Lecture 3 Chapter 3 ANOVA
No ratings yet
Lecture 3 Chapter 3 ANOVA
58 pages
Lectura 1 Point Estimation
No ratings yet
Lectura 1 Point Estimation
47 pages
Stat-Review Xid-8243919 1
No ratings yet
Stat-Review Xid-8243919 1
24 pages
4.4 Parametric and Non-Parametric Estimator
No ratings yet
4.4 Parametric and Non-Parametric Estimator
47 pages
Sta255 Week 11-2 Pre
No ratings yet
Sta255 Week 11-2 Pre
21 pages
VE564 Summer 2023: Lecture 3-1: Maximum Likelihood Estimation and Least Squares
No ratings yet
VE564 Summer 2023: Lecture 3-1: Maximum Likelihood Estimation and Least Squares
78 pages
Lecture 03 Maximum Likelihood Estimation
No ratings yet
Lecture 03 Maximum Likelihood Estimation
22 pages
Lecture 3
No ratings yet
Lecture 3
48 pages
Lecture Notes Statistics II PDF
No ratings yet
Lecture Notes Statistics II PDF
139 pages
02 Review Estimation 2
No ratings yet
02 Review Estimation 2
36 pages
Yates y Cochran 1938
No ratings yet
Yates y Cochran 1938
25 pages
Mlelectures PDF
No ratings yet
Mlelectures PDF
24 pages
Asymptotic Theory and Parametric Inference
No ratings yet
Asymptotic Theory and Parametric Inference
32 pages
NOTES
No ratings yet
NOTES
14 pages
11 Mle
No ratings yet
11 Mle
26 pages
7 Mle
No ratings yet
7 Mle
31 pages
Section 5
No ratings yet
Section 5
18 pages
TS Theme3
No ratings yet
TS Theme3
18 pages
Mlelectures PDF
No ratings yet
Mlelectures PDF
24 pages
Chap - 2point - Estimation
No ratings yet
Chap - 2point - Estimation
11 pages
AMR Concept Notes (Sessions 1-2)
No ratings yet
AMR Concept Notes (Sessions 1-2)
8 pages
Multilevel Categoric
No ratings yet
Multilevel Categoric
15 pages
Maximum Likelihood Method-Red1eco
No ratings yet
Maximum Likelihood Method-Red1eco
14 pages
Likelihood, Bayesian, and Decision Theory
No ratings yet
Likelihood, Bayesian, and Decision Theory
50 pages
MLE Lecture Note For Econometrician
No ratings yet
MLE Lecture Note For Econometrician
13 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
34 pages
MSA Worksheet
No ratings yet
MSA Worksheet
18 pages
All of Stats-W
No ratings yet
All of Stats-W
35 pages
MLE Assingnment
No ratings yet
MLE Assingnment
7 pages
Dafm Cia 2 - 2227610
No ratings yet
Dafm Cia 2 - 2227610
16 pages
T Test
No ratings yet
T Test
14 pages
STAT 135 Lab 2 Confidence Intervals, MLE and The Delta Method
No ratings yet
STAT 135 Lab 2 Confidence Intervals, MLE and The Delta Method
28 pages
Questions For Unit 4
No ratings yet
Questions For Unit 4
6 pages
Anova Assignment
No ratings yet
Anova Assignment
11 pages
Econometrics For Policy: ECONG107 Practical 2: Maximum Likelihood, Bayesian Inference
No ratings yet
Econometrics For Policy: ECONG107 Practical 2: Maximum Likelihood, Bayesian Inference
32 pages
Practice Problems - Part 1
No ratings yet
Practice Problems - Part 1
6 pages
Hypothesis Testing and Interval Estimation
No ratings yet
Hypothesis Testing and Interval Estimation
9 pages
Statistical Inference Cheat Sheet
No ratings yet
Statistical Inference Cheat Sheet
4 pages
Finals MMW Final
No ratings yet
Finals MMW Final
4 pages
Introduction To MME
No ratings yet
Introduction To MME
4 pages
Learning Models From Data: 1 Parametric Estimation
No ratings yet
Learning Models From Data: 1 Parametric Estimation
14 pages
11 Parameter Estimation
No ratings yet
11 Parameter Estimation
6 pages
SP2009F - Lecture03 - Maximum Likelihood Estimation (Parametric Methods)
No ratings yet
SP2009F - Lecture03 - Maximum Likelihood Estimation (Parametric Methods)
23 pages
Prais
No ratings yet
Prais
8 pages
Stats, Mle, and Other Stuff: 1 Sevssd
No ratings yet
Stats, Mle, and Other Stuff: 1 Sevssd
10 pages
Correlation and Regression: Libeeth B. Guevarra Department of Mathematics and Natural Sciences
No ratings yet
Correlation and Regression: Libeeth B. Guevarra Department of Mathematics and Natural Sciences
12 pages
ML Notes
No ratings yet
ML Notes
4 pages
Boots Trapping
No ratings yet
Boots Trapping
4 pages
11241-Article Text-23214-1-10-20190524 PDF
No ratings yet
11241-Article Text-23214-1-10-20190524 PDF
13 pages
Maximum Likelihood
No ratings yet
Maximum Likelihood
11 pages
Module 4
No ratings yet
Module 4
3 pages
Wickham Stati
No ratings yet
Wickham Stati
12 pages
09 Linear Forced Through Zero Calib PDF
No ratings yet
09 Linear Forced Through Zero Calib PDF
13 pages
Point Estimation: Definition of Estimators
No ratings yet
Point Estimation: Definition of Estimators
8 pages
Topic 14: Maximum Likelihood Estimation: 1 Examples
No ratings yet
Topic 14: Maximum Likelihood Estimation: 1 Examples
6 pages
Frequentist Estimation: 4.1 Likelihood Function
No ratings yet
Frequentist Estimation: 4.1 Likelihood Function
6 pages
Errata For Deveaux, Velleman and Bock, Stats: Data and Models, 3 Ed
No ratings yet
Errata For Deveaux, Velleman and Bock, Stats: Data and Models, 3 Ed
1 page
Chapter 7: Introduction: 1 Convergence in Distribution
No ratings yet
Chapter 7: Introduction: 1 Convergence in Distribution
6 pages
Maximum Likelihood Estimation: Guy Lebanon February 19, 2011
No ratings yet
Maximum Likelihood Estimation: Guy Lebanon February 19, 2011
6 pages
MATH 437/ MATH 535: Applied Stochastic Processes/ Advanced Applied Stochastic Processes
No ratings yet
MATH 437/ MATH 535: Applied Stochastic Processes/ Advanced Applied Stochastic Processes
7 pages
Further Statistics 1 Unit Test 7 Central Limit Theorem
No ratings yet
Further Statistics 1 Unit Test 7 Central Limit Theorem
3 pages
Experiment No. 1:: Collegium ("Council of State") and The Italian Word Statista ("Statesman" or "Politician")
No ratings yet
Experiment No. 1:: Collegium ("Council of State") and The Italian Word Statista ("Statesman" or "Politician")
7 pages
Maximum
No ratings yet
Maximum
3 pages
Maximum Likelihood Estimation.: N N I N I 1 N I I 1
No ratings yet
Maximum Likelihood Estimation.: N N I N I 1 N I I 1
5 pages
Agricultural Land Use in Kerala
No ratings yet
Agricultural Land Use in Kerala
5 pages
STF1103 - Kruskal-Wallis Friedman Test Assignment v2
No ratings yet
STF1103 - Kruskal-Wallis Friedman Test Assignment v2
3 pages
Matrix Plot of Law. SCH Gpa Vs Under Grad G, Lmat Perctl, Qlty Rating & Gre
No ratings yet
Matrix Plot of Law. SCH Gpa Vs Under Grad G, Lmat Perctl, Qlty Rating & Gre
4 pages

Mstat Note12 Parametric Inference FSP

Uploaded by

Mstat Note12 Parametric Inference FSP

Uploaded by

Parametric Inference

Math & Stat for Data Science

Graduate School of Data Science

• Now the inference is estimating the parameter 𝜃

• Can provide more efficient estimator (ex. Lower

• Parameters can provide a direct interpretation

• Other parameters are called nuisance parameter

• There can be numerous ways to estimate

• Likelihood based approach is most commonly used.

• Joint density of data

• Find the parameters that most likely generated the

• Any constants in likelihood do not affect MLE

0.5 1.0 1.5 2.0

MLE: mu=1.047, sigma=1.94

• Convenient to find MLE of transformed parameter:

• Score function: first derivative of log likelihood

• Fisher information (FI) is the (expected) second

1. MLE asymptotically follows Normal Distribution

• P-value can also be calculated using asymptotic

• It is possible that we can not obtain analytic

Which one is optimal?

• Asymptotic relative efficiency:

• MLE is the optimal estimator! (under some regularity condition)

• If we know the parametric form of the distribution, it

• Estimate 𝜃 (using MLE)

. is a true distribution function.

• In the previous example, if we reduce the sample

You might also like