0% found this document useful (0 votes)

32 views

STAT 135 Lab 2 Confidence Intervals, MLE and The Delta Method

This document provides an overview of confidence intervals, maximum likelihood estimation (MLE), and the method of moments (MOM). It defines what a confidence interval is and how it is calculated. It then explains MLE, including how to find the value of a parameter θ that maximizes the likelihood function. Finally, it describes MOM, which estimates parameters by equating theoretical moments of a distribution to sample moments.

Uploaded by

azer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views

STAT 135 Lab 2 Confidence Intervals, MLE and The Delta Method

Uploaded by

azer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

STAT 135 Lab 2

Confidence Intervals, MLE and the Delta Method

February 2, 2015
Confidence Intervals
Confidence intervals

What is a confidence interval?

I A confidence interval is calculated in such a way that the
interval contains the true value of θ with some specified
probability (coverage probability).
What kind of parameters can θ correspond to?
I θ = µ from N(µ, σ 2 )
I θ = p from Binomial(n, p)
θ typically corresponds to a parameter from a distribution, F , from
which we are sampling
Xi ∼ F (θ)
Confidence intervals
I We usually write the coverage probability in the form of 1 − α
I If the coverage probability is 95%, then α = 0.05.
I Let qα be the number such that
P(Z < qα ) = 1 − α
where Z ∼ N(0, 1)
I By symmetry of the normal distribution, we have also that
qα = −q(1−α)
Confidence intervals
qα is the number such that

P(Z < qα ) = 1 − α

where Z ∼ N(0, 1).

Note also that by the symmetry of the normal distribution

1 − α = P(Z < qα ) = P(−qα/2 < Z < qα/2 )

For a 95% CI, we have:

q0.05/2 = 1.96 because P(−1.96 < Z < 1.96) = 0.95

Confidence intervals

Suppose that our estimate, θ̂n , of θ, asymptotically satisfies

θ̂n − θ
∼ N(0, 1)
σθ̂n

So in all of the equations in the previous slides, we can replace Z

with θ̂σn −θ and rearrange so that θ is the subject.
θ̂n
Confidence intervals

Recall that
1 − α = P(−qα/2 < Z < qα/2 )
θ̂n −θ
Given that σθ̂n∼ N(0, 1), we have also the result that
!
θ̂n − θ
1 − α = P −qα/2 < < qα/2
σθ̂n
rearranging to make θ the subject, we have

1 − α = P θ̂n − qα/2 σθ̂n < θ < θ̂n + qα/2 σθ̂n
Confidence intervals

We have that

1 − α = P θ̂n − qα/2 σθ̂n < θ < θ̂n + qα/2 σθ̂n

Recall that if we’re looking for a 95% confidence interval (CI), then
we are looking for an interval (a, b) such P(a < θ < b) = 0.95.

Thus, the 95% CI for θ can be found from

0.95 = P θ̂n − q0.025 σθ̂n < θ < θ̂n + q0.025 σθ̂n

For a general (1 − α)% CI, the interval

[θ̂n − q(1−α/2) σθ̂n , θ̂n + q(1−α/2) σθ̂n ]

contains θ with probability 1 − α.

Exercise 1
Confidence intervals - exercise
CI exercises:
1. In R, generate 1000 random samples, x1 , x2 , ..., x1000 , from a
(continuous) Uniform(5, 15) distribution
2. From the 1000 numbers you have just generated, draw 100
simple random samples (without replacement!), X1 , ..., X100 .
Repeat this 1000 times, so that we have 1000 samples of size
100.
3. For each sample of size 100, compute the sample mean, and
produce a histogram (preferably using ggplot()) of the 1000
sample means calculated above. What distribution does the
sample mean (approximately) follow, and why?
4. For each sample, calculate the 95% confidence interval for the
population mean.
5. Of the 1000 confidence intervals, what proportion of them
cover the true mean µ = 15+5
2 = 10?
Maximum likelihood estimation
Maximum likelihood estimation

I Confidence interval for θ: calculate a range of values in

which the true value of the parameter θ lies with some
specified probability.
I Maximum likelihood estimator for θ: calculate a single
value which estimates the true value of θ by maximizing the
likelihood function with respect to θ
I i.e. find the value of θ that maximizes the likelihood of
observing the data given.
Maximum likelihood estimation

What is the likelihood function?

I The likelihood function, lik(θ), is a function of θ which
corresponds to the probability of observing our sample for
various value of θ.

How to find the value of θ that maximizes the likelihood function?

Maximum likelihood estimation

Assume that we have observed i.i.d. random variables X1 , ...., Xn

and that their distribution has density/frequency function fθ .
Suppose that the observed value of Xi is xi for each i = 1, 2, ..., n
How do we write down the likelihood function? The (non-rigorous)
idea:

lik(θ) = P(X1 = x1 , ..., Xn = xn )

= P(X1 = x1 )...P(Xn = xn )
Yn
= fθ (Xi )
i=1

(Note that this proof is not rigorous for continuous variables since
they take on specific values with probability 0)
Maximum likelihood estimation

There are 4 main steps in calculating the MLE, θ̂MLE , of θ.

1. Write down the likelihood function, lik(θ) = ni=1 fθ (Xi ).
Q

2. Calculate the log-likelihood function `(θ) = log(lik(θ))

(Note: this is because it is often much easier to find the
maximum of the log-likelihood function than the likelihood
function)
3. Differentiate the log-likelihood function with respect to θ.
4. Set the derivative to 0, and solve for θ.
Maximum likelihood estimation - example

Example: Suppose Xi ∼ Bernoulli(p).

fp (x) = p x (1 − p)1−x

Step 1: Write down the likelihood function:

n
Y
lik(p) = fp (Xi )
i=1
Yn
= p Xi (1 − p)1−Xi
i=1
Pn Pn
Xi i=1 (1−Xi )
=p i=1 (1 − p)
Maximum likelihood estimation - example

Example: Suppose Xi ∼ Bernoulli(p).

fp (x) = p x (1 − p)1−x
Pn Pn
Step 1: lik(p) = p i=1 Xi (1 − p) i=1 (1−Xi )
Step 2: Calculate the log-likelihood function:

n
X n
X
`(p) = log(lik(p)) = Xi log(p) + (1 − Xi ) log(1 − p)
i=1 i=1
Maximum likelihood estimation - example

Example: Suppose Xi ∼ Bernoulli(p).

fp (x) = p x (1 − p)1−x

Step 2: `(p) = ni=1 Xi log(p) + ni=1 (1 − Xi ) log(1 − p)

P P
Step 3: Differentiate the log-likelihood function with respect
to p:

Pn Pn
d`(p) i=1 Xi i=1 (1− Xi )
= −
dp p 1−p
Maximum likelihood estimation - example

Example: Suppose Xi ∼ Bernoulli(p).

fp (x) = p x (1 − p)1−x
Pn Pn
d`(p) Xi i=1 (1−Xi )
Step 3: dp = i=1
p − 1−p
Step 4: Set the derivative to 0, and solve for p:
Pn
d`(p) Xi
= 0 =⇒ p̂MLE = i=1 =X
dp n
So the MLE for p where Xi ∼ Bernoulli(p) is just equal to the
sample mean.
Method of Moments (MOM)
Method of Moments
I Confidence interval for θ: calculate a range of values in
which the true value of the parameter θ lies with some
specified probability.
I Maximum likelihood estimator for θ: calculate a single
value which estimates the true value of θ by maximizing the
likelihood function with respect to θ.
I Method of moments estimator for θ: By equating the
theoretical moments to the empirical (sample) moments,
derive equations that relate the theoretical moments to θ.
The equations are then solved for θ.
Suppose X follows some distribution. The kth moment of the
distribution is defined to be

µk = E [X k ] = gk (θ)

which will be some function of θ.

Method of Moments

MOM works by equating the theoretical moments (which will be a

function of θ) to the empirical moments.

Moment Theoretical Moment Empirical Moment

Pn
Xi
first moment E [X ] i=1
n
Pn
Xi2
second moment E [X 2 ] i=1
n
Pn
Xi3
third moment E [X 3 ] i=1
n
Method of Moments
MOM is perhaps best described by example.
Suppose that X ∼ Bernoulli(p). Then the first moment is given by

E [X ] = 0 × P(X = 0) + 1 × P(X = 1) = p

Moreover, we can estimate the E[X ] by taking a sample X1 , ..., Xn

and calculating the sample mean :
n
1X
X = Xi
n
i=1

We approximate the first theoretical moment, E [X ], by the first

empirical moment, X , i.e.

p̂MOM = X
which is the same as the MLE estimator! (note that this is not
always the case...)
Exercise 2
Exercise – Question 43, Chapter 8 (page 320) from John
Rice
The file gamma-arrivals contains a set of gamma-ray data
consisting of the times between arrivals (interarrival times) of
3,935 photons (units are seconds)
1. Make a histogram of the interarrival times. Does it appear
that a gamma distribution would be a plausible model?
2. Fit the parameters by the method of moments and by
maximum likelihood. How do the estimates compare?
3. Plot the two fitted gamma densities on top of the histogram.
Do the fits look reasonable?
Hint 1: the gamma distribution can be written as
β α α−1 −βx
fα,β (x) = x e
Γ(α)
Hint 2: the MLE for α has no closed-form solution - use:
α̂MLE = 1
The δ-method
The δ-method

Recall that the CLT says

√
n(X n − µ) → N(0, σ 2 )

What if we have some general function g (·)?

√
n(g (X n ) − g (µ)) →?
The δ-method

The δ-method tells us that

√
n(g (X n ) − g (µ)) → N(0, σ 2 (g 0 (µ))2 )
For a proof for the general case, see
http://en.wikipedia.org/wiki/Delta_method

This method can be used to find the variance of a function of our

random variables!

Navigating Network Complexity Next-Generation Routing With SDN, Service Virtualization, and Service Chaining by Russ White, Jeff (Evgeny) Tantsura
No ratings yet
Navigating Network Complexity Next-Generation Routing With SDN, Service Virtualization, and Service Chaining by Russ White, Jeff (Evgeny) Tantsura
451 pages
Craig Turnbull (Auth.) - A History of British Actuarial Thought-Palgrave Macmillan (2017)
No ratings yet
Craig Turnbull (Auth.) - A History of British Actuarial Thought-Palgrave Macmillan (2017)
350 pages
sta255 Week 11-2 pre
No ratings yet
sta255 Week 11-2 pre
21 pages
MIT14 30s09 Lec19
No ratings yet
MIT14 30s09 Lec19
7 pages
Chap - 2point - Estimation
No ratings yet
Chap - 2point - Estimation
11 pages
Maximum Likelihood
No ratings yet
Maximum Likelihood
7 pages
MLE_Assingnment (1)
No ratings yet
MLE_Assingnment (1)
7 pages
11 Parameter Estimation
No ratings yet
11 Parameter Estimation
6 pages
Session 32 - Point Estimate
No ratings yet
Session 32 - Point Estimate
53 pages
NOTES
No ratings yet
NOTES
14 pages
Questions_for_Unit_4 (2)
No ratings yet
Questions_for_Unit_4 (2)
6 pages
CHAPTER 3
No ratings yet
CHAPTER 3
9 pages
Statistics: Assignment 6
No ratings yet
Statistics: Assignment 6
6 pages
Estimation
No ratings yet
Estimation
4 pages
University of Toronto Scarborough Department of Computer and Mathematical Sciences Final Exam, Winter - 2015
No ratings yet
University of Toronto Scarborough Department of Computer and Mathematical Sciences Final Exam, Winter - 2015
13 pages
Inf 2
No ratings yet
Inf 2
37 pages
Probability and Statistics
100% (1)
Probability and Statistics
26 pages
Session3_QTII_24
No ratings yet
Session3_QTII_24
19 pages
Statistics Formula Sheet
No ratings yet
Statistics Formula Sheet
11 pages
MAST20005 Statistics Assignment 1
No ratings yet
MAST20005 Statistics Assignment 1
10 pages
L08-MaximumLikelihoodEstimation
No ratings yet
L08-MaximumLikelihoodEstimation
5 pages
Maximum
No ratings yet
Maximum
3 pages
Statistics I: Parameter Estimation, Part I
No ratings yet
Statistics I: Parameter Estimation, Part I
24 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
46 pages
Prints PDF
No ratings yet
Prints PDF
106 pages
survival lec 6-1
No ratings yet
survival lec 6-1
63 pages
Handout 6 (Chapter 6) : Point Estimation: Unbiased Estimator: A Point Estimator
No ratings yet
Handout 6 (Chapter 6) : Point Estimation: Unbiased Estimator: A Point Estimator
9 pages
DS 630_Lec 02_St
No ratings yet
DS 630_Lec 02_St
34 pages
Point Estimation: Definition of Estimators
No ratings yet
Point Estimation: Definition of Estimators
8 pages
Inferencia Ejercicios Estadistica
No ratings yet
Inferencia Ejercicios Estadistica
7 pages
X400004_20220215_solutions
No ratings yet
X400004_20220215_solutions
8 pages
Unit 04 - Maximum Likelihood Estimation - 1 Per Page
No ratings yet
Unit 04 - Maximum Likelihood Estimation - 1 Per Page
62 pages
Post-Exam 2 Practice Questions - Solutions 18.05, Spring 2014 Confidence Intervals
No ratings yet
Post-Exam 2 Practice Questions - Solutions 18.05, Spring 2014 Confidence Intervals
7 pages
Ch-5 Solution
No ratings yet
Ch-5 Solution
75 pages
Method of Moments
No ratings yet
Method of Moments
10 pages
STAT 135 Solutions To Homework 4:: 30 Points
No ratings yet
STAT 135 Solutions To Homework 4:: 30 Points
9 pages
Statistical Inference
No ratings yet
Statistical Inference
55 pages
Topic 14: Maximum Likelihood Estimation: 1 Examples
No ratings yet
Topic 14: Maximum Likelihood Estimation: 1 Examples
6 pages
stat100b_maximum_likelihood
No ratings yet
stat100b_maximum_likelihood
9 pages
Hypothesis Testing Problems 11
No ratings yet
Hypothesis Testing Problems 11
9 pages
Introduction to MME
No ratings yet
Introduction to MME
4 pages
AE 248: AI and Data Science: Prabhu Ramachandran 2024-03-01
No ratings yet
AE 248: AI and Data Science: Prabhu Ramachandran 2024-03-01
8 pages
Likelihood, Bayesian, and Decision Theory
No ratings yet
Likelihood, Bayesian, and Decision Theory
50 pages
Probability Statistics2
No ratings yet
Probability Statistics2
37 pages
Maximum Likelihood
No ratings yet
Maximum Likelihood
10 pages
Statistical+Inference+1 Shaw2007
No ratings yet
Statistical+Inference+1 Shaw2007
66 pages
Maximum Likelihood Estimation: Guy Lebanon February 19, 2011
No ratings yet
Maximum Likelihood Estimation: Guy Lebanon February 19, 2011
6 pages
MIT18 05S14 Class22-Slde-A
No ratings yet
MIT18 05S14 Class22-Slde-A
19 pages
ML Notes
No ratings yet
ML Notes
4 pages
Statistics
No ratings yet
Statistics
60 pages
ps2,3
No ratings yet
ps2,3
48 pages
8th Lecture Note - 1039837803 230515 094639
No ratings yet
8th Lecture Note - 1039837803 230515 094639
10 pages
Assignment 4 - Engineering Statistics PDF
No ratings yet
Assignment 4 - Engineering Statistics PDF
5 pages
MLE Dan Bayesian Estimation From Walpole Book
No ratings yet
MLE Dan Bayesian Estimation From Walpole Book
13 pages
Learning Models From Data: 1 Parametric Estimation
No ratings yet
Learning Models From Data: 1 Parametric Estimation
14 pages
STAT 2006 Chapter 2 - 2022
No ratings yet
STAT 2006 Chapter 2 - 2022
83 pages
STAT2602B Topic 4 With Exercise Suggested Solution
No ratings yet
STAT2602B Topic 4 With Exercise Suggested Solution
12 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Mathematical Foundations of Information Theory
From Everand
Mathematical Foundations of Information Theory
A. Ya. Khinchin
3.5/5 (9)
Cad and Analysis: Surface Modelling (Mod3)
100% (1)
Cad and Analysis: Surface Modelling (Mod3)
11 pages
Ap Calculus MVT Worksheet 3: 08/7,3/ (&+2,& (Í UDSKLQJ&DOFXODWRU3HUPLWWHG
No ratings yet
Ap Calculus MVT Worksheet 3: 08/7,3/ (&+2,& (Í UDSKLQJ&DOFXODWRU3HUPLWWHG
5 pages
Draft Scheme-Mat203-1
No ratings yet
Draft Scheme-Mat203-1
3 pages
3 7 14 Trigonometry LP
No ratings yet
3 7 14 Trigonometry LP
1 page
Module Q1 Week 1: BLK 9 Lot 2-8 Exodus St. San Vicente Homes San Vicente, Sta. Maria, Bulacan
No ratings yet
Module Q1 Week 1: BLK 9 Lot 2-8 Exodus St. San Vicente Homes San Vicente, Sta. Maria, Bulacan
5 pages
Course Guide Math 17-b
No ratings yet
Course Guide Math 17-b
3 pages
Chapter 30
100% (2)
Chapter 30
10 pages
Mathematics Form 3 Trial Paper 1
100% (1)
Mathematics Form 3 Trial Paper 1
22 pages
Engineering Drawing Jun 2008 Question Paper
100% (1)
Engineering Drawing Jun 2008 Question Paper
8 pages
Ava Fenech - Assignment #2 Destination Guidebook-2
No ratings yet
Ava Fenech - Assignment #2 Destination Guidebook-2
1 page
Untitled
0% (1)
Untitled
38 pages
CUET 2023 Mathematics Question Paper 1
No ratings yet
CUET 2023 Mathematics Question Paper 1
3 pages
GR 10 Trig Investigation
No ratings yet
GR 10 Trig Investigation
6 pages
Math Interventions Worksheets: RIT Band 181-190
No ratings yet
Math Interventions Worksheets: RIT Band 181-190
43 pages
Creative Thinking in Mathematics With Tangrams and The Geometer's Sketchpad
No ratings yet
Creative Thinking in Mathematics With Tangrams and The Geometer's Sketchpad
9 pages
Maths PPT Class 9
0% (1)
Maths PPT Class 9
22 pages
Classification of Signals & Systems
94% (16)
Classification of Signals & Systems
82 pages
Mathematics: Quarter 1
100% (1)
Mathematics: Quarter 1
11 pages
IB Mathematics Applications and Interpretations HL Study Guide
No ratings yet
IB Mathematics Applications and Interpretations HL Study Guide
2 pages
Optimal Control A Review of Theory and Practice
No ratings yet
Optimal Control A Review of Theory and Practice
23 pages
Vectors Lab
No ratings yet
Vectors Lab
4 pages
Basic Matrix Manipulation With A TI 89/TI 92/voyage 200: Inputting/Editing Matrices
No ratings yet
Basic Matrix Manipulation With A TI 89/TI 92/voyage 200: Inputting/Editing Matrices
5 pages
Xii Maths em Unit 1,2,3,4
No ratings yet
Xii Maths em Unit 1,2,3,4
75 pages
TEST 1 Sample
No ratings yet
TEST 1 Sample
13 pages
General Math & Ability Notes Sir Nasir
No ratings yet
General Math & Ability Notes Sir Nasir
13 pages
Grade 7 ToT Post-Test
No ratings yet
Grade 7 ToT Post-Test
3 pages
Homework_Week5_solutions
No ratings yet
Homework_Week5_solutions
5 pages
(Advanced Courses in Mathematics - CRM Barcelona) Colin Christopher, Chengzhi Li - Limit Cycles of Differential Equations-Birkhäuser Basel (2007)
No ratings yet
(Advanced Courses in Mathematics - CRM Barcelona) Colin Christopher, Chengzhi Li - Limit Cycles of Differential Equations-Birkhäuser Basel (2007)
162 pages

STAT 135 Lab 2 Confidence Intervals, MLE and The Delta Method

Uploaded by

STAT 135 Lab 2 Confidence Intervals, MLE and The Delta Method

Uploaded by

STAT 135 Lab 2

Confidence Intervals, MLE and the Delta Method

What is a confidence interval?

where Z ∼ N(0, 1).

1 − α = P(Z < qα ) = P(−qα/2 < Z < qα/2 )

For a 95% CI, we have:

q0.05/2 = 1.96 because P(−1.96 < Z < 1.96) = 0.95

Suppose that our estimate, θ̂n , of θ, asymptotically satisfies

So in all of the equations in the previous slides, we can replace Z

Thus, the 95% CI for θ can be found from

For a general (1 − α)% CI, the interval

[θ̂n − q(1−α/2) σθ̂n , θ̂n + q(1−α/2) σθ̂n ]

contains θ with probability 1 − α.

I Confidence interval for θ: calculate a range of values in

What is the likelihood function?

How to find the value of θ that maximizes the likelihood function?

Assume that we have observed i.i.d. random variables X1 , ...., Xn

lik(θ) = P(X1 = x1 , ..., Xn = xn )

There are 4 main steps in calculating the MLE, θ̂MLE , of θ.

2. Calculate the log-likelihood function `(θ) = log(lik(θ))

Example: Suppose Xi ∼ Bernoulli(p).

Step 1: Write down the likelihood function:

Example: Suppose Xi ∼ Bernoulli(p).

Example: Suppose Xi ∼ Bernoulli(p).

Step 2: `(p) = ni=1 Xi log(p) + ni=1 (1 − Xi ) log(1 − p)

Example: Suppose Xi ∼ Bernoulli(p).

which will be some function of θ.

MOM works by equating the theoretical moments (which will be a

Moment Theoretical Moment Empirical Moment

Moreover, we can estimate the E[X ] by taking a sample X1 , ..., Xn

We approximate the first theoretical moment, E [X ], by the first

Recall that the CLT says

What if we have some general function g (·)?

The δ-method tells us that

This method can be used to find the variance of a function of our

You might also like