0% found this document useful (0 votes)

7 views32 pages

Lecture 09

Uploaded by

wangweian8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views32 pages

Lecture 09

Uploaded by

wangweian8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

STA732

Statistical Inference
Lecture 09: Bayesian estimation

Yuansi Chen
Spring 2023
Duke University

https://www2.stat.duke.edu/courses/Spring23/sta732.01/

1
Recap from Lecture 08

1. Construct minimum risk equivariant (MRE) estimator via

conditioning on maximal invariant statistics
2. Pitman estimator of location
3. MRE for location is unbiased under squared error loss
4. MRE usually admissible

2
Where we are

• We have finished the first approach of arguing for “the best”

estimator in point estimation: by restricting to a small set of
estimatiors
• Unbiased estimators
• Equivariant estimators
• We begin the second approach: global measure of optimality
• average risk
• minimax risk

3
Goal of Lecture 09

1. Bayes risk, Bayes estimator

2. Examples
3. Bayes estimators are usually biased
4. Bayes estimators are usually admissible

Chap. 7 in Keener or Chap. 4 in Lehmann and Casella

4
Bayes risk, Bayes estimator
Recall the components of a decision problem

• Data 𝑋
• Model family P = {𝑃𝜃 ∶ 𝜃 ∈ Ω}, a collection of probability
distributions on the sample space
• Loss function 𝐿, 𝐿(𝜃, 𝑑) measures the loss incurred by the
decision 𝑑 when compared with the parameter obtained from 𝜃
• Risk function 𝑅, 𝑅(𝜃, 𝛿) = 𝔼𝜃 [𝐿(𝜃, 𝛿)]

5
The frequentist motivation of the Bayesian setup

Motivation
It is in general hard to find uniformly minimum risk estimator.
Oftentimes, we have risks that cross. This difficulty will not arise if
the performance is measured via a single number.

Def. Bayes risk

The Bayes risk is the average-case risk, integrated w.r.t. some
measure Λ, called prior.

6
The frequentist motivation of the Bayesian setup

Motivation
It is in general hard to find uniformly minimum risk estimator.
Oftentimes, we have risks that cross. This difficulty will not arise if
the performance is measured via a single number.

Def. Bayes risk

The Bayes risk is the average-case risk, integrated w.r.t. some
measure Λ, called prior.

Remark
For now, assume Λ(Ω) = 1 (Λ is a prob measure). Later we might
deal with improper prior.

6
Bayes risk

𝑅Bayes (Λ, 𝛿) = ∫ 𝑅(𝜃, 𝛿)𝑑Λ(𝜃)

Ω
= 𝔼𝑅(Θ, 𝛿)

where Θ is the randoma variable with distribution Λ.

𝔼𝑅(Θ, 𝛿) = 𝔼[𝔼[𝐿(Θ, 𝛿(𝑋)) ∣ 𝑋]]

Both 𝑋 and Θ are considered random.

The frequentist understanding: average risk makes sense without believing the
parameter is random

7
Bayes estimator

An estimator 𝛿 which minimizes the average risk 𝑅Bayes (Λ, ⋅) is a

Bayes estimator.

8
Construct Bayes estimator

Thm 7.1 in Keener

Suppose Θ ∼ Λ, 𝑋 ∣ Θ = 𝜃 ∼ 𝑃𝜃 , and 𝐿(𝜃, 𝑑) ≥ 0 for all 𝜃 ∈ Ω and
all 𝑑. If

• 𝔼[𝐿(Θ, 𝛿0 )] < ∞ for some 𝛿0

• for a.e. 𝑥, there exists a 𝛿Λ (𝑥) minimizing

𝔼[𝐿(Θ, 𝑑) ∣ 𝑋 = 𝑥]

with respect to 𝑑

Then 𝛿Λ is a Bayes estimator.

In words: the Bayes estimator can be found by minimizing the conditional

distribution 𝔼[𝐿(𝜃, 𝑑) ∣ 𝑋 = 𝑥], one 𝑥 at a time

9
proof of Thm 7.1

10
Posterior

Def. Posterior
The conditional distribution of Θ given 𝑋, written as ℒ(Θ ∣ 𝑋) is
called the posterior distribution

Remark
• Λ is usually interpreted as prior belief about Θ before seeing
the data
• ℒ(Θ ∣ 𝑋) is the belief after seeing the data

11
Posterior calcultation with density

Suppose prior density 𝜆(𝜃), likelihood 𝑝𝜃 (𝑥), then the posterior

density is

𝜆(𝜃)𝑝𝜃 (𝑥)
𝜆(𝜃 ∣ 𝑥) =
𝑞(𝑥)

where 𝑞(𝑥) = ∫Ω 𝜆(𝜃)𝑝𝜃 (𝑥)𝑑𝜃 is the marginal density of 𝑋.

Then the Bayes estimator has the form

𝛿Λ (𝑥) = arg min ∫ 𝐿(𝜃, 𝑑)𝜆(𝜃 ∣ 𝑥)𝑑𝜃

𝑑 Ω

12
Posterior mean is Bayes estimator for squared error loss

2
Suppose 𝐿(𝜃, 𝑑) = (𝑔(𝜃) − 𝑑) then the Bayes estimator is the
posterior mean
proof:

13
Examples
Binomial model with Beta prior

Suppose 𝑋 ∣ Θ = 𝜃 ∼ Binomial(𝑛, 𝜃) with density 𝜃𝑥 (1 − 𝜃)𝑛−𝑥 (𝑛𝑥),

Θ ∼ Beta(𝛼, 𝛽) with density 𝜃𝛼−1 (1 − 𝜃)𝛽−1 Γ(𝛼)Γ(𝛽)
Γ(𝛼+𝛽) . Find the
Bayes estimator under squared error loss.

14
Weighted squared error loss

2
Suppose 𝐿(𝜃, 𝑑) = 𝑤(𝜃) (𝑔(𝜃) − 𝑑) . Find a Bayes estimator.

15
Normal mean estimation

𝑋 ∣ Θ = 𝜃 ∼ 𝒩(𝜃, 𝜎2 ),
Θ ∼ 𝒩(𝜇, 𝜏 2 ).
Find the Bayes estimator of mean under squared error loss
What if we have 𝑛 i.i.d. data points?

16
Binary classification

Suppose the parameter space Ω = {0, 1}.

ℙ(𝑋 = 𝑥 ∣ Θ = 0) = 𝑓0 (𝑥) and ℙ(𝑋 = 𝑥 ∣ Θ = 1) = 𝑓1 (𝑥). The
prior is 𝜋(1) = 𝑝, 𝜋(0) = 1 − 𝑝.
⎧
{0 𝑑 = 𝜃
Determine a Bayes estimator under 0-1 loss 𝐿(𝜃, 𝑑) =
⎨
{
⎩1 𝑑 ≠ 𝜃

17
Bayes estimators are usually biased
Unbiased estimator under squared error loss is not Bayes

Thm Lehmann Casella 4.2.3

If 𝛿 is unbiased for 𝑔(𝜃) with 𝑅Bayes (Λ, 𝛿) < ∞ then 𝛿 is not Bayes
under squared error loss unless its average risk is zero

𝔼 [(𝛿(𝑋) − 𝑔(Θ))2 ] = 0

18
proof:

19
Bayes estimators are usually
admissible
Uniqueness of Bayes estimator under strictly convex loss

Thm. Lehmann Casella 4.1.4

Let 𝑄 be the marginal distribution of 𝑋, i.e.,
𝑄(𝐸) = ∫ ℙ𝜃 (𝐸)𝑑Λ(𝜃). Suppose 𝐿 is strictly convex. If

1. 𝑅Bayes (Λ, 𝛿Λ ) < ∞,

2. 𝑄(𝐸) = 0 implies 𝑃𝜃 (𝐸) = 0, ∀𝜃,

then the Bayes estimator 𝛿Λ is unique (a.e. with respect to 𝑃𝜃 for all
𝜃).

20
proof: Use the following lemma
Lem. Lehmann Casella exercise 1.7.26
Let 𝜙 be a strictly convex function over an interval 𝐼. If there exists a
value 𝑎0 ∈ 𝐼 minimizing 𝜙, then 𝑎0 is unique.

21
A unique Bayes estimator is admissible

Thm. Lehmann Casella 5.2.4

A unique Bayes estimator (a.s. for all 𝑃𝜃 ) is admissible.

22
proof:

23
Summary

• Bayes estimator is defined as the minimizer of the average risk

over a prior on 𝜃.
• Bayes estimator can be constructed by conditioning the risk
on each 𝑥
• Bayes estimators are biased under squared error loss
• Bayes estimators are admissible under strictly convex loss

24
What is next?

• Where do priors come from?

• Pros and cons of Bayes

25
Thank you

26
27

Market Research Project On Liquid Handwash
67% (18)
Market Research Project On Liquid Handwash
34 pages
Q&A Univ 3unit
No ratings yet
Q&A Univ 3unit
18 pages
Chapter 3 - Bayesian Inference
No ratings yet
Chapter 3 - Bayesian Inference
114 pages
Lectures 5
No ratings yet
Lectures 5
31 pages
Assignment 11
100% (1)
Assignment 11
4 pages
Assignment 1 (Descriptive Analysis)
No ratings yet
Assignment 1 (Descriptive Analysis)
4 pages
Main
No ratings yet
Main
195 pages
Chap 3
No ratings yet
Chap 3
74 pages
DA Unit 2
No ratings yet
DA Unit 2
124 pages
확통1 LectureNote09 on Bayesian Statistical Inference
No ratings yet
확통1 LectureNote09 on Bayesian Statistical Inference
78 pages
Estimation
No ratings yet
Estimation
53 pages
Machine Learning PDF
No ratings yet
Machine Learning PDF
77 pages
Lecture 10
No ratings yet
Lecture 10
33 pages
Bayesian Ibrahim
No ratings yet
Bayesian Ibrahim
370 pages
Lecture 5 - 8 Bayesian Estimation
No ratings yet
Lecture 5 - 8 Bayesian Estimation
65 pages
Dr. Arslan Shaukat
No ratings yet
Dr. Arslan Shaukat
18 pages
Exercise Bayesian
No ratings yet
Exercise Bayesian
2 pages
Chap3 01
No ratings yet
Chap3 01
35 pages
Systemic Inquiry Scientific Method Solve Problems and Answer Questions
No ratings yet
Systemic Inquiry Scientific Method Solve Problems and Answer Questions
10 pages
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
No ratings yet
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
53 pages
Lecture3 Decision Theory
No ratings yet
Lecture3 Decision Theory
28 pages
1-MS2 (Intro Bayes)
No ratings yet
1-MS2 (Intro Bayes)
38 pages
Unit 7 Regration and Correlation
No ratings yet
Unit 7 Regration and Correlation
11 pages
STA402 2december 2024lecture3 BayesianInference
No ratings yet
STA402 2december 2024lecture3 BayesianInference
132 pages
Minimax
No ratings yet
Minimax
26 pages
Bayes Decision Theory
No ratings yet
Bayes Decision Theory
53 pages
19-Bayesian 2
No ratings yet
19-Bayesian 2
39 pages
STAT 830 Bayesian Estimation: Richard Lockhart
No ratings yet
STAT 830 Bayesian Estimation: Richard Lockhart
23 pages
Scribe Notes BML
No ratings yet
Scribe Notes BML
25 pages
Estimation and Detection: Lecture 6: The Bayesian Philosophy
No ratings yet
Estimation and Detection: Lecture 6: The Bayesian Philosophy
19 pages
Block 4 ST3189
No ratings yet
Block 4 ST3189
25 pages
Assignment 5 Stat Inf b3 2022 2023 PDF
No ratings yet
Assignment 5 Stat Inf b3 2022 2023 PDF
16 pages
RenSun Sankhya2004 ComparisonBayesFreqtstPrediction
No ratings yet
RenSun Sankhya2004 ComparisonBayesFreqtstPrediction
29 pages
Bayesian Inference
No ratings yet
Bayesian Inference
22 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages
25 Intro To Bayesian Inference
No ratings yet
25 Intro To Bayesian Inference
31 pages
An Overview of Bayesian Econometrics
No ratings yet
An Overview of Bayesian Econometrics
30 pages
جلسه پنجم-1
No ratings yet
جلسه پنجم-1
15 pages
Decisiontheory 0
No ratings yet
Decisiontheory 0
13 pages
Bayesian Inference in The Normal Linear Regression Model
No ratings yet
Bayesian Inference in The Normal Linear Regression Model
53 pages
BT Wk5 LectureNotes A
No ratings yet
BT Wk5 LectureNotes A
17 pages
Bayes Estimator of One Parameter Gamma Distribution Under Quadratic and LINEX Loss Function Wael Abdul Lateef Jasim
No ratings yet
Bayes Estimator of One Parameter Gamma Distribution Under Quadratic and LINEX Loss Function Wael Abdul Lateef Jasim
16 pages
Credibility Using Semiparametric Models
No ratings yet
Credibility Using Semiparametric Models
13 pages
Minimax
No ratings yet
Minimax
10 pages
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
No ratings yet
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
23 pages
Lecture Notes For Probability and Statistics
No ratings yet
Lecture Notes For Probability and Statistics
7 pages
Notes4 BayesianLearning
No ratings yet
Notes4 BayesianLearning
8 pages
Statistics 512 Notes 26: Decision Theory Continued: FX FX D
No ratings yet
Statistics 512 Notes 26: Decision Theory Continued: FX FX D
11 pages
Intro To Bayes Approach. Reasons To Be Bayesian: Differences Between Bayesian and Frequentist Approaches 1
No ratings yet
Intro To Bayes Approach. Reasons To Be Bayesian: Differences Between Bayesian and Frequentist Approaches 1
9 pages
Statistical Learning Theory: 18.657: Mathematics of Machine Learning
No ratings yet
Statistical Learning Theory: 18.657: Mathematics of Machine Learning
9 pages
Expo Kundu
No ratings yet
Expo Kundu
22 pages
Chapter 1 Basic Concepts: Thomas Bayes (1702-1761)
No ratings yet
Chapter 1 Basic Concepts: Thomas Bayes (1702-1761)
6 pages
Journal of King Saud University - Science: Manoj Kumar Rastogi, Faton Merovci
No ratings yet
Journal of King Saud University - Science: Manoj Kumar Rastogi, Faton Merovci
7 pages
James-Stein Estimator
No ratings yet
James-Stein Estimator
12 pages
Point Estimation
No ratings yet
Point Estimation
5 pages
Bayesian Modelling Tuts-4-9
No ratings yet
Bayesian Modelling Tuts-4-9
6 pages
02 - Statistic With Computer Application - Sampling Procedure
100% (1)
02 - Statistic With Computer Application - Sampling Procedure
15 pages
Toaz - Info Ge 4 Topic 2 Statistics PR
No ratings yet
Toaz - Info Ge 4 Topic 2 Statistics PR
11 pages
2nd Quarterly Exam in Practical Research 2 - 12
No ratings yet
2nd Quarterly Exam in Practical Research 2 - 12
3 pages
Assign 1
No ratings yet
Assign 1
5 pages
Bayes
No ratings yet
Bayes
3 pages
Notes 2 BayesianStatistics
No ratings yet
Notes 2 BayesianStatistics
6 pages
Type-I and Type-II Errors in Statistics
No ratings yet
Type-I and Type-II Errors in Statistics
3 pages
Lecture 6. Bayesian Estimation
No ratings yet
Lecture 6. Bayesian Estimation
14 pages
Anova
No ratings yet
Anova
6 pages
Chapter 5
No ratings yet
Chapter 5
16 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
An Introduction To Bayesian Statistics
100% (9)
An Introduction To Bayesian Statistics
20 pages
Bivariate Correlation in SPSS
No ratings yet
Bivariate Correlation in SPSS
2 pages
B.A. (Prog.) Dse Mathematics - Statistics
No ratings yet
B.A. (Prog.) Dse Mathematics - Statistics
3 pages
Chapter 16: Introduction To Bayesian Methods of Inference: P P y N P y P P y L N P y F
No ratings yet
Chapter 16: Introduction To Bayesian Methods of Inference: P P y N P y P P y L N P y F
8 pages
ANOVA One-Way Classification: Source of Variation Sum of Squares Degrees of Freedom Mean Sum of Squares Variance Ratio
No ratings yet
ANOVA One-Way Classification: Source of Variation Sum of Squares Degrees of Freedom Mean Sum of Squares Variance Ratio
4 pages
Random Forest Intro Presented
No ratings yet
Random Forest Intro Presented
38 pages
Quantile Regression Models and Their Applications A Review 2155 6180 1000354
No ratings yet
Quantile Regression Models and Their Applications A Review 2155 6180 1000354
6 pages
Chap 5
No ratings yet
Chap 5
144 pages
HW2
No ratings yet
HW2
3 pages
Lesson Plan
No ratings yet
Lesson Plan
12 pages
DSC4213 - Analytics Tools For Consulting
No ratings yet
DSC4213 - Analytics Tools For Consulting
40 pages
UOP 999-97 Precission Statements in UOP Methods
No ratings yet
UOP 999-97 Precission Statements in UOP Methods
18 pages
Elem Stat Midterm Exam
No ratings yet
Elem Stat Midterm Exam
3 pages
Fia117v 3 Extra Questions
No ratings yet
Fia117v 3 Extra Questions
6 pages
The Analysis of Means (ANOM) : S N S N
No ratings yet
The Analysis of Means (ANOM) : S N S N
3 pages
MMW Prob Set 202
No ratings yet
MMW Prob Set 202
4 pages
Pharmaceutical Quality - The Dissolution Test and Clinically Relevant Specifications - Impact On Product Development
No ratings yet
Pharmaceutical Quality - The Dissolution Test and Clinically Relevant Specifications - Impact On Product Development
20 pages
T Test Module
No ratings yet
T Test Module
14 pages
Asynchronous-Hypothesis - Solution
No ratings yet
Asynchronous-Hypothesis - Solution
15 pages
C1 STS
No ratings yet
C1 STS
3 pages
Real Variables with Basic Metric Space Topology
From Everand
Real Variables with Basic Metric Space Topology
Robert B. Ash
5/5 (1)
Set Theory Essentials
From Everand
Set Theory Essentials
Emil Milewski
No ratings yet
Group Theory I Essentials
From Everand
Group Theory I Essentials
Emil Milewski
No ratings yet

Lecture 09

Uploaded by

Lecture 09

Uploaded by

STA732

1. Construct minimum risk equivariant (MRE) estimator via

• We have finished the first approach of arguing for “the best”

1. Bayes risk, Bayes estimator

Chap. 7 in Keener or Chap. 4 in Lehmann and Casella

Def. Bayes risk

Def. Bayes risk

𝑅Bayes (Λ, 𝛿) = ∫ 𝑅(𝜃, 𝛿)𝑑Λ(𝜃)

where Θ is the randoma variable with distribution Λ.

𝔼𝑅(Θ, 𝛿) = 𝔼[𝔼[𝐿(Θ, 𝛿(𝑋)) ∣ 𝑋]]

Both 𝑋 and Θ are considered random.

An estimator 𝛿 which minimizes the average risk 𝑅Bayes (Λ, ⋅) is a

Thm 7.1 in Keener

• 𝔼[𝐿(Θ, 𝛿0 )] < ∞ for some 𝛿0

Then 𝛿Λ is a Bayes estimator.

In words: the Bayes estimator can be found by minimizing the conditional

Suppose prior density 𝜆(𝜃), likelihood 𝑝𝜃 (𝑥), then the posterior

where 𝑞(𝑥) = ∫Ω 𝜆(𝜃)𝑝𝜃 (𝑥)𝑑𝜃 is the marginal density of 𝑋.

𝛿Λ (𝑥) = arg min ∫ 𝐿(𝜃, 𝑑)𝜆(𝜃 ∣ 𝑥)𝑑𝜃

Suppose 𝑋 ∣ Θ = 𝜃 ∼ Binomial(𝑛, 𝜃) with density 𝜃𝑥 (1 − 𝜃)𝑛−𝑥 (𝑛𝑥),

Suppose the parameter space Ω = {0, 1}.

Thm Lehmann Casella 4.2.3

Thm. Lehmann Casella 4.1.4

1. 𝑅Bayes (Λ, 𝛿Λ ) < ∞,

Thm. Lehmann Casella 5.2.4

• Bayes estimator is defined as the minimizer of the average risk

• Where do priors come from?

You might also like