0% found this document useful (0 votes)

61 views11 pages

Statistics 512 Notes 26: Decision Theory Continued: FX FX D

This document discusses decision theory and posterior analysis in statistics. It introduces Bayes risk, which is the expected loss of a decision rule given a prior distribution and data generated from a probability model. It defines the posterior distribution as the conditional distribution of the parameter given the observed data. The Bayes rule is the decision rule that minimizes posterior risk. For example, in point estimation with squared error loss, the Bayes rule is the mean of the posterior distribution. The document provides an example estimating the probability of heads for a coin based on one toss. It also discusses admissibility, showing that Bayes estimators are always admissible but other estimators may not be.

Uploaded by

Sandeep Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views11 pages

Statistics 512 Notes 26: Decision Theory Continued: FX FX D

Uploaded by

Sandeep Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 11

Statistics 512 Notes 26: Decision Theory

Continued
Posterior Analysis
We now develop a method for finding the Bayes rule. The
Bayes risk for a prior distribution
( )
is the expected loss
of a decision rule
( ) d X
when X is generated from the
following probability model:
First, the state of nature is generated according to the prior
distribution
( )
Then, the data X is generated according to the distribution
( ; ) f X
, which we will denote by
( | ) f X
Under this probability model (call it the Bayes model), the
marginal distribution of X is (for the continuous case)
( ) ( | ) ( )
X
f x f x d

Applying Bayes rule, the conditional distribution of

given X is
,
( , )
( | ) ( )
( | )
( )
( | ) ( )
X
X
f x
f x
h x
f x
f x d

The conditional distribution

( | ) h x
is called the X x of
. The words prior and posterior derive from the facts that
( )
is specified before (prior to) observing X and
( | ) h x
is calculated after (posterior to) observing X x . We will
discuss later more about the interpretation of prior and
posterior distributions.
Suppose that we have observed X x . We define the
posterior risk of an action
( ) a d x
as the expected loss,
where the expectation is taken with respect to the posterior
distribution of . For continuous random variables, we
have
( | )
[ ( ), ( ))] ( , ( )) ( | )
h X x
E l d x l d x h X x d

Theorem: Suppose there is a function

0
( ) d x
that minimizes
the posterior risk. Then
0
( ) d x
is a Bayes rule.
Proof: We will this for the continuous case. The discrete
case is proved analogously. The Bayes risk of a decision
function d is
( )
( )
,
( ) [ ( ), ]
[ ( ( , ( )) | ]
( , ( )) ( | ) ( )
( , ( )) ( , )
( , ( )) ( | ) ( )
X
x
X
B d E R d
E E l d X
l d x f x dx d
l d x f x dxd
l d x h x d f x dx

(We have used the relations
,
( | ) ( ) ( , ) ( ) ( | )
X X
f x f x f x h x

)
Now the inner integral is the posterior risk and since
( )
X
f x
is nonnegative,
( ) B d
can be minimized by choosing
0
( ) ( ) d x d x
.

The practical importance of this theorem is that it allows us

to use just the observed data,
x
, rather than considering all
possible values of X to find the action for the Bayes rule
**
d
given the data X x ,
**
( ) d x . In summary the
algorithm for finding
**
( ) d x is as follows:
Step 1: Calculate the posterior distribution
( | ) h X x
.
Step 2: For each action
a
, calculate the posterior risk,
which is
[ ( , ) | ] ( , ) ( | ) E l a X x l a h X d

The action
*
a that minimizes the posterior risk is the Bayes
rule action
** *
( ) d x a
Example: Consider again the engineering example.
Suppose that we observe
2
45 X x
. In the notation of
that example, the prior distribution is
1
( ) .8
,
2
( ) .2
.
We first calculate the posterior distribution:
2 1 1
1 2
2
2
1
( | ) ( )
( | )
( | ) ( )
.3*.8

.3*.8 .2*.2
.86
i i
i
f x
h x
f x

Hence,
2 2
( | ) .14 h x
We next calculate the posterior risk (PR) for
1
a
and
2
a
:
1 1 1 1 2 2 1 2 2
( ) ( , ) ( | ) ( , ) ( | )
0 400*.14
56
PR a l a h x l a h x +
+

and
2 1 2 1 2 2 2 2 2
( ) ( , ) ( | ) ( , ) ( | )
100*.86 0
86
PR a l a h x l a h x +
+

Comparing the two, we see that

1
a
has the smaller posterior
risk and is thus the Bayes rule.
Decision Theory for Point Estimation
Estimation theory can be cast in a decision theoretic
framework. The action space is the same as the parameter
space and the decision functions are point estimates

( ) d X . In this framework, a loss function may be

specified, and one of the strengths of the theory is that
general loss functions are allowed and thus the purpose of
estimation is made explicit. But the case of squared error
loss is especially tractable. For squared error loss, the risk
is
[ ]
{ }
( )
2
1
2
( , ) ( , , )

n
R d E d X X
E

1

1
]
K
Suppose that a Bayesian approach is taken and the prior
distribution on is
( )
. Then from the above theorem,
the Bayes rule for squared error loss can be found by
minimizing the posterior risk, which is
2
( )

[( ) | ] E X x

. We have
[ ] ( )
2
2
( ) ( ) ( )
2
( ) ( )

[( ) | ] ( ) | ( ) |

| |
E X x Var X x E X x
Var X x E X x

1 1
+
] ]
1
+
]
The first term of this last expression does not depend on

and the second term is minimized by
( )

| E X x

1

]
.
Thus, the Bayes rule for squared error loss is the mean of
the posterior distribution of ,

( | ) h X d

.
Example: A (possibly) biased coin is thrown once, and we
want to estimate the probability of the coin landing heads
on future tosses based on this single toss. Suppose that we
have no idea how biased the coin is; to reflect this state of
knowledge, we use a uniform prior distribution on :
( ) 1, 0 1 g
Let 1 X if a head appears, and let 0 X if a tail appears.
The distribution of X given is
, 1
( | )
1 , 0
x
f x
x

The posterior distribution is

( | ) 1
( | )
( | )
f x
h x
f x d

In particular,
1
0
1
0
( | 1) 2
(1 ) 1
( | 0) 2(1 )
(1 )
h X
d
h X
d

Suppose that 1 X . The Bayes estimate of is the mean

of the posterior distribution
( | 1) h X
, which is
1
0
2
(2 )
3
d

The Bayes estimate in the case where 0 X is

3

.
Note that these estimates differ from the classical
maximum likelihood estimates, which are 0 and 1.
Comparison of the risk functions of the Bayes estimate and
the MLE:
The risk function of the above Bayes estimate is
( )
2 2
2
2 2
2
1 2

( 0) ( 1)
3 3
1 2
(1 )
3 3
1 1 1

9 3 3
Bayes
E P X P X

_ _
1
+

1
]
, ,
_ _
+

, ,
+
The risk function of the MLE

MLE
X is
( )
( ) ( )
( ) ( )
2
2 2
2 2
2

0 ( 0) 1 ( 1)
0 (1 ) 1
(1 )
MLE
E P X P X

1
+
1
]
+
+
The following graph shows the risk function of the Bayes
estimate and the MLE the Bayes estimate is the solid line
and the MLE is the dashed line. The Bayes estimate has
smaller risk than that of the MLE over most of the range of
[0,1] but neither estimator dominates the other.
Admissibility
Minimax estimators and Bayes estimators are good
estimators in the sense that their risk functions have
certain good properties; minimax estimators minimize the
worst case risk and Bayes estimators minimize a weighted
average of the risk. It is also useful to characterize bad
estimators.
Definition: An estimator

is inadmissible if there exists

another estimator

'
that dominates

, meaning that

( , ') ( , ) for all and

( , ') ( , ) for at least one

If there is no estimator ' that dominates , then is
R R
R R
admissible

<
Example: Let X be a sample of size one from a
( ,1) N
distribution and consider estimating with squared error
loss.
Let

( ) 2 X X . Then

( )
2
2
2

( , ) [(2 ) ]
(2 ) [2 ]
4
R E X
Var X E X

+
+
The risk function of the MLE

( )
MLE
X X is
( )
2
2

( , ) [( ) ]
( ) [ ]
1
MLE
R E X
Var X E X

Thus, the MLE dominates

( ) 2 X X and

( ) 2 X X is
inadmissible.
Consider another estimator

( ) 3 X . We will show that

is admissible. Suppose not. Then there exists a different

estimator

'
with smaller risk. In particular,

(3, ') (3, ) 0 R R . Hence,
( )
2 2
1

0 (3, ') ( ' 3) exp ( 3) / 2
2
R x dx

.
Thus,

'( ) 3 X and there is no estimator that dominates

. Even though

is admissible, it is clearly a bad estimator.

In general, it is very hard to know whether a particular
estimate is admissible, since one would have to check that
it was not strictly dominated by any other estimate in order
to show that it was admissible. The following theorem
states that any Bayes estimate is admissible.
Theorem (Complete Class Theorem): Suppose that one of
the following two assumptions holds:
(1) is discrete and * d is a Bayes rule with respect to a
prior probability mass function

such that
( ) 0 >
for all
.
(2) is an interval and * d is a Bayes rule with respect to a
prior density function
( ) g
such that
( ) 0 >
for all
and
( , ) R d
is a continuous function of for all d .
Then * d is admissible.
Proof: We will prove the theorem for assumption (2). The
proof is by contradiction. Suppose that * d is inadmissible.
There is then another estimate, d , such that
( , *) ( , ) R d R d
for all and with strict inequality for
some , say
0

. Since
( , *) ( , ) R d R d
is a continuous
function of , there is an 0 > and an interval h t such
that
0 0
( , *) ( , ) for R d R d h h > +
Then,
[ ] [ ]
0
0
0
0
( , *) ( , ) ( ) ( , *) ( , ) ( )
( ) 0
h
h
h
h
R d R d d R d R d d
d

> >

But this contradicts the fact that * d is a Bayes rule because

a Bayes rule has the property that
[ ]
( *) ( ) ( , *) ( , ) ( ) 0 B d B d R d R d d

.
The proof is complete.

The theorem can be regarded as both a positive and

negative result. It is positive in that it identifies a certain
class of estimates as being admissible, in particular, any
Bayes estimate. It is negative in that there are apparently
so many admissible estimates one for every prior
distribution that satisfies the hypotheses of the theorem
and some of these might make little sense (like

( ) 3 X
for the normal distribution above).

Bhir103 Statistics PDF
No ratings yet
Bhir103 Statistics PDF
147 pages
Point Estimation
No ratings yet
Point Estimation
5 pages
Lecture 09
No ratings yet
Lecture 09
32 pages
Statistics 512 Notes 25: Decision Theory: of Nature. The Set of All Possible Value of
No ratings yet
Statistics 512 Notes 25: Decision Theory: of Nature. The Set of All Possible Value of
11 pages
Estimation
No ratings yet
Estimation
53 pages
Bayes
No ratings yet
Bayes
3 pages
Lecture3 Decision Theory
No ratings yet
Lecture3 Decision Theory
28 pages
Lecture 6. Bayesian Estimation
No ratings yet
Lecture 6. Bayesian Estimation
14 pages
Lecture Notes For Probability and Statistics
No ratings yet
Lecture Notes For Probability and Statistics
7 pages
Decision Theory Lecture Ii PDF
No ratings yet
Decision Theory Lecture Ii PDF
12 pages
BT Wk5 LectureNotes A
No ratings yet
BT Wk5 LectureNotes A
17 pages
Chap3 01
No ratings yet
Chap3 01
35 pages
Chap 3
No ratings yet
Chap 3
74 pages
3assignment Sol
No ratings yet
3assignment Sol
7 pages
RenSun Sankhya2004 ComparisonBayesFreqtstPrediction
No ratings yet
RenSun Sankhya2004 ComparisonBayesFreqtstPrediction
29 pages
Statistical Learning Theory: 18.657: Mathematics of Machine Learning
No ratings yet
Statistical Learning Theory: 18.657: Mathematics of Machine Learning
9 pages
Revised Lecture Notes 2
No ratings yet
Revised Lecture Notes 2
16 pages
Intro To Bayes Approach. Reasons To Be Bayesian: Differences Between Bayesian and Frequentist Approaches 1
No ratings yet
Intro To Bayes Approach. Reasons To Be Bayesian: Differences Between Bayesian and Frequentist Approaches 1
9 pages
Homework1 Solutions
No ratings yet
Homework1 Solutions
5 pages
Scribe Notes BML
No ratings yet
Scribe Notes BML
25 pages
Chapter 1 Basic Concepts: Thomas Bayes (1702-1761)
No ratings yet
Chapter 1 Basic Concepts: Thomas Bayes (1702-1761)
6 pages
HW 3 Sol
No ratings yet
HW 3 Sol
6 pages
Assign 1
No ratings yet
Assign 1
5 pages
Bayes Decision Theory
No ratings yet
Bayes Decision Theory
53 pages
Decisiontheory 0
No ratings yet
Decisiontheory 0
13 pages
Bayesian Learning: Thanks To Nir Friedman, HU
No ratings yet
Bayesian Learning: Thanks To Nir Friedman, HU
41 pages
Lectures 5
No ratings yet
Lectures 5
31 pages
Minimax
No ratings yet
Minimax
10 pages
LN 13
No ratings yet
LN 13
5 pages
An Overview of Bayesian Econometrics
No ratings yet
An Overview of Bayesian Econometrics
30 pages
RN Notes
No ratings yet
RN Notes
119 pages
Assignment 5 Stat Inf b3 2022 2023 PDF
No ratings yet
Assignment 5 Stat Inf b3 2022 2023 PDF
16 pages
Stat Risk
No ratings yet
Stat Risk
6 pages
Lecture 7 Baysian Classifier
No ratings yet
Lecture 7 Baysian Classifier
25 pages
Notes 2 BayesianStatistics
No ratings yet
Notes 2 BayesianStatistics
6 pages
J Jspi 2005 01 004 PDF
No ratings yet
J Jspi 2005 01 004 PDF
25 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages
T10 Sol..ol
No ratings yet
T10 Sol..ol
8 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
65 pages
Bayesian Modelling Tuts-4-9
No ratings yet
Bayesian Modelling Tuts-4-9
6 pages
Chapter 3 - Bayesian Inference
No ratings yet
Chapter 3 - Bayesian Inference
114 pages
STA402 2december 2024lecture3 BayesianInference
No ratings yet
STA402 2december 2024lecture3 BayesianInference
132 pages
Chapter 2 - Bayesian Statistics
No ratings yet
Chapter 2 - Bayesian Statistics
126 pages
Bayes Estimator of One Parameter Gamma Distribution Under Quadratic and LINEX Loss Function Wael Abdul Lateef Jasim
No ratings yet
Bayes Estimator of One Parameter Gamma Distribution Under Quadratic and LINEX Loss Function Wael Abdul Lateef Jasim
16 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Minimax
No ratings yet
Minimax
26 pages
HW 2 Sol
No ratings yet
HW 2 Sol
7 pages
Bayesian Decision Theory: CS479/679 Pattern Recognition Dr. George Bebis
No ratings yet
Bayesian Decision Theory: CS479/679 Pattern Recognition Dr. George Bebis
64 pages
STAT 830 Bayesian Estimation: Richard Lockhart
No ratings yet
STAT 830 Bayesian Estimation: Richard Lockhart
23 pages
19-Bayesian 2
No ratings yet
19-Bayesian 2
39 pages
Bayesian Uncertainty Quantification
No ratings yet
Bayesian Uncertainty Quantification
23 pages
Lecture 10
No ratings yet
Lecture 10
33 pages
Theory For Classification and Linear Models (I)
No ratings yet
Theory For Classification and Linear Models (I)
32 pages
An Introduction To Bayesian Statistics
100% (9)
An Introduction To Bayesian Statistics
20 pages
Bayesian Analysis of Extreme Operational Losses: Chyng-Lan Liang
No ratings yet
Bayesian Analysis of Extreme Operational Losses: Chyng-Lan Liang
17 pages
SDA Bayes
No ratings yet
SDA Bayes
12 pages
1-MS2 (Intro Bayes)
No ratings yet
1-MS2 (Intro Bayes)
38 pages
MCMC Bayes PDF
No ratings yet
MCMC Bayes PDF
27 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Statistics 512 Notes 19
No ratings yet
Statistics 512 Notes 19
12 pages
Statistics 512 Notes 24: Uniformly Most Powerful Tests: X FX X FX X X
No ratings yet
Statistics 512 Notes 24: Uniformly Most Powerful Tests: X FX X FX X X
7 pages
Notes 21
No ratings yet
Notes 21
7 pages
Statistics 512 Notes 18
No ratings yet
Statistics 512 Notes 18
10 pages
Statistics 512 Notes 16: Efficiency of Estimators and The Asymptotic Efficiency of The MLE
No ratings yet
Statistics 512 Notes 16: Efficiency of Estimators and The Asymptotic Efficiency of The MLE
6 pages
Statistics 512 Notes 8: The Monte Carlo Method: X X H H X Is Normal (With Unknown
No ratings yet
Statistics 512 Notes 8: The Monte Carlo Method: X X H H X Is Normal (With Unknown
7 pages
Statistics 512 Notes 12: Maximum Likelihood Estimation: X X PX X
No ratings yet
Statistics 512 Notes 12: Maximum Likelihood Estimation: X X PX X
5 pages
Confidence Intervals Continued: Statistics 512 Notes 4
No ratings yet
Confidence Intervals Continued: Statistics 512 Notes 4
8 pages
Statistics 512 Notes I D. Small
No ratings yet
Statistics 512 Notes I D. Small
8 pages
Sum of Weibull Variates
No ratings yet
Sum of Weibull Variates
6 pages
PS 3 Bus 310 Resubmit
No ratings yet
PS 3 Bus 310 Resubmit
7 pages
Decision Map PDF
No ratings yet
Decision Map PDF
1 page
Rip 2
No ratings yet
Rip 2
12 pages
Misuse of Slovin's Formula
No ratings yet
Misuse of Slovin's Formula
8 pages
Lesson 7 7 Answer Key AP Stats Math Medic b9304651b7
No ratings yet
Lesson 7 7 Answer Key AP Stats Math Medic b9304651b7
2 pages
Home Study Notes (Hypothesis Testing and Normal Distribution)
No ratings yet
Home Study Notes (Hypothesis Testing and Normal Distribution)
23 pages
Polit & Beck Chapter 17 QUIZ
No ratings yet
Polit & Beck Chapter 17 QUIZ
6 pages
7mutually Exclusive For Students
No ratings yet
7mutually Exclusive For Students
130 pages
Summer 2024 - P&S
100% (1)
Summer 2024 - P&S
27 pages
Stat. Analysis Prelim
No ratings yet
Stat. Analysis Prelim
32 pages
5 Sampling Technique and Sample Size
No ratings yet
5 Sampling Technique and Sample Size
11 pages
PH TWO WAY ANOVA EXCEL Sample Result
No ratings yet
PH TWO WAY ANOVA EXCEL Sample Result
2 pages
6 - 2 Transforming and Combining Random Variables
No ratings yet
6 - 2 Transforming and Combining Random Variables
20 pages
Thesis Chapter 4 Chi Square
100% (3)
Thesis Chapter 4 Chi Square
8 pages
Random Variables and Mathematical Expectations - Lecture 13 Notes
No ratings yet
Random Variables and Mathematical Expectations - Lecture 13 Notes
9 pages
Odds Ratio
No ratings yet
Odds Ratio
12 pages
Conditions For Inference With The SDSM: Random
No ratings yet
Conditions For Inference With The SDSM: Random
6 pages
لتوافق الزواجي وعلاقته بالاستقرار الاسري لدى عينة من المتزوجين بمدينة مكة المكرمة
No ratings yet
لتوافق الزواجي وعلاقته بالاستقرار الاسري لدى عينة من المتزوجين بمدينة مكة المكرمة
161 pages
(PDF) Solution Manual of Probability Statist
0% (1)
(PDF) Solution Manual of Probability Statist
135 pages
Stats Exam 1 Cheat Sheet
No ratings yet
Stats Exam 1 Cheat Sheet
3 pages
Chapter5 - Solution Manual
No ratings yet
Chapter5 - Solution Manual
4 pages
Lecture 03
No ratings yet
Lecture 03
36 pages
SLM-RESEARCH-7-Week 3 - Appropriate-Measuring-Tools-and-Their-Uses
100% (2)
SLM-RESEARCH-7-Week 3 - Appropriate-Measuring-Tools-and-Their-Uses
4 pages
Assignment 3
No ratings yet
Assignment 3
6 pages
TireInfo TireComparison
No ratings yet
TireInfo TireComparison
1 page
Summative Test and Performance Task Q3 WK 1-2
No ratings yet
Summative Test and Performance Task Q3 WK 1-2
4 pages
Sheffes Test
No ratings yet
Sheffes Test
4 pages
Kolmogorov Smirnov Test For Normality
No ratings yet
Kolmogorov Smirnov Test For Normality
11 pages
Module Four Lesson One Activity One
No ratings yet
Module Four Lesson One Activity One
3 pages