0% found this document useful (0 votes)

7 views10 pages

Single Parametric Models

The document discusses Bayes' Theorem and its application in Bayesian inference, particularly focusing on single parameter models. It explains how to derive posterior distributions using prior distributions and likelihoods, with examples including normal and binomial distributions. Additionally, it covers concepts of point estimation, interval estimation, and the use of credible intervals in Bayesian analysis.

Uploaded by

jackyko0319

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views10 pages

Single Parametric Models

Uploaded by

jackyko0319

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Outline

Single Parameter Models

Bayes’ Theorem
Yi Yang Single Parameter Models

Department of Biostatistics and School of Data Science Bayesian Inference Based on Posterior
City University of Hong Kong
Prediction

Prior Elicitation

1/37 2/37

Bayes' Theorem

Bayes’ Theorem PLAIB) =

P(A , B)
P(B)
=
P(A , B)
P(A)

P (A ,
B)
-

Let A denote an event, and Ac denote its complement. Thus, A [ Ac = S P(A B),
+ P(AY B),
=
P(B)
P(BIA) P(A)
and A \ Ac = ;, where S is the sample space. We have =

PLBIA) P(A) +
P(BIA)P(AY

c
P(A) + P(A ) = P(S) ⌘ 1

– This is Bayes’ Theorem.

3/37
Bayes’ Theorem Applied to Statistical
Bayes’ Theorem (cont’d)
Models R for parameter
Example: Suppose 5% of a given population is infected with HIV virus,
and that a certain HIV test gives a positive result 98% of the time among -distribution y
Suppose we have observed data y, which has a probability > likelihord
-

patients who have HIV and 4% of the time among patients who do not distribution f (y|✓) that depends upon an unknown vector of
have HIV. If a given person has tested positive, what is the probability parameters ✓, and ⇡(✓) is the prior distribution of ✓ that represents
that he/she actually has HIV virus? the experimenter’s opinion about ✓.
A = event: a person has HIV Bayes’ theorem applied to statistical models
B = event: tested positive statistical version of Bayes' Theorem
p(y, ✓)
P(A|B) =
P(B|A)P(A)
Posterior ! p(✓|y) =
m(y)
=>
P(AIB) :
I
P(B|A)P(A) + P(B|Ac )P(Ac ) f (y|✓)⇡(✓) likelihood ⇥ prior
0.98 ⇥ 0.05 = R
=
0.98 ⇥ 0.05 + 0.04 ⇥ 0.95
= 0.563 P(BIAi) PLAi) #
⇥
f (y|✓)⇡(✓)d✓ marginal distribution of y
⇥ is the parameter space, i.e., the set of all possible values for ✓.
General Bayes’ theorem: Let A1 , . . . , Am be mutually exclusive and
exhaustive events. (Exhaustive means A1 [ · · · [ Am = S.) For any event
The marginal distribution of y is a function of y alone (nothing to
B with P(B) > 0,
do with ✓), and is often called ‘normalizing constant’.
P(B|Aj )P(Aj ) p(✓|y) / f (y|✓)⇡(✓)
P(Aj |B) = Pm , j = 1, . . . , m.
i=1 P(B|Ai )P(Ai )
4/37
Proportional 5/37

Single Parameter Model: Normal with

Known Variance
Consider a single observation y from a normal distribution with
known variance.
2
Likelihood: y ⇠ N(y |✓, ), > 0 is known.
Prior on ✓: ✓ ⇠ N(✓ |µ, ⌧ 2 ), µ 2 R and ⌧ > 0 are known
hyperparameters.
Posterior distribution of ✓:
✓ 2
◆
⌧2 2 2
⌧
p(✓|y ) = N ✓ | 2 2
µ+ 2 y, .
+⌧ + ⌧2 2 + ⌧2

2
Write B = 2 +⌧ 2 , and note that 0 < B < 1. Then:
E (✓|y ) = Bµ + (1 B)y , a weighted average of the prior mean and
the observed data value, with weights determined sensibly by the
variances.
Var (✓|y ) = B⌧ 2 ⌘ (1 B) 2
, smaller than ⌧ 2 and 2
.
Precision (which is like “information”) is additive:
Var 1 (✓|y ) = Var 1 (✓) + Var 1 (y |✓).
6/37
Example: µ = 2, ȳ = 6, ⌧ = = 1, varying n µ = 2, ȳ = 6, n = 1, = 1, varying ⌧

0.7
prior

1.2
posterior with n = 1 posterior with τ=1
posterior with n = 10 posterior with τ=2

0.6
posterior with τ=5
1.0

0.5
0.8

0.4
density

density
0.6

0.3
0.4

0.2
0.2

0.1
0.0
0.0

-2 0 2 4 6 8 −2 0 2 4 6 8 10
θ
θ

When n = 1 the prior and likelihood receive equal weight, so the When ⌧ = 1 the prior is as informative as likelihood, so the posterior
posterior mean is 4 = 2+62 . mean is 4 = 2+6
2 .
When n = 10 the data dominate the prior, resulting in a posterior When ⌧ = 5 the prior is almost flat over the likelihood region, and
mean much closer to ȳ . thus is dominated by the likelihood.
The posterior variance also shrinks as n gets larger; the posterior As ⌧ increases, the prior becomes “flat” relative to the likelihood
collapses to a point mass on ȳ as n ! 1. function. Such prior distributions are called “noninformative” priors.
7/37 8/37

Deriving the Posterior Deriving the Posterior

We can find the posterior distribution of the normal mean ✓ via

Bayes Theorem 2
Consider the previous example: a single observation y ⇠ N(y |✓, )
f (y |✓)⇡(✓) f (y |✓)⇡(✓) with known and prior ✓ ⇠ N(✓ |µ, ⌧ 2 ). Can you derive the
p(✓|y ) = =R . posterior?
m(y ) ⇥
f (y |✓)⇡(✓)d✓
✓ 2
◆
⌧2 2 2
⌧
p(✓|y ) = N ✓ | 2 µ+ 2 y, 2
Note that m(y ) does NOT depend on ✓, and thus is just a constant. + ⌧2 + ⌧2 + ⌧2
That is,
p(✓|y ) / f (y |✓)⇡(✓). Question: Now, consider n independent observations
y = (y1 , . . . , yn ) from the normal distribution f (yi |✓) = N(yi |✓, 2 ),
The final posterior is A · f (y |✓)⇡(✓), such that and the same prior ⇡(✓) = N(✓|µ, ⌧ 2 ). What is the posterior of ✓
Z now?
A · f (y |✓)⇡(✓)d✓ = 1

9/37 10/37
Bayes and Sufficiency Single Parameter Model: Binomial Data
Recall that T (y) is sufficient for ✓ if the likelihood can be factored as
Example: Estimating the probability of a female birth. The currently
f (y|✓) = h(y)g (T (y)|✓). accepted value of the proportion of female births in large European
populations is 0.485. Recent interest has focused on factors that
Implication in Bayes: may influence the sex ratio.
p(✓|y) / f (y|✓)⇡(✓) / g (T (y)|✓)⇡(✓)
We consider a potential factor, the maternal condition placenta
Then p(✓|y) = p(✓|T (y)) ) we may work with T (y) instead of the previa, an unusual condition of pregnancy in which the placenta is
entire dataset y. implanted low in the uterus obstructing the fetus from a normal
vaginal delivery.
Again, consider n ind. observations y = (y1 , . . . , yn ) from the normal
distribution f (yi |✓) = N(yi |✓, 2 ), and prior ⇡(✓) = N(✓|µ, ⌧ 2 ). Observation: An early study concerning the sex of placenta previa
births in Germany found that of a total of 980 births, 437 were
Since T (y) = ȳ is sufficient for ✓, we have that p(✓|y) = p(✓|ȳ ). female.

We know that f (ȳ |✓) = N(✓,

2
), this implies that Question: How much evidence does this provide for the claim that
n
2
! the proportion of female births in the population of placenta previa
n ⌧2 ⌧ 2 2 births is less than the proportion of female births in the general
p(✓|ȳ ) = N ✓ 2 µ+ 2 ȳ , 2
. population?
n + ⌧2 n +⌧
2 + n⌧ 2
11/37 12/37

Example: Probability of a female birth Example: Probability of a female birth

given placenta previa given placenta previa
Likelihood: Let
The posterior distribution can be obtained via
✓ = prob. of a female birth given placenta previa
⇢
1 if a female birth p(✓|x) / f (x|✓) ⇡(✓)
Yi = ✓ ◆
0 otherwise 980 (↵ + ) x+↵ 1
= ✓ (1 ✓)980 x+ 1
P980 x (↵) ( )
Let X = i=1 Yi . Assuming independent births and constant ✓, we
have X |✓ ⇠ Binomial(980, ✓), / ✓x+↵ 1
(1 ✓)980 x+ 1
.
✓ ◆ The only distribution function that is proportional to the above is
980 x
f (x|✓) = ✓ (1 ✓)980 x . Beta(x + ↵, 980 x + )!
x

Consider a beta prior distribution for ✓ ✓|X ⇠ Beta(x + ↵, 980 x+ )

(↵ + ) ↵ Beta distributions are conjugate priors for Binomial likelihood

1 1
⇡(✓) = ✓ (1 ✓) .
(↵) ( )
13/37 14/37
Three di↵erent beta priors Bayesian Inference
Beta(1,1)
2.5

Beta(1.485,1.515) Now that we know what the posterior is, we can use it to make
Beta(5.85,6.15)
inference about ✓.
2.0

The three classes of classical, or frequentist, inference are

prior density

1.5

1 Point estimation
1.0

2 Confidence interval (CI)

3 Hypothesis testing
0.5

Each of them has its analog in the Bayesian world.

0.0

0.0 0.2 0.4 0.485 0.6 0.8 1.0

θ
15/37 16/37

Bayesian Inference: Point Estimation Posterior estimates

Prior Posterior
distribution Mode Mean Median
Easy! Simply choose an appropriate distributional summary: Beta(1, 1) 0.44592 0.44603 0.44599
posterior mean, median, or mode. Beta(1.485, 1.515) 0.44596 0.44607 0.44603
Beta(5.85, 6.15) 0.44631 0.44642 0.44639
Mode is often easiest to compute (no integration), but is often least
representative of “middle”, especially for one-tailed distributions. The classical point estimate is ✓ˆMLE = 437
980 = 0.44592.
Mean has the opposite property, tending to ”chase” heavy tails (just Remarks:
like the sample mean X̄ ) 1 A Bayes point estimate is a weighted average of a common
Median is probably the best compromise overall, though can be frequentist estimate and a parameter estimate obtained only from
awkward to compute, since it is the solution ✓median to the prior distribution.
Z
2 The Bayes point estimate “shrinks” the frequentist estimate toward
✓ median
1 the prior estimate.
p(✓|x) d✓ = .
1 2 3 The weight on the frequentist estimate tends to be 1 as n goes to
infinity.

17/37 18/37
Bayesian Inference: Interval Estimation HPD Credible Interval
Definition: The 100(1 ↵)% highest posterior density (HPD) credible
interval for ✓ is a subset C of ⇥ such that
The Bayesian analogue of a frequentist CI is referred to as a credible
interval: a 100 ⇥ (1 ↵)% credible interval for ✓ is a subset C of ⇥ C = {✓ 2 ⇥ : p(✓|y) k(↵)} ,
such that
Z where k(↵) is the largest constant for which
P(C |y) = p(✓|y)d✓ 1 ↵. P(C |y) 1 ↵.
C

Unlike the classical confidence interval, it has a proper probability 95% HPD interval 95% HPD interval

interpretation: “The probability that ✓ lies in C is (1 ↵)”

0.20

0.20
posterior

posterior
Two principles used in constructing credible interval C :

0.10

0.10
The volume of C should be as small as possible.
The posterior density should be greater for every ✓ 2 C than it is for

0.00

0.00
any ✓ 62 C .
0 2 4 6 8 10 12 0 2 4 6 8 10 12
The two criteria turn out to be equivalent.
θ θ

An HPD credible interval has the smallest volume of all intervals of the
same ↵ level.
19/37 20/37

Equal-tail Credible Interval Interval Estimation: Example

Using a Gamma(2, 1) posterior distribution and k(↵) = 0.1:

Simpler alternative: the equal-tail interval, or central posterior
interval, which takes the ↵/2- and (1 ↵/2)-quantiles of p(✓|y).
87% HPD interval, (0.12,3.59)
87% equal tail interval, (0.42,4.39)

0.4
Specifically, consider qL and qU , the ↵/2- and (1 ↵/2)-quantiles of
p(✓|y):

0.3
Z qL Z 1
posterior
p(✓|y)d✓ = ↵/2 and p(✓|y)d✓ = ↵/2 .
0.2
1 qU

Clearly, P(qL < ✓ < qU |y) = 1 ↵; our confidence that ✓ lies in

0.1

(qL , qU ) is 100 ⇥ (1 ↵)%. Thus, this interval is a 100 ⇥ (1 ↵)%

credible interval for ✓.
0.0

0 2 4 6 8 10
This interval is usually slightly wider than HPD interval, but easier
θ
to compute (just two quantiles), and also transformation invariant.
Equal-tail intervals do not work well for multimodal posteriors.

21/37 22/37
Example: probability of a female birth Bayesian Hypothesis Testing
f (X |✓) = Bin(980, ✓), ⇡(✓) = Beta(1, 1), xobs = 437
To test hypothesis of H0 versus H1 :
25
20 Classical approach bases accept/reject decision on

p-value = P{T (Y) more “extreme” than T (yobs )|✓, H0 } ,

15
posterior

where “extremeness” is in the direction of HA

Several problems with this approach:

0.35 0.40 0.45 0.50 0.55 hypotheses must be nested

Plot the posterior Beta(xobs + 1, n xobs + 1) = Beta(438, 544) in R: p-value can only o↵er evidence against the null
p-value is not the “probability that H0 is true” (but is often
theta <- seq(from=0, to=1, by=0.01)
erroneously interpreted this way)
xobs <- 437; n <- 980;
plot(theta,dbeta(theta,xobs+1,n-xobs+1),type="l",xlim=c(0.35,0.55)) As a result of the dependence on “more extreme” T (Y) values, two
experiments with identical likelihoods could result in di↵erent
p-values, violating the Likelihood Principle
Add 95% equal-tail Bayesian CI (dotted vertical lines):
abline(v=qbeta(.5, xobs+1, n-xobs+1))
abline(v=qbeta(c(.025,.975),xobs+1,n-xobs+1),lty=2)
23/37 24/37

Bayes Factor Bayes Factor vs Likelihood Ratio Test

Hypothesis testing in Bayesian framework is often translated into a model

Bayes factors can also be written in a similar form to the likelihood
selection problem: Model M1 under H1 versus Model M0 under H0 .
ratio test:
The quantity commonly used for Bayesian hypothesis testing and model
selection is the Bayes factor (BF): p(y|M1 )
BF =
p(y|M0 )
P(M1 |y)/P(M0 |y) posterior odds ratio
BF = (
P(M1 )/P(M0 ) prior odds ratio We integrate over the parameter space instead of maximizing over it.
P(M1 , y)/m(y)/P(M1 )
= The Bayes factor reduces to a likelihood ratio test in case of a
P(M0 , y)/m(y)/P(M0 )
p(y|M1 ) simple vs. simple hypothesis test, i.e., H0 : ✓ = ✓0 vs H1 : ✓ = ✓1
=
p(y|M0 )
R Other advantages of Bayes factor:
p(y|✓, M1 )⇡(✓|M1 )d✓
⇥
R M1 – The BF does NOT require nested models.
=
⇥
p(y|✓, M0 )⇡(✓|M0 )d✓ – The BF has a nice interpretation: large values of BF favors M1 (H1 ).
M0

25/37 26/37
Interpretation of Bayes Factor Example: Probability of a female birth
Data: x = 437 out of n = 980 placenta previa births were female.
We test the hypothesis that H0 : ✓ 0.485 vs. H1 : ✓ < 0.485.
Possible interpretations
Choose the uniform prior ⇡(✓) = Beta(1, 1), and the prior probability
of H1 is
BF Strength of evidence P(✓ < 0.485) = 0.485.
1 to 3 barely worth mentioning
The posterior is p(✓|x) = Beta(438, 544), and the posterior
3 to 20 positive probability of H1 is
P(✓ < 0.485|x = 437) = 0.993
20 to 150 strong
> 150 very strong The Bayes factor is
0.993/(1 0.993)
BF = = 150.6,
These are subjective interpretations and not uniformly agreed upon. 0.485/(1 0.485)
strong evidence in favor of H1 , a substantial lower proportion of
female births in population of placenta previa births than in the
general population.
27/37 28/37

Limitations and Alternatives Bayesian Prediction

We are often interested in predicting a future observation, yn+1 ,
given the observed data y = (y1 , . . . , yn ). A necessary assumption is
Limitations: exchangeability.
NOT well-defined when the prior ⇡(✓|H) is improper
Exchangeability: Given a parametric model f (Y |✓), observations
may be sensitive to the choice of prior y1 , . . . , yn , yn+1 are conditionally independent, i.e., the joint
distribution density f (y1 , . . . , yn+1 ) is invariant to permutation of the
Alternatives for model checking:
indexes.
Conditional predictive distribution
Z Under the assumption, we can predict a future observation, yn+1 ,
f (y)
f (yi |y(i) ) = = f (yi |✓, y(i) )p(✓|y(i) )d✓ , conditional on the observed data
f (y(i) )
Z
which will be proper if p(✓|y(i) ) is. p(yn+1 |y) = f (yn+1 |✓)p(✓|y)d✓ ,
Penalized likelihood criteria: the Akaike information criterion (AIC),
Bayesian information criterion (BIC), or Deviance information p(yn+1 |y) is known as posterior predictive distribution.
criterion (DIC).
b here, which is asymptotically
The frequentist would use f (yn+1 |✓)
equivalent to p(yn+1 |y) above (i.e., when p(✓|y) is a point mass at
b
✓).
29/37 30/37
Example: Predicting the sex of a future Prior Elicitation
birth

Given a Beta(1, 1) prior, the posterior of ✓ is Beta(438, 544). The A Bayesian analysis can be subjective in that two di↵erent people
posterior predictive distribution for the sex of a future birth is thus may observe the same data y and yet arrive at di↵erent conclusions
about ✓ when they have di↵erent prior opinions on ✓.
Z 1
⇤ ⇤
y⇤ (982) – Main criticism from frequentists.
p(y |y) = ✓y (1 ✓)1 · ✓437 (1 ✓)543 d✓
0 (438) (544)
How should one specify a prior (countering to subjectivity)?
This is known as the beta-binomial distribution. Mean and variance
Objective and informative: e.g., Historical data, data from pilot
of the posterior predictive distribution can be obtained by
experiments.
“Today’s posterior is tomorrow’s prior”
E (y ⇤ |y) = E (E (y ⇤ |✓, y)|y) = E (✓|y) = 0.446
Noninformative: priors meant to express ignorance about the
unknown parameters.
var (y ⇤ |y) = E (var (y ⇤ |✓, y)|y) + var (E (y ⇤ |✓, y)|y)
Conjugate: posterior and prior belong to the same distribution family.
= E (✓(1 ✓)|y) + var (✓|y)

31/37 32/37

Noninformative Prior Je↵reys Prior

Meant to express ignorance about the unknown parameter or have Another noninformative prior is the Je↵reys prior, given in the
minimal impact on the posterior distribution of ✓. univariate case by
Also referred as vague prior or flat prior. p(✓) = [I (✓)]1/2 ,
Example 1: ✓ = true probability of success for a new surgical where I (✓) is the expected Fisher information in the model, namely
procedure, 0  ✓  1. A noninformative prior is ⇡(✓) = Unif (0, 1).
 2
@
Example 2: y1 , . . . yn ⇠ N(yi |✓, 2 ), is known, ✓ 2 R. A I (✓) = Ex|✓ log f (x|✓) .
@✓2
noninformative prior is ⇡(✓) = 1, 1  ✓  1.
R1
This is an improper prior: 1
⇡(✓)d(✓) = 1. Je↵reys prior is improper for many models. It may be proper,
An improper prior may or may not lead to a proper posterior. however, for certain models.
⇣ ⌘
2
The posterior of ✓ in Example 2 is p(✓|y) = N ȳ , n , which is Unlike the uniform, the Je↵reys prior is invariant to 1-1
proper and is equivalent to the likelihood.
transformations.

33/37 34/37
Conjugate Priors Another Example of Conjugate Prior
Suppose that X is distributed as Poisson(✓), so that
✓ x
e ✓
Defined as one that leads to a posterior distribution belonging to the f (x|✓) = , x 2 {0, 1, 2, . . .}, ✓ > 0.
same distributional family as the prior: x!
– normal prior is conjugate for normal likelihood A reasonably flexible prior for ✓ is the Gamma(↵, ) distribution,
– beta prior is conjugate for binomial likelihood
1
✓↵ e ✓/
Conjugate priors are computationally convenient, but are often not p(✓) = ↵
, ✓ > 0, ↵ > 0, > 0,
possible in complex settings (↵)
– In high dimensional parameter space, priors that are conditionally The posterior is then
conjugate are often available (and helpful).
p(✓|x) / f (x|✓)p(✓)
We may guess the conjugate prior by looking at the likelihood as a
/ ✓x+↵ 1
e ✓(1+1/ )
.
function of ✓.
There is one and only one density proportional to the very last
function, Gamma(x + ↵, (1 + 1/ ) 1 ) density. Gamma is the
conjugate family for the Poisson likelihood.
35/37 36/37

Common Conjugate Families

Likelihood Conjugate Prior

Binomial(N, ✓) ✓ ⇠ beta(↵, )

Poisson(✓) ✓ ⇠ gamma( 0 , 0)

2 2
N(✓, ), is known ✓ ⇠ N(µ, ⌧ 2 )

2
N(✓, ), ✓ is known ⌧ 2 = 1/ 2
⇠ gamma( 0 , 0)

Exp( ) ⇠ gamma( 0 , 0)

MVN(✓, ⌃), ⌃ is known ✓ ⇠ MVN(µ, V )

MVN(✓, ⌃), ✓ is known ⌃ ⇠ Inv Wishart(⌫, V )

37/37

Discount Factors Table
80% (5)
Discount Factors Table
1 page
Single Parameter Models
No ratings yet
Single Parameter Models
37 pages
CH 5
No ratings yet
CH 5
45 pages
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
No ratings yet
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
53 pages
19-Bayesian 2
No ratings yet
19-Bayesian 2
39 pages
MIT18 650F16 Bayesian Statistics
No ratings yet
MIT18 650F16 Bayesian Statistics
18 pages
Bayesian Statistics 01
100% (1)
Bayesian Statistics 01
22 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
20 pages
Lecture 20 - Bayesian Analysis
No ratings yet
Lecture 20 - Bayesian Analysis
4 pages
Bayesian Modelling Tuts-4-9
No ratings yet
Bayesian Modelling Tuts-4-9
6 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
16 pages
An Overview of Bayesian Econometrics
No ratings yet
An Overview of Bayesian Econometrics
30 pages
Bayesian Inference: by Hoai Nam Nguyen September 9, 2017
No ratings yet
Bayesian Inference: by Hoai Nam Nguyen September 9, 2017
7 pages
Lectures 5
No ratings yet
Lectures 5
31 pages
Slides PDF
No ratings yet
Slides PDF
40 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Bayes
No ratings yet
Bayes
3 pages
BT Wk3 LectureNotes
No ratings yet
BT Wk3 LectureNotes
16 pages
Lecture 6. Bayesian Estimation
No ratings yet
Lecture 6. Bayesian Estimation
14 pages
Lecture Material 2.5 - Bayesian Estimation & Concepts
No ratings yet
Lecture Material 2.5 - Bayesian Estimation & Concepts
12 pages
Bayesian Statistics: Thomas Bayes
No ratings yet
Bayesian Statistics: Thomas Bayes
22 pages
DS 630 - Lec 4 - ST
No ratings yet
DS 630 - Lec 4 - ST
27 pages
LN 13
No ratings yet
LN 13
5 pages
BML Lecture Notes
No ratings yet
BML Lecture Notes
126 pages
Bayesian - Lec - 3
No ratings yet
Bayesian - Lec - 3
24 pages
DS 630 - Lec 5 - ST
No ratings yet
DS 630 - Lec 5 - ST
15 pages
A Beginner's Notes On Bayesian Econometrics (Art)
No ratings yet
A Beginner's Notes On Bayesian Econometrics (Art)
21 pages
BaYesian Models Machine Learning 2016
No ratings yet
BaYesian Models Machine Learning 2016
126 pages
Lecture Notes For Probability and Statistics
No ratings yet
Lecture Notes For Probability and Statistics
7 pages
Bayesian Inference
No ratings yet
Bayesian Inference
18 pages
24 Intro To Bayesian Inference
No ratings yet
24 Intro To Bayesian Inference
33 pages
Chapter 1 B
No ratings yet
Chapter 1 B
35 pages
DS 630 - Lec 3 - ST
No ratings yet
DS 630 - Lec 3 - ST
24 pages
Introduction To Bayesian Methods With An Example
No ratings yet
Introduction To Bayesian Methods With An Example
25 pages
1-MS2 (Intro Bayes)
No ratings yet
1-MS2 (Intro Bayes)
38 pages
Bayesian Statistics: MA501, Statistics For Insurance
No ratings yet
Bayesian Statistics: MA501, Statistics For Insurance
28 pages
Bayes Lectures English
No ratings yet
Bayes Lectures English
74 pages
Bayesian Statistics
No ratings yet
Bayesian Statistics
76 pages
IDS22Bayes Applications
No ratings yet
IDS22Bayes Applications
34 pages
Cs 13 Batch 1
No ratings yet
Cs 13 Batch 1
84 pages
Notes 2 BayesianStatistics
No ratings yet
Notes 2 BayesianStatistics
6 pages
Modern Bayesian Econometrics
No ratings yet
Modern Bayesian Econometrics
100 pages
25 Intro To Bayesian Inference
No ratings yet
25 Intro To Bayesian Inference
31 pages
Notes4 BayesianLearning
No ratings yet
Notes4 BayesianLearning
8 pages
Lecture 2 - 4 Prior
No ratings yet
Lecture 2 - 4 Prior
51 pages
확통1 LectureNote09 on Bayesian Statistical Inference
No ratings yet
확통1 LectureNote09 on Bayesian Statistical Inference
78 pages
Lecture4 More Bayes
No ratings yet
Lecture4 More Bayes
24 pages
Bayesian Ibrahim
No ratings yet
Bayesian Ibrahim
370 pages
MCMC Bayes PDF
No ratings yet
MCMC Bayes PDF
27 pages
Bayesian Parameter Estimation
No ratings yet
Bayesian Parameter Estimation
40 pages
Notes BMDA PDF
No ratings yet
Notes BMDA PDF
520 pages
Notes
No ratings yet
Notes
520 pages
Chap 2
No ratings yet
Chap 2
28 pages
Analytics of Observational Data Lec 10
No ratings yet
Analytics of Observational Data Lec 10
23 pages
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
No ratings yet
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
23 pages
MIT18 05S14 Class14 Slides
No ratings yet
MIT18 05S14 Class14 Slides
26 pages
Course On Bayesian Methods in Environmental Valuation: Basics (Continued) : Models For Proportions and Means
No ratings yet
Course On Bayesian Methods in Environmental Valuation: Basics (Continued) : Models For Proportions and Means
34 pages
A Course in Bayesian Econometrics University of Queensland
No ratings yet
A Course in Bayesian Econometrics University of Queensland
22 pages
Basics of Probabilistic/Bayesian Modeling and Parameter Estimation
No ratings yet
Basics of Probabilistic/Bayesian Modeling and Parameter Estimation
21 pages
Chan Sui Ki Second Term Exam
No ratings yet
Chan Sui Ki Second Term Exam
14 pages
W1 - Network Basics
No ratings yet
W1 - Network Basics
38 pages
DSE Math
No ratings yet
DSE Math
3 pages
SDSC3006 - Assignment 1
No ratings yet
SDSC3006 - Assignment 1
2 pages
Assignment 2 Ans
No ratings yet
Assignment 2 Ans
6 pages
8 Balanced - BST - New
No ratings yet
8 Balanced - BST - New
78 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Chapter 15
No ratings yet
Chapter 15
38 pages
Midterm Exam Data Analytics
No ratings yet
Midterm Exam Data Analytics
858 pages
Practica 3 Eviews
No ratings yet
Practica 3 Eviews
37 pages
Laporan Self Esteem-1
No ratings yet
Laporan Self Esteem-1
16 pages
Econs 503 - Advanced Microeconomics Ii Handout On Bayesian Nash Equilibrium
No ratings yet
Econs 503 - Advanced Microeconomics Ii Handout On Bayesian Nash Equilibrium
6 pages
Investment Appraisal - PV and Annuity Tables
No ratings yet
Investment Appraisal - PV and Annuity Tables
4 pages
Customer Shopping Trends Dataset: Analysis of Data - Regression Model
No ratings yet
Customer Shopping Trends Dataset: Analysis of Data - Regression Model
4 pages
Game Theory Exam Questions
100% (1)
Game Theory Exam Questions
13 pages
Soluciones Unidad 3 Opcionales
No ratings yet
Soluciones Unidad 3 Opcionales
15 pages
Solutions To Assignment 10 EC487 Advanced Microeconomics Part I
No ratings yet
Solutions To Assignment 10 EC487 Advanced Microeconomics Part I
8 pages
Regression
No ratings yet
Regression
18 pages
JCN 10 774 Wald Test
No ratings yet
JCN 10 774 Wald Test
1 page
Activity 5.3 Compound Interest Salosagcol, Yancy G.
No ratings yet
Activity 5.3 Compound Interest Salosagcol, Yancy G.
2 pages
Incomplete Information in The Extensive Form: Beyond Subgame Perfection
No ratings yet
Incomplete Information in The Extensive Form: Beyond Subgame Perfection
7 pages
Time Series: Ioannis Vrontos Athens University of Economics and Business
No ratings yet
Time Series: Ioannis Vrontos Athens University of Economics and Business
18 pages
Compound Interest
No ratings yet
Compound Interest
12 pages
Simulation-Monte Carlo
No ratings yet
Simulation-Monte Carlo
17 pages
Bayesplot
No ratings yet
Bayesplot
1 page
Lecture 1 Game Theory
No ratings yet
Lecture 1 Game Theory
10 pages
Chapter 8
No ratings yet
Chapter 8
45 pages
Materi Eksplorasi Pengolahan Data - KTI Banten-Compressed
No ratings yet
Materi Eksplorasi Pengolahan Data - KTI Banten-Compressed
36 pages
13 Regression 06 02 2024
No ratings yet
13 Regression 06 02 2024
16 pages
Jurnal Reviewer 1 (Revisi)
No ratings yet
Jurnal Reviewer 1 (Revisi)
9 pages
Game Theory For Strategic Advantage: Alessandro Bonatti MIT Sloan
No ratings yet
Game Theory For Strategic Advantage: Alessandro Bonatti MIT Sloan
34 pages
Solucionario Econometria Jeffrey M Wooldridge PDF
0% (1)
Solucionario Econometria Jeffrey M Wooldridge PDF
4 pages
Chapter 11 Quasiexperimental Designs
No ratings yet
Chapter 11 Quasiexperimental Designs
22 pages
FAL (2021-22) MAT1011 ELA AP2021222001178 Reference Material I Applied Statistics MAT1011 V 2.1
No ratings yet
FAL (2021-22) MAT1011 ELA AP2021222001178 Reference Material I Applied Statistics MAT1011 V 2.1
2 pages
Syilfi, Dwi Ispriyanti, Diah Safitri: Analisis Regresi Linier Piecewise Dua Segmen
No ratings yet
Syilfi, Dwi Ispriyanti, Diah Safitri: Analisis Regresi Linier Piecewise Dua Segmen
11 pages
2019 Yiss - Econometrics (1) - Seokjoo Andrew Chang
No ratings yet
2019 Yiss - Econometrics (1) - Seokjoo Andrew Chang
3 pages

Single Parametric Models

Uploaded by

Single Parametric Models

Uploaded by

Outline

Single Parameter Models

Bayes’ Theorem PLAIB) =

– This is Bayes’ Theorem.

Single Parameter Model: Normal with

Deriving the Posterior Deriving the Posterior

We can find the posterior distribution of the normal mean ✓ via

We know that f (ȳ |✓) = N(✓,

Example: Probability of a female birth Example: Probability of a female birth

Consider a beta prior distribution for ✓ ✓|X ⇠ Beta(x + ↵, 980 x+ )

(↵ + ) ↵ Beta distributions are conjugate priors for Binomial likelihood

The three classes of classical, or frequentist, inference are

2 Confidence interval (CI)

Each of them has its analog in the Bayesian world.

0.0 0.2 0.4 0.485 0.6 0.8 1.0

Bayesian Inference: Point Estimation Posterior estimates

interpretation: “The probability that ✓ lies in C is (1 ↵)”

Equal-tail Credible Interval Interval Estimation: Example

Using a Gamma(2, 1) posterior distribution and k(↵) = 0.1:

Clearly, P(qL < ✓ < qU |y) = 1 ↵; our confidence that ✓ lies in

(qL , qU ) is 100 ⇥ (1 ↵)%. Thus, this interval is a 100 ⇥ (1 ↵)%

p-value = P{T (Y) more “extreme” than T (yobs )|✓, H0 } ,

where “extremeness” is in the direction of HA

Several problems with this approach:

0.35 0.40 0.45 0.50 0.55 hypotheses must be nested

Bayes Factor Bayes Factor vs Likelihood Ratio Test

Hypothesis testing in Bayesian framework is often translated into a model

Limitations and Alternatives Bayesian Prediction

Noninformative Prior Je↵reys Prior

Common Conjugate Families

Likelihood Conjugate Prior

MVN(✓, ⌃), ⌃ is known ✓ ⇠ MVN(µ, V )

MVN(✓, ⌃), ✓ is known ⌃ ⇠ Inv Wishart(⌫, V )

You might also like