0% found this document useful (0 votes)

43 views25 pages

Statistical Inference III: Mohammad Samsul Alam

This document is an introduction to Bayesian inference presented by Mohammad Samsul Alam. It discusses the three main paradigms of statistical inference: frequentist, Fisherian, and Bayesian. The key difference between the frequentist/Fisherian approaches and the Bayesian approach is that in Bayesian inference, parameters are treated as random variables rather than fixed unknown quantities. The document goes on to define conditional probability and Bayes' theorem, and explains how Bayesian inference uses Bayes' theorem to update prior beliefs about parameters based on observed data.

Uploaded by

Md Abdul Basit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views25 pages

Statistical Inference III: Mohammad Samsul Alam

Uploaded by

Md Abdul Basit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Statistical Inference III

(Introduction to Bayesian Inference

Mohammad Samsul Alam

Assistant Professor of Applied Statistics
Institute of Statistical Research and Training (ISRT)
University of Dhaka

https://www.isrt.ac.bd/people/msalam

Email: [email protected] Lecture Material 5 1|25

Introduction I

The three main paradigms of statistical inference are Frequen-

tist, Fisherian and Bayesian.
Frequentist approach is based on the idea of repeated sampling
(sampling distribution) where as Fisherian approach is based
on the likelihood function and Bayesian approach is based on
the Bayes theorem.
However, in frequentist and Fisherian approaches, the param-
eter of interest θ is assumed fixed but an unknown quantity
whereas, in Bayesian approach, θ is assumed as random vari-
able.
In Bayesian inference, the assumption that θ is random states
that the θ is a realization of a random variable which takes val-
ues from Θ, the parameter space, by a probability mechanism.

Email: [email protected] Lecture Material 5 2|25

Introduction II

Any inferential problem, in Bayesian approach, is dealt using

the Bayes theorem that utilizes the concept of conditional
probability.
Assuming θ as random variable, Bayesian inference tries to find
out a probability mechanism exploiting the observed data.
This probability distribution is then used to make conclusion
regarding θ in form of estimation or hypothesis testing.

Email: [email protected] Lecture Material 5 3|25

Conditional Probability I

If we know that one event has occurred, does that affect

the probability that another event has occurred? To answer
this, we need to look at conditional prbobability.
Suppose we are told that the event A has occurred. Everything
outside of A is no longer possible. Then, we only have to
consider outcomes inside event A.
The given condition implies that the universe, U , will reduces
to the A. That is, for the given condition, U = A.
Therefore, the only part of event B that is now relevant is
that part which is also in A, that is B ∩ A.
Given that event A has occurred, the total probability in the
reduced universe must equal 1.

Email: [email protected] Lecture Material 5 4|25

Conditional Probability II

The probability of B given A is the unconditional probability

of that part of B that is also in A, multiplied by the scale factor
1/P (A).
That gives the conditional probability of event B gives event
A:
P (A ∩ B)
P (B|A) = .
P (A)

It has been seen that the conditional probability P (B|A) is

proportional to the joint probability P (A ∩ B) but has been
rescaled so the probability of the reduced universe equals 1.

Email: [email protected] Lecture Material 5 5|25

Conditional Probability III

A∩ B
A

Email: [email protected] Lecture Material 5 6|25

Bayes Theorem I

Let B1 , B2 , . . . , Bm be events that partition the sample space

Ω (i.e. Ω = B1 ∪ B2 ∪ . . . , ∪Bm and Bi ∩ Bj = ∅ when i 6= j)
and let A be an event on the space Ω for which P (A) > 0.
Moreover, the event A can defined as A = (B1 ∩ A) ∪ (B2 ∩
A) ∪ . . . ∪ (Bm ∩ A).
In this case the conditional probabilities P (A|B1 ), P (A|B2 ),
. . ., P (A|Bm ), and the marginal probabilities P (A) and P (Bi ),
for all i, are known to us.

Email: [email protected] Lecture Material 5 7|25

Bayes Theorem II

Then from the definition of conditional probability we can

write,

P (Bi ∩ A)
P (Bi |A) =
P (A)
P (A|Bi )P (Bi )
=
P (A)

From the definition of the event A we can write,

m
X m
X
P (A) = P ∪(Bi ∩ A) = P (Bi ∩ A) = P (A|Bi )P (Bi )
i
i=1 i=1

Email: [email protected] Lecture Material 5 8|25

Bayes Theorem III
Ω

B1 B2 ... Bm

Replacing this quantity we have

P (A|Bi )P (Bi )
P (Bi |A) = Pm (1)
i=1 P (A|Bi )P (Bi )

The result in equation (1) is known as Bayes theorem.

Email: [email protected] Lecture Material 5 9|25

Link Between Bayes Theorem and Bayesian Inference I

Let, for our inferential problem, there is an observable quantity,

y, and an unobservable quantity, θ, where the probability model
for the y depends on the θ.
Moreover, assume that the unobserved quantity, θ, can take
values from Θ.
Further we are assuming that the observed quantity is the
realization of the interested variable Y which has probability
model that is defined as f (y|θ).
In addition, we have our belief regarding θ which we denote, in
terms of probability model, as P (θ). Note that this is our belief
regarding θ prior to having any knowledge of the observable
quantity Y .

Email: [email protected] Lecture Material 5 10|25

Link Between Bayes Theorem and Bayesian Inference II
Now, after observing the quantity Y , we can update our prior
belief P (θ) regarding θ using the Bayes theorem stated in
equation (1) as

f (y|θ)P (θ)
P (θ|y) = R , (2)
f (y|θ)P (θ)δθ
θ

where P (θ|y) is called our belief about θ after observing y.

Formally, in Bayesian inference, the quantities used in equation
(2) have their respective names as,
f (y|θ) stands for the likelihood function
P (θ) stands for the prior distribution
P (θ|y) stands for the posterior distribution
In short, Bayesian inference update our prior belief through
the data that we observe.
Email: [email protected] Lecture Material 5 11|25
Link Between Bayes Theorem and Bayesian Inference III

Finally, in Bayesian inference, any conclusion (probabilistic)

regarding the parameter θ based on the posterior distribution
P (θ|y).
Therefore, the Bayesian inference can be summarized as follows,

Assume a model f (y|θ) for the observable phenomena (data)

Y
Specify prior belief P (θ) regarding θ before observing the Y .
Update the prior belief to the posterior distribution P (θ|y)
using the Bayes theorem.

Email: [email protected] Lecture Material 5 12|25

Conditional Independence I

Suppose Y1 , . . . , Yn are random variables and that θ is a param-

eter describing the conditions under which the random variables
are generated. The variables Y1 , . . . , Yn are conditionally inde-
pendent given θ if for every collection of n sets {A1 , . . . , An }
if

P (Y1 ∈ A1 , . . . , Yn ∈ An |θ) = P (Y1 ∈ A1 |θ) × . . . × P (Yn ∈ An |θ)

The conditional independence assures that

P (Yi ∈ Ai |θ, Yj ∈ Aj ) = P (Yi ∈ Ai |θ),

that is Yj gives no additional information about Yi beyond that

in knowing θ.

Email: [email protected] Lecture Material 5 13|25

Conditional Independence II

In general, under conditional independence the joint density is

given by
n
Y
P (y1 , . . . , yn |θ) = PY1 (y1 |θ) × . . . × PYn (yn |θ) = PYi (yi |θ).
i=1

However, if Y1 , . . . , Yn are generated in similar ways from a

common process, the marginal densities are all equal to some
common density giving
n
Y
P (y1 , . . . , yn |θ) = P (yi |θ).
i=1

Email: [email protected] Lecture Material 5 14|25

Exchangeability I

Exchangeable
Let P (y1 , y2 , . . . , yn ) be the joint density of Y1 , Y2 , . . . , Yn . If P (y1 ,
y2 , . . . , yn ) = P (yπ1 , yπ2 , . . . , yπn ) for all permutations π of
{1, 2, . . . , n}, then Y1 , Y2 , . . . , Yn are exchangeable.

If θ ∼ P (θ) and Y1 , Y2 , . . . , Yn are conditionally i.i.d. given θ, then

marginally (unconditionally on θ), Y1 , Y2 , . . . , Yn are exchangeable.

Email: [email protected] Lecture Material 5 15|25

Exchangeability II

Suppose Y1 , Y2 , . . . , Yn are conditionally i.i.d. given some

unknown parameter θ. Then for any permutation of π of
{1, 2, . . . , n} and any set of values (y1 , y2 , . . . , yn ) ∈ Y n ,
Z
P (y1 , y2 , . . . , yn ) = P (y1 , y2 , . . . , yn |θ)P (θ)dθ
n
Z (Y )
= P (yi |θ) P (θ)dθ
i=1
Z ( n )
Y
= P (yπi |θ) P (θ)dθ
i=1
= P (yπ1 , yπ2 , . . . , yπn )

Email: [email protected] Lecture Material 5 16|25

Exchangeability III
de Finetti’s Theorem
Let Yi ∈ Y for all i ∈ {1, 2, . . .}. Suppose that, for any n, our belief
model for Y1 , Y2 , . . . , Yn is exchangeable:

P (y1 , y2 , . . . , yn ) = P (yπ1 , yπ2 , . . . , yπn )

for all permutations π of {1, 2, . . . , n}. Then our model can be

written as,
n
Z (Y )
P (y1 , y2 , . . . , yn ) = P (yi |θ) P (θ)dθ
i=1

for some parameter θ, some prior distribution on θ and some sam-

pling model P (y|θ). The prior and sampling model depend on the
form of the belief model P (y1 , y2 , . . . , yn ).

Email: [email protected] Lecture Material 5 17|25

Bayesian Inference I

Let the model for data is fY (y; θ) and our prior belief regarding
the parameter θ is P (θ).
Further assume that Y1 , Y2 , . . . , Yn be the random sample whose
elements are conditionally independent given that θ. Then the
model for data, the likelihood function, can be written as,
n
Y
L(θ|y) = P (yi |θ),
i=1

where yi is the realized value of Yi .

Email: [email protected] Lecture Material 5 18|25

Bayesian Inference II
The posterior distribution of θ can then be computed using the
Bayes theorem as,

L(θ|y)P (θ)
P (θ|y) = , (3)
P (y)

where P (y) is the marginal probability of observing the data

y.
Note that, for an observed sample y = {y1 , y2 , . . . , yn }, the
quantity P (y) in the denominator of equation (3) is a constant.
Therefore, we can write,

P (θ|y) ∝ L(θ|y)P (θ) (4)

Posterior ∝ Likelihood × Prior (5)

Email: [email protected] Lecture Material 5 19|25

Bayesian Inference III
As a result, we can say that the product of likelihood and prior
is sufficient to capture the kernel of the posterior distribution.
Therefore, in Bayesian inference, we need to assume a data
model and speicification of the prior distribution for drawing
inference.
However, the prior can be of either noninformative or informa-
tive.
The prior that assign equal probability with each element in
the parameter space is called a noninformative prior. In
such prior, we give equal preference to every value in the
parameter space. This is because we don’t have any information
to give preference to some values from the parameter space.

Email: [email protected] Lecture Material 5 20|25

Bayesian Inference IV
On the other hand, in informative prior, we have specific infor-
mation regarding the θ before the data collection. That is why
we assign different probabilities with the different values of the
parameter space.
There is another class of prior that is called conjugate prior.
Conjugate priors are defined for data model (likelihood). For a
given data model, the prior for which the posterior distribution
has the same form as the prior is called conjugated prior for
that data model.
Note that, though we can not assume an improper distribution,
a distribution for which sum of all probabilities is not equal to 1,
as data model, for prior both proper and improper distributions
can be assumed.

Email: [email protected] Lecture Material 5 21|25

Bayesian Inference V

No matter, what the prior distribution is (proper or improper),

the posterior distribution will be a proper distribution.
There is another kind of prior as well which is known as Jeffreys
prior. This kind of prior is used to retain the prior noninforma-
tive when reparameterization is made.

Email: [email protected] Lecture Material 5 22|25

Summary of the Posterior Distribution I

The posterior probability distribution contains all the current

information about the parameter θ.
One way to summarize the posterior distribution is its graphical
presentation, plot of the entire posterior density.
For many practical purposes
One useful summary of the posterior distribution is posterior
mean which is defined as
Z
E(θ|y) = θP (θ|y)dθ.
θ

Email: [email protected] Lecture Material 5 23|25

Summary of the Posterior Distribution II
Another useful summary is the posterior variance which is
defined as,

V (θ|y) = E [θ − E[θ|y]]2
Z
= [θ − E[θ|y]]2 P (θ|y)dθ
θ
Z h i
= θ2 − {E[θ|y]}2 P (θ|y)dθ
θ
h i Z
= E θ2 |y − {E[θ|y]}2 P (θ|y)dθ
θ
h i
2 2
= E θ |y − {E[θ|y]}
R
Note that, if the posterior distribution is discrete, the sign
P
will be replaced by .
Email: [email protected] Lecture Material 5 24|25
Summary of the Posterior Distribution III

Like posterior mean and variance, poterior quantiles are also

useful in summarizing the posterior distribution.

Email: [email protected] Lecture Material 5 25|25

1-MS2 (Intro Bayes)
No ratings yet
1-MS2 (Intro Bayes)
38 pages
Block 4 ST3189
No ratings yet
Block 4 ST3189
25 pages
Bayes Lecture Notes
No ratings yet
Bayes Lecture Notes
79 pages
(Ebook) Introduction To Bayesian Econometrics and Decision Theory
No ratings yet
(Ebook) Introduction To Bayesian Econometrics and Decision Theory
29 pages
24 Intro To Bayesian Inference
No ratings yet
24 Intro To Bayesian Inference
33 pages
CS1 (A) Book
No ratings yet
CS1 (A) Book
38 pages
Baysian Inferences
No ratings yet
Baysian Inferences
20 pages
Bayes' Formula and Independence: Scott Sheffield
No ratings yet
Bayes' Formula and Independence: Scott Sheffield
61 pages
Notes 2 BayesianStatistics
No ratings yet
Notes 2 BayesianStatistics
6 pages
Bayesian Statistics: Thomas Bayes
No ratings yet
Bayesian Statistics: Thomas Bayes
22 pages
Notes On ML
No ratings yet
Notes On ML
42 pages
Single Parameter Models
No ratings yet
Single Parameter Models
37 pages
1 Bayesian Talk
No ratings yet
1 Bayesian Talk
84 pages
An Overview of Bayesian Econometrics
No ratings yet
An Overview of Bayesian Econometrics
30 pages
Chapter 2: Belief, Probability, and Exchangeability: Lecture 1: Probability, Bayes Theorem, Distributions
No ratings yet
Chapter 2: Belief, Probability, and Exchangeability: Lecture 1: Probability, Bayes Theorem, Distributions
17 pages
Single Parametric Models
No ratings yet
Single Parametric Models
10 pages
Notes - Module 4
No ratings yet
Notes - Module 4
17 pages
Bayesian Statistics (Szábo & V.d.vaart)
No ratings yet
Bayesian Statistics (Szábo & V.d.vaart)
146 pages
Overview of Principles of Statistics
No ratings yet
Overview of Principles of Statistics
8 pages
A Course in Bayesian Econometrics University of Queensland
No ratings yet
A Course in Bayesian Econometrics University of Queensland
22 pages
03 Bay Est He or em
No ratings yet
03 Bay Est He or em
13 pages
A Beginner's Notes On Bayesian Econometrics (Art)
No ratings yet
A Beginner's Notes On Bayesian Econometrics (Art)
21 pages
Bayesian Inference: Chris Mathys
No ratings yet
Bayesian Inference: Chris Mathys
32 pages
Introduction To Bayesian Inference: M. Botje NIKHEF, PO Box 41882, 1009DB Amsterdam, The Netherlands June 21, 2006
No ratings yet
Introduction To Bayesian Inference: M. Botje NIKHEF, PO Box 41882, 1009DB Amsterdam, The Netherlands June 21, 2006
68 pages
Bayesian Modelling Tuts-4-9
No ratings yet
Bayesian Modelling Tuts-4-9
6 pages
Lectures 5
No ratings yet
Lectures 5
31 pages
Chapitre 1 Statistique - Bayesienne
No ratings yet
Chapitre 1 Statistique - Bayesienne
47 pages
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
No ratings yet
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
53 pages
DarkArts Handout
No ratings yet
DarkArts Handout
11 pages
Intro To Bayes Approach. Reasons To Be Bayesian: Differences Between Bayesian and Frequentist Approaches 1
No ratings yet
Intro To Bayes Approach. Reasons To Be Bayesian: Differences Between Bayesian and Frequentist Approaches 1
9 pages
Hipnotic Bayestas
No ratings yet
Hipnotic Bayestas
30 pages
Ba Yes I An Inference
No ratings yet
Ba Yes I An Inference
30 pages
Teaching Bayes' Theorem: Strength of Evidence As Predictive Accuracy
No ratings yet
Teaching Bayes' Theorem: Strength of Evidence As Predictive Accuracy
18 pages
FRM Part 1: Fundamentals of Probability
No ratings yet
FRM Part 1: Fundamentals of Probability
22 pages
Bayesian Inference: Statisticat, LLC
No ratings yet
Bayesian Inference: Statisticat, LLC
30 pages
Bayesian Ibrahim
No ratings yet
Bayesian Ibrahim
370 pages
Unit II Classification
No ratings yet
Unit II Classification
31 pages
ch05 ConditionalProbBayesThm
No ratings yet
ch05 ConditionalProbBayesThm
8 pages
Bayesian-Statistics Final 20140416 3
No ratings yet
Bayesian-Statistics Final 20140416 3
38 pages
Bayes Slides1
No ratings yet
Bayes Slides1
146 pages
Osta L5
No ratings yet
Osta L5
26 pages
Bayesian Inference
No ratings yet
Bayesian Inference
20 pages
Bayesian Statistics 01
100% (1)
Bayesian Statistics 01
22 pages
Lecture Notes - 5
No ratings yet
Lecture Notes - 5
8 pages
Topic 6: Conditional Probability - Bayes' Theorem Bayes Theorem
No ratings yet
Topic 6: Conditional Probability - Bayes' Theorem Bayes Theorem
8 pages
Data Analytics Unit-2 PPT Notes
No ratings yet
Data Analytics Unit-2 PPT Notes
190 pages
Bayes
No ratings yet
Bayes
3 pages
Bayesian Classification
No ratings yet
Bayesian Classification
7 pages
1 Bayes' Theorem: P (B - A) P (A) P (B)
100% (1)
1 Bayes' Theorem: P (B - A) P (A) P (B)
3 pages
Bayes Handout
No ratings yet
Bayes Handout
17 pages
Bayesian Analysis - Explanation
No ratings yet
Bayesian Analysis - Explanation
20 pages
Bayesian
No ratings yet
Bayesian
50 pages
Bayesian Inference
No ratings yet
Bayesian Inference
12 pages
Primer Clase de Estadística Bayesiana
No ratings yet
Primer Clase de Estadística Bayesiana
19 pages
Chap 25
No ratings yet
Chap 25
85 pages
Bayesian Estimation and Inference
No ratings yet
Bayesian Estimation and Inference
21 pages
IDS21 Bayes Theorem
No ratings yet
IDS21 Bayes Theorem
22 pages
ML Last Document Group 2 PDF
No ratings yet
ML Last Document Group 2 PDF
13 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
16 pages
Set Theory Essentials
From Everand
Set Theory Essentials
Emil Milewski
No ratings yet
Statistical Inference III: Mohammad Samsul Alam
No ratings yet
Statistical Inference III: Mohammad Samsul Alam
32 pages
Statistical Inference II: Mohammad Samsul Alam
No ratings yet
Statistical Inference II: Mohammad Samsul Alam
31 pages
Introduction Field Sigma Field Prob-1
No ratings yet
Introduction Field Sigma Field Prob-1
125 pages
Statistical Inference III: Mohammad Samsul Alam
No ratings yet
Statistical Inference III: Mohammad Samsul Alam
19 pages
Algorithms Probability Distribution Markov Chain Limiting Distribution
No ratings yet
Algorithms Probability Distribution Markov Chain Limiting Distribution
1 page
Convergence Almost Surely:: The Most Important Example of Convergence in
No ratings yet
Convergence Almost Surely:: The Most Important Example of Convergence in
2 pages
Prewedding Catalog 2023
No ratings yet
Prewedding Catalog 2023
8 pages
BHS Inggris Xi Sem-1 TP 2021-2022
No ratings yet
BHS Inggris Xi Sem-1 TP 2021-2022
8 pages
Runehammer OSE Hacked 1.2
100% (1)
Runehammer OSE Hacked 1.2
17 pages
Section C Electrics Section C: Component Identification
No ratings yet
Section C Electrics Section C: Component Identification
1 page
Intermediary Liability in A Global World: Prof. Dr. Matthias Leistner, LL.M. (Cambridge)
No ratings yet
Intermediary Liability in A Global World: Prof. Dr. Matthias Leistner, LL.M. (Cambridge)
40 pages
XARIOS 400.: Superior Versatility and Reliability For Large-Sized Delivery Vehicles
No ratings yet
XARIOS 400.: Superior Versatility and Reliability For Large-Sized Delivery Vehicles
2 pages
Carrier VRF Xct7 2022
No ratings yet
Carrier VRF Xct7 2022
186 pages
Hardening
No ratings yet
Hardening
7 pages
Xie 2021
No ratings yet
Xie 2021
8 pages
Saint Louis College: Legislative Committee
No ratings yet
Saint Louis College: Legislative Committee
3 pages
Lesson 1
No ratings yet
Lesson 1
4 pages
Company Profile
No ratings yet
Company Profile
28 pages
HL Business Management Course Outline - Final
No ratings yet
HL Business Management Course Outline - Final
14 pages
Ficha Técnica de Balatas-001 Noviembre 2011
No ratings yet
Ficha Técnica de Balatas-001 Noviembre 2011
4 pages
Lower Secondary Science Revision Guide Secondary 1 Sample Pages 9781398364219
No ratings yet
Lower Secondary Science Revision Guide Secondary 1 Sample Pages 9781398364219
23 pages
Room Tariff: Special Rates On Continental Plan (CPAI)
No ratings yet
Room Tariff: Special Rates On Continental Plan (CPAI)
4 pages
Cooling Tower Motor Type
No ratings yet
Cooling Tower Motor Type
1 page
Pengaruh Model PBL Terhadap Kemampuan Berpikir Kreatif Ditinjau Dari Kemandirian Belajar Siswa
No ratings yet
Pengaruh Model PBL Terhadap Kemampuan Berpikir Kreatif Ditinjau Dari Kemandirian Belajar Siswa
14 pages
Teach Anyone: Understanding Personality To
No ratings yet
Teach Anyone: Understanding Personality To
18 pages
Answers To The First General Quick TEST UTME
No ratings yet
Answers To The First General Quick TEST UTME
22 pages
Safe Work Procedure
No ratings yet
Safe Work Procedure
2 pages
5th Grade Gmo Plan
No ratings yet
5th Grade Gmo Plan
1 page
Literature Review Last Edit
No ratings yet
Literature Review Last Edit
11 pages
Ka & TN Cbse (c3 To c5) C Batch BWT - 7 Syllabus (19.02.2024)
No ratings yet
Ka & TN Cbse (c3 To c5) C Batch BWT - 7 Syllabus (19.02.2024)
2 pages
Rebranding and Revitalisation
100% (1)
Rebranding and Revitalisation
7 pages
PaperCrafter - Issue 168, February 2022
100% (4)
PaperCrafter - Issue 168, February 2022
92 pages
Data (Prod & Admin) - July 2023 - August
No ratings yet
Data (Prod & Admin) - July 2023 - August
332 pages
Pre-Schwarzian and Schwarzian Norm Estimates For Subclasses of Univalent Functions
No ratings yet
Pre-Schwarzian and Schwarzian Norm Estimates For Subclasses of Univalent Functions
19 pages
Updated Resume
No ratings yet
Updated Resume
3 pages
Harrison's Rheumatology, 2nd Edition Scribd Download
100% (13)
Harrison's Rheumatology, 2nd Edition Scribd Download
15 pages

Statistical Inference III: Mohammad Samsul Alam

Uploaded by

Statistical Inference III: Mohammad Samsul Alam

Uploaded by

Statistical Inference III

(Introduction to Bayesian Inference

Mohammad Samsul Alam

Email: [email protected] Lecture Material 5 1|25

The three main paradigms of statistical inference are Frequen-

Email: [email protected] Lecture Material 5 2|25

Any inferential problem, in Bayesian approach, is dealt using

Email: [email protected] Lecture Material 5 3|25

If we know that one event has occurred, does that affect

Email: [email protected] Lecture Material 5 4|25

The probability of B given A is the unconditional probability

It has been seen that the conditional probability P (B|A) is

Email: [email protected] Lecture Material 5 5|25

Email: [email protected] Lecture Material 5 6|25

Let B1 , B2 , . . . , Bm be events that partition the sample space

Email: [email protected] Lecture Material 5 7|25

Then from the definition of conditional probability we can

From the definition of the event A we can write,

Email: [email protected] Lecture Material 5 8|25

Replacing this quantity we have

The result in equation (1) is known as Bayes theorem.

Email: [email protected] Lecture Material 5 9|25

Let, for our inferential problem, there is an observable quantity,

Email: [email protected] Lecture Material 5 10|25

where P (θ|y) is called our belief about θ after observing y.

Finally, in Bayesian inference, any conclusion (probabilistic)

Assume a model f (y|θ) for the observable phenomena (data)

Email: [email protected] Lecture Material 5 12|25

Suppose Y1 , . . . , Yn are random variables and that θ is a param-

P (Y1 ∈ A1 , . . . , Yn ∈ An |θ) = P (Y1 ∈ A1 |θ) × . . . × P (Yn ∈ An |θ)

The conditional independence assures that

P (Yi ∈ Ai |θ, Yj ∈ Aj ) = P (Yi ∈ Ai |θ),

that is Yj gives no additional information about Yi beyond that

Email: [email protected] Lecture Material 5 13|25

In general, under conditional independence the joint density is

However, if Y1 , . . . , Yn are generated in similar ways from a

Email: [email protected] Lecture Material 5 14|25

If θ ∼ P (θ) and Y1 , Y2 , . . . , Yn are conditionally i.i.d. given θ, then

Email: [email protected] Lecture Material 5 15|25

Suppose Y1 , Y2 , . . . , Yn are conditionally i.i.d. given some

Email: [email protected] Lecture Material 5 16|25

P (y1 , y2 , . . . , yn ) = P (yπ1 , yπ2 , . . . , yπn )

for all permutations π of {1, 2, . . . , n}. Then our model can be

for some parameter θ, some prior distribution on θ and some sam-

Email: [email protected] Lecture Material 5 17|25

where yi is the realized value of Yi .

Email: [email protected] Lecture Material 5 18|25

where P (y) is the marginal probability of observing the data

P (θ|y) ∝ L(θ|y)P (θ) (4)

Email: [email protected] Lecture Material 5 19|25

Email: [email protected] Lecture Material 5 20|25

Email: [email protected] Lecture Material 5 21|25

No matter, what the prior distribution is (proper or improper),

Email: [email protected] Lecture Material 5 22|25

The posterior probability distribution contains all the current

Email: [email protected] Lecture Material 5 23|25

Like posterior mean and variance, poterior quantiles are also

Email: [email protected] Lecture Material 5 25|25

You might also like