0% found this document useful (0 votes)

7 views16 pages

Lecture 1 Introduction

The document provides an introduction to Bayesian Statistics, contrasting it with the Frequentist paradigm. It discusses key concepts such as parametric statistical models, likelihood functions, and the principles of inference in both paradigms. Additionally, it covers Bayes' formula, prior and posterior distributions, and the application of multivariate normal distributions in Bayesian analysis.

Uploaded by

Lavy Koilpitchai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views16 pages

Lecture 1 Introduction

Uploaded by

Lavy Koilpitchai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Bayesian Statistics

Introduction

Shaobo Jin
Department of Mathematics

Shaobo Jin (Math) Bayesian Statistics 1 / 16

Introduction Frequentist Paradigm

Parametric Statistical Model

Suppose that the vector of observations x = (x1 , ..., xn ) is generated

from a probability distribution with density f (x | θ), where θ is the
vector of parameters.
For example, if we further assume the observations are iid, then
n
Y
f (x | θ) = f (xi | θ) .
i=1

A parametric statistical model consists of the observation x of a

random variable X , distributed according to the density f (x | θ), where
the parameter θ belongs to a parameter space Θ of nite dimension.

Shaobo Jin (Math) Bayesian Statistics 2 / 16

Introduction Frequentist Paradigm

Likelihood Function

Denition
For an observation x of a random variable X with density f (x | θ), the
likelihood function L (· | x) : Θ → [0, ∞) is dened by
L (θ | x) = f (x | θ).

Example
T
If X = X1 · · · Xn is a sample of independent random variables,

then
n
Y
L (θ | x) = fi (xi | θ) ,
i=1

as a function in θ conditional on x.

Shaobo Jin (Math) Bayesian Statistics 3 / 16

Introduction Frequentist Paradigm

Likelihood Function: Example

1 If X1 , ..., Xn is a sample of i.i.d. random variables according to

N θ, σ 2 , then
n
" ( )#
Y 1 (xi − µ)2
L (θ | x) = √ exp − .
2πσ 2 2σ 2
i=1

2 If X1 , ..., Xn is a sample of i.i.d. random variables according to

Binomial (k, θ), then
n
Y k xi n−xi
L (θ | x) = θ (1 − θ) .
xi
i=1

Shaobo Jin (Math) Bayesian Statistics 4 / 16

Introduction Frequentist Paradigm

Likelihood Function: Another Example

Consider the case

For i ̸= j , Xi1 · · · Xin and Xj1 · · · Xjn are independent

and identically distributed.

For each i, Xi1 , ..., Xip are not necessarily independent.
Then, the likelihood is
n
Y
L (θ | x) = f (xi1 , · · · , xip | θ) ,
i=1

where f (xi1 , · · · , xip | θ) is the joint density of Xi1 · · · Xip .

Shaobo Jin (Math) Bayesian Statistics 5 / 16

Introduction Frequentist Paradigm

Inference Principle

In the frequentist context,

1 likelihood principle: the information brought by observation x is

entirely contained in the likelihood function L (θ | x).

2 suciency principle: two observations x and y factorizing through

the same value of a sucient statistic T as T (x) = T (y) must lead

to the same inference on θ.

Shaobo Jin (Math) Bayesian Statistics 6 / 16

Introduction Bayesian Paradigm

Bayes Formula

If A and E are two events, then

Shaobo Jin (Math) Bayesian Statistics 7 / 16

Introduction Bayesian Paradigm

Prior and Posterior

A Bayes model consists of a distribution π (θ) on the parameters, and a

conditional probability distribution f (x | θ) on the observations.
The distribution π (θ) is called the prior distribution.
The unknown parameter θ is a random parameter.
By Bayes formula,
f (x | θ) π (θ) f (x | θ) π (θ)
π (θ | x) = =´ ,
m (x) f (x | θ) π (θ) dθ

where the conditional distribution π (θ | x) is the posterior distribution

and m (x) is the marginal distribution of x.

Shaobo Jin (Math) Bayesian Statistics 8 / 16

Introduction Bayesian Paradigm

Update Our Knowledge on θ

The prior often summarizes the prior information about θ.
From similar experiences, the average number of accidents at a
crossing is 1 per 30 days. We assume
π (θ) = 30 exp (−30θ) , [day]−1 .

Our experiment resulted in an observation x.

Three accidents have been recorded after monitoring the
roundabout for one year. The likelihood is
(365θ)3
f (X = 3 | θ) = exp (−365θ) .
3!
We use the information in x to update our knowledge on θ.
By Bayes' formula
f (X = 3 | θ) π (θ)
π (θ | x) = .
m (x)
Shaobo Jin (Math) Bayesian Statistics 9 / 16
Introduction Bayesian Paradigm

Distributions

In a Bayesian model, we will have many distributions

We most of the time use π (·) and m (·) as generic symbols. But in
several cases, they are tied to specic functions.

Shaobo Jin (Math) Bayesian Statistics 10 / 16

Introduction Bayesian Paradigm

Use Bayes Formula To Obtain Posterior

Example
Find the posterior distribution.
1 Suppose that we have an iid sample X | θ ∼ Bernoulli (θ),
i
i = 1, ..., n. The prior is θ ∼ Beta (a0 , b0 ).
2 Suppose that we have an iid sample X | µ ∼ N µ, σ 2 , i = 1, ..., n,

i
where σ 2 is known. The prior is µ ∼ N µ0 , σ02 .

3 Suppose that we have an iid sample X | µ, σ 2 ∼ N µ, σ 2 ,

i
i = 1, ..., n. The priors are µ | σ 2 ∼ N µ0 , σ 2 /λ0 and

σ 2 ∼ InvGamma (a0 , b0 ), where

ba00

2
1 b0
π σ = exp − 2 .
Γ (a0 ) (σ 2 )a0 +1 σ

Shaobo Jin (Math) Bayesian Statistics 11 / 16

Introduction Bayesian Paradigm

Bayesian Inference Principle

Information on the underlying parameter θ is entirely contained in the
posterior distribution π (θ | x). That is, all statistical inference are
based on the posterior distribution π (θ | x).

Some examples are

1 posterior mean: E[θ | x].

2 posterior mode (MAP): θ that maximizes π (θ | x).

3 predictive distribution of a new observation:

ˆ
f (y | x) = f (y | x, θ) π (θ | x) dθ.

Shaobo Jin (Math) Bayesian Statistics 12 / 16

Introduction Multivariate Normal Distribution

From Univariate to Multivariate Normal

Let Z ∼ N (0, 1). Then, X = σZ + µ ∼ N µ, σ 2 , where E [X] = µ and

Var (X) = σ 2 .
T
Let Z = Z1 Z2 · · · Zp be a random vector, each Zj ∼ N (0, 1),

and Zj is independent of Zk for any j ̸= k. Then,

X = Σ1/2 Z + µ ∈ Rp

follows a p−dimensional multivariate normal distribution, denoted by

X ∼ Np (µ, Σ), where E [X] = µ and Var (X) = Σ.

Shaobo Jin (Math) Bayesian Statistics 13 / 16

Introduction Multivariate Normal Distribution

From Univariate to Multivariate Normal: Density

The density function of the random variable X ∼ N µ, σ 2 with σ > 0

can be expressed as
( )
(x − µ)2

1 1 1 1
√ exp − = √ exp − (x − µ) 2 (x − µ) .
2πσ 2 2σ 2 2πσ 2 2 σ

A p-dimensional random variable X ∼ Np (µ, Σ) with Σ > 0 has the

density

1 1 T −1
f (x) = exp − (x − µ) Σ (x − µ) .
(2π)p/2 det (Σ) 2
p

Shaobo Jin (Math) Bayesian Statistics 14 / 16

Introduction Multivariate Normal Distribution

Some Useful Properties

1 Linear combination of normal remains normal: Suppose that

X ∼ Np (µ, Σ), then AX + d ∼ Nq Aµ + d, AΣA , for every q × p
T

constant matrix A, and every p × 1 constant vector d.

2 Marginal normal + independence imply joint normal: If X1 and
X2 are independent and are distributed Np (µ1 , Σ11 ) and
Nq (µ2 , Σ22 ), respectively, then

X1 µ1 Σ11 0
∼ Np+q , .
X2 µ2 0 Σ22

X1 µ1 Σ11 Σ12
3 Conditional distribution: Let ∼ Np+q , .
X2 µ2 Σ21 Σ22
Then the conditional distribution of X1 given that X2 = x2 , is

X1 | X2 ∼ N µ1 + Σ12 Σ−1 −1

22 (x2 − µ2 ) , Σ11 − Σ12 Σ22 Σ21 .

Shaobo Jin (Math) Bayesian Statistics 15 / 16

Introduction Multivariate Normal Distribution

Multivariate Normal In Bayesian Statistics

Example
Suppose that X | θ ∼ Np(Cθ, Σ), where Cp×q and Σ > 0 are known.
The prior is Nq µ0 , Λ−1
0 . Find the posterior of θ.

We can in fact use the property of the conditional distribution of a

multivariate normal distribution to simplify the steps.
Result
If we know X1 | X2 ∼ Np (CX2 , Σ) and X2 ∼ Nq (m, Ω), then

Σ + CΩC T

X1 Cm CΩ
∼ Np+q , .
X2 m ΩC T Ω

Shaobo Jin (Math) Bayesian Statistics 16 / 16

Lectures 5
No ratings yet
Lectures 5
31 pages
Module 3 Descriptive Statistics Final
100% (1)
Module 3 Descriptive Statistics Final
15 pages
Cs 13 Batch 1
No ratings yet
Cs 13 Batch 1
84 pages
Single Parameter Models
No ratings yet
Single Parameter Models
37 pages
Single Parametric Models
No ratings yet
Single Parametric Models
10 pages
Bayes Lecture Notes
No ratings yet
Bayes Lecture Notes
79 pages
CH 5
No ratings yet
CH 5
45 pages
Slides 1
No ratings yet
Slides 1
73 pages
Chapter 1 B
No ratings yet
Chapter 1 B
35 pages
Slides PDF
No ratings yet
Slides PDF
40 pages
DS 630 - Lec 3 - ST
No ratings yet
DS 630 - Lec 3 - ST
24 pages
Introduction To Bayesian Statistics
No ratings yet
Introduction To Bayesian Statistics
33 pages
DS 630 - Lec 4 - ST
No ratings yet
DS 630 - Lec 4 - ST
27 pages
1-MS2 (Intro Bayes)
No ratings yet
1-MS2 (Intro Bayes)
38 pages
Lecture 20 - Bayesian Analysis
No ratings yet
Lecture 20 - Bayesian Analysis
4 pages
Bayesian
No ratings yet
Bayesian
26 pages
Chapter 5. Bayesian Statistics (II)
No ratings yet
Chapter 5. Bayesian Statistics (II)
30 pages
LN 13
No ratings yet
LN 13
5 pages
Lecture 3
No ratings yet
Lecture 3
4 pages
2021 - Nature - Bayesian Statistics and Modelling
100% (1)
2021 - Nature - Bayesian Statistics and Modelling
26 pages
Bayesian Statistics: Thomas Bayes
No ratings yet
Bayesian Statistics: Thomas Bayes
22 pages
MIT18 650F16 Bayesian Statistics
No ratings yet
MIT18 650F16 Bayesian Statistics
18 pages
Bayesian Statistics: MA501, Statistics For Insurance
No ratings yet
Bayesian Statistics: MA501, Statistics For Insurance
28 pages
Nonlinear Nonparametric Statistics: Using Partial Moments
100% (2)
Nonlinear Nonparametric Statistics: Using Partial Moments
101 pages
BT Wk3 LectureNotes
No ratings yet
BT Wk3 LectureNotes
16 pages
Bayes
No ratings yet
Bayes
3 pages
Introduction To Bayesian Methods With An Example
No ratings yet
Introduction To Bayesian Methods With An Example
25 pages
BML Lecture Notes
No ratings yet
BML Lecture Notes
126 pages
Bayesian Inference: The Basics
No ratings yet
Bayesian Inference: The Basics
37 pages
Baysian Analysis Notes
No ratings yet
Baysian Analysis Notes
30 pages
Bayesian Inference: by Hoai Nam Nguyen September 9, 2017
No ratings yet
Bayesian Inference: by Hoai Nam Nguyen September 9, 2017
7 pages
Lecture 2 - 4 Prior
No ratings yet
Lecture 2 - 4 Prior
51 pages
Bayesian Inference: A Practical Primer: Outline
No ratings yet
Bayesian Inference: A Practical Primer: Outline
28 pages
Lecture Notes For Probability and Statistics
No ratings yet
Lecture Notes For Probability and Statistics
7 pages
Bayesian Lec.4
No ratings yet
Bayesian Lec.4
24 pages
The Normal Distribution Estimation Correlation
100% (1)
The Normal Distribution Estimation Correlation
16 pages
19-Bayesian 2
No ratings yet
19-Bayesian 2
39 pages
Lecture Material 2.5 - Bayesian Estimation & Concepts
No ratings yet
Lecture Material 2.5 - Bayesian Estimation & Concepts
12 pages
Bayesian Statistics 01
100% (1)
Bayesian Statistics 01
22 pages
Var PPTS
No ratings yet
Var PPTS
249 pages
Notes4 BayesianLearning
No ratings yet
Notes4 BayesianLearning
8 pages
Basic and Applied Questions - Hypothesis Testing-Homework 2 PDF
100% (1)
Basic and Applied Questions - Hypothesis Testing-Homework 2 PDF
15 pages
The T Distribution
No ratings yet
The T Distribution
25 pages
Lecture 5 - 8 Bayesian Estimation
No ratings yet
Lecture 5 - 8 Bayesian Estimation
65 pages
Queing Thry
No ratings yet
Queing Thry
6 pages
An Overview of Bayesian Econometrics
No ratings yet
An Overview of Bayesian Econometrics
30 pages
확통1 LectureNote09 on Bayesian Statistical Inference
No ratings yet
확통1 LectureNote09 on Bayesian Statistical Inference
78 pages
Chapitre 1 Statistique - Bayesienne
No ratings yet
Chapitre 1 Statistique - Bayesienne
47 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
20 pages
MAS3301 Bayesian Statistics: M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2008-9
No ratings yet
MAS3301 Bayesian Statistics: M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2008-9
18 pages
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
No ratings yet
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
53 pages
BPS651 Exercise V
50% (2)
BPS651 Exercise V
5 pages
BaYesian Models Machine Learning 2016
No ratings yet
BaYesian Models Machine Learning 2016
126 pages
P Syllabus PDF
No ratings yet
P Syllabus PDF
5 pages
Bayesian Modelling Tuts-4-9
No ratings yet
Bayesian Modelling Tuts-4-9
6 pages
Bayesian Inference
No ratings yet
Bayesian Inference
18 pages
Bayesian Inference Slides 2021
No ratings yet
Bayesian Inference Slides 2021
37 pages
24 Intro To Bayesian Inference
No ratings yet
24 Intro To Bayesian Inference
33 pages
Bayesian Statistics
No ratings yet
Bayesian Statistics
6 pages
Bayesian Statistics
No ratings yet
Bayesian Statistics
20 pages
Data Compression (Rcs087) Assignment Unit-5
No ratings yet
Data Compression (Rcs087) Assignment Unit-5
6 pages
Bayesian Statistics (Szábo & V.d.vaart)
No ratings yet
Bayesian Statistics (Szábo & V.d.vaart)
146 pages
MCMC Bayes PDF
No ratings yet
MCMC Bayes PDF
27 pages
Bayesian Ibrahim
No ratings yet
Bayesian Ibrahim
370 pages
Chap 2
No ratings yet
Chap 2
28 pages
The Binomial Probability Distribution PDF
No ratings yet
The Binomial Probability Distribution PDF
7 pages
10 12 28 04 39 13 850 Maris
No ratings yet
10 12 28 04 39 13 850 Maris
78 pages
A Beginner's Notes On Bayesian Econometrics (Art)
No ratings yet
A Beginner's Notes On Bayesian Econometrics (Art)
21 pages
II B.Tech (MIC23) SMDS Model Paper-2
No ratings yet
II B.Tech (MIC23) SMDS Model Paper-2
2 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
BBA - 106 - Lecture Notes On Regression Analysis
67% (3)
BBA - 106 - Lecture Notes On Regression Analysis
2 pages
International Statistical Institute (ISI)
No ratings yet
International Statistical Institute (ISI)
15 pages
Entropy: An Entropy-Based Approach To Portfolio Optimization
No ratings yet
Entropy: An Entropy-Based Approach To Portfolio Optimization
17 pages
Feature Selection For Unsupervised Learning: Jennifer G. Dy
No ratings yet
Feature Selection For Unsupervised Learning: Jennifer G. Dy
45 pages
3-2 F Baumeister Presentation Homogeneity in EQA
No ratings yet
3-2 F Baumeister Presentation Homogeneity in EQA
24 pages
Introecon Central Limit Theorem
No ratings yet
Introecon Central Limit Theorem
12 pages
Instrumented Principal Component Analysis
No ratings yet
Instrumented Principal Component Analysis
71 pages
Unit 1 Ssmda Notes
No ratings yet
Unit 1 Ssmda Notes
35 pages
Lecture 2
No ratings yet
Lecture 2
17 pages
Identifying The Orders of AR and MA Terms in An ARIMA Model
No ratings yet
Identifying The Orders of AR and MA Terms in An ARIMA Model
11 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
54 pages
Disease Mapping
No ratings yet
Disease Mapping
35 pages
SRM Formula Sheet
No ratings yet
SRM Formula Sheet
16 pages
Reliability in Evidence-Based Clinical Practice, A Primer For Allied Health Professionals
No ratings yet
Reliability in Evidence-Based Clinical Practice, A Primer For Allied Health Professionals
7 pages
Actual Base+Trend Month Number+Seasonal Index: Airline Miles Data
No ratings yet
Actual Base+Trend Month Number+Seasonal Index: Airline Miles Data
3 pages
MCQs - 161 To 165 With Solutions
No ratings yet
MCQs - 161 To 165 With Solutions
3 pages
Machine Learning
No ratings yet
Machine Learning
4 pages
WeeklyPracticeQuestions (E)
No ratings yet
WeeklyPracticeQuestions (E)
2 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Group Theory I Essentials
From Everand
Group Theory I Essentials
Emil Milewski
No ratings yet