0% found this document useful (0 votes)

36 views

Lecture 5

This document discusses statistical models and statistics. It defines statistical models as collections of probability distributions or densities, distinguishing between parametric models which have a fixed form depending on parameters, and nonparametric models which make fewer assumptions. Statistics are defined as functions of the sample data, like the sample mean or variance, whose distributions can be studied. The document provides examples of how the distributions of statistics change depending on the underlying distributions of the data. It introduces the concepts of sufficiency, sufficient statistics and partitions, and the factorization theorem. A sufficient statistic is one that contains all the information about the parameters, and minimal sufficient statistics provide the greatest data reduction.

Uploaded by

vmtammath2005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views

Lecture 5

Uploaded by

vmtammath2005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Lecture Notes 5

1 Statistical Models
A statistical model P is a collection of probability distributions (or a collection of densi-
ties). An example of a nonparametric model is
P =

p :

(x))
2
dx <

.
A parametric model has the form
P =

p(x; ) :

where R
d
. An example is the set of Normal densities {p(x; ) = (2)
1/2
e
(x)
2
/2
}.
For now, we focus on parametric models. Later we consider nonparametric models.
2 Statistics
Let X
1
, . . . , X
n
p(x; ). Let X
n
(X
1
, . . . , X
n
). Any function T = T(X
1
, . . . , X
n
) is itself
a random variable which we will call a statistic.
Some examples are:
1. order statistics, X
(1)
X
(2)
X
(n)
2. sample mean: X =
1
n

i
X
i
,
3. sample variance: S
2
=
1
n1

i
(X
i
x)
2
,
4. sample median: middle value of ordered statistics,
5. sample minimum: X
(1)
6. sample maximum: X
(n)
.
Often, we are interested in the distribution of T.
Example 1 If X
1
, . . . , X
n
(, ), then X (n, /n).
1
Proof. The mgf is
M
X
= E[e
tx
] = E[e
P
X
i
t/n
] =

i
E[e
X
i
(t/n)
]
= [M
X
(t/n)]
n
=

1
1 t/n

n
=

1
1 /nt

n
.
This is the mgf of (n, /n).
Example 2 If X
1
, . . . , X
n
N(,
2
) then X N(,
2
/n).
Example 3 If X
1
, . . . , X
n
iid Cauchy(0,1),
p(x) =
1
(1 +x
2
)
for x R, then X Cauchy(0,1).
Example 4 If X
1
, . . . , X
n
N(,
2
) then
(n 1)

2
S
2

2
(n1)
.
The proof is based on the mgf.
Example 5 Let X
(1)
, X
(2)
, . . . , X
(n)
be the order statistics, which means that the sample
X
1
, X
2
, . . . , X
n
has been ordered from smallest to largest:
X
(1)
X
(2)
X
(n)
.
Now,
F
X
(k)
(x) = P(X
(k)
x)
= P(at least k of the X
1
, . . . , X
n
x)
=
n

j=k
P(exactly j of the X
1
, . . . , X
n
x)
=
n

j=k

n
j

[F
X
(x)]
j
[1 F
X
(x)]
nj
Dierentiate to nd the pdf (See CB p. 229):
p
X
(k)
(x) =
n!
(k 1)!(n k)!
[F
X
(x)]
k1
p(x) [1 F
X
(x)]
nk
.
2
3 Suciency
(Ch 6 Casella and Berger.) We continue with parametric inference. In this section we
discuss data reduction as a formal concept.
3.1 Sucient Statistics
Suppose that X
1
, . . . , X
n
p(x; ). T is sucient for if the conditional distribution
of X
1
, . . . , X
n
|T does not depend on . Thus, p(x
1
, . . . , x
n
|t; ) = p(x
1
, . . . , x
n
|t).
Intuitively, this means that you can replace X
1
, . . . , X
n
with T(X
1
, . . . , X
n
) without losing
information. (This is not quite true as well see later. But for now, you can think of it this
way.)
Example 6 X
1
, , X
n
Poisson(). Let T =

n
i=1
X
i
. Then,
p
X
n
|T
(x
n
|t) = P(X
n
= x
n
|T(X
n
) = t) =
P(X
n
= x
n
and T = t)
P(T = t)
.
But
P(X
n
= x
n
and T = t) =

0 T(x
1
, . . . , x
n
) = t
P(X
1
= x
1
, . . . , X
n
= x
n
) T(x
1
, . . . , x
n
) = t.
Hence,
P(X
n
= x
n
) =
n

i=1
e

x
i
x
i
!
=
e
n

P
x
i

(x
i
!)
=
e
n

(x
i
!)
.
Now, T(x
n
) =

x
i
= t and so
P(T = t) =
e
n
(n)
t
t!
since T Poisson(n).
Thus,
P(X
n
= x
n
)
P(T = t)
=
t!
(

x
i
)!n
t
which does not depend on . So T =

i
X
i
is a sucient statistic for . Other sucient
statistics are: T = 3.7

i
X
i
, T = (

i
X
i
, X
4
), and T(X
1
, . . . , X
n
) = (X
1
, . . . , X
n
).
3
3.2 Sucient Partitions
It is better to describe suciency in terms of partitions of the sample space.
Example 7 Let X
1
, X
2
, X
3
Bernoulli(). Let T =

X
i
.
x
n
t p(x|t)
(0, 0, 0) t = 0 1
(0, 0, 1) t = 1 1/3
(0, 1, 0) t = 1 1/3
(1, 0, 0) t = 1 1/3
(0, 1, 1) t = 2 1/3
(1, 0, 1) t = 2 1/3
(1, 1, 0) t = 2 1/3
(1, 1, 1) t = 3 1
8 elements 4 elements
1. A partition B
1
, . . . , B
k
is sucient if f(x|X B) does not depend on .
2. A statistic T induces a partition. For each t, {x : T(x) = t} is one element of the
partition. T is sucient if and only if the partition is sucient.
3. Two statistics can generate the same partition: example:

i
X
i
and 3

i
X
i
.
4. If we split any element B
i
of a sucient partition into smaller pieces, we get another
sucient partition.
Example 8 Let X
1
, X
2
, X
3
Bernoulli(). Then T = X
1
is not sucient. Look at its
partition:
x
n
t p(x|t)
(0, 0, 0) t = 0 (1 )
2
(0, 0, 1) t = 0 (1 )
(0, 1, 0) t = 0 (1 )
(0, 1, 1) t = 0
2
(1, 0, 0) t = 1 (1 )
2
(1, 0, 1) t = 1 (1 )
(1, 1, 0) t = 1 (1 )
(1, 1, 1) t = 1
2
8 elements 2 elements
4
3.3 The Factorization Theorem
Theorem 9 T(X
n
) is sucient for if the joint pdf/pmf of X
n
can be factored as
p(x
n
; ) = h(x
n
) g(t; ).
Example 10 Let X
1
, , X
n
Poisson. Then
p(x
n
; ) =
e
n

P
X
i

(x
i
!)
=
1

(x
i
!)
e
n

P
i
X
i
.
Example 11 X
1
, , X
n
N(,
2
). Then
p(x
n
; ,
2
) =

1
2
2
n
2
exp

(x
i
x)
2
+n(x )
2
2
2

.
(a) If known:
p(x
n
; ) =

1
2
2
n
2
exp

(x
i
x)
2
2
2

. .. .
h(x
n
)
exp

n(x )
2
2
2

. .. .
g(T(x
n
)|)
.
Thus, X is sucient for .
(b) If (,
2
) unknown then T = (X, S
2
) is sucient. So is T = (

X
i
,

X
2
i
).
3.4 Minimal Sucient Statistics (MSS)
We want the greatest reduction in dimension.
Example 12 X
1
, , X
n
N(0,
2
). Some sucient statistics are:
T(X
1
, , X
n
) = (X
1
, , X
n
)
T(X
1
, , X
n
) = (X
2
1
, , X
2
n
)
T(X
1
, , X
n
) =

i=1
X
2
i
,
n

i=m+1
X
2
i

T(X
1
, , X
n
) =

X
2
i
.
5
T is a Minimal Sucient Statistic if the following two statements are true:
1. T is sucient and
2. If U is any other sucient statistic then T = g(U) for some function g.
In other words, T generates the coarsest sucient partition.
Suppose U is sucient. Suppose T = H(U) is also sucient. T provides greater reduction
than U unless H is a 1 1 transformation, in which case T and U are equivalent.
Example 13 X N(0,
2
). X is sucient. |X| is sucient. |X| is MSS. So are
X
2
, X
4
, e
X
2
.
Example 14 Let X
1
, X
2
, X
3
Bernoulli(). Let T =

X
i
.
x
n
t p(x|t) u p(x|u)
(0, 0, 0) t = 0 1 u = 0 1
(0, 0, 1) t = 1 1/3 u = 1 1/3
(0, 1, 0) t = 1 1/3 u = 1 1/3
(1, 0, 0) t = 1 1/3 u = 1 1/3
(0, 1, 1) t = 2 1/3 u = 73 1/2
(1, 0, 1) t = 2 1/3 u = 73 1/2
(1, 1, 0) t = 2 1/3 u = 91 1
(1, 1, 1) t = 3 1 u = 103 1
Note that U and T are both sucient but U is not minimal.
3.5 How to nd a Minimal Sucient Statistic
Theorem 15 Dene
R(x
n
, y
n
; ) =
p(y
n
; )
p(x
n
; )
.
Suppose that T has the following property:
R(x
n
, y
n
; ) does not depend on if and only if T(y
n
) = T(x
n
).
Then T is a MSS.
6
Example 16 Y
1
, , Y
n
iid Poisson ().
p(y
n
; ) =
e
n

P
y
i

y
i
,
p(y
n
; )
p(x
n
; )
=

P
y
i

P
x
i

y
i
!/

x
i
!
which is independent of i

y
i
=

x
i
. This implies that T(Y
n
) =

Y
i
is a minimal
sucient statistic for .
The minimal sucient statistic is not unique. But, the minimal sucient partition is unique.
Example 17 Cauchy.
p(x; ) =
1
(1 + (x )
2
)
.
Then
p(y
n
; )
p(x
n
; )
=
n

i=1
{1 + (x
i
)
2
}
n

j=1
{1 + (y
j
)
2
}
.
The ratio is a constant function of if
T(Y
n
) = (Y
(1)
, , Y
(n)
).
It is technically harder to show that this is true only if T is the order statistics, but it could
be done using theorems about polynomials. Having shown this, one can conclude that the
order statistics are the minimal sucient statistics for .
4 What Suciency Really Means
If T is sucient, then T contains all the information you need from the data to compute the
likelihood function. It does not contain all the information in the data. We will dene
the likelihood function in the next set of notes.
Note: Ignore the material on completeness and ancillary statistics.
7

Lecture 5
No ratings yet
Lecture 5
7 pages
Module2_Principles of data reduction
No ratings yet
Module2_Principles of data reduction
6 pages
Sufficient Statistics and Exponential Family
No ratings yet
Sufficient Statistics and Exponential Family
11 pages
Statistical Inference 2 Note 03
No ratings yet
Statistical Inference 2 Note 03
6 pages
Mathematical Statistics (MA212M) : Lecture Slides
No ratings yet
Mathematical Statistics (MA212M) : Lecture Slides
8 pages
Topic 3 Theory of Estimation
No ratings yet
Topic 3 Theory of Estimation
10 pages
Sufficient Statistics - Problems - Solved - Xiang - Yin
No ratings yet
Sufficient Statistics - Problems - Solved - Xiang - Yin
5 pages
Minimal Sufficient Statistics
No ratings yet
Minimal Sufficient Statistics
6 pages
s131 Reviewer 002
No ratings yet
s131 Reviewer 002
14 pages
Bahadur's Theorem
No ratings yet
Bahadur's Theorem
6 pages
Principle of Data Reduction
No ratings yet
Principle of Data Reduction
15 pages
1 Sufficient Statistics: I I N I 1 I N I 1 I
No ratings yet
1 Sufficient Statistics: I I N I 1 I N I 1 I
5 pages
Chapter-3
No ratings yet
Chapter-3
7 pages
11.1 Sufficient Statistic
No ratings yet
11.1 Sufficient Statistic
3 pages
Statistical Inference
100% (1)
Statistical Inference
118 pages
Chap 07 Data Reduction
No ratings yet
Chap 07 Data Reduction
20 pages
Lecture 2
No ratings yet
Lecture 2
13 pages
A 18 Factor Ization
No ratings yet
A 18 Factor Ization
4 pages
MA204 FinalTest 2022
No ratings yet
MA204 FinalTest 2022
14 pages
Wa0003.
No ratings yet
Wa0003.
16 pages
Stat 111
No ratings yet
Stat 111
7 pages
IE605 2021 Soln08
No ratings yet
IE605 2021 Soln08
13 pages
Statistics
No ratings yet
Statistics
60 pages
Fundamentals of Statistical Signal Processing - Estimation Theory-Kay-2
No ratings yet
Fundamentals of Statistical Signal Processing - Estimation Theory-Kay-2
2 pages
Sufficient Statistics
No ratings yet
Sufficient Statistics
22 pages
18.6501x Fundamentals of Statistics
100% (1)
18.6501x Fundamentals of Statistics
8 pages
STAT 713 Mathematical Statistics Ii: Lecture Notes
No ratings yet
STAT 713 Mathematical Statistics Ii: Lecture Notes
152 pages
04 Sufficiency Mvue
No ratings yet
04 Sufficiency Mvue
30 pages
Chapter 8 Estimation of Parameters and Fitting of Probability Distributions
No ratings yet
Chapter 8 Estimation of Parameters and Fitting of Probability Distributions
20 pages
thống kê đủ (MIT-lecture note)
No ratings yet
thống kê đủ (MIT-lecture note)
21 pages
Lecture 03
No ratings yet
Lecture 03
36 pages
Principles of Statistics
No ratings yet
Principles of Statistics
113 pages
final_soln
No ratings yet
final_soln
5 pages
0 - Statistical Inference Theory
No ratings yet
0 - Statistical Inference Theory
80 pages
Solution 5 Problem 1: Let a > 0 be a known constant, and let θ > 0 be a parameter
No ratings yet
Solution 5 Problem 1: Let a > 0 be a known constant, and let θ > 0 be a parameter
8 pages
1 Complete Statistics
No ratings yet
1 Complete Statistics
10 pages
Tut 06
No ratings yet
Tut 06
2 pages
Theorem 5.3.1.: 1 N 2 N I 2 I
No ratings yet
Theorem 5.3.1.: 1 N 2 N I 2 I
18 pages
Squalsoln
No ratings yet
Squalsoln
61 pages
Spring 2009
No ratings yet
Spring 2009
4 pages
Ma40189 2016 2017 Problem Sheet 3 Solutions合并版
No ratings yet
Ma40189 2016 2017 Problem Sheet 3 Solutions合并版
67 pages
Stat 245 Homework 3 Solution
No ratings yet
Stat 245 Homework 3 Solution
8 pages
Sol Stat Chapter2
No ratings yet
Sol Stat Chapter2
9 pages
Generalized Fisher-Darmois-Koopman-Pitman Theorem and Rao-Blackwell Type Estimators For Power-Law Distributions
No ratings yet
Generalized Fisher-Darmois-Koopman-Pitman Theorem and Rao-Blackwell Type Estimators For Power-Law Distributions
19 pages
College Statistics
No ratings yet
College Statistics
244 pages
Confidence Intervals with σ unknown
No ratings yet
Confidence Intervals with σ unknown
9 pages
Confidence Intervals with σ unknown
No ratings yet
Confidence Intervals with σ unknown
9 pages
A Useful Pivotal Quantity: The American Statistician
No ratings yet
A Useful Pivotal Quantity: The American Statistician
10 pages
Assign20153 Sol
No ratings yet
Assign20153 Sol
47 pages
Generalized Fisher-Darmois-Koopman-Pitman Theorem and Rao-Blackwell Type Estimators For Power-Law Distributions
No ratings yet
Generalized Fisher-Darmois-Koopman-Pitman Theorem and Rao-Blackwell Type Estimators For Power-Law Distributions
33 pages
Geometry in Space
No ratings yet
Geometry in Space
9 pages
Normal Statistics Estimation
No ratings yet
Normal Statistics Estimation
8 pages
STAT 513 Solutions
No ratings yet
STAT 513 Solutions
16 pages
Stats, Mle, and Other Stuff: 1 Sevssd
No ratings yet
Stats, Mle, and Other Stuff: 1 Sevssd
10 pages
inference assignment 3
No ratings yet
inference assignment 3
4 pages
1973 - Faulkenberry - A Method of Obtaining Prediction Intervals
No ratings yet
1973 - Faulkenberry - A Method of Obtaining Prediction Intervals
4 pages
X400004_20220215_solutions
No ratings yet
X400004_20220215_solutions
8 pages
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
3.5/5 (1)
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
R Programing 6 Feb
No ratings yet
R Programing 6 Feb
10 pages
C8511 BSC First Year Examination 2012 Research Skills in Psychology
No ratings yet
C8511 BSC First Year Examination 2012 Research Skills in Psychology
22 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
59 pages
BUS 308 Weeks 1
No ratings yet
BUS 308 Weeks 1
43 pages
Eviews User Guide 2
No ratings yet
Eviews User Guide 2
822 pages
Assignment Solutions 8
No ratings yet
Assignment Solutions 8
3 pages
Telecommunication Customer Churn (New)
100% (1)
Telecommunication Customer Churn (New)
23 pages
BDA-PPT Final
No ratings yet
BDA-PPT Final
28 pages
Lesson 6 - Chi-Square Test For Independence
No ratings yet
Lesson 6 - Chi-Square Test For Independence
4 pages
Formula Sheet
No ratings yet
Formula Sheet
5 pages
Kolmogorov Uji Normalitas
No ratings yet
Kolmogorov Uji Normalitas
19 pages
Analysis of Variance
No ratings yet
Analysis of Variance
20 pages
Section 8.3 Variance and Standard Deviation
No ratings yet
Section 8.3 Variance and Standard Deviation
3 pages
A Guide to Modern Econometrics 5th Edition Marno Verbeek - The ebook in PDF and DOCX formats is ready for download
100% (1)
A Guide to Modern Econometrics 5th Edition Marno Verbeek - The ebook in PDF and DOCX formats is ready for download
57 pages
Id No Inst Time Status Ag e Se X Ph. Ecog Ph. Karno Pat. Karno Meal - Cal WT - Loss
No ratings yet
Id No Inst Time Status Ag e Se X Ph. Ecog Ph. Karno Pat. Karno Meal - Cal WT - Loss
4 pages
Confidence Intervals - Google Slides
No ratings yet
Confidence Intervals - Google Slides
21 pages
Nama: Eka Faradilla NPM: 177052071 Kelas: A1
No ratings yet
Nama: Eka Faradilla NPM: 177052071 Kelas: A1
2 pages
Impulse Response VAR
No ratings yet
Impulse Response VAR
2 pages
Ma3251 SNM Important Questions
No ratings yet
Ma3251 SNM Important Questions
44 pages
Zhu (2012)
No ratings yet
Zhu (2012)
14 pages
Lecture 2 Deep Learning Overview
No ratings yet
Lecture 2 Deep Learning Overview
99 pages
Margins Stata
No ratings yet
Margins Stata
74 pages
AMCAT Data Analysis
No ratings yet
AMCAT Data Analysis
18 pages
QMB MT and Final
100% (1)
QMB MT and Final
138 pages
Module 4 EDA
No ratings yet
Module 4 EDA
20 pages
Machine Learning Super Cheatsheet (Prof. Pedram Jahangiry)
No ratings yet
Machine Learning Super Cheatsheet (Prof. Pedram Jahangiry)
2 pages
Pertemuan 10. Pengolahan Data - Maksi Feb Unpad Mei 2024
No ratings yet
Pertemuan 10. Pengolahan Data - Maksi Feb Unpad Mei 2024
33 pages
Dhs h2 Math p2 Solution
No ratings yet
Dhs h2 Math p2 Solution
14 pages
(1) Frederic M. Lord - Applications of Item Response Theory to Practical Testing Problems (1980)
No ratings yet
(1) Frederic M. Lord - Applications of Item Response Theory to Practical Testing Problems (1980)
289 pages
Using Gretl For Principles of Econometrics, 3rd Edition: The Errata (Page 286) For Changes Since The Last Update
No ratings yet
Using Gretl For Principles of Econometrics, 3rd Edition: The Errata (Page 286) For Changes Since The Last Update
316 pages

Lecture 5

Uploaded by

Lecture 5

Uploaded by

Lecture Notes 5

You might also like