0% found this document useful (0 votes)

134 views14 pages

3 - Principles of Data Reduction

1. The document discusses two principles of data reduction: the sufficiency principle and the likelihood principle. The sufficiency principle promotes methods of data reduction that do not discard information about the unknown parameter while summarizing the data. The likelihood principle describes how the likelihood function contains all the available information about the parameter. 2. A sufficient statistic captures all information about the unknown parameter in a sample. The sufficiency principle states that inference should depend only on the sample through the value of a sufficient statistic. 3. The likelihood function is defined as the function of the parameter given an observed sample. The likelihood principle states that samples yielding proportional likelihood functions should result in identical conclusions.

Uploaded by

lucy heartfilia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

134 views14 pages

3 - Principles of Data Reduction

Uploaded by

lucy heartfilia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Principles of Data Reduction

Stat 131 | 2nd Sem, AY 2019-2020

Introduction
An experimenter uses the information in a sample X1 , X2 , . . . , Xn to make inferences about
an unknown parameter θ.
If the sample size is large, the observations may be a long list of numbers that is hard to
interpret. Hence, the experimenter might wish to summarize the information in a sample
by determining a few key features of the sample values.
This is usually done by computing statistics, functions of the sample.
Any statistic, T (X1 , . . . , Xn ), denes a form of data reduction or data summary.
In this chapter, we study two principles of data reduction:
1 Suciency Principle - promotes a method of data reduction that does not discard
information about θ while achieving some summarization of the data
2 Likelihood Principle - describes a function of the parameter (determined by the observed
sample) that contains all the information about θ that is available from the sample

Principles of Data Reduction 2 / 14

The Suciency Principle

Principles of Data Reduction 3 / 14

The Suciency Principle

A sucient statistic for a parameter θ captures all the information about θ contained in the
sample. Any additional information in the sample besides the value of the sucient statistic
does not contain any more information about θ.
Def. A statistic T (X ) is a sucient statistic for θ if the conditional distribution of the sample
X given the value of T (X ) does not depend on θ.

The suciency principle states that if T (X ) is a sucient statistic for θ then any inference
about θ should depend on the sample X only through the value T (X ). That is, if x and y are
two sample points such that T (x) = T (y ) then the inference about θ should be the same
whether X = x or Y = y is observed.

Principles of Data Reduction 4 / 14

Theorem

Theorem. If p(x|θ) is the joint PDF or PMF of X and q(t|θ) is the PMF or PDF of T (X )
p(x|θ)
then T (X ) is a sucient statistic for θ if for every x in the sample space, the ratio is
q(t|θ)
constant as a function of θ.
Example: Let X1, X2, . . . , Xn r∼.s. Be(θ). Is the sample sum a sucient statistic for θ?

Principles of Data Reduction 5 / 14

Factorization Theorem

Theorem. Let f (x|θ) denote the joint PDF or PMF of X . A statistic T (X ) is a sucient
statistic for θ if and only if there exists functions g (t|θ) and h(x) such that, for all sample
points x and all parameter points θ,

f (x|θ) = g (T (X )|θ)h(x)

Note: When the parameter is a vector (e.g. (θ1 , θ2 )) then the sucient statistic is a vector and
they are viewed as jointly sucient. One-to-one functions of (jointly) sucient statistics are
also (jointly) sucient.

Principles of Data Reduction 6 / 14

Factorization Theorem: Exponential Family
Theorem. Let X1, X2, . . . , Xn be iid observations from a PDF or PMF f (x|θ) that belongs to
an exponential family given by
k
!
X
f (x|θ) = h(x)c(θ)exp wi (θ)ti (x)
i=1

where θ = (θ1 , θ2 , . . . , θd ), d ≤ k . Then

n n
!
X X
T (X ) = t1 (Xj ), tk (Xj )
i=1 i=1

is a sucient statistic for θ.

Example: Let X1, X2, . . . , Xn r∼.s. Po(λ), λ > 0. Find a sucient statistic for λ.
Principles of Data Reduction 7 / 14
Minimal Sucient Statistic

Def. A sucient statistic T (X ) is called a minimal sucient statistic if for any other
sucient statistic T (X ), T (X ) is a function of T (X ).
0 0

Remarks:
1. A minimal sucient statistic achieves the greatest possible data reduction for a sucient
statistic.
2. A minimal sucient statistic is not unique. Any one-to-one function of the minimal
sucient statistic is also a minimal sucient statistic.

Principles of Data Reduction 8 / 14

Theorem

Theorem. Let f (x|θ) be the joint PDF or PMF of a sample X . Suppose there exists a
f (x|θ)
function T (x) such that for every two sample points x and y , the ratio is constant as a
f (y |θ)
function of θ if and only if T (x) = T (y ). Then T (X ) is a minimal sucient statistic for θ.

Principles of Data Reduction 9 / 14

Ancillary Statistic

Previously, we considered sucient statistics which contain all the information about θ that is
available in the sample. Now, we introduce a dierent sort of statistic which has a
complementary purpose.
Def. A statistic S(X ) whose distribution does not depend on the parameter θ is called an
ancillary statistic.
Alone, an ancillary statistic contains no information about θ. However, when used in
conjunction with other statistics, it can contain valuable information for inferences about θ.

Principles of Data Reduction 10 / 14

Complete Statistic
Def. A statistic T = T (x) is said to be complete if and only if
E [g (T )] = 0 implies P[g (T ) = 0] = 1, ∀θ ∈ Ωθ

T (x) is complete if and only if the only estimator of zero, which is a function of T and which
has zero mean, is a statistic that is identically zero with probability 1 (i.e., a statistic that is
degenerate at the point 0). Also, note that completeness is (strictly speaking) a property of a
family of distributions.
Theorem. Given a random sample from an exponential family, T (x) as previously dened also
yields a (set of jointly) complete sucient statistic/s. (NOTE: This theorem holds true as long
as a certain condition is also true. Due to the level of the course, we will only consider cases
where such condition is true.)
Example: Let X1, X2, . . . , Xn r∼.s. Be(θ). Is the sample sum a complete statistic?
Principles of Data Reduction 11 / 14
Basu's Theorem

Theorem. If T (x) is a complete (and minimal) sucient statistic then T (x) is independent of
every ancillary statistic.
Remarks:
This theorem is useful since it allows us to deduce the independence of two statistics
without ever nding their joint distribution.
The word "minimal" may be omitted from the statement of the theorem but it will remain
true: if a minimal sucient statistic exists then any complete statistic is also a minimal
sucient statistic.

Principles of Data Reduction 12 / 14

The Likelihood Principle

Principles of Data Reduction 13 / 14

The Likelihood Function

Def. Let f (x|θ) denote the joint PDF or PMF of the sample X . Given that X = x is observed,
the function of θ dened by L(θ|x) = f (x|θ) is called the likelihood function.
The likelihood principle states that if x and y are two sample points such that L(θ|x) is
proportional to L(θ|y ), i.e. there exists a constant C (x, y ) such that

L(θ|x) = C (x, y )L(θ|y )

then the conclusions drawn from x and y should be identical.

Principles of Data Reduction 14 / 14

Tailoring: Training Manual
100% (5)
Tailoring: Training Manual
42 pages
Cramer Raoh and Out 08
No ratings yet
Cramer Raoh and Out 08
13 pages
All Formulas in One: Quantitative Aptitude Ebook by Lucid Math
100% (1)
All Formulas in One: Quantitative Aptitude Ebook by Lucid Math
26 pages
Moment Generating Functions
No ratings yet
Moment Generating Functions
7 pages
STAT 480b Answer Key To Problem Set No. 4
No ratings yet
STAT 480b Answer Key To Problem Set No. 4
3 pages
402 08 Elandt Johnson Survival Models and Data Analysis 1980
No ratings yet
402 08 Elandt Johnson Survival Models and Data Analysis 1980
478 pages
Agra University Journal Scie
No ratings yet
Agra University Journal Scie
69 pages
Generalised Linear Models and Bayesian Statistics
No ratings yet
Generalised Linear Models and Bayesian Statistics
35 pages
Lectures on the Coupling Method
From Everand
Lectures on the Coupling Method
Torgny Lindvall
No ratings yet
Basic Methods of Linear Functional Analysis
From Everand
Basic Methods of Linear Functional Analysis
John D. Pryce
No ratings yet
Sufficient Statistics and Exponential Family
No ratings yet
Sufficient Statistics and Exponential Family
11 pages
Ejemplo de Inferencia Umvue
No ratings yet
Ejemplo de Inferencia Umvue
10 pages
Sufficient Statistics - Problems - Solved - Xiang - Yin
No ratings yet
Sufficient Statistics - Problems - Solved - Xiang - Yin
5 pages
281A Final Sol
No ratings yet
281A Final Sol
9 pages
Estimation EMV
No ratings yet
Estimation EMV
37 pages
CMSC 56 Course Outline
No ratings yet
CMSC 56 Course Outline
17 pages
Exponential Distribution
No ratings yet
Exponential Distribution
19 pages
Homing Sorrow: Bharati Mukherjee'S "The Management of Grief" As Metadiasporic Narrative and Inscription of Political Empowerment
No ratings yet
Homing Sorrow: Bharati Mukherjee'S "The Management of Grief" As Metadiasporic Narrative and Inscription of Political Empowerment
19 pages
Statistics 131 Worksheet 10: Let X, · · ·, X ∼ U (0, θ), θ > 0. Find unbiased estimators of θ
No ratings yet
Statistics 131 Worksheet 10: Let X, · · ·, X ∼ U (0, θ), θ > 0. Find unbiased estimators of θ
2 pages
Statistical Inference
100% (1)
Statistical Inference
118 pages
Autoregressive-Moving Average (ARMA) Models
100% (1)
Autoregressive-Moving Average (ARMA) Models
34 pages
Rohatgi Expl
No ratings yet
Rohatgi Expl
192 pages
Estimation and Hypothesis
100% (2)
Estimation and Hypothesis
32 pages
Generalized Linear Models: Ariel Alonso Abad
No ratings yet
Generalized Linear Models: Ariel Alonso Abad
43 pages
Prob&StatsBook PDF
No ratings yet
Prob&StatsBook PDF
202 pages
Final Exam Sample Solutions
No ratings yet
Final Exam Sample Solutions
19 pages
Stat 138 Course Syllabus
No ratings yet
Stat 138 Course Syllabus
4 pages
Pranab K Sen - Julio M Singer - Large Sample Methods in Statistics (1994) - An Introduction With Applications (2017, CRC Press) - Libgen - Li
No ratings yet
Pranab K Sen - Julio M Singer - Large Sample Methods in Statistics (1994) - An Introduction With Applications (2017, CRC Press) - Libgen - Li
395 pages
MCMC Sheldon Ross
No ratings yet
MCMC Sheldon Ross
68 pages
Linear Regression: Major: All Engineering Majors Authors: Autar Kaw, Luke Snyder
100% (1)
Linear Regression: Major: All Engineering Majors Authors: Autar Kaw, Luke Snyder
25 pages
Estimation Theory: x, x, x ,…… ……x ,x f x,θ θ θ θ
No ratings yet
Estimation Theory: x, x, x ,…… ……x ,x f x,θ θ θ θ
18 pages
Fall 2011
No ratings yet
Fall 2011
2 pages
Solution Exercises List 1 - Probability and Measure Theory
No ratings yet
Solution Exercises List 1 - Probability and Measure Theory
8 pages
Ejercicios Resueltos de Inferencia Estadistica
No ratings yet
Ejercicios Resueltos de Inferencia Estadistica
229 pages
Axiomatic Probability and Concepts
No ratings yet
Axiomatic Probability and Concepts
6 pages
(GAM) Application PDF
No ratings yet
(GAM) Application PDF
30 pages
202004160626023624rajiv Saksena Advance Statistical Inference
No ratings yet
202004160626023624rajiv Saksena Advance Statistical Inference
31 pages
Solution CH # 5
No ratings yet
Solution CH # 5
39 pages
Linear Statistical Models The Less Than Full Rank Model: Yao-Ban Chan
100% (1)
Linear Statistical Models The Less Than Full Rank Model: Yao-Ban Chan
140 pages
RYAN, THOMAS P. - (Wiley Series in Probability and Statistics) Modern Regression Methods - (2
No ratings yet
RYAN, THOMAS P. - (Wiley Series in Probability and Statistics) Modern Regression Methods - (2
658 pages
Computational Optimal Transport
No ratings yet
Computational Optimal Transport
56 pages
Count Data Models in SAS
No ratings yet
Count Data Models in SAS
12 pages
BiodiversityR PDF
No ratings yet
BiodiversityR PDF
128 pages
Estimation and Testing of Hypothesis PDF
100% (1)
Estimation and Testing of Hypothesis PDF
75 pages
STAT 650 - Foundations of Data Science Syllabus
No ratings yet
STAT 650 - Foundations of Data Science Syllabus
13 pages
EC2303 Final Formula Sheet PDF
No ratings yet
EC2303 Final Formula Sheet PDF
8 pages
Least Squares Problems: How To State and Solve Them, Then Evaluate Their Solutions
100% (1)
Least Squares Problems: How To State and Solve Them, Then Evaluate Their Solutions
63 pages
Poisson Distribution
No ratings yet
Poisson Distribution
22 pages
Notes PDF
No ratings yet
Notes PDF
407 pages
Binomial Distribution
No ratings yet
Binomial Distribution
16 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
8 pages
Principles of Biostatistics: Class Notes To Accompany The Textbook by Pagano and Gauvreau
No ratings yet
Principles of Biostatistics: Class Notes To Accompany The Textbook by Pagano and Gauvreau
125 pages
151 Practice Final 1
100% (1)
151 Practice Final 1
11 pages
Measure Theory Liskevich
No ratings yet
Measure Theory Liskevich
40 pages
Curve Fitting
No ratings yet
Curve Fitting
18 pages
Stat 231 Course Notes
100% (1)
Stat 231 Course Notes
326 pages
Univariate and Bivariate Data Analysis + Probability
100% (1)
Univariate and Bivariate Data Analysis + Probability
5 pages
Statistical Inference
No ratings yet
Statistical Inference
55 pages
Generalized Functions and Partial Differential Equations
From Everand
Generalized Functions and Partial Differential Equations
Avner Friedman
No ratings yet
Examples and Problems in Mathematical Statistics
From Everand
Examples and Problems in Mathematical Statistics
Shelemyahu Zacks
5/5 (2)
Complex analysis A Complete Guide
From Everand
Complex analysis A Complete Guide
Gerardus Blokdyk
No ratings yet
Section VI Principles of Data Reduction
No ratings yet
Section VI Principles of Data Reduction
13 pages
Chapter 4
No ratings yet
Chapter 4
48 pages
Annotated Lecture Slides
No ratings yet
Annotated Lecture Slides
14 pages
Lecture 3A AI201 Blind Search
No ratings yet
Lecture 3A AI201 Blind Search
41 pages
Chapter 2
No ratings yet
Chapter 2
62 pages
Sample Technical Report
No ratings yet
Sample Technical Report
12 pages
Hypothesis Testing Part I
No ratings yet
Hypothesis Testing Part I
3 pages
4 - Point Estimation
No ratings yet
4 - Point Estimation
36 pages
s131 Reviewer 002
No ratings yet
s131 Reviewer 002
14 pages
Lecture Notes in Statistics 149 (Daquis) (Nov 08)
No ratings yet
Lecture Notes in Statistics 149 (Daquis) (Nov 08)
81 pages
Math 180.1 Modelling
No ratings yet
Math 180.1 Modelling
19 pages
Symbol Essay Outline The Witch of Blackbird Pond: Hook or Attention Grabber
No ratings yet
Symbol Essay Outline The Witch of Blackbird Pond: Hook or Attention Grabber
4 pages
How To Be Secure From Social Engineering Attack
No ratings yet
How To Be Secure From Social Engineering Attack
3 pages
Objection Deadline: March 20, 2012 at 4:00 P.M. (ET) Hearing Date: April 5, 2012 at 10:00 A.M. (ET)
No ratings yet
Objection Deadline: March 20, 2012 at 4:00 P.M. (ET) Hearing Date: April 5, 2012 at 10:00 A.M. (ET)
32 pages
AI-Powered Course Recommendation System
No ratings yet
AI-Powered Course Recommendation System
11 pages
The Role of Peer Interaction and Second Language Learning For Esl Students in Academic Contexts: An Extended Literature Review
No ratings yet
The Role of Peer Interaction and Second Language Learning For Esl Students in Academic Contexts: An Extended Literature Review
74 pages
6 Accounting For Merchandising Businesses
100% (1)
6 Accounting For Merchandising Businesses
107 pages
Individual Performance Commitment and Review Form: Tabuk City Division Tuga National High School Tuga, Tabuk City
No ratings yet
Individual Performance Commitment and Review Form: Tabuk City Division Tuga National High School Tuga, Tabuk City
10 pages
Elliott Wave Theroist
100% (1)
Elliott Wave Theroist
5 pages
11.interview C Coding Question
No ratings yet
11.interview C Coding Question
7 pages
12 Reach Dealer Parts
No ratings yet
12 Reach Dealer Parts
185 pages
Group 2 - Aspects of Connected Speech
No ratings yet
Group 2 - Aspects of Connected Speech
31 pages
Cost Optimization of Reinforced Concrete Rectangular Beams
100% (1)
Cost Optimization of Reinforced Concrete Rectangular Beams
12 pages
Kinematic Diagrams
No ratings yet
Kinematic Diagrams
16 pages
Applied Sciences: Fficiency Analysis of Manufacturing Line With
No ratings yet
Applied Sciences: Fficiency Analysis of Manufacturing Line With
15 pages
2015-2016". May I Respectfully Ask Your Permission To Allow Me To Conduct This Research
No ratings yet
2015-2016". May I Respectfully Ask Your Permission To Allow Me To Conduct This Research
6 pages
Phase 0
No ratings yet
Phase 0
15 pages
Project VBA: How and Why It Can Make You A Project Guru!
No ratings yet
Project VBA: How and Why It Can Make You A Project Guru!
14 pages
Unity TCP Open Block Library Users Manual
No ratings yet
Unity TCP Open Block Library Users Manual
124 pages
Patent Concept Foe Entreprenenur
No ratings yet
Patent Concept Foe Entreprenenur
6 pages
Edible Oil Industry 1 PDF
No ratings yet
Edible Oil Industry 1 PDF
45 pages
Niact 2
No ratings yet
Niact 2
25 pages
Alyssamari Aurereyes
No ratings yet
Alyssamari Aurereyes
2 pages
Present Perfect Continuous
100% (1)
Present Perfect Continuous
22 pages
CCW Basics and The Micro 830
No ratings yet
CCW Basics and The Micro 830
52 pages
David Wall VP Hse & Im EPT - HSE, Operations & Engineering: Confidential BP-HZN - BLYOO196756
No ratings yet
David Wall VP Hse & Im EPT - HSE, Operations & Engineering: Confidential BP-HZN - BLYOO196756
3 pages
3.2. Perspectives On Listening Ho
No ratings yet
3.2. Perspectives On Listening Ho
35 pages
On Case Study Method of Teaching
No ratings yet
On Case Study Method of Teaching
36 pages
Logica Portfolio-1
No ratings yet
Logica Portfolio-1
10 pages
Information On The Format of The TOEFL
No ratings yet
Information On The Format of The TOEFL
2 pages

3 - Principles of Data Reduction

Uploaded by

3 - Principles of Data Reduction

Uploaded by

Principles of Data Reduction

Stat 131 | 2nd Sem, AY 2019-2020

Principles of Data Reduction 2 / 14

Principles of Data Reduction 3 / 14

Principles of Data Reduction 4 / 14

Principles of Data Reduction 5 / 14

Principles of Data Reduction 6 / 14

where θ = (θ1 , θ2 , . . . , θd ), d ≤ k . Then

is a sucient statistic for θ.

Principles of Data Reduction 8 / 14

Principles of Data Reduction 9 / 14

Principles of Data Reduction 10 / 14

Principles of Data Reduction 12 / 14

Principles of Data Reduction 13 / 14

L(θ|x) = C (x, y )L(θ|y )

Principles of Data Reduction 14 / 14

You might also like

is a sucient statistic for θ.