0% found this document useful (0 votes)

63 views5 pages

Statistical Inference Notes Melon University

This document provides an overview of key concepts in statistical inference and hypothesis testing: 1. It defines parametric and non-parametric statistical models used to make inferences about populations based on samples. 2. It introduces point estimation to calculate a "best guess" of an unknown quantity and discusses properties like bias, variance, and consistency of estimators. 3. It explains the bias-variance decomposition of mean squared error and how it relates to consistency of estimators. 4. It describes the concept of asymptotic normality of estimators. 5. It defines confidence sets and how asymptotic confidence intervals are constructed when an estimator is asymptotically normal. 6. It provides an example of hypothesis

Uploaded by

Åbd Ür Råhmåñ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views5 pages

Statistical Inference Notes Melon University

Uploaded by

Åbd Ür Råhmåñ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

LECTURE NOTES 8

1 Statistical Inference

A central concern of statistics and machine learning is to estimate things about some under-
lying population on the basis of samples. Formally, given a sample,

X1 , . . . , Xn ∼ F,

what can we infer about F ?

To make meaningful inferences about F from samples we typically restrict F in some natural
way. A statistical model is a set of distributions F. Broadly, there are two possibilities:

1. Parametric model: In a parametric model, the set of possible distributions F can

be described by a finite number of parameters. Here are a few examples:

(a) A Gaussian model: This is a simple two parameter model. Here we suppose that:

(x − µ)2

1
F = f (x; µ, σ) = √ exp − , µ ∈ R, σ > 0 .
σ 2π 2σ 2

(b) The Bernoulli model: This is a one parameter model where:

F = pθ (x) = θx (1 − θ)1−x : 0 ≤ θ ≤ 1 .

2. Non-parametric model: A non-parametric model is one which where F cannot be

parameterized by a finite number of parameters. Here are a few popular examples:

(a) Estimating the CDF: Here the model consists of any valid CDF, i.e. a function
that is between 0 and 1, is monotonically increasing, right-continuous and equal
to 0 at −∞ and 1 at ∞. We are given samples X1 , . . . , Xn ∼ F and the goal is
to estimate F .
(b) Density estimation: In density estimation, we are given samples X1 , . . . , Xn ∼ fX ,
where fX is an unknown density that we would like to estimate. It turns out that
the class of all possible densities is too big for this problem to be well posed so
we need to assume some smoothness on the density. A typical assumption is that
the model is given by:
Z Z
00 2
F = f : (f (x)) dx < ∞, f (x)dx = 1, f (x) ≥ 0 .

1
2 Point Estimation

Point estimation in statistics refers to calculating a single “best guess” of the value of an
unknown quantity of interest. The quantity of interest could be a parameter or, for instance,
a density function. Typically, we will use θb or θbn to denote a point estimator. A point
estimator is a function of the data X1 , . . . , Xn :

θbn = g(X1 , . . . , Xn ),

so that θbn is a random variable. The bias of an estimator is written as:

b(θbn ) = Eθ (θbn ) − θ.

Similarly, the variance of an estimator is given by:

v(θbn ) = Eθ (θbn − θn )2
q
where θn = Eθ (θbn ). The standard error is defined to be se = v(θbn ).

In the olden days, there was a lot of emphasis on unbiased estimators, and we wanted to find
the unbiased estimators with small (or minimal variance). In modern statistics, we often use
biased estimators because the reduction in variance often justifies the bias.

We call an estimator of a parameter consistent if the estimator converges to the true param-
eter in probability, i.e. for any :

Pθ (|θbn − θ| ≥ ) → 0,
P
as n → ∞. In other words, θbn → θ or θbn − θ = oP (1).

3 The Bias-Variance decomposition

One way to compute the quality of an estimator is via its mean squared error:

MSE = Eθ (θ − θbn )2 .

The MSE can be decomposed as the sum of the squared bias and variance, i.e.:

MSE = Eθ (θ − θbn )2
= Eθ (θ − θn + θn − θbn )2
= b(θbn )2 + v(θbn ).

2
qm
A simple consequence of this decomposition is: b(θbn ) → 0 and v(θbn ) → 0 then θbn → θ and
P
hence θbn → θ.

Example: Suppose X1 , . . . , Xn ∼ Ber(p), and our estimator:

n
1X
pbn = Xi .
n i=1

What is the bias of this estimator? What is its variance? Is the estimator consistent?

4 Asymptotic Normality

Often estimators that we study will have an asymptotically normal distribution. This means
that: (θbn − θ)/se N (0, 1). We will refer to this property as asymptotic normality.

5 Confidence Sets

In general, for a parameter θ we define a 1 − α confidence set Cn to be any random set which
has the property that:
Pθ (θ ∈ Cn ) ≥ 1 − α
for all θ. We refer to Pθ (θ ∈ Cn ) as the coverage of the confidence set Cn . The confidence
set Cn is a random set (and θ is a fixed parameter).

One can think about the coverage guarantee in the following way:

You repeat the experiment many times, each time constructing a different confidence interval
Cn . Then 1 − α of these different sets will contain the corresponding true parameter. Notice,
that the true parameter does not have to be fixed, so in some sense the experiment you
conduct can be different each time.

We already saw a way to construct confidence intervals for a Bernoulli parameter using
Hoeffding’s inequality. More generally, we can always use concentration inequalities to con-
struct confidence intervals. These confidence intervals are often loose and we instead resort
to approximate (asymptotic) confidence intervals.

It is often the case that:

θb − θ
qn
v(θbn )

3
is asymptotically N (0, 1). In these cases we have that θbn ≈ N (θ, v(θbn )). Define, zα/2 =
Φ−1 (1 − α/2). Then we would construct a confidence interval:
q q
Cn = θn − zα/2 v(θn ), θn + zα/2 v(θbn ) .
b b b

We now need to verify that:

Pθ (θ ∈ Cn ) → 1 − α,

as n → ∞, which is what it means to be an asymptotic confidence interval.

q q
Pθ (θ ∈ Cn ) = P(θbn − zα/2 v(θbn ) ≤ θ ≤ θbn + zα/2 v(θbn ))
 
θn − θ
b
= Pθ −zα/2 ≤ q ≤ zα/2 
v(θbn )
→ P(−zα/2 ≤ Z ≤ zα/2 ) = 1 − α.

Example: Bernoulli confidence sets:

1. We previously constructed confidence sets using Hoeffding’s inequality. They took the
form:
r r !
log(2/α) log(2/α)
Cn = pbn − , pbn + .
2n 2n

2. If we instead use the normal approximation: we first note that the variance of our
estimator is:
p(1 − p)
v(θbn ) = .
n
However, we cannot use this variance to create our confidence set, so we instead esti-
mate the variance as:
pbn (1 − pbn )
vb(θbn ) = .
n
With this we would use the confidence interval:
q q
Cn = pbn − zα/2 vb(θn ), pbn + zα/2 vb(θn ) .
b b

It is easy to verify that this interval is always shorter than the Hoeffding interval but
it is only asymptotically correct.

4
6 Hypothesis testing

Typically, the way that statistical hypothesis testing proceeds is by defining a so-called null
hypothesis. We then collect data, and typically the question we ask is whether the data
provides enough evidence to reject the null hypothesis.

Example: Suppose X1 , . . . , Xn ∼ Ber(p), and we want to test if the coin is fair. In this
case the null hypothesis would be:

H0 : p = 1/2.

We typically also specify a alternate hypothesis. In this case, the alternative hypothesis
is:

H1 : p 6= 1/2.

Typically, hypothesis testing proceeds by defining a test statistic. In this case, a natural
statistic might be:
n
1X
T = Xi − p .
n i=1

It might make sense to reject the null hypothesis if T is large. We will be more precise
about this later on particularly by defining the different types of errors, and how to set the
threshold for T .

Statistical Inference Point Estimators Estimating The Population Mean Using Confidence Intervals
No ratings yet
Statistical Inference Point Estimators Estimating The Population Mean Using Confidence Intervals
40 pages
STAT 713 Mathematical Statistics Ii: Lecture Notes
No ratings yet
STAT 713 Mathematical Statistics Ii: Lecture Notes
152 pages
Estimation
No ratings yet
Estimation
5 pages
Unit 2
No ratings yet
Unit 2
63 pages
Unit 2
No ratings yet
Unit 2
42 pages
2A3. Review of Mathematical Statistics
No ratings yet
2A3. Review of Mathematical Statistics
8 pages
Estimação Pontual
No ratings yet
Estimação Pontual
58 pages
Basic Concepts of Inference: Corresponds To Chapter 6 of Tamhane and Dunlop
No ratings yet
Basic Concepts of Inference: Corresponds To Chapter 6 of Tamhane and Dunlop
40 pages
00 Estimation
No ratings yet
00 Estimation
33 pages
16 Intro To Point Estimation, Bias, MSE, Efficiency, (8.1-8.2, 9.1-9.2) - 1
No ratings yet
16 Intro To Point Estimation, Bias, MSE, Efficiency, (8.1-8.2, 9.1-9.2) - 1
26 pages
Ch-5 Solution
No ratings yet
Ch-5 Solution
75 pages
PMP Exam Prep - 2023 11th Edition (Rita Mulcahy, PMP With Margo Kirwin)
93% (69)
PMP Exam Prep - 2023 11th Edition (Rita Mulcahy, PMP With Margo Kirwin)
456 pages
Chap02-5 (Autosaved)
No ratings yet
Chap02-5 (Autosaved)
66 pages
Lecture Notes For Mathematical Statistics
No ratings yet
Lecture Notes For Mathematical Statistics
184 pages
Lec3 ppt2019
No ratings yet
Lec3 ppt2019
18 pages
AE 248: AI and Data Science: Prabhu Ramachandran 2024-03-01
No ratings yet
AE 248: AI and Data Science: Prabhu Ramachandran 2024-03-01
8 pages
webMATH236 Lecture5
No ratings yet
webMATH236 Lecture5
87 pages
18.6501x Fundamentals of Statistics
100% (1)
18.6501x Fundamentals of Statistics
8 pages
Introduction
No ratings yet
Introduction
11 pages
Statistics I: Parameter Estimation, Part II
No ratings yet
Statistics I: Parameter Estimation, Part II
22 pages
Notes 12
No ratings yet
Notes 12
23 pages
Asymptotic Theory and Parametric Inference
No ratings yet
Asymptotic Theory and Parametric Inference
32 pages
Lecture Notes On Bayesian Nonparametrics: Version: May 16, 2014
No ratings yet
Lecture Notes On Bayesian Nonparametrics: Version: May 16, 2014
108 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
61 pages
Fundamentals of Statistics (18.6501x)
No ratings yet
Fundamentals of Statistics (18.6501x)
20 pages
Stat-Review Xid-8243919 1
No ratings yet
Stat-Review Xid-8243919 1
24 pages
Eva Uation Methods 273 A Spring 09
No ratings yet
Eva Uation Methods 273 A Spring 09
17 pages
Interval Estimation
No ratings yet
Interval Estimation
20 pages
Statistics 512 Notes I D. Small
No ratings yet
Statistics 512 Notes I D. Small
8 pages
Week 1 1720465962 Estimation Hour 2
No ratings yet
Week 1 1720465962 Estimation Hour 2
14 pages
Lecture Notes Statistics II PDF
No ratings yet
Lecture Notes Statistics II PDF
139 pages
STAT2102 Chapter6
No ratings yet
STAT2102 Chapter6
5 pages
Lecture Notes - 1
No ratings yet
Lecture Notes - 1
56 pages
Lecture Notes 5
No ratings yet
Lecture Notes 5
13 pages
UMass Stat 516 Solutions Chapter 8
No ratings yet
UMass Stat 516 Solutions Chapter 8
26 pages
F (A) P (X A) : Var (X) 0 If and Only If X Is A Constant Var (X) Var (X+Y) Var (X) + Var (Y) Var (X-Y)
No ratings yet
F (A) P (X A) : Var (X) 0 If and Only If X Is A Constant Var (X) Var (X+Y) Var (X) + Var (Y) Var (X-Y)
8 pages
Hogg McKean Craig 4 2
No ratings yet
Hogg McKean Craig 4 2
9 pages
Notes Confidence Intervalsintercvacnfidence
No ratings yet
Notes Confidence Intervalsintercvacnfidence
1 page
Math and Statistics PDF
No ratings yet
Math and Statistics PDF
192 pages
BNP PDF
No ratings yet
BNP PDF
108 pages
1 Preliminaries: 1.1 Motivation
No ratings yet
1 Preliminaries: 1.1 Motivation
7 pages
Notes For 18.6501x, Fundamentals of Statistics: v0.2 (2019 April 24)
100% (1)
Notes For 18.6501x, Fundamentals of Statistics: v0.2 (2019 April 24)
14 pages
1.1 Parametric and Nonparametric Statistical Inference
No ratings yet
1.1 Parametric and Nonparametric Statistical Inference
8 pages
Statistics
No ratings yet
Statistics
53 pages
Handout 14: Unbiasedness and MSE
No ratings yet
Handout 14: Unbiasedness and MSE
3 pages
Lecture 15 - Foundation Models - CLIP and GPT
No ratings yet
Lecture 15 - Foundation Models - CLIP and GPT
45 pages
Intro To Essential Stats With Python
No ratings yet
Intro To Essential Stats With Python
51 pages
Confidence Intervals with σ unknown
No ratings yet
Confidence Intervals with σ unknown
9 pages
Lecture 1
No ratings yet
Lecture 1
8 pages
An Introduction To Objective Bayesian Statistics PDF
No ratings yet
An Introduction To Objective Bayesian Statistics PDF
69 pages
Statistical+Inference+1 Shaw2007
No ratings yet
Statistical+Inference+1 Shaw2007
66 pages
Chapter 2: Statistical Inference, Point Estimation, and Confidence Intervals
No ratings yet
Chapter 2: Statistical Inference, Point Estimation, and Confidence Intervals
16 pages
Estimation of Parameter
No ratings yet
Estimation of Parameter
10 pages
Chap - 2point - Estimation
No ratings yet
Chap - 2point - Estimation
11 pages
CSCE 970 Lecture 6: System Evaluation and Combining Classifiers
No ratings yet
CSCE 970 Lecture 6: System Evaluation and Combining Classifiers
9 pages
Lecture 21
No ratings yet
Lecture 21
4 pages
Lectura 2 Point Estimator Basics
No ratings yet
Lectura 2 Point Estimator Basics
11 pages
Financial Management - Prasanna Chandra
80% (10)
Financial Management - Prasanna Chandra
817 pages
CH 15
No ratings yet
CH 15
21 pages
Estimators: The Basic Statistical Model
No ratings yet
Estimators: The Basic Statistical Model
9 pages
Solved Problems On EOQ
100% (10)
Solved Problems On EOQ
5 pages
Sequence Diagram
No ratings yet
Sequence Diagram
1 page
05 NN
No ratings yet
05 NN
151 pages
Be - Electronics and Telecommunication Engineering - Semester 7 - 2024 - May - Deep Learning DL 2019 Pattern
No ratings yet
Be - Electronics and Telecommunication Engineering - Semester 7 - 2024 - May - Deep Learning DL 2019 Pattern
2 pages
CS 611 Slides 5
No ratings yet
CS 611 Slides 5
28 pages
Confidence Intervals with σ unknown
No ratings yet
Confidence Intervals with σ unknown
9 pages
Total Quality Management MCQs
100% (16)
Total Quality Management MCQs
15 pages
MCQ - Project Management
100% (3)
MCQ - Project Management
301 pages
CS402 Sample Paper
No ratings yet
CS402 Sample Paper
12 pages
Accountancy Class 11th T.S. Grewal Book PDF New Edition (Part 2) Pdf. by HELPING HAND ?? PDF
80% (46)
Accountancy Class 11th T.S. Grewal Book PDF New Edition (Part 2) Pdf. by HELPING HAND ?? PDF
128 pages
A1249374774 16765 26 2019 MTH302 Unit2 3 Problems
0% (1)
A1249374774 16765 26 2019 MTH302 Unit2 3 Problems
3 pages
Kuis Minggu 15 - Tepelll
No ratings yet
Kuis Minggu 15 - Tepelll
15 pages
Cumulative Distribution Function
No ratings yet
Cumulative Distribution Function
5 pages
Efficient nonlinear function approximation in analog resistive crossbars for recurrent neural networks的全文翻译
No ratings yet
Efficient nonlinear function approximation in analog resistive crossbars for recurrent neural networks的全文翻译
15 pages
FALLSEM2020-21 CSI1003 TH VL2020210103426 Reference Material I 17-Jul-2020 TOC-Chapter2
No ratings yet
FALLSEM2020-21 CSI1003 TH VL2020210103426 Reference Material I 17-Jul-2020 TOC-Chapter2
38 pages
MTH408 Machine - Learning - Logistic - Regression
No ratings yet
MTH408 Machine - Learning - Logistic - Regression
43 pages
Discrete Probability Distribution
No ratings yet
Discrete Probability Distribution
34 pages
Econometric S Lecture 45
No ratings yet
Econometric S Lecture 45
31 pages
Question Bank
No ratings yet
Question Bank
4 pages
Multiple Choice Questions (MCQ) On Inventory Management
80% (44)
Multiple Choice Questions (MCQ) On Inventory Management
2 pages
Artificial Neural Networks Kluniversity Course Handout
No ratings yet
Artificial Neural Networks Kluniversity Course Handout
18 pages
Stat Tables PDF
No ratings yet
Stat Tables PDF
28 pages
Types of Neural Networks
No ratings yet
Types of Neural Networks
7 pages
How To Build Your Own Neural Network From Scratch in Python
No ratings yet
How To Build Your Own Neural Network From Scratch in Python
11 pages
CS 121 - Object Oriented Programming (Lec 8)
No ratings yet
CS 121 - Object Oriented Programming (Lec 8)
22 pages
Fundamentals of Total Quality Management
100% (2)
Fundamentals of Total Quality Management
357 pages
Atcd-18is61 - Model Paper 2021
No ratings yet
Atcd-18is61 - Model Paper 2021
7 pages
Pengaruh Infrastruktur Terhadap Sektor Pertanian Di Pulau Sumatera Zakiah Wilis Subroto, Diana Sapha
No ratings yet
Pengaruh Infrastruktur Terhadap Sektor Pertanian Di Pulau Sumatera Zakiah Wilis Subroto, Diana Sapha
13 pages
Application of Normal Distribution: Huining Kang August 10, 2020
No ratings yet
Application of Normal Distribution: Huining Kang August 10, 2020
18 pages
Operations Management
92% (12)
Operations Management
522 pages
UML2 Class Diagram in Java
No ratings yet
UML2 Class Diagram in Java
13 pages
Quality Management-Mba Study Notes
93% (14)
Quality Management-Mba Study Notes
49 pages
S2 Binomial Distribution
No ratings yet
S2 Binomial Distribution
24 pages
Flat BTCS-502-18 MST-2
No ratings yet
Flat BTCS-502-18 MST-2
1 page
MobileNetSSD Deploy - Prototxt
No ratings yet
MobileNetSSD Deploy - Prototxt
33 pages
Lesson Teaching Plan: Subject: Automata Theory Branch: Computer Application Semester: 4 Faculty Name: Bighnaraj Naik
No ratings yet
Lesson Teaching Plan: Subject: Automata Theory Branch: Computer Application Semester: 4 Faculty Name: Bighnaraj Naik
2 pages
ANN Backpropagation: Weight Updates For Hidden Nodes: Step 1: Update The Weights V
No ratings yet
ANN Backpropagation: Weight Updates For Hidden Nodes: Step 1: Update The Weights V
3 pages
Operations Management - MCQ With Answers
95% (66)
Operations Management - MCQ With Answers
12 pages
Managerial Accounting
93% (14)
Managerial Accounting
550 pages
Total Quality Management Chapter 1
80% (5)
Total Quality Management Chapter 1
66 pages
Production and Operations Management
91% (34)
Production and Operations Management
284 pages
Research Methodology MCQ Questions With Answers
81% (779)
Research Methodology MCQ Questions With Answers
60 pages
Dmgt524 Total Quality Management
100% (10)
Dmgt524 Total Quality Management
243 pages
Quality Management System
100% (10)
Quality Management System
59 pages
Cost Accounting Vol. II
92% (26)
Cost Accounting Vol. II
464 pages
Strategic Management Question-Answers
89% (106)
Strategic Management Question-Answers
97 pages
Managerial Accounting An Introduction To Concepts Methods and Uses PDF
91% (11)
Managerial Accounting An Introduction To Concepts Methods and Uses PDF
624 pages
Lecture Short-Term Financial Decisions
No ratings yet
Lecture Short-Term Financial Decisions
52 pages
Multi-Product Break-Even Point Formula: Margin and Weighted Average Contribution Margin Ratio Are Used
No ratings yet
Multi-Product Break-Even Point Formula: Margin and Weighted Average Contribution Margin Ratio Are Used
2 pages
Principles of Cost Accounting Epub PDF
100% (12)
Principles of Cost Accounting Epub PDF
432 pages
Cifa Financial Institutions and Markets PDF
No ratings yet
Cifa Financial Institutions and Markets PDF
104 pages
Managerial Economics
90% (41)
Managerial Economics
572 pages
CA Inter Cost Blast From The Past PDF
No ratings yet
CA Inter Cost Blast From The Past PDF
83 pages
Operations Management
95% (117)
Operations Management
290 pages
Chapter 11 - Decision Making and Relevant Information
No ratings yet
Chapter 11 - Decision Making and Relevant Information
44 pages
501 - Costing Theory Notes
No ratings yet
501 - Costing Theory Notes
141 pages
Total Quality Management 6
No ratings yet
Total Quality Management 6
119 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)