0% found this document useful (0 votes)

23 views96 pages

Statistics Training

The document discusses statistical inference and covers topics like the weak law of large numbers, central limit theorem, estimation theory, Markov's inequality, Chebyshev's inequality, and their applications. It also provides an example explaining how to determine the minimum sample size needed for a pre-election survey to estimate the population proportion within 1% error at 95% confidence level.

Uploaded by

tgnr890

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views96 pages

Statistics Training

Uploaded by

tgnr890

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 96

STATISTICAL INFERENCE LECTURE

- Dripto Bakshi
Inferential Statistics

 Inferential statistics: the part of statistics that allows

researchers to generalize their findings beyond data
collected.

 Statistical inference: a procedure for making inferences or

generalizations about a larger population from a sample of
that population
Lecture Parts

 Weak Law of Large Numbers & Central Limit Theorem

 Estimation Theory
Weak Law of Large Numbers & Central Limit
Theorem
Pre - Poll / Election Survey
Markov’ Inequality

 Consider any positive random variable X∈ (0, ∞) with density 𝑓𝑋 .

Markov’ Inequality

 Consider any positive random variable X∈ (0, ∞) with density 𝑓𝑋 .

+∞
 E(X) = ‫׬‬0 𝑥 𝑓𝑋 (x) dx
Markov’ Inequality

 Consider any positive random variable X∈ (0, ∞) with density 𝑓𝑋 .

+∞ +∞
 E(X) = ‫׬‬0 𝑥 𝑓𝑋 (x) dx ≥ ‫𝑋𝑓 𝑥 𝑎׬‬ (x) dx , where a > 0
Markov’ Inequality

 Consider any positive random variable X∈ (0, ∞) with density 𝑓𝑋 .

+∞ +∞
 E(X) = ‫׬‬0 𝑥 𝑓𝑋 (x) dx ≥ ‫𝑋𝑓 𝑥 𝑎׬‬ (x) dx , where a > 0

+∞ +∞
 ‫𝑋𝑓 𝑥 𝑎׬‬ (x) dx ≥ ‫𝑎 𝑎׬‬ . 𝑓𝑋 (x) dx = a . P(X ≥ a)
Markov’ Inequality

 Consider any positive random variable X∈ (0, ∞) with density 𝑓𝑋 .

+∞ +∞
 E(X) = ‫׬‬0 𝑥 𝑓𝑋 (x) dx ≥ ‫𝑋𝑓 𝑥 𝑎׬‬ (x) dx , where a > 0

+∞ +∞
 ‫𝑋𝑓 𝑥 𝑎׬‬ (x) dx ≥ a ‫׬‬0 1. 𝑓𝑋 (x) dx = a . P(X ≥ a)

𝐸(𝑋)
 P(X ≥ a) ≤
𝑎
Chebyshev’s Inequality
 Consider any random variable X ~ 𝐹𝑋 (𝜇, 𝜎 2 )
Chebyshev’s Inequality
 Consider any random variable X ~ 𝐹𝑋 (𝜇, 𝜎 2 )

 Define Q = (𝑋 − 𝜇)2 . Q is definitely a positive random variable.

Chebyshev’s Inequality
 Consider any random variable X ~ 𝐹𝑋 (𝜇, 𝜎 2 )

 Define Q = (𝑋 − 𝜇)2 . Q is definitely a positive random variable.

 By Markov’s inequality, for any 𝜀 > 0

𝐸((𝑋 −𝜇)2 ) 𝜎2
P ((𝑋 − 𝜇)2 > 𝜀 2 ) ≤ =
𝜀2 𝜀2
Chebyshev’s Inequality
 Consider any random variable X ~ 𝐹𝑋 (𝜇, 𝜎 2 )

 Define Q = (𝑋 − 𝜇)2 . Q is definitely a positive random variable.

 By Markov’s inequality, for any 𝜀 > 0

2 2 𝐸((𝑋 −𝜇)2 ) 𝜎 2
P ((𝑋 − 𝜇) > 𝜀 ) ≤ = 2
𝜀2 𝜀

𝜎2
 P(|X − µ| ≥ 𝜀) ≤ 𝜀2
Weak Law of Large Numbers

 𝑋1 , 𝑋2 , 𝑋3 ,…… 𝑋𝑛 are iid, where 𝑋𝑗 ~ F (𝜇, 𝜎 2 ) ∀ 𝑗 = 1,2, … . . 𝑛

Weak Law of Large Numbers

 𝑋1 , 𝑋2 , 𝑋3 ,…… 𝑋𝑛 are iid, where 𝑋𝑗 ~ F (𝜇, 𝜎 2 ) ∀ 𝑗 = 1,2, … . . 𝑛

𝑋1 + 𝑋2 + 𝑋3 ….𝑋𝑛
 𝑀𝑛 =
𝑛
Weak Law of Large Numbers

 𝑋1 , 𝑋2 , 𝑋3 ,…… 𝑋𝑛 are iid, where 𝑋𝑗 ~ F (𝜇, 𝜎 2 ) ∀ 𝑗 = 1,2, … . . 𝑛

𝑋1 + 𝑋2 + 𝑋3 ….𝑋𝑛
 𝑀𝑛 =
𝑛

 What happens if n -> ∞ ?

Convergence in Probability
𝜎2
 E(𝑀𝑛 ) = 𝜇 & V(𝑀𝑛 ) =
𝑛
Convergence in Probability
𝜎2
 E(𝑀𝑛 ) = 𝜇 & V(𝑀𝑛 ) =
𝑛

 According to Chebyshev’s Inequality: For any finite 𝜀 > 0

𝜎2
𝑛 𝜎2
P(| 𝑀𝑛 − µ| ≥ 𝜀) ≤ =
𝜀2 𝑛𝜀 2
Convergence in Probability
𝜎2
 E(𝑀𝑛 ) = 𝜇 & V(𝑀𝑛 ) =
𝑛

 According to Chebyshev’s Inequality: For any finite 𝜀 > 0

𝜎2
𝑛 𝜎2
P(| 𝑀𝑛 − µ| ≥ 𝜀) ≤ =
𝜀2 𝑛𝜀 2

 Hence as n -> ∞ , P(| 𝑀𝑛 − µ| ≥ 𝜀) -> 0 for any 𝜀 > 0

𝑀𝑛 converges in probability to µ
Pre - Poll / Election Survey
The Methodology
 f: The proportion of people respond “voting for Republicans”
The Methodology
 f: The proportion of people respond “voting for Republicans”

 Define a random variable 𝑋𝑗 for any individual j where

𝑋𝑗 = 1 ; “vote for Republican”
= 0 ; “vote for Democrat”
The Methodology
 f: The proportion of people respond “voting for Republicans”

 Define a random variable 𝑋𝑗 for any individual j where

𝑋𝑗 = 1 ; “vote for Republican”
= 0 ; “vote for Democrat”

𝑋1 + 𝑋2 + 𝑋3 ….𝑋𝑛
 𝑀𝑛 =
𝑛

 𝑀𝑛 denotes the fraction of people responding “voting for Republicans”

The Methodology
 f: The proportion of people respond “voting for Republicans”

 Define a random variable 𝑋𝑗 for any individual j where

𝑋𝑗 = 1 ; “vote for Republican”
= 0 ; “vote for Democrat”

𝑋1 + 𝑋2 + 𝑋3 ….𝑋𝑛
 𝑀𝑛 = 𝑛

 𝑀𝑛 denotes the fraction of people responding “voting for Republicans”

 Goal: 95% confidence of ≤1% error

P(|𝑀𝑛 − f| ≥ .01) ≤ .05
 𝑋𝑗 ~ Ber (f)
 𝑋𝑗 ~ Ber (f)

 E(𝑋𝑗 ) = f & V(𝑋𝑗 ) = f(1-f)

 𝑋𝑗 ~ Ber (f)

 E(𝑋𝑗 ) = f & V(𝑋𝑗 ) = f(1-f)

𝑋1 + 𝑋2 + 𝑋3 ….𝑋𝑛
 𝑀𝑛 =
𝑛
 𝑋𝑗 ~ Ber (f)

 E(𝑋𝑗 ) = f & V(𝑋𝑗 ) = f(1-f)

𝑋1 + 𝑋2 + 𝑋3 ….𝑋𝑛
 𝑀𝑛 =
𝑛

𝑓(1−𝑓)
 E(𝑀𝑛 ) = f & V(𝑀𝑛 ) =
𝑛
 Using Chebyshev’s Inequality
V(𝑀𝑛 )
P(|𝑀𝑛 − f| ≥ .01) ≤ 2
(.01)
 Using Chebyshev’s Inequality
𝑓(1−𝑓)
V(𝑀𝑛 ) 𝑛
P(|𝑀𝑛 − f| ≥ .01) ≤ =
(.01)2 (.01)2
 Using Chebyshev’s Inequality
𝑓(1−𝑓)
V(𝑀𝑛 ) 𝑛
P(|𝑀𝑛 − f| ≥ .01) ≤ =
(.01)2 (.01)2

𝑓(1−𝑓)
 We want 𝑛
< 0.05
(.01)2
𝑓(1−𝑓)
 We want 𝑛
< 0.05
(.01)2

𝑓(1−𝑓)
 Choose the sample size (n) such that 𝑛
< 0.05
(.01)2
𝑓(1−𝑓)
 We want 𝑛
(.01)2
< 0.05

𝑓(1−𝑓)
 Choose the sample size (n) such that 𝑛
(.01)2
< 0.05

 But we don’t know f

 In fact we are trying to find / estimate f.

 f ∈ 0,1 & 𝑓 1 − 𝑓 ≤ ¼
 f ∈ 0,1 & 𝑓 1 − 𝑓 ≤ ¼

𝑓(1−𝑓)
1
 Therefore 𝑛
≤
(.01)2 4𝑛(.01)2
 f ∈ 0,1 & 𝑓 1 − 𝑓 ≤ ¼

𝑓(1−𝑓)
1
 Therefore 𝑛
≤
(.01)2 4𝑛(.01)2

1
 Now < 0.05 if n ≥ 50000
4𝑛(.01)2
 f ∈ 0,1 & 𝑓 1 − 𝑓 ≤ ¼

𝑓(1−𝑓)
1
 Therefore 𝑛
≤
(.01)2 4𝑛(.01)2

1
 Now < 0.05 if n ≥ 50000
4𝑛(.01)2

 So the minimum sample size (n) = 50000 !!!!

Central Limit Theorem
 𝑋1 , 𝑋2 , 𝑋3 ,…… 𝑋𝑛 are iid, where 𝑋𝑗 ~ F (𝜇, 𝜎 2 ) ∀ 𝑗 = 1,2, … . . 𝑛

 E(𝑋𝑗 ) = 𝜇 & V(𝑋𝑗 ) =𝜎 2 ∀ 𝑗 = 1,2, … . . 𝑛

Central Limit Theorem
 𝑋1 , 𝑋2 , 𝑋3 ,…… 𝑋𝑛 are iid, where 𝑋𝑗 ~ F (𝜇, 𝜎 2 ) ∀ 𝑗 = 1,2, … . . 𝑛

 E(𝑋𝑗 ) = 𝜇 & V(𝑋𝑗 ) =𝜎 2 ∀ 𝑗 = 1,2, … . . 𝑛

 𝑆𝑛 = 𝑋1 + 𝑋2 + 𝑋3 … . 𝑋𝑛
Central Limit Theorem
 𝑋1 , 𝑋2 , 𝑋3 ,…… 𝑋𝑛 are iid, where 𝑋𝑗 ~ F (𝜇, 𝜎 2 ) ∀ 𝑗 = 1,2, … . . 𝑛

 E(𝑋𝑗 ) = 𝜇 & V(𝑋𝑗 ) =𝜎 2 ∀ 𝑗 = 1,2, … . . 𝑛

 𝑆𝑛 = 𝑋1 + 𝑋2 + 𝑋3 … . 𝑋𝑛

 E(𝑆𝑛 ) = n𝜇 & V(𝑆𝑛 ) = n 𝜎 2

Central Limit Theorem
 𝑋1 , 𝑋2 , 𝑋3 ,…… 𝑋𝑛 are iid, where 𝑋𝑗 ~ F (𝜇, 𝜎 2 ) ∀ 𝑗 = 1,2, … . . 𝑛

 E(𝑋𝑗 ) = 𝜇 & V(𝑋𝑗 ) =𝜎 2 ∀ 𝑗 = 1,2, … . . 𝑛

 𝑆𝑛 = 𝑋1 + 𝑋2 + 𝑋3 … . 𝑋𝑛

 E(𝑆𝑛 ) = n𝜇 & V(𝑆𝑛 ) = n 𝜎 2

𝑆𝑛 −E(𝑆𝑛 ) 𝑆𝑛 −n𝜇 𝑀𝑛 −𝜇
 Standardize: 𝑍𝑛 = = = 𝜎
𝜎 n
V(𝑆𝑛) n
 E(𝑍𝑛 ) = 0 & V(𝑍𝑛 ) = 1
 E(𝑍𝑛 ) = 0 & V(𝑍𝑛 ) = 1

 Let Z be a standard Normal random variable. i.e Z ~ 𝑁 0,1

 E(𝑍𝑛 ) = 0 & V(𝑍𝑛 ) = 1

 Let Z be a standard Normal random variable. i.e Z ~ 𝑁 0,1

 The Central Limit Theorem states that:

For every c, P(Zn ≤ c) → P(Z ≤ c) as n becomes large enough. (Convergence in

distribution)

P(Z ≤ c) is the standard normal CDF, Φ(c), available from the normal tables
Election problem – (with CLT)
𝑓(1−𝑓)
 E(𝑀𝑛 ) = f & V(𝑀𝑛 ) =
𝑛
Election problem – (with CLT)
𝑓(1−𝑓)
 E(𝑀𝑛 ) = f & V(𝑀𝑛 ) =
𝑛

 Now we want P(|𝑀𝑛 − f| ≥ .01) ≤ .05

Election problem – (with CLT)

𝑓(1−𝑓)
 E(𝑀𝑛 ) = f & V(𝑀𝑛 ) =
𝑛

 Now we want P(|𝑀𝑛 − f| ≥ .01) ≤ .05

𝑀𝑛 −f .01
 P(| |≥ ) ≤ .05
𝑓(1−𝑓) 𝑓(1−𝑓)
𝑛 𝑛
Election problem – (with CLT)

𝑓(1−𝑓)
 E(𝑀𝑛 ) = f & V(𝑀𝑛 ) =
𝑛

 Now we want P(|𝑀𝑛 − f| ≥ .01) ≤ .05

𝑀𝑛 −f .01
 P(| |≥ ) ≤ .05
𝑓(1−𝑓) 𝑓(1−𝑓)
𝑛 𝑛

.01
 P(|Z| ≥ ) ≤ .05
𝑓(1−𝑓)
𝑛
 f ∈ 0,1 & 𝑓 1 − 𝑓 ≤ ¼
 f ∈ 0,1 & 𝑓 1 − 𝑓 ≤ ¼ => 𝑓 1 − 𝑓 ≤ ½
 f ∈ 0,1 & 𝑓 1 − 𝑓 ≤ ¼ => 𝑓 1 − 𝑓 ≤ ½

.01 .01 𝑛
 ≥ 1 = .02 𝑛
𝑓(1−𝑓)
2
𝑛
 f ∈ 0,1 & 𝑓 1 − 𝑓 ≤ ¼ => 𝑓 1 − 𝑓 ≤ ½

.01 .01 𝑛
 ≥ 1 = .02 𝑛
𝑓(1−𝑓)
2
𝑛

.01
 P(|Z| ≥ ) ≤ P(|Z| ≥ .02 𝑛)
𝑓(1−𝑓)
𝑛
 If n = 10000, P(|Z| ≥ .02 𝑛) = P(|Z| ≥ 2) < .05
 If n = 10000, P(|Z| ≥ .02 𝑛) = P(|Z| ≥ 2) < .05

 In fact P(|Z| ≥ .02 𝑛) = .05 implies .02 𝑛 = 1.96 i.e n = 9604.

 If n = 10000, P(|Z| ≥ .02 𝑛) = P(|Z| ≥ 2) < .05

 In fact P(|Z| ≥ .02 𝑛) = .05 implies .02 𝑛 = 1.96 i.e n = 9604.

 And 9604 << 50000 !!!!!

ESTIMATION
Estimation - categories

 Point Estimation

 Interval estimation
Point Estimation
 The Statistic

 The desirable properties of an estimator:

i. Unbiasedness
ii. Consistency
Interval Estimation

• Interval Estimation: an inferential statistical procedure used to

estimate population parameters from sample data through the
building of confidence intervals

• Confidence Intervals: a range of values computed from sample data

that has a known probability of capturing some population parameter
of interest
The Problem Statement

 Find the average height of Indian males.

 But India has 72 Cr males ! It’s impossible to measure everyone.

 Remedy: Collect a sample, say S = { 𝑋1 , 𝑋2 , 𝑋3 … . 𝑋𝑛 }

 Each sample observation is a realisation of a random variable. Why??

 Since the sample is drawn from the populations 𝑋𝑖 ’s are i.i.d.

The Assumption

 Let us assume that height of Indian males are normally distributed

with mean 𝜇 and variance 𝜎 2 .

 Our aim is to find (estimate) 𝜇 and 𝜎 2

 These are called “population parameters”.

Sample Statistics

1
 Sample mean 𝑋 = σ𝑛𝑖=1 𝑋𝑖
ത
𝑛

1
2
 Sample variance 𝑠 = σ𝑛𝑖=1(𝑋𝑖 − 𝑋)
ത 2
𝑛−1
Sample Statistics

1
 Sample mean 𝑋 = σ𝑛𝑖=1 𝑋𝑖
ത
𝑛

1
2
 Sample variance 𝑠 = σ𝑛𝑖=1(𝑋𝑖 − 𝑋)
ത 2
𝑛−1

 I would want to use the sample statistics to “estimate” the

population parameters.

 Let’s see how.

Point Estimation

ത and sample variance (𝑠 2 ) as proxies

 Just take the sample mean (𝑋)
for the population mean (𝜇) and population variance (𝜎 2 ),
respectively.

 𝑋ത and 𝑠 2 are called point estimators of 𝜇 and 𝜎 2 respectively.

 But are they “good” proxies?

The “Good” Properties

The good / desirable properties of a point estimator are

i. Unbiasedness
ii. Consistency
The “Good” Properties
The good / desirable properties of a point estimator are

i. Unbiasedness:
ത =𝜇
 E(𝑋)
 E(𝑠 2 ) = 𝜎 2

ii. Consistency
The “Good” Properties
The good / desirable properties of a point estimator are

i. Unbiasedness:

ത =𝜇
 E(𝑋)
 E(𝑠 2 ) = 𝜎 2

ii. Consistency:

ത =0
 lim 𝑉(𝑋)
𝑛→ ∞
 lim 𝑉(𝑠 2 ) = 0
𝑛→ ∞
Interval estimation of mean
 Sample observations S = { 𝑋1 , 𝑋2 , 𝑋3 … . 𝑋𝑛 }
Interval estimation of mean
 Sample observations S = { 𝑋1 , 𝑋2 , 𝑋3 … . 𝑋𝑛 }

 𝑋𝑖 ~ N (𝜇, 𝜎 2 )
Interval estimation of mean
 Sample observations S = { 𝑋1 , 𝑋2 , 𝑋3 … . 𝑋𝑛 }

 𝑋𝑖 ~ N (𝜇, 𝜎 2 )

1
 𝑋 = σ𝑛𝑖=1 𝑋𝑖
ത
𝑛
Interval estimation of mean
 Sample observations S = { 𝑋1 , 𝑋2 , 𝑋3 … . 𝑋𝑛 }

 𝑋𝑖 ~ N (𝜇, 𝜎 2 )

1
 𝑋 = σ𝑛𝑖=1 𝑋𝑖
ത
𝑛

𝜎2
 𝑋ത ~ N (𝜇, )
𝑛
Interval estimation of mean
 Sample observations S = { 𝑋1 , 𝑋2 , 𝑋3 … . 𝑋𝑛 }

 𝑋𝑖 ~ N (𝜇, 𝜎 2 )

1
 𝑋ത = σ𝑛𝑖=1 𝑋𝑖
𝑛

𝜎2
 𝑋ത ~ N (𝜇, )
𝑛

𝑋ത −𝜇
 Z= 𝜎 ~ N (0,1)
𝑛
The Standard Normal Distribution
The Interval
 P( 𝑍 < 𝑍𝛼/2 ) = 1 - 𝛼
The Interval
 P( 𝑍 < 𝑍𝛼/2 ) = 1 - 𝛼

 P(- 𝑍𝛼/2 < Z < 𝑍𝛼/2 ) = 1 - 𝛼

The Interval
 P( 𝑍 < 𝑍𝛼/2 ) = 1 - 𝛼

 P(- 𝑍𝛼/2 < Z < 𝑍𝛼/2 ) = 1 - 𝛼

𝑋ത −𝜇
 P(- 𝑍𝛼/2 < 𝜎 < 𝑍𝛼/2 ) = 1 - 𝛼
𝑛
The Interval
 P( 𝑍 < 𝑍𝛼/2 ) = 1 - 𝛼

 P(- 𝑍𝛼/2 < Z < 𝑍𝛼/2 ) = 1 - 𝛼

𝑋ത −𝜇
 P(- 𝑍𝛼/2 < 𝜎 < 𝑍𝛼/2 ) = 1 - 𝛼
𝑛

𝜎 𝜎
 P(𝜇 ∈ (𝑋ത − . 𝑍𝛼 , 𝑋ത + . 𝑍𝛼 ) = 1 - 𝛼
𝑛 2 𝑛 2
The Interval
 P( 𝑍 < 𝑍𝛼/2 ) = 1 - 𝛼

 P(- 𝑍𝛼/2 < Z < 𝑍𝛼/2 ) = 1 - 𝛼

𝑋ത −𝜇
 P(- 𝑍𝛼/2 < 𝜎 < 𝑍𝛼/2 ) = 1 - 𝛼
𝑛

𝜎 𝜎
 P(𝜇 ∈ (𝑋ത − . 𝑍𝛼 , 𝑋ത + . 𝑍𝛼 ) = 1 - 𝛼
𝑛 2 𝑛 2

 This is the interval within which 𝜇 lies with probability = 1 - 𝛼

 We can compute 𝑋ത from the sample. n is the sample size. 𝑍𝛼/2 can be computed from
the standard normal table. So if we know 𝜎, we are done !!
We may not know 𝝈 !!
The t - statistic

𝑛−1 .𝑠 2
 ~ χ2(𝑛−1)
𝜎2
The t - statistic

𝑛−1 .𝑠 2
 ~ χ2(𝑛−1)
𝜎2

𝑋ത −𝜇
 Define a statistic t = 𝑠
𝑛
The t - statistic

𝑛−1 .𝑠 2
 ~ χ2(𝑛−1)
𝜎2

ഥ −𝜇
𝑋
𝜎
𝑋ത −𝜇 𝑛
 Define a statistic t = 𝑠 =
𝑠/𝜎
𝑛
The t - statistic

𝑛−1 .𝑠 2
 ~ χ2(𝑛−1)
𝜎2

ഥ −𝜇
𝑋 ഥ −𝜇
𝑋
𝜎 𝜎
𝑋ത −𝜇 𝑛 𝑛 𝑍
 Define a statistic t = 𝑠 = = = ~ 𝑡𝑛−1
𝑠/𝜎 𝑛−1 .𝑠2 1
𝑛 .𝑛−1 χ2(𝑛−1)
𝜎2 .
𝑛−1
The t - statistic

𝑛−1 .𝑠 2
 ~ χ2(𝑛−1)
𝜎2

ഥ −𝜇
𝑋 ഥ −𝜇
𝑋
𝜎 𝜎
𝑋ത −𝜇 𝑛 𝑛 𝑍
 Define a statistic t = 𝑠 = = = ~ 𝑡𝑛−1
𝑠/𝜎 𝑛−1 .𝑠2 1
𝑛 .𝑛−1 χ2(𝑛−1)
𝜎2 .
𝑛−1

i.e the student’s t – distribution with n-1 degrees of freedom.

Student’s t - distribution
The Interval
(𝛼/2)
 P( 𝑡 < 𝑡𝑛−1 )=1-𝛼
The Interval
(𝛼/2)
 P( 𝑡 < 𝑡𝑛−1 )=1-𝛼

(𝛼/2) (𝛼/2)
 P(-𝑡𝑛−1 < t< 𝑡𝑛−1 ) =1-𝛼

(𝛼/2) 𝑋ത −𝜇 (𝛼/2)
 P(-𝑡𝑛−1 < 𝑠 <𝑡𝑛−1 ) =1-𝛼
𝑛
The Interval
(𝛼/2)
 P( 𝑡 < 𝑡𝑛−1 )=1-𝛼

(𝛼/2) (𝛼/2)
 P(-𝑡𝑛−1 < t < 𝑡𝑛−1 ) = 1 - 𝛼

(𝛼/2) 𝑋ത −𝜇 (𝛼/2)
 P(-𝑡𝑛−1 < 𝑠 <𝑡𝑛−1 ) =1-𝛼
𝑛

𝑠 (𝛼/2) ത 𝑠 (𝛼/2)
 P(𝜇 ∈ (𝑋ത − . 𝑡𝑛−1 , 𝑋 + . 𝑡𝑛−1 ) =1-𝛼
𝑛 𝑛
The Interval
(𝛼/2)
 P( 𝑡 < 𝑡𝑛−1 ) = 1 - 𝛼

(𝛼/2) (𝛼/2)
 P(-𝑡𝑛−1 < t < 𝑡𝑛−1 ) = 1 - 𝛼

(𝛼/2) 𝑋ത −𝜇 (𝛼/2)
 P(-𝑡𝑛−1 < 𝑠 <𝑡𝑛−1 ) = 1 - 𝛼
𝑛

𝑠 (𝛼/2) 𝑠 (𝛼/2)
 P(𝜇 ∈ (𝑋ത − . 𝑡𝑛−1 , 𝑋ത + . 𝑡𝑛−1 ) = 1 - 𝛼
𝑛 𝑛

 This is the interval within which 𝜇 lies with probability = (1 - 𝛼)

(𝛼/2)
 We can compute 𝑋ത from the sample. n is the sample size.𝑡𝑛−1 can be computed from the t - distribution
table.
Interval estimation of Variance

𝑛−1 .𝑠 2
 ~ χ2(𝑛−1)
𝜎2
𝛼 𝛼
2
(1− ) 𝑛−1 .𝑠2 ( )
2 2
 P( χ 𝑛−12 < < χ 𝑛−1 ) = 1-𝛼
𝜎2

2 𝑛−1 .𝑠2 𝑛−1 .𝑠2

 P (𝜎 ∈ ( 𝛼
(1− 2 )
, 𝛼 )=1-𝛼
χ 2
𝑛−1 χ 2 2
𝑛−1

 Given a sample we can compute 𝑠 2 and we know n (sample size). We can

𝛼 𝛼
( ) (1− )
2 2 2
compute χ 𝑛−1 & χ 𝑛−12 from the chi – square distribution table.

 Thus we have found the interval such that 𝜎 2 lies in that interval with probability
(1- 𝛼 )

0 - Statistical Inference Theory
No ratings yet
0 - Statistical Inference Theory
80 pages
Lesson
No ratings yet
Lesson
63 pages
Statistical Downscaling For Hydrological and Environmental Applications
100% (1)
Statistical Downscaling For Hydrological and Environmental Applications
179 pages
Point Estimation
No ratings yet
Point Estimation
47 pages
ემპირიული პროცესები
No ratings yet
ემპირიული პროცესები
131 pages
00 Estimation
No ratings yet
00 Estimation
33 pages
STAT8310 Statistical Theory 2021 Topic 5.2 Chebyshev's Inequality and The Central Limit Theorem
No ratings yet
STAT8310 Statistical Theory 2021 Topic 5.2 Chebyshev's Inequality and The Central Limit Theorem
42 pages
Data Analysis Slides
No ratings yet
Data Analysis Slides
43 pages
Unit - 1 Sampling Distribution and Estimation Part 2
No ratings yet
Unit - 1 Sampling Distribution and Estimation Part 2
15 pages
Empirical Process (Sara Van de Geer)
No ratings yet
Empirical Process (Sara Van de Geer)
91 pages
Statistical Foundations: SOST70151 - LECTURE 5
No ratings yet
Statistical Foundations: SOST70151 - LECTURE 5
49 pages
Econ-2042 - Unit 5-HO
No ratings yet
Econ-2042 - Unit 5-HO
22 pages
Estimation New
No ratings yet
Estimation New
37 pages
370 Formulas Tables Packet
No ratings yet
370 Formulas Tables Packet
24 pages
Chapters4 5 PDF
No ratings yet
Chapters4 5 PDF
96 pages
Lecture 5: Limit Theorem: Hengki Purwoto (Econ UGM) Statistics 2: Lecture 5 March 15, 2021 1
No ratings yet
Lecture 5: Limit Theorem: Hengki Purwoto (Econ UGM) Statistics 2: Lecture 5 March 15, 2021 1
27 pages
STA 241 Topic 14 Laws of Large Numbers (Corr)
No ratings yet
STA 241 Topic 14 Laws of Large Numbers (Corr)
9 pages
Chapters4 5 PDF
No ratings yet
Chapters4 5 PDF
96 pages
Introduction
No ratings yet
Introduction
11 pages
9 CLT
No ratings yet
9 CLT
19 pages
STAT 512 Mathematical Statistics: Lecture Notes
No ratings yet
STAT 512 Mathematical Statistics: Lecture Notes
120 pages
Asymptotic Theory and Parametric Inference
No ratings yet
Asymptotic Theory and Parametric Inference
32 pages
Chapter Two Stat II
No ratings yet
Chapter Two Stat II
20 pages
STA248
No ratings yet
STA248
26 pages
Chapter-8-Estimation & Hypothesis Testing
100% (1)
Chapter-8-Estimation & Hypothesis Testing
12 pages
POINT INTERVAL Estimates
No ratings yet
POINT INTERVAL Estimates
48 pages
Chapter-8-Estimation & Hypothesis Testing
No ratings yet
Chapter-8-Estimation & Hypothesis Testing
14 pages
Notes 2
No ratings yet
Notes 2
10 pages
Chapter 8
No ratings yet
Chapter 8
21 pages
Lec 8
No ratings yet
Lec 8
13 pages
2022 MA311 Statistics-Tutorial
No ratings yet
2022 MA311 Statistics-Tutorial
5 pages
18.6501x Fundamentals of Statistics
100% (1)
18.6501x Fundamentals of Statistics
8 pages
Cheat Sheet For The Final Exam
No ratings yet
Cheat Sheet For The Final Exam
6 pages
Unit-3 (Estimation)
No ratings yet
Unit-3 (Estimation)
16 pages
CH 2
No ratings yet
CH 2
20 pages
X400004 20220215 Solutions
No ratings yet
X400004 20220215 Solutions
8 pages
Lecture 1
No ratings yet
Lecture 1
8 pages
Probability and Statistics Soln 20
No ratings yet
Probability and Statistics Soln 20
5 pages
Msqe Metrics 1 ps2
No ratings yet
Msqe Metrics 1 ps2
11 pages
Statistics Formula Sheet-With Tables
No ratings yet
Statistics Formula Sheet-With Tables
5 pages
Stats 2 Formulae
No ratings yet
Stats 2 Formulae
5 pages
Notes For 18.6501x, Fundamentals of Statistics: v0.2 (2019 April 24)
100% (1)
Notes For 18.6501x, Fundamentals of Statistics: v0.2 (2019 April 24)
14 pages
Estimation of Parameter
No ratings yet
Estimation of Parameter
10 pages
Essentials of Econometrics
7% (27)
Essentials of Econometrics
12 pages
Forensic Dna Statistic Evett I Weir PDF
No ratings yet
Forensic Dna Statistic Evett I Weir PDF
306 pages
Stat 115 - Basic Statistical Methods
No ratings yet
Stat 115 - Basic Statistical Methods
6 pages
Statics Chapter 8 88
No ratings yet
Statics Chapter 8 88
12 pages
15 GIT All Exercises
100% (1)
15 GIT All Exercises
162 pages
Unit 4 PTSP
No ratings yet
Unit 4 PTSP
40 pages
Osobine Var
No ratings yet
Osobine Var
19 pages
ECE286 Final Exam Aid Sheet
No ratings yet
ECE286 Final Exam Aid Sheet
4 pages
Statistics Help Card Formulas
No ratings yet
Statistics Help Card Formulas
3 pages
Estimation Theory: x, x, x ,…… ……x ,x f x,θ θ θ θ
No ratings yet
Estimation Theory: x, x, x ,…… ……x ,x f x,θ θ θ θ
18 pages
Operations Management: William J. Stevenson
No ratings yet
Operations Management: William J. Stevenson
19 pages
Formula Sheet Math236
No ratings yet
Formula Sheet Math236
2 pages
College Statistics
No ratings yet
College Statistics
244 pages
Statistics Help Card Formulas
No ratings yet
Statistics Help Card Formulas
3 pages
MATH 376 - Final Exam Sample Solutions: 1 2 M 1 2 N I 1 2 1 I 2 2 2
No ratings yet
MATH 376 - Final Exam Sample Solutions: 1 2 M 1 2 N I 1 2 1 I 2 2 2
8 pages
MIT14 30s09 Lec17
No ratings yet
MIT14 30s09 Lec17
9 pages
Chapter 2 Organizing and Summarizing Data
No ratings yet
Chapter 2 Organizing and Summarizing Data
8 pages
Advanced Statistical Inference
No ratings yet
Advanced Statistical Inference
7 pages
STA 3105 Design and Analysis of Sample Surveys by DR - Orwa
No ratings yet
STA 3105 Design and Analysis of Sample Surveys by DR - Orwa
207 pages
Lec16 MTH305
No ratings yet
Lec16 MTH305
72 pages
Probability Notes
No ratings yet
Probability Notes
69 pages
5 Distributions and Algebra of Variance
No ratings yet
5 Distributions and Algebra of Variance
53 pages
Econ 2042 - Course Outline-1
No ratings yet
Econ 2042 - Course Outline-1
4 pages
Types of Probability Distribution
No ratings yet
Types of Probability Distribution
10 pages
HW 6 Solutions - 441
100% (1)
HW 6 Solutions - 441
4 pages
Conditional Expectation: Scott Sheffield
No ratings yet
Conditional Expectation: Scott Sheffield
17 pages
NAME:Kshitij Jha SUBJECT:Mathematics INDEX NO.:038
No ratings yet
NAME:Kshitij Jha SUBJECT:Mathematics INDEX NO.:038
31 pages
What Is Distribution?
No ratings yet
What Is Distribution?
4 pages
Stats Prob Review G12HL
No ratings yet
Stats Prob Review G12HL
5 pages
s2 2022 Jan QP
No ratings yet
s2 2022 Jan QP
28 pages
Sibd Questions Soved Theory
No ratings yet
Sibd Questions Soved Theory
14 pages
Reliability of Structures 2nd Nowak Solution Manual
100% (52)
Reliability of Structures 2nd Nowak Solution Manual
20 pages
Probability and Statistics
No ratings yet
Probability and Statistics
3 pages
NAME: - Assignment 5 Data Files Needed For These Problems Are in The Attached Files. Problems: 3.31
No ratings yet
NAME: - Assignment 5 Data Files Needed For These Problems Are in The Attached Files. Problems: 3.31
9 pages
Addition Rule M1420 Accessible vfpW7Kz
No ratings yet
Addition Rule M1420 Accessible vfpW7Kz
5 pages
Mdm4u Fianl Exam Formula PDF
No ratings yet
Mdm4u Fianl Exam Formula PDF
1 page
Ibm-524 - Probability Homework
No ratings yet
Ibm-524 - Probability Homework
3 pages
SB HW - Ses5 - Solutions
No ratings yet
SB HW - Ses5 - Solutions
3 pages
Arch
No ratings yet
Arch
8 pages
Final Exam, Stochastic Processes: Hai Le, ID: 998010705
No ratings yet
Final Exam, Stochastic Processes: Hai Le, ID: 998010705
2 pages
Analisis Soalan STPM Matematik
No ratings yet
Analisis Soalan STPM Matematik
1 page
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
From Everand
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)

Statistics Training

Uploaded by

Statistics Training

Uploaded by

STATISTICAL INFERENCE LECTURE

 Inferential statistics: the part of statistics that allows

 Statistical inference: a procedure for making inferences or

 Weak Law of Large Numbers & Central Limit Theorem

 Consider any positive random variable X∈ (0, ∞) with density 𝑓𝑋 .

 Consider any positive random variable X∈ (0, ∞) with density 𝑓𝑋 .

 Consider any positive random variable X∈ (0, ∞) with density 𝑓𝑋 .

 Consider any positive random variable X∈ (0, ∞) with density 𝑓𝑋 .

 Consider any positive random variable X∈ (0, ∞) with density 𝑓𝑋 .

 Define Q = (𝑋 − 𝜇)2 . Q is definitely a positive random variable.

 Define Q = (𝑋 − 𝜇)2 . Q is definitely a positive random variable.

 By Markov’s inequality, for any 𝜀 > 0

 Define Q = (𝑋 − 𝜇)2 . Q is definitely a positive random variable.

 By Markov’s inequality, for any 𝜀 > 0

 𝑋1 , 𝑋2 , 𝑋3 ,…… 𝑋𝑛 are iid, where 𝑋𝑗 ~ F (𝜇, 𝜎 2 ) ∀ 𝑗 = 1,2, … . . 𝑛

 𝑋1 , 𝑋2 , 𝑋3 ,…… 𝑋𝑛 are iid, where 𝑋𝑗 ~ F (𝜇, 𝜎 2 ) ∀ 𝑗 = 1,2, … . . 𝑛

 𝑋1 , 𝑋2 , 𝑋3 ,…… 𝑋𝑛 are iid, where 𝑋𝑗 ~ F (𝜇, 𝜎 2 ) ∀ 𝑗 = 1,2, … . . 𝑛

 What happens if n -> ∞ ?

 According to Chebyshev’s Inequality: For any finite 𝜀 > 0

 According to Chebyshev’s Inequality: For any finite 𝜀 > 0

 Hence as n -> ∞ , P(| 𝑀𝑛 − µ| ≥ 𝜀) -> 0 for any 𝜀 > 0

 Define a random variable 𝑋𝑗 for any individual j where

 Define a random variable 𝑋𝑗 for any individual j where

 𝑀𝑛 denotes the fraction of people responding “voting for Republicans”

 Define a random variable 𝑋𝑗 for any individual j where

 𝑀𝑛 denotes the fraction of people responding “voting for Republicans”

 Goal: 95% confidence of ≤1% error

 E(𝑋𝑗 ) = f & V(𝑋𝑗 ) = f(1-f)

 E(𝑋𝑗 ) = f & V(𝑋𝑗 ) = f(1-f)

 E(𝑋𝑗 ) = f & V(𝑋𝑗 ) = f(1-f)

 But we don’t know f

 In fact we are trying to find / estimate f.

 So the minimum sample size (n) = 50000 !!!!

 E(𝑋𝑗 ) = 𝜇 & V(𝑋𝑗 ) =𝜎 2 ∀ 𝑗 = 1,2, … . . 𝑛

 E(𝑋𝑗 ) = 𝜇 & V(𝑋𝑗 ) =𝜎 2 ∀ 𝑗 = 1,2, … . . 𝑛

 E(𝑋𝑗 ) = 𝜇 & V(𝑋𝑗 ) =𝜎 2 ∀ 𝑗 = 1,2, … . . 𝑛

 E(𝑆𝑛 ) = n𝜇 & V(𝑆𝑛 ) = n 𝜎 2

 E(𝑋𝑗 ) = 𝜇 & V(𝑋𝑗 ) =𝜎 2 ∀ 𝑗 = 1,2, … . . 𝑛

 E(𝑆𝑛 ) = n𝜇 & V(𝑆𝑛 ) = n 𝜎 2

 Let Z be a standard Normal random variable. i.e Z ~ 𝑁 0,1

 Let Z be a standard Normal random variable. i.e Z ~ 𝑁 0,1

 The Central Limit Theorem states that:

For every c, P(Zn ≤ c) → P(Z ≤ c) as n becomes large enough. (Convergence in

 Now we want P(|𝑀𝑛 − f| ≥ .01) ≤ .05

 Now we want P(|𝑀𝑛 − f| ≥ .01) ≤ .05

 Now we want P(|𝑀𝑛 − f| ≥ .01) ≤ .05

 In fact P(|Z| ≥ .02 𝑛) = .05 implies .02 𝑛 = 1.96 i.e n = 9604.

 In fact P(|Z| ≥ .02 𝑛) = .05 implies .02 𝑛 = 1.96 i.e n = 9604.

 And 9604 << 50000 !!!!!

 The desirable properties of an estimator:

• Interval Estimation: an inferential statistical procedure used to

• Confidence Intervals: a range of values computed from sample data

 Find the average height of Indian males.

 But India has 72 Cr males ! It’s impossible to measure everyone.

 Remedy: Collect a sample, say S = { 𝑋1 , 𝑋2 , 𝑋3 … . 𝑋𝑛 }

 Each sample observation is a realisation of a random variable. Why??

 Since the sample is drawn from the populations 𝑋𝑖 ’s are i.i.d.

 Let us assume that height of Indian males are normally distributed

 Our aim is to find (estimate) 𝜇 and 𝜎 2

 These are called “population parameters”.

 I would want to use the sample statistics to “estimate” the

 Let’s see how.

ത and sample variance (𝑠 2 ) as proxies

 𝑋ത and 𝑠 2 are called point estimators of 𝜇 and 𝜎 2 respectively.

 But are they “good” proxies?

The good / desirable properties of a point estimator are

 P(- 𝑍𝛼/2 < Z < 𝑍𝛼/2 ) = 1 - 𝛼

 P(- 𝑍𝛼/2 < Z < 𝑍𝛼/2 ) = 1 - 𝛼

 P(- 𝑍𝛼/2 < Z < 𝑍𝛼/2 ) = 1 - 𝛼

 P(- 𝑍𝛼/2 < Z < 𝑍𝛼/2 ) = 1 - 𝛼

 This is the interval within which 𝜇 lies with probability = 1 - 𝛼

i.e the student’s t – distribution with n-1 degrees of freedom.

 This is the interval within which 𝜇 lies with probability = (1 - 𝛼)

2 𝑛−1 .𝑠2 𝑛−1 .𝑠2

 Given a sample we can compute 𝑠 2 and we know n (sample size). We can

You might also like