0% found this document useful (0 votes)

8 views30 pages

Week 12-1 Pre

The document outlines the concepts of confidence intervals and hypothesis testing in statistical theory. It covers the construction of confidence intervals for both known and unknown variances, as well as the elements and setup of hypothesis testing, including null and alternative hypotheses, test statistics, and types of errors. Additionally, it discusses significance levels and their implications in hypothesis testing.

Uploaded by

배민규

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views30 pages

Week 12-1 Pre

Uploaded by

배민규

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

STA255H1S: Statistical Theory

Week 12-1: Confidence Interval and hypothesis testing

Yaoming Zhen

Department of Statistical Sciences

University of Toronto

March 25, 2025

1 / 30
Roadmap

1. Confidence Interval

2. Concepts of Hypothesis testing

2 / 30
Roadmap

1. Confidence Interval

3 / 30
Normal Confidence Intervals for the Mean

To begin with, we assume that we have normally distributed

data: X1 , X2 , ..., Xn ∼ N(µ, σ 2 ). Our first objective is to estimate
the confidence interval around the mean µ. We will have two
different scenarios: when σ 2 is known, and when σ 2 is
unknown.

4 / 30
Critical Values

To develop confidence intervals when the variance is known,

we need values from the standard normal distribution to define
critical values.

5 / 30
Critical Values

The critical value zp of a N(0, 1) distribution is the number

that has right tail probability p. It is defined by P(Z ≥ zp ) =
p, where zp is the (1 − p) − th quantile of the standard nor-
mal distribution.

For example, P(Z ≥ 1.96) = 0.025 so the critical value is

z0.025 = 1.96.

By the symmetry of the standard normal density, P(Z ≤

−zp ) = P(Z ≥ zp ) = p, so P(Z ≥ −zp ) = 1 − p and there-
fore z1−p = −zp . For example z0.975 = −z0.025 = −1.96.

6 / 30
α-level

7 / 30
Confidence Intervals for µ: σ 2 known

Let X1 , X2 , ..., Xn ∼ N(µ, σ 2 ) where σ 2 is known and we wish

to construct a confidence interval for µ.
The estimator for the unknown parameter µ is X̄ and it has
distribution N(µ, σ 2 /n)
X̄ −µ
Use √
σ/ n
∼ N(0, 1) to construct the 1 − α confidence inter-
val
σ σ
[X̄ − √ z1−α/2 , X̄ + √ z1−α/2 ]
n n

8 / 30
Confidence Intervals for µ: σ 2 known

Example
Suppose we collect 100 data points from a N(µ, 32) distribution
and the sample mean is x̄ = 12. Give the 95 % confidence
interval for µ.

9 / 30
Confidence Intervals for µ: σ 2 unknown

When we do not know σ, we cannot use

X̄ − µ
√ .
σ/ n

Rather, we need an estimate for σ in order to construct the

confidence interval for µ. In this situation we use the estimate
S, and the fact that

X̄ − µ
√ ∼ T (n − 1)
S/ n

10 / 30
t-distribution
Let X ∼pN(0, 1), W ∼ χ2 (m). If X is independent with W ,
then X / W /m ∼ T (m).

11 / 30
Critical Values

We now need critical values for the t-distribution. Since it is

symmetric, we use the same logic as we did for z as the critical
value.

The critical value tm of a t-distribution is the number that

has right tail probability p. It is defined by P(T ≥ tm,p ) = p
where tm,p is the (1 − p) − th quantile of the t-distribution
with m degrees of freedom.

Because t is symmetric, tm,1−p = −tm,p

12 / 30
Confidence Intervals for µ: σ 2 unknown

Let X1 , X2 , ..., Xn ∼ N(µ, σ 2 ), where σ 2 is unknown and we

wish to construct a confidence interval for µ.
−µ
X̄ √
Use S n
to construct the 1 − α confidence interval

s s
[x̄ − √ tn−1,α/2 , x̄ + √ tn−1,α/2 ]
n n

13 / 30
Confidence Intervals for µ: σ 2 unknown

Example
Suppose the data 2.5, 5.5, 8.5, 11.5 was drawn from a N(µ, σ 2 )
distribution with µ and σ both unknown. Give interval estimates
for µ by finding the 95%, 80% and 50% confidence intervals.
(t3,0.025 = 3.1824, t3,0.05 = 2.3533, t3,0.1 = 1.6377,
t3,0.25 = 0.7649 )

14 / 30
Confidence Intervals for µ when σ 2 unknown: big n
The central limit theorem states that as n → ∞, things tend
to the normal distribution. In this case,

X̄ − µ d
√ → Z ∼ N(0, 1)
S/ n

That is, if n is large enough,

X̄ − µ
P(−zα/2 < √ < zα/2 ) ≈ 1 − α
S/ n

So, the interval

s s
[x̄ − √ zα/2 , x̄ + √ zα/2 ]
n n

approximates a 100(1-α)% confidence interval for µ.

15 / 30
Confidence Intervals for Non-normal distribution

Let X ∼ Bin(n, p). By the central limit theorem. For large n,

X − np
p
np(1 − p)

is approximately N(0,1)

Therefore,
X − np
P(−zα/2 < p < zα/2 ) ≈ 1 − α
np(1 − p)

16 / 30
Confidence Intervals for Binomial p
The confidence interval is characterized by

X − np
(p )2 < (zα/2 )2
np(1 − p)

17 / 30
Confidence Intervals for Binomial p

Example
Of a series of 100 i.i.d chemical experiments, 70 were
concluded successfully. Construct an approximating 90% CI for
the success probability of this type of experiment. (z0.05 = 1.64)

18 / 30
Sample Size

The width of an interval, which we can express as

w = 2 √σn zα/2 . If we use this width, we can express it in a way
such that we use it to find the smallest samples that satisfies
this width:
σ
2zα/2 √ ≤ w
n

and solve for n

2
2zα/2 σ

n≥
w

This tells us how many samples we need to satisfy a particular

confidence length.

19 / 30
Sample Size

Example
What sample size do we need if we want to be 99% confident
that the sample mean age of undergraduate U of T students is
within 2 years of the population mean age? Suppose we are
given that the standard deviation in ages is 4 years.
(z0.005 = 2.58)

20 / 30
One Sided Intervals

For one sided intervals, we have the following:

L(X) is a lower confidence bound when U(X)=∞

U(X) is an upper confidence bound when L(X)=−∞

21 / 30
One Sided CI: σ 2 unknown

If we are interested in the lower bound, for example, we would

be interested in saying that we are 95% confident that a mean
exceeds a certain value. In which case we would have:

X̄ − µ
P( √ < tn−1,α ) = 1 − α
S/ n

which gives us:

s
(x̄ − tn−1 √ , ∞)
n

Note we do not divide α by 2, and the upper limit of the interval

is ∞.

22 / 30
Roadmap

2. Hypothesis testing

23 / 30
Hypothesis Testing

We now turn to hypothesis testing whereby we form a state-

ment about a parameter θ and then perform a statistical test
to determine the correctness of the statement.

Hypothesis testing is similar to the scientific method: a sci-

entist formulates a theory and then tests this theory against
observation.

In statistics we pose a theory concerning one or more popu-

lation parameters (i.e. that they equal specified values), we
then sample the population and compare our observations
with our posed theory. If the observations disagree with the
theory then we reject it.

24 / 30
Null and Alternative Hypotheses
Elements of hypothesis testing:

H0 : the null hypothesis. This is the default assumption for

the model generating the data.

HA or H1 : the alternative hypothesis, which complements

the null hypothesis

Test statistic, computing from the data

Null distribution: the probability distribution of the test statis-

tics assuming H0 .

Rejection region (critical region): if X is in the rejection re-

gion we reject H0 in favor of HA .
25 / 30
Hypothesis Testing

A statistical hypothesis is a statement concerning an un-

known parameter for the population distribution f (x|θ), x ∈
R and θ ∈ Θ

The statistical hypothesis is a statement about θ and the

testing aims to prove its correctness.

One-sided hypothesis test takes on the form H0 : θ ≤ θ0

while H1 : θ > θ0 or H0 : θ ≥ θ0 while H1 : θ < θ0

Two-sided hypothesis test takes on the form H0 : |θ| ≤ θ0

while H1 : |θ| > θ0

26 / 30
Test Statistic

Test Statistic
For a dataset modeled on a realization of the random variable
X1 , ..., Xn , a test statistic is any sample statistic T = h(X1 , .., Xn )
whose numerical value is used to decide whether or not to
reject H0 .

The distributions that go with these statistics are always

conditioned on the null hypothesis. That is, we will compute
likelihoods such as f (z|H0 ).

27 / 30
Hypothesis Test Setup

Example
Suppose a mayoral candidate of Toronto claims she will get
more than 50% of the votes in an election, and thereby be the
winner. We do not believe this claim, so we would like to test
the candidate’s claim as a hypothesis test. Set up the elements
of a statistical test (hypothesis test):
null hypothesis
alternative hypothesis
test statistic
rejection/critical region

28 / 30
Type I and II Errors

Type I and Type II errors

Test of the null hypothesis H0 against an alternative hypoth-

esis H1 leads to either a correct decision or one of the fol-
lowing errors:

Type I error = rejecting H0 when it is true

Type II error = fail to reject H0 when it is false

Fail to reject H0 Reject H0

H0 is true correct decision type I error
H0 is false type II error correct decision

29 / 30
Significance

Significance Level:

α = P(reject H0 |H0 ) = P(type I error)

In testing a statistical hypothesis, a significance level α with

0 ≤ α ≤ 1 is the largest acceptable probability of commit-
ting a type I error.

Usually significance level is chosen to be α ≤ 0.05, but a

more conservative significance level is α ≤ 0.01.

30 / 30

Week 8 Statistical Intervals
No ratings yet
Week 8 Statistical Intervals
32 pages
Lecture7 Confidence
No ratings yet
Lecture7 Confidence
45 pages
h5 Statistical Inference
No ratings yet
h5 Statistical Inference
4 pages
Estimation
No ratings yet
Estimation
29 pages
A Confidence Interval Provides Additional Information About Variability
No ratings yet
A Confidence Interval Provides Additional Information About Variability
14 pages
Inference Using Normal and T Distribution
No ratings yet
Inference Using Normal and T Distribution
9 pages
5 Estimation and Hypothesis Testing
No ratings yet
5 Estimation and Hypothesis Testing
25 pages
Ed Inference1
No ratings yet
Ed Inference1
20 pages
Week 1 - Hypothesis Testing - Part 1
No ratings yet
Week 1 - Hypothesis Testing - Part 1
77 pages
Research Methodology - I
No ratings yet
Research Methodology - I
55 pages
9 Statistical Interval PDF
No ratings yet
9 Statistical Interval PDF
16 pages
Inference Using Normal and T Distribution
No ratings yet
Inference Using Normal and T Distribution
9 pages
REPORT in Stat.n Ed.
No ratings yet
REPORT in Stat.n Ed.
33 pages
Stat 255 Supplement 2011 Fall
100% (1)
Stat 255 Supplement 2011 Fall
78 pages
L8 Estimate 2014
No ratings yet
L8 Estimate 2014
40 pages
Chapter11 - Inf - Proportions - Student
No ratings yet
Chapter11 - Inf - Proportions - Student
37 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
44 pages
Chapter 9 (Independent Means Only) UPDATED!!!
No ratings yet
Chapter 9 (Independent Means Only) UPDATED!!!
27 pages
Business Research Methods: Univariate Statistics
No ratings yet
Business Research Methods: Univariate Statistics
55 pages
COR 006 Reviewer
No ratings yet
COR 006 Reviewer
5 pages
Confidence Intervals
No ratings yet
Confidence Intervals
50 pages
Introduction To Statistical Inference 2
No ratings yet
Introduction To Statistical Inference 2
46 pages
Interval Estimation Hypothesis Test Compressed
No ratings yet
Interval Estimation Hypothesis Test Compressed
75 pages
Bus 7
No ratings yet
Bus 7
48 pages
Normal Distribution
No ratings yet
Normal Distribution
8 pages
Review of Statistics
No ratings yet
Review of Statistics
36 pages
Unit 2
No ratings yet
Unit 2
6 pages
Chapt10 Hypothesis Testing One-Sample Tests BBA
No ratings yet
Chapt10 Hypothesis Testing One-Sample Tests BBA
50 pages
Confidence Intervals-Reader
No ratings yet
Confidence Intervals-Reader
9 pages
Lecture 03. Statistical Inference
No ratings yet
Lecture 03. Statistical Inference
31 pages
Null Hypothesis: P-Value
No ratings yet
Null Hypothesis: P-Value
30 pages
MZB127 Topic 10 Lecture Notes (Unannotated Version)
No ratings yet
MZB127 Topic 10 Lecture Notes (Unannotated Version)
19 pages
Confidence Interval
No ratings yet
Confidence Interval
44 pages
Chapter 5
No ratings yet
Chapter 5
43 pages
QEM 2004 - Module 2 (Confidence Interval Estimation)
No ratings yet
QEM 2004 - Module 2 (Confidence Interval Estimation)
59 pages
Chapter3 Statistics 2021 22
No ratings yet
Chapter3 Statistics 2021 22
35 pages
Lab 5
No ratings yet
Lab 5
7 pages
Point and Interval Estimation
No ratings yet
Point and Interval Estimation
55 pages
Probability and Statistics 4 - PARAMETER ESTIMATION
No ratings yet
Probability and Statistics 4 - PARAMETER ESTIMATION
20 pages
ch8 4710
No ratings yet
ch8 4710
63 pages
11.estimation IV
No ratings yet
11.estimation IV
62 pages
Bab 3 Pengantar Inferensi Statistika
No ratings yet
Bab 3 Pengantar Inferensi Statistika
46 pages
CH 4 - Estimation & Hypothesis One Sample
No ratings yet
CH 4 - Estimation & Hypothesis One Sample
139 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
71 pages
Business Analytics & Machine Learning: Regression Analysis
No ratings yet
Business Analytics & Machine Learning: Regression Analysis
58 pages
Lecture 7-Statistics Decisions
No ratings yet
Lecture 7-Statistics Decisions
43 pages
RESEARCH
No ratings yet
RESEARCH
10 pages
One-Sample Hypothesis Tests
No ratings yet
One-Sample Hypothesis Tests
47 pages
C22 Inferential Statistics DXB
No ratings yet
C22 Inferential Statistics DXB
66 pages
BRM Topic9 Hypothesis Testing MSK
No ratings yet
BRM Topic9 Hypothesis Testing MSK
81 pages
Statistics
No ratings yet
Statistics
29 pages
Outline 3
No ratings yet
Outline 3
1 page
Module3 Part3 Inference About Population Mean
No ratings yet
Module3 Part3 Inference About Population Mean
67 pages
Confidence Interval Estimation
No ratings yet
Confidence Interval Estimation
62 pages
A Session 18 2021
No ratings yet
A Session 18 2021
36 pages
IE6200-Lecture 7
No ratings yet
IE6200-Lecture 7
46 pages
I. Test of a Mean: σ unknown: X Z n Z N X t s n ttn
No ratings yet
I. Test of a Mean: σ unknown: X Z n Z N X t s n ttn
12 pages
Elementary-statistics-Group-4 20250402 132652 0000
No ratings yet
Elementary-statistics-Group-4 20250402 132652 0000
31 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
63 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Investigating The Role of Anger and Cognitive Malfunction in Mental Health A Cross-Sectional Exploration Paving The Way For A Subsequent Experiment
No ratings yet
Investigating The Role of Anger and Cognitive Malfunction in Mental Health A Cross-Sectional Exploration Paving The Way For A Subsequent Experiment
24 pages
Advanced Business Quantitative Methods L4: Reinhold Kamati, PHD
No ratings yet
Advanced Business Quantitative Methods L4: Reinhold Kamati, PHD
7 pages
Assignment
No ratings yet
Assignment
9 pages
IE4102 Lecture2
No ratings yet
IE4102 Lecture2
43 pages
SQQS2013 Individual Assignment 1 Answer
No ratings yet
SQQS2013 Individual Assignment 1 Answer
3 pages
Populasi Sampel Sampling
No ratings yet
Populasi Sampel Sampling
12 pages
Pearson'S Product-Moment Correlation Coefficient: Statistics and Probability
No ratings yet
Pearson'S Product-Moment Correlation Coefficient: Statistics and Probability
18 pages
Q.Paper - Correlation and Regression UACA
No ratings yet
Q.Paper - Correlation and Regression UACA
3 pages
Basic Business Statistics: 12 Edition
No ratings yet
Basic Business Statistics: 12 Edition
57 pages
(Ebook PDF) Discovering Statistics Using IBM SPSS Statistics 4th Download
100% (1)
(Ebook PDF) Discovering Statistics Using IBM SPSS Statistics 4th Download
55 pages
Chapter 9
No ratings yet
Chapter 9
22 pages
Question WIHT ANSWER FINAL Mustafa
No ratings yet
Question WIHT ANSWER FINAL Mustafa
8 pages
Chapter IV
No ratings yet
Chapter IV
4 pages
đề CLC số 1
No ratings yet
đề CLC số 1
2 pages
Scheme of Valuation of Bussiness Statics Set 2.
No ratings yet
Scheme of Valuation of Bussiness Statics Set 2.
8 pages
Lesson 8 Random Sampling Activity 12
No ratings yet
Lesson 8 Random Sampling Activity 12
6 pages
Unit 11 - F - Distribution and Analysis of Variance (ANOVA)
No ratings yet
Unit 11 - F - Distribution and Analysis of Variance (ANOVA)
24 pages
E - Jurnal Riset Manajemen Fakultas Ekonomi Dan Bisnis Unisma Website
No ratings yet
E - Jurnal Riset Manajemen Fakultas Ekonomi Dan Bisnis Unisma Website
13 pages
Course Content: St. Paul University Philippines
50% (4)
Course Content: St. Paul University Philippines
33 pages
Chapter Two: Bivariate Regression Mode
100% (1)
Chapter Two: Bivariate Regression Mode
54 pages
Apuntes de Clase - DataCamp - Visualization in Higher Dimensions
No ratings yet
Apuntes de Clase - DataCamp - Visualization in Higher Dimensions
50 pages
Algorithmic Trading & Quantitative Strategies Gappy Lecture 5
No ratings yet
Algorithmic Trading & Quantitative Strategies Gappy Lecture 5
22 pages
Advanced Quality Control
No ratings yet
Advanced Quality Control
38 pages
Dmaic - GRR Template
No ratings yet
Dmaic - GRR Template
25 pages
Data Analytics On Vechicle Insurance Data
No ratings yet
Data Analytics On Vechicle Insurance Data
22 pages
Module 5b - Pre and Post Testing - PowerPoint
No ratings yet
Module 5b - Pre and Post Testing - PowerPoint
16 pages
Anova Table
No ratings yet
Anova Table
5 pages
The Actor-Partner Interdependence Model A Model of
No ratings yet
The Actor-Partner Interdependence Model A Model of
11 pages
Class 03 04 Confidence Interval, Hypothesis Testing
No ratings yet
Class 03 04 Confidence Interval, Hypothesis Testing
87 pages
Crime Scene Project
No ratings yet
Crime Scene Project
6 pages

Week 12-1 Pre

Uploaded by

Week 12-1 Pre

Uploaded by

STA255H1S: Statistical Theory

Week 12-1: Confidence Interval and hypothesis testing

Department of Statistical Sciences

March 25, 2025

2. Concepts of Hypothesis testing

To begin with, we assume that we have normally distributed

To develop confidence intervals when the variance is known,

The critical value zp of a N(0, 1) distribution is the number

For example, P(Z ≥ 1.96) = 0.025 so the critical value is

By the symmetry of the standard normal density, P(Z ≤

Let X1 , X2 , ..., Xn ∼ N(µ, σ 2 ) where σ 2 is known and we wish

When we do not know σ, we cannot use

Rather, we need an estimate for σ in order to construct the

We now need critical values for the t-distribution. Since it is

The critical value tm of a t-distribution is the number that

Because t is symmetric, tm,1−p = −tm,p

Let X1 , X2 , ..., Xn ∼ N(µ, σ 2 ), where σ 2 is unknown and we

That is, if n is large enough,

So, the interval

approximates a 100(1-α)% confidence interval for µ.

Let X ∼ Bin(n, p). By the central limit theorem. For large n,

The width of an interval, which we can express as

and solve for n

This tells us how many samples we need to satisfy a particular

For one sided intervals, we have the following:

L(X) is a lower confidence bound when U(X)=∞

U(X) is an upper confidence bound when L(X)=−∞

If we are interested in the lower bound, for example, we would

which gives us:

Note we do not divide α by 2, and the upper limit of the interval

We now turn to hypothesis testing whereby we form a state-

Hypothesis testing is similar to the scientific method: a sci-

In statistics we pose a theory concerning one or more popu-

H0 : the null hypothesis. This is the default assumption for

HA or H1 : the alternative hypothesis, which complements

Test statistic, computing from the data

Null distribution: the probability distribution of the test statis-

Rejection region (critical region): if X is in the rejection re-

A statistical hypothesis is a statement concerning an un-

The statistical hypothesis is a statement about θ and the

One-sided hypothesis test takes on the form H0 : θ ≤ θ0

Two-sided hypothesis test takes on the form H0 : |θ| ≤ θ0

The distributions that go with these statistics are always

Type I and Type II errors

Test of the null hypothesis H0 against an alternative hypoth-

Type I error = rejecting H0 when it is true

Type II error = fail to reject H0 when it is false

Fail to reject H0 Reject H0

α = P(reject H0 |H0 ) = P(type I error)

In testing a statistical hypothesis, a significance level α with

Usually significance level is chosen to be α ≤ 0.05, but a

You might also like