0% found this document useful (0 votes)

2 views

MIT18_05S14_class18slides

Uploaded by

mail2vinaykk

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

MIT18_05S14_class18slides

Uploaded by

mail2vinaykk

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Null Hypothesis Signiﬁcance Testing

p-values, signiﬁcance level, power, t-tests

18.05 Spring 2014

January 1, 2017 1 /22

Understand this ﬁgure
f (x|H0 )

x
reject H0 don’t reject H0 reject H0

x = test statistic

f (x|H0 ) = pdf of null distribution = green curve

Rejection region is a portion of the x-axis.

Signiﬁcance = probability over the rejection region = red area.

January 1, 2017 2 /22
Simple and composite hypotheses

Simple hypothesis: the sampling distribution is fully speciﬁed.

Usually the parameter of interest has a speciﬁc value.

Composite hypotheses: the sampling distribution is not fully

speciﬁed. Usually the parameter of interest has a range of values.

Example. A coin has probability θ of heads. Toss it 30 times and let

x be the number of heads.
(i) H: θ = 0.4 is simple. x ∼ binomial(30, 0.4).
(ii) H: θ > 0.4 is composite. x ∼ binomial(30, θ) depends on which
value of θ is chosen.

January 1, 2017 3 /22

Extreme data and p-values
Hypotheses: H0 , HA .

Test statistic: value: x, random variable X .

Null distribution: f (x|H0 ) (assumes the null hypothesis is true)

Sides: HA determines if the rejection region is one or two-sided.

Rejection region/Signiﬁcance: P(x in rejection region | H0 ) = α.

The p-value is a computational tool to check if the test statistic is in

the rejection region. It is also a measure of the evidence for rejecting
H0 .
p-value: P(data at least as extreme as x | H0 )

Data at least as extreme: Determined by the sided-ness of the

rejection region.
January 1, 2017 4 /22
Extreme data and p-values
Example. Suppose we have the right-sided rejection region shown
below. Also suppose we see data with test statistic x = 4.2. Should
we reject H0 ?
f (x|H0 )

x
cα 4.2
don’t reject H0 reject H0

answer: The test statistic is in the rejection region, so reject H0 .

Alternatively: blue area < red area

Signiﬁcance: α = P(x in rejection region | H0 ) = red area.
p-value: p = P(data at least as extreme as x | H0 ) = blue area.
Since, p < α we reject H0 .
January 1, 2017 5 /22
Extreme data and p-values
Example. Now suppose x = 2.1 as shown. Should we reject H0 ?
f (x|H0 )

x
2.1 cα
don’t reject H0 reject H0

answer: The test statistic is not in the rejection region, so don’t

reject H0 .

Alternatively: blue area > red area

Signiﬁcance: α = P(x in rejection region | H0 ) = red area.
p-value: p = P(data at least as extreme as x | H0 ) = blue area.
Since, p > α we don’t reject H0 .

January 1, 2017 6 /22

Critical values

Critical values:

The boundary of the rejection region are called critical values.

Critical values are labeled by the probability to their right.

They are complementary to quantiles: c0.1 = q0.9

Example: for a standard normal c0.025 = 1.96 and c0.975 = −1.96.

In R, for a standard normal c0.025 = qnorm(0.975).

January 1, 2017 7 /22

Two-sided p-values
These are trickier: what does ‘at least as extreme’ mean in this case?
Remember the p-value is a trick for deciding if the test statistic is in
the region.
If the signiﬁcance (rejection) probability is split evenly between the
left and right tails then

p = 2min(left tail prob. of x, right tail prob. of x)

f (x|H0 )

x
c1−α/2 x cα/2
reject H0 don’t reject H0 reject H0

x is outside the rejection region, so p > α: do not reject H0

January 1, 2017 8 /22
Concept question
1. You collect data from an experiment and do a left-sided z-test
with signiﬁcance 0.1. You ﬁnd the z-value is 1.8
(i) Which of the following computes the critical value for the
rejection region.
(a) pnorm(0.1, 0, 1) (b) pnorm(0.9, 0, 1)
(c) pnorm(0.95, 0, 1) (d) pnorm(1.8, 0, 1)
(e) 1 - pnorm(1.8, 0, 1) (f) qnorm(0.05, 0, 1)
(g) qnorm(0.1, 0, 1) (h) qnorm(0.9, 0, 1)
(i) qnorm(0.95, 0, 1)

(ii) Which of the above computes the p-value for this experiment.

(iii) Should you reject the null hypothesis.

(a) Yes (b) No

January 1, 2017 9 /22

Error, signiﬁcance level and power
True state of nature
H0 HA
Our Reject H0 Type I error correct decision
decision Don’t reject H0 correct decision Type II error

Signiﬁcance level = P(type I error)

= probability we incorrectly reject H0
= P(test statistic in rejection region | H0 )
= P(false positive)
Power = probability we correctly reject H0
= P(test statistic in rejection region | HA )
= 1 − P(type II error)
= P(true positive)
• HA determines the power of the test.
• Significance and power are both probabilities of the rejection region.
• Want significance level near 0 and power near 1.
January 1, 2017 10 /22
Table question: significance level and power

The rejection region is boxed in red. The corresponding probabilities

for diﬀerent hypotheses are shaded below it.
x 0 1 2 3 4 5 6 7 8 9 10
H0 : p(x|θ = 0.5) .001 .010 .044 .117 .205 .246 .205 .117 .044 .010 .001
HA : p(x|θ = 0.6) .000 .002 .011 .042 .111 .201 .251 .215 .121 .040 .006
HA : p(x|θ = 0.7) .000 .0001 .001 .009 .037 .103 .200 .267 .233 .121 .028

1. Find the signiﬁcance level of the test.

2. Find the power of the test for each of the two alternative
hypotheses.

January 1, 2017 11 /22

Concept question

1. The power of the test in the graph is given by the area of

f (x|HA ) f (x|H0 )
R3
R2
R1 R4

x
reject H0 region . non-reject H0 region

(a) R1 (b) R2 (c) R1 + R2 (d) R1 + R2 + R3

January 1, 2017 12 /22

Concept question
2. Which test has higher power?

f (x|HA ) f (x|H0 )

x
reject H0 region . do not reject H0 region

f (x|HA ) f (x|H0 )

x
reject H0 region . do not reject H0 region

(a) Top graph (b) Bottom graph

January 1, 2017 13 /22
Discussion question

The null distribution for test statistic x is N(4, 82 ). The rejection

region is {x ≥ 20}.

What is the signiﬁcance level and power of this test?

January 1, 2017 14 /22

One-sample t-test
Data: we assume normal data with both µ and σ unknown:
x1 , x2 , . . . , xn ∼ N(µ, σ 2 ).
Null hypothesis: µ = µ0 for some speciﬁc value µ0 .
Test statistic:
x − µ0
t= √
s/ n
where n
2 1 n
s = (xi − x)2 .
n − 1 i=1
Here t is the Studentized mean and s 2 is the sample variance.
Null distribution: f (t | H0 ) is the pdf of T ∼ t(n − 1),
the t distribution with n − 1 degrees of freedom.
Two-sided p-value: p = P(|T | > |t|).
R command: pt(x,n-1) is the cdf of t(n − 1).
http://mathlets.org/mathlets/t-distribution/
January 1, 2017 15 /22
Board question: z and one-sample t-test

For both problems use signiﬁcance level α = 0.05.

Assume the data 2, 4, 4, 10 is drawn from a N(µ, σ 2 ).

Suppose H0 : µ = 0; HA : µ = 0.

1. Is the test one or two-sided? If one-sided, which side?

2. Assume σ 2 = 16 is known and test H0 against HA .

3. Now assume σ 2 is unknown and test H0 against HA .

January 1, 2017 16 /22

Two-sample t-test: equal variances
Data: we assume normal data with µx , µy and (same) σ unknown:
x1 , . . . , xn ∼ N(µx , σ 2 ), y1 , . . . , ym ∼ N(µy , σ 2 )

Null hypothesis H0 : µx = µy .
(n − 1)sx2 + (m − 1)sy2 1 1
Pooled variance: sp2 = + .
n+m−2 n m
x̄ − ȳ
Test statistic: t=
sp
Null distribution: f (t | H0 ) is the pdf of T ∼ t(n + m − 2)

In general (so we can compute power) we have

(x̄ − ȳ ) − (µx − µy )
∼ t(n + m − 2)
sp

Note: there are more general formulas for unequal variances.

January 1, 2017 17 /22
Board question: two-sample t-test

Real data from 1408 women admitted to a maternity hospital for (i)
medical reasons or through (ii) unbooked emergency admission. The
duration of pregnancy is measured in complete weeks from the
beginning of the last menstrual period.
Medical: 775 obs. with x̄ = 39.08 and s 2 = 7.77.
Emergency: 633 obs. with x̄ = 39.60 and s 2 = 4.95

1. Set up and run a two-sample t-test to investigate whether the

duration diﬀers for the two groups.
2. What assumptions did you make?

January 1, 2017 18 /22

Table discussion: Type I errors Q1

1. Suppose a journal will only publish results that are statistically

signiﬁcant at the 0.05 level. What percentage of the papers it
publishes contain type I errors?

answer: With the information given we can’t know this. The

percentage could be anywhere from 0 to 100! –See the next
two questions.

January 1, 2017 19 /22

Table discussion: Type I errors Q2

2. Jerry desperately wants to cure diseases but he is terrible at

designing effective treatments. He is however a careful scientist and
statistician, so he randomly divides his patients into control and
treatment groups. The control group gets a placebo and the
treatment group gets the experimental treatment. His null hypothesis
H0 is that the treatment is no better than the placebo. He uses a
significance level of α = 0.05. If his p-value is less than α he publishes
a paper claiming the treatment is significantly better than a placebo.
(a) Since his treatments are never, in fact, effective what percentage
of his experiments result in published papers?
(b) What percentage of his published papers contain type I errors,
i.e. describe treatments that are no better than placebo?

January 1, 2017 20 /22

Table discussions: Type I errors: Q3

3. Efrat is a genius at designing treatments, so all of her proposed

treatments are eﬀective. She’s also a careful scientist and statistician
so she too runs double-blind, placebo controlled, randomized studies.
Her null hypothesis is always that the new treatment is no better than
the placebo. She also uses a signiﬁcance level of α = 0.05 and
publishes a paper if p < α.
(a) How could you determine what percentage of her experiments
result in publications?
(b) What percentage of her published papers contain type I errors,
i.e. describe treatments that are no better than placebo?

January 1, 2017 21 /22

MIT OpenCourseWare
https://ocw.mit.edu

18.05 Introduction to Probability and Statistics

Spring 2014

For information about citing these materials or our Terms of Use, visit: https://ocw.mit.edu/terms .

Reference Card (PDQ Sheet) : Week 1: Statistical Inference For One and Two Popula-Tion Variances
No ratings yet
Reference Card (PDQ Sheet) : Week 1: Statistical Inference For One and Two Popula-Tion Variances
50 pages
MIT18_05S14_class18_slides
No ratings yet
MIT18_05S14_class18_slides
27 pages
MIT18 05S14 Class17 Slides
No ratings yet
MIT18 05S14 Class17 Slides
25 pages
Hypothesis Test
No ratings yet
Hypothesis Test
49 pages
Chapter 9 Worksheet
No ratings yet
Chapter 9 Worksheet
18 pages
I3 TD3 (Tests of Hypotheses Based On A Single Sample)
No ratings yet
I3 TD3 (Tests of Hypotheses Based On A Single Sample)
8 pages
6 Hypothesis Testing
No ratings yet
6 Hypothesis Testing
22 pages
Bab 5 Fundamentals of Hypothesis
No ratings yet
Bab 5 Fundamentals of Hypothesis
55 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
33 pages
9 Testing of Hypothesis
No ratings yet
9 Testing of Hypothesis
12 pages
G. Hypothesis Testing-1
No ratings yet
G. Hypothesis Testing-1
21 pages
CH-8 Hypothesis Testing
No ratings yet
CH-8 Hypothesis Testing
37 pages
IM1017 Topic 06 Hypthesis testing
No ratings yet
IM1017 Topic 06 Hypthesis testing
34 pages
EC-512EC512 LecNotes Pt2
No ratings yet
EC-512EC512 LecNotes Pt2
29 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
36 pages
Basics of Hypothesis Testing: October 19
No ratings yet
Basics of Hypothesis Testing: October 19
36 pages
Mit6 041F10 L25 PDF
No ratings yet
Mit6 041F10 L25 PDF
3 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
51 pages
Week - 10
No ratings yet
Week - 10
34 pages
One-Sample Hypothesis Tests
No ratings yet
One-Sample Hypothesis Tests
47 pages
Fundamentals of Hypothesis Testing: Dr. K. M. Salah Uddin
No ratings yet
Fundamentals of Hypothesis Testing: Dr. K. M. Salah Uddin
59 pages
The P - Value Method of Hypothesis Testing
No ratings yet
The P - Value Method of Hypothesis Testing
3 pages
Hypothesis - Testing-1
No ratings yet
Hypothesis - Testing-1
51 pages
Chapter 7: Hypothesis Testing With One Sample
No ratings yet
Chapter 7: Hypothesis Testing With One Sample
6 pages
CVE 303 - 6. Hypothesis Test
No ratings yet
CVE 303 - 6. Hypothesis Test
44 pages
Mat 326 Chapter 10 Fall 2024
No ratings yet
Mat 326 Chapter 10 Fall 2024
10 pages
Basics of Hypothesis Testing
No ratings yet
Basics of Hypothesis Testing
36 pages
7_Hypothesis testing-
No ratings yet
7_Hypothesis testing-
49 pages
(8 One-And Two-Sample Test of Hypothesis) : 324 Stat Lecture Notes
No ratings yet
(8 One-And Two-Sample Test of Hypothesis) : 324 Stat Lecture Notes
29 pages
Basics of Hypothesis Testing
No ratings yet
Basics of Hypothesis Testing
36 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
63 pages
Lecture 12
No ratings yet
Lecture 12
15 pages
22 Hypothesis 2
No ratings yet
22 Hypothesis 2
36 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
8 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
36 pages
Elements of A Test of Hypothesis
No ratings yet
Elements of A Test of Hypothesis
5 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
4 pages
Gerstman PP09
No ratings yet
Gerstman PP09
36 pages
Lecture
No ratings yet
Lecture
3 pages
statistics-cheat-sheet-formulas-and-steps
No ratings yet
statistics-cheat-sheet-formulas-and-steps
19 pages
Lecture 3 of Computational Statistics
No ratings yet
Lecture 3 of Computational Statistics
32 pages
Basics of Hypothesis Testing
No ratings yet
Basics of Hypothesis Testing
42 pages
C 5 A
No ratings yet
C 5 A
127 pages
Session 12 - Hypothesis Testing-Single Sample Tests
No ratings yet
Session 12 - Hypothesis Testing-Single Sample Tests
56 pages
Comparing Classical and Bayesian Approaches To Hypothesis Testing
No ratings yet
Comparing Classical and Bayesian Approaches To Hypothesis Testing
27 pages
Week 11
No ratings yet
Week 11
2 pages
MR Hypothesis Testing 1
No ratings yet
MR Hypothesis Testing 1
20 pages
DAO Cheatsheeet
No ratings yet
DAO Cheatsheeet
3 pages
Hypothesis Tests
0% (1)
Hypothesis Tests
11 pages
Flowchart 2
No ratings yet
Flowchart 2
7 pages
Lecture7-Hypothesis Testing and applications -Slides-annotated
No ratings yet
Lecture7-Hypothesis Testing and applications -Slides-annotated
28 pages
Hypothesis Testing For One Population
No ratings yet
Hypothesis Testing For One Population
57 pages
webMATH236_Lecture6
No ratings yet
webMATH236_Lecture6
60 pages
Test of Hypothesis: One-Sample Tests
No ratings yet
Test of Hypothesis: One-Sample Tests
51 pages
Statistics: Fundamentals of Hypothesis Testing: One-Sample Tests
No ratings yet
Statistics: Fundamentals of Hypothesis Testing: One-Sample Tests
35 pages
6 RM - Basics of Testing of Hypothesis
No ratings yet
6 RM - Basics of Testing of Hypothesis
33 pages
HW 3
No ratings yet
HW 3
1 page
Hypothesis testing 112
No ratings yet
Hypothesis testing 112
46 pages
CH05 Cheat Sheet
No ratings yet
CH05 Cheat Sheet
5 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
new_watermark_1_1__10d6a7cff52151b75a73d371e8f669c7 (1)
No ratings yet
new_watermark_1_1__10d6a7cff52151b75a73d371e8f669c7 (1)
41 pages
Eapp Examples
100% (1)
Eapp Examples
2 pages
Northern Provincial Council Elections: Pre-Election Survey Results
No ratings yet
Northern Provincial Council Elections: Pre-Election Survey Results
3 pages
Inference and Confidence Intervals
No ratings yet
Inference and Confidence Intervals
35 pages
Activity: Parameter Quantitative Research Qualitative Research
No ratings yet
Activity: Parameter Quantitative Research Qualitative Research
2 pages
บทความภาษาอังกฤษ
No ratings yet
บทความภาษาอังกฤษ
9 pages
Common Statistical Tests Are Linear Models
No ratings yet
Common Statistical Tests Are Linear Models
1 page
EMEn_Chapter0_2425 (2)
No ratings yet
EMEn_Chapter0_2425 (2)
10 pages
Hypothesis
No ratings yet
Hypothesis
71 pages
The Gmat Exam, Now With Integrated Reasoning
No ratings yet
The Gmat Exam, Now With Integrated Reasoning
2 pages
Math-105 d1 Voting Rights Kinds of Voting Methods
No ratings yet
Math-105 d1 Voting Rights Kinds of Voting Methods
3 pages
04 Activity Chi Square
No ratings yet
04 Activity Chi Square
2 pages
UM04CBBA04 - 09 - Statistics For Management II
No ratings yet
UM04CBBA04 - 09 - Statistics For Management II
2 pages
A1 - Answer Sheet Toefl Test (T-Iii)
No ratings yet
A1 - Answer Sheet Toefl Test (T-Iii)
3 pages
Jee Answer Key Session 2
No ratings yet
Jee Answer Key Session 2
38 pages
Chapter 1
No ratings yet
Chapter 1
3 pages
Quantum XLExample
No ratings yet
Quantum XLExample
83 pages
CB2200 Course Outline
No ratings yet
CB2200 Course Outline
6 pages
The Scientific Method: Adrian Bil R. Palacio
No ratings yet
The Scientific Method: Adrian Bil R. Palacio
16 pages
Qualitative Research Design
No ratings yet
Qualitative Research Design
20 pages
Jabnoun 2002
No ratings yet
Jabnoun 2002
9 pages
Nonparametric Test
No ratings yet
Nonparametric Test
75 pages
STAT&PROB
No ratings yet
STAT&PROB
5 pages
QNT 561 Final Exam - QNT 561 Week 1 Practice Quiz 45 Questions - UOP Students
No ratings yet
QNT 561 Final Exam - QNT 561 Week 1 Practice Quiz 45 Questions - UOP Students
36 pages
Real Statistics Examples Part 1A
No ratings yet
Real Statistics Examples Part 1A
853 pages
Downloadfile 41 PDF
No ratings yet
Downloadfile 41 PDF
17 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
40 pages
Laporan Analisis Cluster - Tessa Putri Denia - 2010212054
No ratings yet
Laporan Analisis Cluster - Tessa Putri Denia - 2010212054
5 pages
Characteristics of A Good Research Intrument According To The Teachers College, Columbia University
No ratings yet
Characteristics of A Good Research Intrument According To The Teachers College, Columbia University
9 pages
CAIA
No ratings yet
CAIA
1 page