Module 33. Hypothesis Testing Basics
Module 33. Hypothesis Testing Basics
UNCLASSIFIED / FOUO
National Guard
Black Belt Training
Module 33
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
ACTIVITIES TOOLS
• Value Stream Analysis
• Identify Potential Root Causes • Process Constraint ID
• Reduce List of Potential Root • Takt Time Analysis
Causes • Cause and Effect Analysis
• Brainstorming
• Confirm Root Cause to Output
• 5 Whys
Relationship
• Affinity Diagram
• Estimate Impact of Root Causes • Pareto
on Key Outputs • Cause and Effect Matrix
• FMEA
• Prioritize Root Causes
• Hypothesis Tests
• Complete Analyze Tollgate • ANOVA
• Chi Square
• Simple and Multiple
Regression
Note: Activities and tools vary by project. Lists provided here are not necessarily all-inclusive. UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Learning Objectives
Review the terms “Parameters” and “Statistics” as
they relate to Populations and Samples.
Introduce Confidence Intervals for expressing the
uncertainty when predicting a population parameter
using a sample statistic, and how to calculate CI’s
for some common situations for different sample
sizes.
Show how the Central Limit Theorem and the
Standard Error of the Mean applies to the use of
Confidence Intervals and Tests
Application Examples
Transactional – A Black Belt has just finished a pilot of a new
process for handling blanket Purchase Orders and wants to
know if it has a statistically significant: a) shorter cycle time
and b) increased accuracy over the old process.
Administrative – The manager of an AAFES1 order entry
department wants to compare two order entry procedures to
see if one is faster than the other.
Service – Medical diagnostic imaging services are provided
from two different medical treatment facilities to a central
hospital which wants to know if there are differences in the
quality of service, particularly: a) the number of lost records
and re-takes, and b) average waiting time for MRIs and X-rays.
1AAFES, Army and Air Force Exchange System
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 5
UNCLASSIFIED / FOUO
Population Sample
All U.S. registered voters 10,000 people are asked
who they will vote for
President
All sufferers of a certain 3,000 people are given a
disease that might be new treatment in a clinical
given the new treatment study
All appraisals completed 25 appraisals chosen at
that month random from a given
month
Population = Sample =
Term
Parameter Statistic
Count of items N n
m
x
Mean
~ ~
Median m x
Standard Dev. s S
m x
Estimators =
s s
x2 , s2
x3 , s3
m, s x4 , s4
Population Parameters; Sample Statistics; Mean, x-bar,
Mean, m (mu), and Standard Deviation, s (sigma) and Standard Deviation, s
Then:
The distribution of x X X 1 X 2 x
3 X n
n
has mean and standard
deviation given by: s Standard Error
mX m and sX of the Mean
n
Variability of Means
Sample statistics estimate population parameters by inference:
s, n ), we can estimate population
For a given sample ( x,
parameters of m s by inference.
As the sample size increases we are more confident that our
sample statistic is a more valid estimator of the population
parameter.
n=5
ssxx
n=3
sx n
n=1
UNCLASSIFIED / FOUO
National Guard
Black Belt Training
Confidence Intervals
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Exercise
Let’s look at a population that has a normal distribution with:
known mean value = 65
standard deviation = 4
(This has been generated in dataset Confidence.mtw)
Each member in the class will randomly sample 25 data points from
this population. (In Minitab, use Calc>Random Data>Sample from
Columns.)
Sample 25 rows of data from C1 and store the results in C2.
Use graphical descriptive statistics to calculate the 95% confidence
interval for the mean and sigma based on your sample of 25 data
points. Do they include the mean, 65, and the sigma, 4?
Based on a class size of 25, we would expect 1 confidence interval to
not contain 65 for the mean, and 1 that does not include 4 for sigma.
.025 .025
Standard Error
m1 - 1.96(3.0) m1 m1+ 1.96(3.0) of the Mean
We are 95% confident that the long-run mean, m , lies in this interval.
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 20
Confidence Interval for the Mean (m) with
UNCLASSIFIED / FOUO
An Example of a t-Distribution
0.4
t-distribution
0.3 (n = 5)
frequency
0.2
Area =
0.025
0.1
0.0
-3 -2 -1 0 1 2 2.78 3
t
* For a 95% CI, = .05. Therefore, for a two tail distribution: /2= .05/2= .025
s
Luckily, we don’t
x t α/2, n1
have to worry about
n
the details of how to
calculate the t-value. 1.008
54.860 2.09
Minitab takes care of 20
that for us.
54.389, 55.331
4. Click on OK
54 55 56 57
3rd Q uartile
Maximum
55.850
57.100
95% Confidence Interval
95% C onfidence Interv al for Mean for Mean (m:
54.388 55.332
54.388 55.332
95% C onfidence Interv al for Median
54.100 55.582
95% Confidence Intervals 95% C onfidence Interv al for StDev 95% Confidence Interval
Mean
0.767 1.472
for Standard Deviation (s:
0.767 1.472
Median
UNCLASSIFIED / FOUO
National Guard
Black Belt Training
Hypothesis Testing
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Distribution of gas
consumption for this
car
12.8 %
s = 3.46
Hypothesis Testing
Hypothesis Testing:
Allows us to determine statistically whether or not a
value is cause for alarm (or is simply due to random
variation)
Tells us whether or not two sets of data are different
Tells us whether or not a statistical parameter
(mean, standard deviation, etc.) is statistically
different from a test value of interest
Allows us to assess the “strength” of our conclusion
(our probability of being correct or wrong)
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 40
UNCLASSIFIED / FOUO
Sample D
m
Population
Mean
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 42
UNCLASSIFIED / FOUO
m
Likely value of x-bar
x with a large sample
size
x
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 44
UNCLASSIFIED / FOUO
Standard Deviation
What effect would a lot of variation in the population
have on our estimate of the population mean from a
sample?
How would this affect our ability to draw conclusions
about the mean?
What if there is very little variation in the population?
Population
If we want to have
high confidence in our
conclusions, we must
relax the range in
which we say the true
m mean lies
As we tighten our
estimate of the mean,
our risk of being wrong
x
increases. Thus, our
confidence decreases. x
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 48
UNCLASSIFIED / FOUO
(n)
Variation Difference
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 49
UNCLASSIFIED / FOUO
An Example
A unit has several quick response forces, QRF. Some forces have over
700 members, with at least 300 on the site at any time.
By regulation, all forces must have a quick response plan, the critical
first phase of which is required to be completed in 10 minutes (600
seconds) or less.
There are two teams that are vying for “most responsive.” They have
taken somewhat different approaches to implementing their quick
response plans and management wants to know which approach is
better: Team 1 or Team 2
Each one has 100 data points for actual responses and drills (Minitab
file Response.mtw)
Median
Median
Example
The average cycle time for Team 1 is 599.55
seconds.
The average cycle time for Team 2 is 600.23
seconds.
The target cycle time for Phase 1 response is 600
seconds.
Is the difference between the two average cycle times
statistically significant?
Example (Cont.)
The unit wants to determine if the true averages of
the two teams are really different.
The unit thinks that the 600.23 average of team 2 is
little too high, so there is a need to determine if the
data indicates that the true average is really not equal
to the target of 600 seconds.
The unit will use hypothesis testing to answer these
questions.
Example
The first hypothesis test to be performed is to determine
whether there is a statistically significant difference between
the means of the two teams. This is called a 2-Sample t
Test.
The real question is whether or not the means are different
enough to indicate that the approaches taken by the two
teams really are centered differently, or are they close
enough that the difference could simply be a result of random
variation?
After that, hypothesis testing can tell us if there is evidence
indicating whether or not each team’s average is different
from the target of 600 seconds.
First, we need to introduce some terminology.
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 59
UNCLASSIFIED / FOUO
H a : m1 m 2 0
H o : m1 600
H a : m1 600
604
603
602
601
Data
600
599
598
597
596
Team 1 Team 2
604
603
602
601
Data
600
599
598
597
596
Team 1 Team 2
We will cover
p-values in more
detail a little later
Assumptions
The Hypothesis Tests we have discussed make certain
assumptions:
Independence between and within samples
Random samples
Normally distributed data
Unknown Variance
In our example, we did not assume equal variances.
This is the safe choice. However, if we had reason to
believe equal variances, then we could have checked
the “Assume equal variances” box in the dialogue box.
Conclusion Drawn
Accept Ho Reject Ho
Type I
Ho True Correct Error
The -Risk)
True
State Type II Error
Correct
Ho False -Risk)
Exercise
Draw the Type I & II error matrix for airport security.
Do you think the security system at most airports
places more emphasis on the alpha or beta risk?
The p-Value
If we reject the null hypothesis, the p-value is the
probability of being wrong.
In other words, if we reject the null hypothesis, the p-
value is the probability of making a Type I error.
It is the critical alpha value at which the null
hypothesis is rejected.
If we don’t want alpha to be more than 0.05, then we
simply reject the null hypothesis when the p-value is
0.05 or less.
As we will learn later, it isn’t always this simple.
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 81
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
National Guard
Black Belt Training
Power, Delta and Sample
Size
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Power 1
Minitab will calculate beta for us for a given sample size, but
first let’s show it graphically….
Alpha Risk
m1
Here is our first population and its corresponding alpha risk.
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 84
UNCLASSIFIED / FOUO
m1 m2
D
We want to compare these two populations. Do you think that
we will easily be able to determine if they are different?
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 85
UNCLASSIFIED / FOUO
Beta Risk
m1 m2
D
If our sample from population 2 is in this grey area, we will
not be able to see the difference. This is called Beta Risk.
m1 m2
D
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 87
UNCLASSIFIED / FOUO
Beta Risk
m1 m2
D
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 88
UNCLASSIFIED / FOUO
Beta Risk
95% Confidence Limit
(alpha = .05) for mean, m1
(critical value)
m1 m2
D
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 89
UNCLASSIFIED / FOUO
Example:
Power
Let’s use Minitab to determine the beta risk of the hypothesis
test we performed on the two teams.
First, we’ll have to make some assumptions.
We don’t know the TRUE difference in the means, so we’ll assume that
it’s 0.682, the differences in the sample averages.
A variance hypothesis test shows that the variances are not equal.
We will average the variances from Minitab to determine the combined
variance using the following formula:
2 2
s1 s 2
s
2
Example:
Power (Cont.)
Select; Stat>
Power and Sample Size>
2-Sample t...
Example:
Power (Cont.)
To calculate Power, we need three things;
1. Sample Size
2. The Difference between the two Means
3. The Average Standard Deviation of the two samples
We can get all this information from our 2-Sample t-Test conducted earlier:
Example:
Power (Cont.)
To Calculate Average Standard Deviation
Example:
Power (Cont.)
3. Type in Average
Standard Deviation
of 1.393 here
4. Click on OK
Example:
Power (Cont.)
Example:
Power (Cont.)
In practice, we evaluate the power of a test to
determine its ability to detect a difference of a given
magnitude that we deem important, or practically
significant.
For example, we could calculate the power of a
hypothesis test to see if we could measure a one
minute difference in responsiveness between the two
teams.
Example:
Power (Cont.)
Let’s say that if the two support organizations’ cycle
times differ by as little as 0.4 seconds, then we need
to analyze the reasons for the differences.
What is the power of our test to detect this
difference?
What is the probability of making a type II error
(concluding that there is no difference when one
exists)?
Use Minitab to individually answer these questions.
Exercise:
Sample Size
Now that we understand the relationship between
Beta, Power, Delta, and Sample Size, we can use this
information to calculate the sample size necessary to
give us the information we want.
We simply use the same function in Minitab to solve
for sample size rather than power.
This is a very useful and common extension of
Hypothesis Testing.
Exercise:
Sample Size (Cont.)
Here we enter the
Difference (delta) we wish
to detect, and the minimum
Power value that we are
willing to live with.
We leave Sample sizes
blank.
Exercise:
Sample Size
Let’s extend our response team cycle time example
Determine what sample size we would need to detect
a difference of 0.4 seconds at a power of 0.90.
What about at a power of 0.95?
What about at a power of 0.95 and an alpha of
0.025?
Hint: Click the Options button in the Power and
Sample Size dialogue box.
H o : m1 600
H a : m1 600
Select Histogram
of data and
Boxplot of data
10
0 _
X
-5 Ho
598.0 598.5 599.0 599.5 600.0 600.5 601.0
Team 1
_
X
Ho
12.5
10.0
Frequency
7.5
5.0
2.5
0.0 _
X
Ho
Boxplot of Team 2
(with Ho and 95% t-confidence interval for the mean)
_
X
Ho
596 597 598 599 600 601 602 603 604 605
Team 2
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
0
Part/No Part
Part
No Part
Analysis of Variance for Net Hour Because the p-value <=
Source DF SS MS F P
Part/No 1 7421 7421 8.65 0.004 0.05, we can be confident
Error 69 59194 858 that calls requiring parts
Total 70 66615 do have an impact on the
Individual 95% CI's For Mean
Level N Mean StDev --+---------+---------+---------+----
ticket cycle time.
No Part 27 21.99 19.95 (--------*---------)
Part 44 43.05 33.70 (------*------)
--+---------+---------+---------+----
Pooled StDev = 29.29 12 24 36 48
45
40
- Example -
35
10 15 20 25 30 35
Deliveries
Takeaways
Since it is not always practical or possible to measure
every item in the population, you take a random
sample.
A basic understanding of the terms: Population,
Sample, Population Parameter, Sample Statistic,
Sample Mean, and Sample Standard Deviation
How to calculate a confidence interval with the
population standard deviation known
How to calculate a confidence interval with the
population standard deviation unknown
Takeaways (Cont.)
How Hypothesis tests help us handle uncertainty
The role of sample size, variation, and confidence
level
The null and alternative hypotheses
Type I and Type II errors
Hypothesis tests in Minitab
Stat Guide
p-value
Takeaways (Cont.)
How to conduct a 1-way and 2-way t-test
How to conduct a Variance test (see Appendix)
How to conduct a Paired t-test (see Appendix)
Understanding of 1-way and 2-way test of proportions
(see Appendix)
Understanding the relationship between Power and
sample size and detectable difference (delta)
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
References
Hildebrand and Ott, Statistical Thinking for Managers,
4th Edition
Kiemele, Schmidt, and Berdine, Basic Statistics,
4th Edition
UNCLASSIFIED / FOUO
National Guard
Black Belt Training
APPENDIX
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Variable Testing equality of population MEANS (averages) from more than two populations 3
Testing equality of population VARIANCES (standard deviation) from more than two
Variable
populations 4
Attribute - Binomial
"Go/No-Go" Testing equality of population PROPORTIONS (binomial data; e.g., pass/fail, go/no
"Pass/Fail" or go, is/is not, etc.) from one or more populations 5
"Defective" Data
Attribute - Poisson
Testing equality of population PROPORTIONS (Poisson data; i.e., frequency of
"Count" or
occurence in time or space) from two or more populations 6
"Defects" data
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Decision Tree # 1
Testing Equality of Population Mean
Application:
to a Specific Value
Type of Data: Variable (Continuous)
Has the average button diameter from the welder
Example:
changed from its historical value?
Start
Is
Yes
1-Sample Z-test
n > 30? Stat > Basic Statistics > 1-Sample Z
No
Is
population 1-Sample Wilcoxon test
normally random sample from a continuous,
No
distributed? symmetric population
(Anderson- Stat > Nonparametrics > 1-Sample Wilcoxon
Darling)
Yes
1-Sample t-test
(reasonably robust
against normality assumption)
Stat > Basic Statistics > 1-Sample t
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Decision Tree # 2
Testing Equality of Means
Application:
from Two Populations
Type of Data: Variable (Continuous)
Is the average button diameter from Welder A
Example:
different from that of Welder B?
Start
No
Do
n1 and n2 Yes
2-Sample Z-te st
both exceed Stat > Basic Statistics > 2-Sample t
30?
No
2-Sample M ann Whitne y
(independent, random variables from two
Are both populations with same shape, same variance)
populations Stat > Nonparametrics > Mann-Whitney
normally No
distributed?
(Anderson- Note: If the two populations have different shapes
Darling) or different standard deviations, then use:
2-Sample t-test
without pooling variances
Yes
Equal
2-Sample t-te st
Variances? No without pooling variances
(F-test) Stat > Basic Statistics > 2-Sample t
(Do not assume equal variances)
Yes
2-Sample t-te st
w ith pooled variances
(reasonably robust against normality
assumption)
Stat > Basic Statistics > 2-Sample t
Assume equal variances
Example2.vsd 6-1-00
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Decision Tree # 3
Testing Equality of Means from
Application:
More than Two Populations
Type of Data: Variable (Continuous)
Do the average button diameters from
Example:
Welders A, B and C differ from one another?
Start
Tukey's test
Yes to conduct pairwise comparisons
Stat > ANOVA > One-way
Comparisons: Tukeys
Do samples
Mood's Median test
contain outliers? Yes (independant, random samples f rom continuous
(Box Plot) distributions hav ing same shape)
Stat > Nonparametrics > Mood's Median test
No
Example3.v sd 6-1-00
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Decision Tree # 4
Application: Testing Equality of Variances
Start
Are the
populations
How many
populations are 2
normally
No
Levene's test
being compared? distributed? Stat > Basic Statistics > 2 Variances
(Anderson-
Darling)
Yes
More than 2
F-test
Stat > Basic Statistics >2 Variances
Are the
populations
normally
No
Levene's test
distributed? Stat > ANOVA > Test for Equal Variances
(Anderson-
Darling)
Yes
UNCLASSIFIED / FOUO
Example4.v sd 6-1-00
UNCLASSIFIED / FOUO
Decision Tree # 5
Testing Equality of Population
Application:
Proportions
Type of Data: Attribute (Discrete) - Binomial Distribution
Case 2:
Case 1:
Te sting Equality of
Te sting Population Proportion
Proportions from Two
Against a Spe cific Value
Populations
Example: Has the % defective rate
on Line 1 changed Example: Are Lines 1 and 2
from its historical value? running at the same
% defective rate?
Case 3:
Te sting Equality of
Proportions from M ore than
Two Populations
Example: Are Lines 1, 2 and 3
running at the same
% defective rate?
Decision Tree # 6
Testing Equality of Population
Application:
Defect Rates
Type of Data: Attribute (Discrete) - Poisson Distribution
Stat >Use
Basic2Stats
Sample t-test
> 2-sample Poisson Rate Use One-Way Analysis of Variance
Stat > Basic Stat > 2 Sample t Stat > ANOVA > One-way
Caution
No Extreme Outliers
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Decision Tree # 7
Chi-square test
Minitab:
Stat > Tables > Chi-square test
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Group 1:
Hypothesis Tests for Variation
Use the Minitab electronic docs, stat guide, and help
to learn about performing hypothesis tests for equality
of variance among two populations. You may also use
your textbooks if you wish.
Prepare a 10-15 minute teachback on hypothesis tests
for variation.
Be sure to work an example in your teachback.
Hint: Using Minitab to conduct a hypothesis test to
determine if there is a difference in the amount of
variation exhibited by each support organization would
be a good example to use.
Group 2:
Paired t-Test
Use the Minitab electronic docs, stat guide, and help
to learn about performing paired t-tests. You may
also use your textbooks if you wish.
Prepare a 10-15 minute teachback on paired t-tests.
Be sure to illustrate the difference between a
standard 2-way t-test and a paired t-test.
Be sure to work an example in your teachback.
Go through a sample size calculation in your example.
Group 3:
Hypothesis Tests with Proportions
Use the Minitab electronic docs, stat guide, and help to
learn about performing hypothesis tests with proportions.
You may also use your text books if you wish.
Prepare a 10-15 minute teachback on hypothesis tests with
proportions.
Be sure to illustrate the main difference between
hypothesis tests of proportions and the other hypothesis
tests we have talked about.
Include both 1-way and 2-way proportion hypothesis tests.
Be sure to work an example in your teachback.
Go through a sample size calculation in your example.
σ s
x Z α/2 x t α/2,n1
n n
Standard Deviation
n1 n1
s σs
α/2,
2
n1 12α/2, n1
Proportions (Approximate)
pˆ1 pˆ pˆ1 pˆ
pˆ Z / 2 p pˆ Z / 2
n n
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 135
UNCLASSIFIED / FOUO
0.60 0.2257 0.2291 0.2324 0.2357 0.2389 0.2422 0.2454 0.2486 0.2517 0.2549
0.70 0.2580 0.2611 0.2642 0.2673 0.2704 0.2734 0.2764 0.2794 0.2823 0.2852
0.80 0.2881 0.2910 0.2939 0.2967 0.2995 0.3023 0.3051 0.3078 0.3106 0.3133
0.60 0.3159 0.3186 0.3212 0.3238 0.3264 0.3289 0.3315 0.3340 0.3365 0.3389
0 z 1.00 0.3413 0.3438 0.3461 0.3485 0.3508 0.3531 0.3554 0.3577 0.3599 0.3621
1.10 0.3643 0.3665 0.3686 0.3708 0.3729 0.3749 0.3770 0.3790 0.3810 0.3830
1.20 0.3849 0.3869 0.3888 0.3907 0.3925 0.3944 0.3962 0.3980 0.3997 0.4015
1.30 0.4032 0.4049 0.4066 0.4082 0.4099 0.4115 0.4131 0.4147 0.4162 0.4177
1.40 0.4192 0.4207 0.4222 0.4236 0.4251 0.4265 0.4279 0.4292 0.4306 0.4319
1.50 0.4332 0.4345 0.4357 0.4370 0.4382 0.4394 0.4406 0.4418 0.4429 0.4441
1.60 0.4452 0.4463 0.4474 0.4484 0.4495 0.4505 0.4515 0.4525 0.4535 0.4545
1.70 0.4554 0.4564 0.4573 0.4582 0.4591 0.4599 0.4608 0.4616 0.4625 0.4633
1.80 0.4641 0.4649 0.4656 0.4664 0.4671 0.4678 0.4686 0.4693 0.4699 0.4706
1.90 0.4713 0.4719 0.4726 0.4732 0.4738 0.4744 0.4750 0.4756 0.4761 0.4767
2.00 0.4772 0.4778 0.4783 0.4788 0.4793 0.4798 0.4803 0.4808 0.4812 0.4817
2.10 0.4821 0.4826 0.4830 0.4834 0.4838 0.4842 0.4846 0.4850 0.4854 0.4857
2.20 0.4861 0.4864 0.4868 0.4871 0.4875 0.4878 0.4881 0.4884 0.4887 0.4890
2.30 0.4893 0.4896 0.4898 0.4901 0.4904 0.4906 0.4909 0.4911 0.4913 0.4916
2.40 0.4918 0.4920 0.4922 0.4925 0.4927 0.4929 0.4931 0.4932 0.4934 0.4936
2.50 0.4938 0.4940 0.4941 0.4943 0.4945 0.4946 0.4948 0.4949 0.4951 0.4952
2.60 0.4953 0.4955 0.4956 0.4957 0.4959 0.4960 0.4961 0.4962 0.4963 0.4964 z area
2.70 0.4965 0.4966 0.4967 0.4968 0.4969 0.4970 0.4971 0.4972 0.4973 0.4974 3.5 0.49976737
2.80 0.4974 0.4975 0.4976 0.4977 0.4977 0.4978 0.4979 0.4979 0.4980 0.4981 4.0 0.49996833
2.90 0.4981 0.4982 0.4982 0.4983 0.4981 0.4984 0.4985 0.4985 0.4986 0.4986 4.5 0.49999660
3.00 0.4987 0.4987 0.4987 0.4988 0.4988 0.4989 0.4989 0.4989 0.4990 0.4990 5.0 0.49999971
Source: Statistical Thinking for Managers, Hildebrand and Ott, 4th Edition, page 800.
Hypothesis Testing - Basic UNCLASSIFIED / FOUO 136
UNCLASSIFIED / FOUO
1 1
sX sp
1 X 2
n1 n2
where
2 2
(n1 1)s 1 (n 2 1)s 2
sp
n1 n 2 2