Unit 09 - Testing of Hypothesis in Case of Large and Small Samples
Unit 09 - Testing of Hypothesis in Case of Large and Small Samples
9.1 Introduction
In the previous unit, estimation, we have studied about the estimation of the
parameter from the samples and the methods of estimation. In this unit,
Testing of hypothesis, we will study about hypothesis and the testing of
hypothesis. Estimation is about estimating the parameters and finding out
9.1.2 Assumptions
Although hypothesis testing sounds like some formal statistical term and
completely unrelated to business decision making, in fact, managers
propose and test hypothesis all the time. For example, “if we drop the price
of this car model by Rs.1,500, we will sell 50,000 cars this year” is a
hypothesis. To test this hypothesis, total car sales till the end of the year
have to be counted.
Managerial hypothesis are based on intuition; the marketplace decides
whether the manager’s intuitions were correct. Hypothesis testing is about
making inferences about a population from only a small sample. The bottom
line in hypothesis testing is when we ask ourselves (and then decide)
whether a population, like this one, would be likely to produce a sample like
the one we are looking at.
Example 1
We want to test the hypothesis, that the population mean is equal to 500.
We would symbolise it as follows and read it as,
The null hypothesis is that the population mean = 500 which is written as,
0 : 500
Example 3
If we want to test if the attribute of educational qualification has any
influence on the income of an individual, we make null hypothesis as:
0 : Educational qualification has no influence on the income of an
individual
and alternative hypothesis is
1 : Educational qualification has an influence on the income of the
individual
Type I error
The combinations are:
If null hypothesis is true, and the test result make us to accept it, then
we have made a right decision.
If null hypothesis is true, and the test result make us to reject it, then we
have made a wrong decision (Type I error). It is also known as
consumer’s risk, denoted by .
If hypothesis is false, and the test results make us to accept it, then we
have made a wrong decision (Type II error). It is known as producer’s
risk, denoted by .1 – is called power of the test.
If hypothesis is false, and the test result make us to reject it – we have
made a right decision.
Manipal University Jaipur Page No. 364
Statistics for Management Unit 9
Table 9.2: Conditions for Using the Normal and ‘t’ Distributions in
Testing Hypothesis about Means
When the Population When the Population
Standard Deviation is Standard Deviation is
known not known
Sample size ‘n’ is larger Normal distribution, Normal distribution,
than 30. z–table z–table
Sample size ‘n’ is 30 or Normal distribution, ‘t’ distribution, ‘t’ table
less and we assume the z–table
population is normal or
approximately so.
One more rule has to be kept in mind, when testing the hypothesised values
of a mean. As in estimation, use the finite population multiplier whenever the
population is finite in size, sampling is done without replacement, and the
sample is more than five percent of the population.
A left-tailed test is one of two kinds of one-tailed tests. The other kind of
one-tailed test is a right-tailed test (or an upper-tailed test). An upper-tailed
test is used when the hypothesis is H1: > o. Only values of the sample
mean that are significantly above the hypothesised population mean will
cause us to reject the null hypothesis in favour of the alternative hypothesis.
Figure 9.3 depicts an upper-tailed test where the rejection region is in the
upper tail of the distribution of the sample mean.
Tests for proportion and other parameters are similarly discussed; rejection
regions are similarly identified with reference to the given level of
significance and appropriate distribution.
In each example of hypothesis testing, when we accept a null hypothesis on
the basis of sample information, we are really saying that there is no
statistical evidence to reject it. We are not saying that the null hypothesis is
true. The only way to prove a null hypothesis is to know the exact value of
the population parameter or the population distribution and that is not
possible with just sampling. Thus, we accept the null hypothesis and behave
as if it is true simply because we can find no evidence to reject it.
Example 4
The hypothesis to be tested is Ho: = 100, against the alternative
hypothesis H1: 100, with sample size n = 20, population standard
deviation σ =2.5. Here sample size is smaller than 30 but population
standard deviation is given; hence to test the hypothesis the probability
distribution used is ‘normal distribution’.
Example 5
The hypothesis to be tested is Ho: = 10 against the alternative
hypothesis H1: > 10, with sample size n = 20, population standard
deviation is not known. Here sample size is smaller than 20 but
population standard deviation is not given, hence to test the hypothesis
the probability distribution used is ‘t- distribution’.
difference in
p1 = first sample proportion
( p1 p 2 )
proportions of Z
4 Test between
proportion –
p1 = first sample proportion
when
populations
p 2 = second sample
are similar (p1 p 2 ) proportion
Z n1 = first sample size
with respect to
1 1
a given p 0 q 0 n2 = second sample size
attribute n1 n 2
n p n 2 p2
p0 1 1
n1 n 2
and q0 = 1- p0
Table 9.3b: Statistics for Testing the Hypothesis on Mean; Large Sample
Case
Test Description
Test Statistics Notes
No. of Test
5 Test for = Population mean
specified
mean –
X = Sample mean
infinite (X ) p = Population S.D
Z
population, p In case p is not known, we
n>30 and use s in its place calculating
n
population
variance(s) ( X X )2
s i
known n 1
6 Test for = Population mean
specified
mean –
X = Sample mean
( X ) p = Population S.D
finite Z
p In case p is not known, we
population,
Nn
n>30 and use s in its place calculating
population
n N 1
variance(s) ( X X )2
s i
known n 1
7 Test for X 1 and X 2 are sample mean
difference in (X1 X 2 )
Z for the first and second
means–
p1 2 p 2 2 samples respectively.
different
n1 n2 n1= first sample size
population,
n>30 and n2 = second sample size
( X X )2
1i 1
s1 = n 1
1
(X X )2
s2 = 2i 2
n 1
2
d1 = X1 X12
d2 = X 2 X12
N1 X1 N 2 X 2
X 12
N1 N 2
Solved problem 1
XYZ press hypothesis is that the average life of its latest web-offset press is
14,500 hours. They know the standard deviation of the press life is 2,100
hours. From a sample of 25 presses, the company finds a sample mean of
13,000 hours. At a significance level of 0.01, should the company conclude
that the average life of the presses is less than the hypothesised 14,500
hours?
Solution
The procedure is described here:
1. Null hypothesis H0: = 14,500
Alternate hypothesis H1: < 14,500 (one-tailed test)
2. Level of significance = 0.01 Ztab = - 2.33 and R:z< -2.33
(X )
3. Test statistics Z
p
n
p 2100 2100
420
n 25 5
13000 14500
Z cal 3.57
420
5. Conclusion: Since Zcal (-3.57) < Ztab (-2.33) and is in the rejection region,
H0 is rejected. In other words, we accept that the average life of the
press is significantly lesser than 14,500 hrs at 1% level of significance.
Solved problem 2
Theatre owners in India know that a hit movie ran for an average of 84 days,
with a standard deviation of 10 days in each city the movie was screened. A
particular movie distributor was interested in comparing the popularity of the
movie in his/her region with that of the population. The distributor randomly
chose 75 theatres in the region and found a popular movie ran for 81.5
days.
1) State appropriate hypothesis for testing whether there was a significant
difference between theatres in the distributor’s region and the
population.
2) At 1% significance level, test this hypothesis.
Solution
The procedure is explained in the form of steps:
1. Null hypothesis Ho: = 84
Alternate hypothesis H1: 84 (two-tailed test)
2. Level of significance = 0.01 Ztab = 2.58 and R:|z| > 2.58
3. Test statistics
(X )
Z
p
n
84 81.5
Z cal 2.165
1.1547
5. Conclusion: Since Zcal (2.165) < Ztab (2.58), and not in the rejection
region, H0 is accepted at 1% level of significance.
Solved Problem 3
A ketchup manufacturer is in the process of deciding whether to produce a
new extra spicy brand of ketchup. In a survey of 6000 households, the
company’s market research team found that, 355 households would buy the
extra spicy brand. A more extensive study carried out 2 years ago showed
that 5% of the households would buy the brand then. At 2% level of
significance, should the company conclude that there is an increased
interest in the extra spicy flavour?
Solution
The procedure is explained in the following steps:
1. Null hypothesis Ho: p = 0.05
Alternate hypothesis H1: p > 0.05 (one-tailed test)
2. Level of significance = 0.01 Ztab = 2.05 and R:z > 2.05
3. Test statistics
p p
Z
pq
n
4. Given p = 0.05, p = 355/6000 = 0.0592, n = 6000, q = 1 – p =1- 0.05=
0.95
( 0.0592 0.05 )
Z cal 3.29
0.05 0.95
.
6000
5. Conclusion: Since Zcal (3.29) > Ztab (2.05), and found in the rejection
region, Ho is rejected and it is accepted that there is an increase of the
proportion of population having an interest in the new flavour.
Solved Problem 4
Microsoft estimated that out of 10,000 potential software buyers, 35% wait
to purchase the new OS Windows Vista, until an upgrade has been
released. After an advertising campaign to reassure the public was
released, Microsoft surveyed 3000 buyers and found 950 who are still
skeptical. At 5% level of significance, can the company conclude that the
population of skeptical people had decreased?
Solution
The procedure is explained in the following steps:
1. Null hypothesis Ho: p = .35
Alternate hypothesis H1: p < 0.35
2. Level of significance = 0.05 Ztab = - 1.645 and R: z < -1.645
3. Test statistics
( p p)
Z
pq Nn
n N 1
4. Given p = 950/3000 = 19/60 = 0.317, p = 0.35, q = 1-p = 1- 0.35 = 0.65,
N=10,000, n = 3000
Solved problem 5
A machine is designed to pack 200ml of a medicine with a standard
deviation of 5ml. A sample of 100 bottles when measured had a mean
content of 201.3ml. Test whether the machine is functioning properly (use
5% level of significance).
Solution
The procedure is explained in the following steps:
1. Null hypothesis Ho: = 200
Alternate hypothesis H1: 200 (two-tailed test)
2. Level of significance = 0.05 Ztab = 1.96 and R: |Z| > 1.96.
3. Test statistics
(X )
Z
p
n
p 5
0.5
n 100
201 .3 200
Z cal 13 / 5 2.60
0 .5
5. Conclusion: Since Zcal (2.60) > Ztab (1.96) and Zcal is in the rejection
region, Ho is rejected. Hence at 5% level of significance, we reject null
hypothesis and conclude that the machine is not functioning properly.
It should be noted that the methods and theory of small samples are
applicable to large samples, but the reverse is not true.
v 1/ 2
t2
f ( t ) C 1
v
where,
C = Constant required to make the area under the curve equal to unity.
= n – 1, Degree of Freedom.
4. The value of ‘t’ ranges from - to +
5. “” is called the parameter of the distribution
6. It is symmetrical about mean
7. Its mean is zero
8. Variance of the distribution is greater than one
9. It has larger areas at the tails compared to normal distribution and
lower height at the mean.
10. It tends to a normal distribution as n ∞.
9.7.1 Uses of ‘t’ test
The ‘t’ test is used:
To test a specified value
Manipal University Jaipur Page No. 379
Statistics for Management Unit 9
Test Description
Test Statistics Notes
No. of Test
1 Test for X is the sample
specified mean
value – infinite
= Hypothised
population ( X )
t value of
with s population mean
d.f. = n -1, n
Population ( X X )2
variance s i
n 1
unknown
2 Test for ( X )
specified t
value – finite s N n
N = Population
population n N 1 size
with d.f.= n-1,
Population
variance
unknown
3 Test between X1 X 2 X1 = first sample
values – t
mean
independent 2
(X X ) (X X )
2 1 1
samples with 1i 1 2i 2 X 2 = second
n n 2 n n
d.f= n1 + n2 – 1 2 1 2 sample mean
2, X1 X 2 n1 and n 2 are
t
Population sizes of first and
( n 1) 2s ( n 1) 2s 1 1
variances not 1 1 2 2 second sample
known but n n 2 n n respectively.
1 2 1 2
assumed to
be equal
4 Paired “t – D = Mean of
D
test t
(dependent diff , where difference
n = sample size
samples) with n
d.f= n -1
diff
2
D 2i D .n
n 1
Solved problem 6
A random sample of 10 bags of fertilisers is found to have the following
weight (kg):
45, 49, 50, 49, 44, 52, 48, 45, 46, 45
Test at 5% level of significance whether the average packing weight can be
taken as 50 kg.
Solution: Table 9.6 depicts the frequency table for solved
problem 6.
Table 9.6: Frequency Table
Xi
X i X = X i X =
2
X i =473
∑ X i X =64.1
2
X i
473
X i 1
47.3
n 10
The sample variance is given by,
n
(X i X) 2
64.1
s 2 i 1
7.12
n 1 10 1
s 2.6687
( X )
t
s N n
n N 1
s
Given n = 10, N = 1000, X 47.3 , = 0.8439
n
( X ) ( 47.3 50)
t cal
3.2138
s N n 1000 10
0.8439
n N 1 1000 1
4. Conclusion: Since |tcal (-3.2138)| > |ttab (2.262)|, tcal is in the rejection
region and thus Ho is rejected.
Solved Problem 8
Average tensile strength of nine samples of paper is found to be 15.8 units
and variance is 10.3. Can we say at 1% level of significance that it is a
random sample drawn from a population whose mean tensile strength is
17.5?
Solution
The steps are described as follows:
1. Null hypothesis H0: 17 .5
Alternate hypothesis H1: 17 .5
2. Level of significance 1% and degrees of freedom (d.f.) =n-1= 9-1=8
ttab = 3.36 and R:|t| > 3.36
3. Test statistics
( X )
t
s
n
X 15 .8 , = 17.5, s = 10.3 ,
2
Given n=9
s 3.2084
1.0698
n 9
15.8 17.5
t cal 1.5891
1.0698
4. Conclusion: Since Itcal (-1.5891)| < Ittab (3.36)I, Ho is accepted
It can be considered as a random sample at 1% level of
significance.
Solved Problem 9
A sales manager wants to know whether a special promotional campaign is
a success. Table 9.7 depicts the data. Test at 5% level of significance,
whether it is a success?
Table 9.7: Sales Data Before and After the Campaign
Retail Outlets 1 2 3 4 5 6
Sales before campaign 50 48 31 42 28 53
Sales after campaign 56 55 30 45 29 58
Solution
Table 9.7a depicts the frequency table calculated for the sales data before
and after the campaign.
Table 9.7a: Frequency Table for the Sales Data Before and After the Campaign
Before (Xi) After (Yi) D = After – Before Di 2
Di = Yi - Xi
Campaign
50 56 6 36
48 55 7 49
31 30 -1 1
42 45 3 9
28 29 1 1
53 58 5 25
∑ Di = 21 ∑ Di 2 =121
D i 21
Mean of Differences or D 3.5
n 6
diff
2
D 2i D .n
n 1
121 2 .6
3.5
diff =3.08
6 1
Activity:
1. A random sample of 200 tins of vanaspathi has a mean weight 4.97
kgs and a standard deviation of 0.2kgs. Test at 1% level of
significance, that the tins have 5 kgs. vanaspathi
2. A random sample of 100 rods drawn from a lot of rods has a mean
length 32.7cms. and a standard deviation of 1.3cms. Can it be
concluded that the lot has a mean of 32 cms?
Solution
1. H0 : µ = 5kg
H1: µ ≠ 5kg
Level of significance = 0.01 Ztab = 2.58 and R: |Z| > 2.58
Test statistics
( X )
Z
s
n
Given = 5, X = 4.97, s = 0.2, n = 200
4.97 5
Z cal 2.12
0 .2
200
Conclusion: Since IZcalI < Ztab, we accept H0 at 1% level of
significance and conclude that the tins have 5 kgs of vanaspathi.
2. H0 : µ = 32
H1: µ ≠ 32
Level of significance = 0.05 Ztab = 1.96 and R: |Z| > 1.96
Test statistics
( X )
Z
s
n
Given = 32, X = 32.7, s = 1.3, n = 100
32.7 32
Z cal 5.38
1.3
100
Conclusion: Since Zcal > Ztab, we reject H0 at 5% level of significance
and conclude that the lot does not have a mean of 32 cms.
Manipal University Jaipur Page No. 386
Statistics for Management Unit 9
9.8 Summary
Let us recapitulate the important concepts discussed in this unit:
Hypothesis testing is the opinion about the population parameter that
may or may not be in the confidence interval derived from the sample.
In hypothesis testing, we must state the assumed or hypothesised value
of the population parameter before we begin sampling. The assumption
we wish to test is called the null hypothesis and is symbolised by ’H0’.
If our sample results fail to support the null hypothesis, we must
conclude that something else is true. Whenever we reject the
hypothesis, the conclusion we do accept is called the alternative
hypothesis and is symbolised as ‘H1.
If null hypothesis is true, and the test result make us to accept it, then
we have made a right decision.
If null hypothesis is true, and the test result make us to reject it, then we
have made a wrong decision (Type I error). It is also known as
consumer’s risk, denoted by .
If hypothesis is false, and the test results make us to accept it, then we
have made a wrong decision (Type II error). It is known as producer’s
risk, denoted by .1 – is called power of the test.
If hypothesis is false, and the test result make us to reject it , we have
made a right decision.
‘t’ tests can be used for sample size (n 30) and samples whose
population standard deviations are not known.
9.9 Glossary
Level of significance: The smallest probability at which the null hypothesis
would be rejected (Type I error). Usually, if the significance level is less than
a number such as 0.05 (5%), the null hypothesis would be rejected in favour
of the alternative; the chance of getting a sample like the one being
analysed if the null hypothesis were true. A small significance level would
imply that getting such a sample was highly unlikely, suggesting that the null
hypothesis is probably not true; also called the P-value of the test.
Null distribution: The distribution of the test statistic assuming the null
hypothesis is true.
One-tailed test: A test in which the alternative hypothesis specifies that the
population parameter is strictly greater, or strictly lesser, than a specified
value. A test in which the alternative hypothesis specifies that the parameter
is on "one side" of the null hypothesis value; a test in which H1 contains >
or <.
P-value: The value that indicates how unusual a computed test statistic
compared with what would be expected under the null hypothesis. A small
value indicates that the null hypothesis should be rejected at any
significance level above the calculated value. For example, if the P value
equals 0.0246, we would reject the null hypothesis at the 5% significance
level, but would not reject it at the 1% significance level.
Two-tailed test: The rejection region in a two-tailed test is split between the
two tails of the distribution.
Type I error: Rejecting a true null hypothesis. The probability of a type I
error is indicated by alpha (α).
Type II error: Not rejecting a false null hypothesis. The probability of a type
II error is indicated by beta (β).
Z test for a population mean: Tests a hypothesis pertaining to the
population mean by using a z-test statistic to evaluate the magnitude of
difference between the sample mean.
Z test for a population proportion: Tests a hypothesis pertaining to the
population proportion by using a z-test statistic to evaluate the magnitude of
the difference between sample proportion and hypothesised population
proportion.
9. The table 9.10 depicts the results related to the memory capacity of 10
students before and after training. Test at 5% level of significance
whether training is effective.
Roll No 1 2 3 4 5 6 7 8 9 1
Before Training 1 14 11 8 7 1 3 0 5 6
After Training 1 16 10 7 5 1 10 2 3 8
9.11 Answers
Terminal Questions
1. Zcal = 1.9457, H0 accepted
2. Zcal = 0.71, H0 accepted
3. Zcal = 0.50, H0 accepted
4. Zcal = 1.54, H0 accepted
5. Zcal = 18.75, H0 rejected
6. Zcal = 1.30, H0 accepted
7. tcal = 2.397, H0 is rejected
8. tcal = 2.21, H0 is rejected
9. tcal = 1.365, H0 is rejected
References:
Bevington, P.R. & Robinson, D.K. Data Reduction and Error Analysis for
the Physical Sciences (3rd Edition), (Paperback).
Cowan, G. Statistical Data Analysis (Oxford Science Publications),
(Paperback).
Manipal University Jaipur Page No. 391
Statistics for Management Unit 9
Devore, J.L. Probability and Statistics for Engineering and the Sciences,
Enhanced Review Edition. (Hardcover - Jan. 29, 2008).
James, F. Statistical Methods in Experimental Physics (2nd Edition).
(Hardcover - Nov. 29, 2006).
Levin, R.I. & Rubin, D.S. (2008) Statistics for Management, Seventh
Edition, PHI Learning Private Limited.
Lyons, l. Statistics for Nuclear and Particle Physicists. (Paperback,
1989).
Mandel, J. The Statistical Analysis of Experimental Data, (Paperback).
Meyer, S.L. Data Analysis for Scientists and Engineers, (Paperback).
Morris, H., Schervish, M.J. & Degroot, Probability and Statistics
[PROBABILITY & STATISTICS 3 -OS] (Paperback - Jan. 31, 2002).
Press, W.H., Teukolsky, S.A., Vetterling, W.T. & Flannery, B.P.
Numerical Recipes: The Art of Scientific Computing, 3rd Edition.