9 Testing of Hypothesis
9 Testing of Hypothesis
Level of significance is the probability of rejecting the null hypothesis H0 when it is assumed to be true. i.e. the probability
of making a type I error. Level of significance is the risk of rejecting the null hypothesis H 0, when it is true. This probability is
equal to , the size of the test. The common choices of are 0.05 and 0.01. If the cost of making a Type I error is high,
small values of α are preferred. If the cost of making a Type I error is not too high, larger values of α are typically used.
A significance test is a statistical test lying down the procedure for deciding whether to accept or reject a statistical
hypothesis. A value is said to be significant when the probability of its occurrence under H 0 is equal to or less than
significance level . i.e. the value falls outside the acceptance region.
A statistic on which the decision whether to accept or reject a null hypothesis is based is called a test statistic. e.g. the
student’s t-test, z-test or F distributions.
A test for which the region of rejection is only located at one end, either on the left end or on the right end of the sampling
distribution of the test statistic is called a one tailed test or one sided test. When the region of rejection comprises both
ends or tails of the sampling distribution of the test statistic. The test is called two tailed test or two sided test.
A two tailed test is appropriate when null hypothesis is H0 : = 0 and the alternate hypothesis is H1 : 0. A left tailed
(lowered tailed) test is used if the hypotheses are H 0 :=0 and H1 : < 0.
9 Testing the Hypothesis about the Population Parameter (2)
(vi) Conclusion. Since the calculated value z = 2.40 falls in the region of rejection, we therefore reject our null
hypothesis and accept H1 : 25
Example 4
The breaking strength of cables produces by a manufacturer have mean 1800 lb and standard deviation 100 lb. By a new
technique in the manufacturing process it is claimed that the breaking strength can be increased. To test this claim a sample of 50
cables is tested and it is found that the mean breaking strength is 1850 lb. can we support the claim at a 0.01 level of significance?
Solution
H0: = 1800 lb, and there is really no change in breaking strength.
H1: > 1800 lb, and there is a change in breaking strength.
a one tailed test should be used here
Level of significance: = 0.01
.
x
Test of statistic to be used: z =
/ n
Critical point: Z = 2.33
1850 1800
Calculation: Z = = 3.55
100/ 50
Decision: As Z = 3.55 greater than 2.33, so the claim should be supported.
i.e. we reject H0.
Hence alternative is accepted.
i.e. . By a new technique in the manufacturing process, the breaking strength can be increased.
Example 5
A company manufactures rope whose breaking strength has a mean of 300 lbs and a standard deviation of 24 lbs. It is
believed that by a newly developed process the mean breaking strength can be increased.
(a) Design a decision rule for rejecting the old process at the 0.01 significance level if it is agreed to test 64 ropes.
(Schaum 10.13)
(b) Under the decision rule adopted in part (a), what is the probability of accepting the old process when in fact
the new process has increased the mean breaking strength to 310 lbs? Assume that the standard deviation is
still 24lbs.
Solution (a)
(i) Null Hypothesis: H0 : = 300 lbs
Alternative Hypothesis: H1 : > 300 lbs (right tailed test)
with n = 64 and = 24 lbs
(ii) Level of significance: = 0.01
(iii) Critical Points: z = 2.33
.
X
(iv) Test statistic to be used: z =
/ n
(v) Calculation: X = + Z / n = 300 + (2.33) (24)/ 64 = 307
i.e. observed sample mean
(vi) Conclusion: If the observed sample mean is greater than 307, we
reject H0 and we accept H1.
Solution (b) (The probability of making a type II error)
Null Hypothesis: H0 : = 300 lbs
Alternative Hypothesis:
H1 : = 310 lbs
with = 24 lbs
9 Testing the Hypothesis about the Population Parameter (5)
.
X
(iii) Test Statistic to be used: t =
/ n
(iv) Critical Point: t()(n-1) = 2.33
(iv) Calculation: X = + Z / n
2100
= 14500 + (-2.33) = 13521.4 hours
25
Conclusion:
Observed sample mean X = 13000 is less than 13521.4, we should reject H0 and accept the alternative hypothesis i.e. the
average life of press is less than hypothesized value.
Example 8
American theatres know that a certain hit movie ran an average of 84 days in each city and the corresponding standard
deviation was 10 days. The manager of the Southeast district was interested in comparing the movie’s popularity in his region with that
in all of American’s other theatres. He randomly chose 75 theatres in his region and found that they ran the movie an average of 81.5
days. State appropriate hypothesis for testing whether there was a significant difference in the length of the movie’s run between
theatres in the Southeast district and all of American’s other theatres. At 1% level of significance, test these hypothesis.
Solution
Null Hypothesis: H0 : = 84 days
Alternative Hypothesis: H1 : 84
with = 10, n = 75 and X = 81.5
Level of significance: = 0.01
Critical Points: X = 2.58
.
X
Test Statistic to be used: z =
/ n
Calculation: X = + Z/2 / n = 81.02 and 86.98
Conclusion: Because the observed sample mean X = 81.5 is in the acceptance region, so we accept H0.
Test of Hypothesis for When n < 30
State the null hypothesis H0 with an alternate hypothesis H1
Level of significance
.
X
State the test statistic to be used. i.e. t =
s/ n
where s the standard deviation of the sampling distribution.
Select the region of rejection or critical region
Compute the value of test statistic from the sample values
Decision or conclusion
Example 9
A test of the breaking strength of 6 ropes manufactured by a company showed a mean breaking strength of 7750 lb and a
standard deviation of 145 lb, whereas the manufacturer claim a mean breaking strength of 8000 lb. Can we support the manufacturer’s
claim at a level of significance of (a) 0.05 (b) 0.01?
Solution (a)
The null hypothesis H0 : = 8000 lb, i.e. manufacturer’s claim
and alternate hypothesis H1 : < 8000 lb (one tail test)
9 Testing the Hypothesis about the Population Parameter (7)
Exercise (2)
The masses in grams of 12 ball bearings taken at random from a lot are 31.4, 33.1, 35.9, 34.7, 33.4, 34.5, 35.0, 32.5, 36.9, 36.4,
35.8, 33.2. Test whether these values are consistent with a population mean of 34. Assume that the weights are normally distributed.
Example 11
Accept the hypothesis if the number of heads in a single sample of 100 tosses is between 40 and 60 inclusive. Reject the
hypothesis otherwise.
a) Find the probability of rejecting the hypothesis when it is actually correct. (Type I Error : the value of ).
b) What is the probability of accepting the hypothesis that the coin is fair when the actual probability of heads is p = 0.7.
(Type II Error: the value of )
Solution
We use normal approximation to binomial distribution if np > 5 and nq >5.
Now the mean and standard deviation are:
= np = 100 ( ½ ) = 50 and = npq = 100( ½ ) ( ½ ) = 5
Standardized values of 40 and 60 are – 2.0 and 2.0.
Area between 40 and 60 = 0.9544
H0 : = 50, H1 : 50
Therefore probability of rejecting H0 is the probability
P[not getting heads between 40 and 60] = 1 – 0.9544 = 0.0456
Probability of accepting the hypothesis that the coin is fair when the actual probability of heads is p = 0.7 is highlighted in the
diagram.
To compute this area, we observe the distribution
under p = 0.7 which has
= np = 70 and = 100( 0.7 ) ( 0.3 ) = 4.58
corresponding standardized value of 60 is – 2.18
with corresponding area = 0.4854 (remember this area
is from 70 to 60 with = 70 )
Hence Required Area = = 0.5 0.4854 = 0.0146
Example 12
Consider the null hypothesis that the average weight of mail students in a certain college is 68 kg. A random sample of size 36
is drawn from a normal population with 3.6. Calculate the probabilities of committing type I and type II errors when testing the null
hypothesis against the alternative that 68 kilograms.
Type I Error
The probability of committing a type I error, or the level of significance of our test, is equal to the sum of the areas that have
been shaded in each tail of the distribution in Figure.
Therefore = P(X < 67 when = 68) + P(X > 69 when = 68)
z-values corresponding to x = 67 and x = 69
9 Testing the Hypothesis about the Population Parameter (9)
67 68 69 68
z1 = = 1.67 and z2 = = 1.67
3.6 6 3.6 6
Therefore = P(Z < 1.67) + P(Z > 1.67) = 2 P(Z < 1.67) = 0.950
Thus 95% of all samples of size 36 would lead us to reject = 68 when in fact, it is true.
Type II Error
Now we evaluate for various alternative hypotheses. If it is important to reject 0 when the true mean is some value
70 or 66, then the probability of committing a type II error should be computed and examined for the alternatives = 66 and = 70.
Because of symmetry, it is only necessary to consider the probability of not rejecting the null hypothesis that = 68 when the
alternative = 70 is true. A type II error will result when the sample mean x falls between 67 and 69 when H1 is true. Therefore,
referring to Figure, we find that
= P( x 69 when = 70).
69 70
z-values corresponding to x = 69 when = 70 is z = 0.45 = 2.22
Example 14
Consider above Example 13, Suppose that the true burning rate is 49 centimeters per second. What is β for the two-sided test
with α = 0.05, σ =2, and n = 25?
Solution
9 Testing the Hypothesis about the Population Parameter (11)