2 Hypothesis-Testing
2 Hypothesis-Testing
H1 : 0
Procedure For Hypothesis Testing
Step I : setup the null Hypothesis e.g. H 0 : 0
Step II : setup the Alternative Hypothesis e.g. H1 : 0
Step III: Decide level of significance e.g. α = 5%
Step IV: Compute test – statistic under the validity of
null hypothesis as
x E( x )
Z ~ N (0,1)
S .E ( x )
where E ( x ) , Z can be defined as a
Standard Normal Variate ( S .N .V ) for any statistic
Step V: Conclusion: We compare the computed value of Z in
step (iv) with the significant value Zα (tabulated value)
at given level of significance ‘α’
if |Z| < Zα, then Z is not significant i.e. difference b/w statistic and the
difference is just due to sampling fluctuations
H0 can not be rejected & Vice – Versa.
Step VI: Making Decision
Types of Error in Hypothesis Testing
Type I Error: Reject H0 when it is true P (Type I Error)= α
α: Level of significance
1- α = P (Rejecting H0 when it is false)
Accept H0 Reject H0
x x
z
x
n
Sampling Distribution of Mean
2
as
n
Acceptance and Rejection Region
Accept null hypothesis if test statistic falls in the acceptance region
0.95 of area
0.025 Acceptance Region
0.025
1.96 1.96
P-value (Observed Level of Significance)
• P-value – If p-value is more than , Null hypothesis
accepted. Will be rejected if p-value is less than
Large Sample Test
Sample is large if n > 30
S.N.V to a statistic:
t E (t )
Z ~ N (0,1) as n
S .E.(t )
Test For Proportion Test For Mean
pP x
Z ~ N (0,1) Z ~ N (0,1)
PQ / n / n
Limits for P at α Limits for μ at α
Level of significance: Level of significance:
PQ
P p z x z
n n
PQ 2
P ~ N P, x ~ N ,
n n
Area under standard normal distribution between the mean and
successive value of Z
(i) Testing the significance of mean (x )
x E(x) x
Z
S .E ( x ) 2 /n
H0 : 4
H1 : 4
0.05
x 4.724 4
Z 2.471
2 /n 0.293
z0.05 (one- tailed)=1.645
|Z| > z0.05 H1 accepted
On a typing test, a random sample of 36
graduates of a secretarial school averaged 73.6
word with a standard deviation of 8.10 words per
minute. Test an employer claim that the school’s
graduates average is less than 75 words per
minute using the 5 percent level of significance.
From past studies the monthly average household
expenditure on food items in a locality is 2700
with a standard deviation of Rs 160. An
economist took a random sample of 25
households from the locality and found that their
monthly household expenditure on food items to
be Rs 2790. At 0.01 level of significance, can we
conclude that the average household expenditure
on the food items is greater than Rs 2700?
(Critical Value for 0.01 is 2.33)
Prices of share of a company on the different
days in a month were found to be 66, 65, 69, 70,
69, 71, 70, 63, 64 and 68. Examine whether the
mean price of shares in the month is different
from 65. You may use 10 percent level of
significance. (Critical value for 10 percent level
of significance is 1.833)
Tests for difference between two
population means
Case of large sample
A study is carried out to examine whether the mean hourly wages of the
unskilled workers in the two cities are same. The random sample of hourly
earnings in both the cities is taken and the results are presented in the given
Table. Using 5 percent level of significance, test the hypothesis of no
difference in the average wages of unskilled workers in the two cities.
(ii) Test of significance for difference of Means
H 0 : 1 2
H1 : 1 2
x1 x2
Z ~ N (0,1)
1 2
2 2
n1 n2
Sampling Distribution of Difference in Means
Y1 Y 2
Z ~ N (0,1)
2
2
1
2
n1 n2
• 12 and 22 are unknown and estimated by s12 and s22
Past records indicate that a golfer has averaged
7.9 on certain course. With new set of clubs, he
averages 8.2 over five rounds with a standard
deviation of 2.65. Can we conclude that at 0.025
level of significance, the new club has an
adverse effect on the performance? (alpha value
is 2.776)
Two types of medicines are meant to provide
relief to arthritis sufferers were produced in two
different laboratories. The first medicine was
administered to a group of 12 patients and
produced an average of 8.5 hours of relief with a
standard deviation of 1.8 hours. The second
medicine was tested on a sample of 8 patients
and produced an average of 7.9 hours of relief
with a standard deviation of 2.1 hours. Test the
hypothesis that the first medicine drug differs
from second medicine in providing a
significantly higher period of relief. Use 5
percent level of significance.
There were two medicines of ( 1 and 2) that were tried on some
patients for reducing weight. There were 8 adults who were
subjected to medicine 1 and 7 adults who were administered
medicine 2. The decrease in weight (in pounds) is given below:
Medicine 1 10 8 12 14 7 15 13 11
Medicine 2 12 10 7 6 12 11 12
H 0 : New Std
H A : New Std
• Experimental (Sample) data:
y New y Std
s New sStd
nNew nStd
Example:
In order to make a survey of the buying habits, two markets A
& B are chosen at two different parts of a city.
Market A Market B
n1 400 n2 400
x1 250 x2 220
s1 40 s2 55
H 0 : 1 2
H1 : 1 2
H 0 : 1 2
H1 : 1 2
54
Z ~ N (0,1)
2.9 2.6
40 40
Z 1.90
H 0 : P .65 H 0 : P .80
H1 : P .65 H1 : P .80
Small Sample Tests
If the sample size n is small (less than or equal to 30), then
the sampling distribution of the statistics is far from
normality. The normal test can’t be applied. Exact Sample
Tests are applied in such cases.
Assumptions:
H 0 : 1 2
H 1 : 1 2
x1 x2
t ~ t n1 n2 2
1 1
S (
2
)
n1 n2
S
2 1
n1 n2 2
(x 1 x1 ) 2
2 2
( x x ) 2
Example:
A group of 5 patients treated with medicine ‘A’ weigh 42,
39, 48, 60 and 41 Kgs. Second group of 7 patients treated
with Medicine ‘B’ weigh 38, 42, 56, 64, 68, 69, 62 Kgs. Do
you agree with the claim that the medicine ‘B’ increases
the weight significantly ?
Note:
Two samples in this test are independent
Difference of Means (Independent Samples)
Product rated on 7- point interval scale
S.No Delhi Bombay
1 2 3
2 3 4
3 3 5
4 4 6
5 5 5
6 4 5
7 4 5
8 5 4
9 3 3
10 4 3
11 5 5
12 4 6
13 3 6
14 3 6 (Separate set
15 4 5 of consumers)
H0 : Mean ratings are same i.e. consumer rate the product in
two cities in same way.
H 0 : 1 2
H 1 : 1 2
5%
| t | 2.75
t at 28 d . f . 2.05
| t | t H 0 rejected
Significant Values Of t-Distribution
(Two-Tail Areas) P(|t| > tα) = α
Rejection Rejection
region (α/2) Acceptance region (α/2)
region (1-α)
n 1
( d d ) 2
Example: An IQ test was administered to 5 persons before and after they
were trained . The results are given below
Candidates: I II III IV V
IQ B.T 110 120 123 132 125
IQ A.T 120 118 125 136 121
Example:
Attitude measurement towards a brand before and after the ad campaign.
rating done on 10 point scale .
10: Highly disliked 1: Highly liked
SN Before Test After Test
1 3 5
2 4 6
3 2 6
4 5 7
5 3 8
6 4 4
7 5 6
8 3 7
9 4 5
10 2 4
11 2 6
12 4 7
13 1 4
14 3 6
15 6 8
16 3 4
17 2 5
18 3 6
H o : 1 2
H 1 : 1 2
5%
t 8.19
t 2.11
t t
Hence Ho rejected hence Consumers rate the product differently in two
situations.