Chi-Square Questions - Biostatistics
Chi-Square Questions - Biostatistics
11
Attributes- Contingency table – 2x2 contingency table – Test for independence of
attributes – test for goodness of fit of mendalian ratio
Test based on -distribution
In case of attributes we can not employ the parametric tests such as F and t.
Instead we have to apply test. When we want to test whether a set of observed values
are in agreement with those expected on the basis of some theories or hypothesis. The
statistic provides a measure of agreement between such observed and expected
frequencies.
Applications
A very powerful test for testing the significance of the discrepancy between theory
and experiment was given by Prof. Karl Pearson in 1900 and is known as “chi-square test
of goodness of fit “.
It follows a distribution with n-1 d.f. In case of only one tailed test is used.
1
Example
In plant genetics, our interest may be to test whether the observed segregation
ratios deviate significantly from the mendelian ratios. In such situations we want to test
the agreement between the observed and theoretical frequency, such test is called as test
of goodness of fit.
3. N, the total frequency should be reasonably large, say greater then (>) 50.
4. No theoretical cell frequency should be less than (<)5. If any theoretical cell frequency
is <5, then for the application of - test, it is pooled with the preceding or scecceeding
frequency so that the pooled frequency is more than 5 and finally adjust for degree’s of
freedom lost in pooling.
Example1
The number of yiest cells counted in a haemocytometer is compared to the theoretical
value is given below. Does the experimental result support the theory?
No. of Yeast cells Obseved Frequency Expected Frequency
in the square
0 103 106
1 143 141
2 98 93
3 42 41
4 8 14
5 6 5
2
Solution
H0: the experimental results support the theory
H1: the esperimental results does not support the theory.
Level of significance=5%
Test Statistic:
Table value
(6-1=5 at 5 % l.os)= 11.070
Inference
< tab
We accept the null hypothesis.
(i.e) there is a good correspondence between theory and experiment.
computed as
where
Oij – observed frequencies
Eij – Expected frequencies
n= number of rows
m= number of columns
It can be verified that
This is distributed as with (n-1) (m-1) d.f.
When the number of rows and numberof columns are equal to 2 it is termed as 2 x
2 contingency table .It will be in the following form
B1 B2 Row Total
A1 a b a+b r1
A2 c d c+d r2
Column a+c b+d a+b+c+d
Total =n
c1 c2
4
Where a, b, c and d are cell frequancies c1 and c2 are column totals, r1 and r2 are row
totals and n is the total number of observations.
In case of 2 x 2 contigency table can be directly found using the short cut formula,
B1 B2 Row Total
A1 a b a+b=r1
c d c+d =r2
A2
Column a+c=c1 b+d=c2 n = a+b+c+d
Total
Exapmle 2
The severity of a disease and blood group were studied in a research projest. The
findings sre given in the following table, knowmn as the m xn contingency table. Can this
severity of the condition and blood group are associated.
Severity of a disease classified by blood group in 1500 patients.
Blood Groups
Condition Total
O A B AB
5
Severe 51 40 10 9 110
Moderate 105 103 25 17 250
Mild 384 527 125 104 1140
Total 540 670 160 130 1500
Solution
H0: The severity of the disease is not associated with blood group.
H1: The severity of the disease is associated with blood group.
Calculation of Expected frequencies
Blood Groups
Condition Total
O A B AB
Severe 39.6 49.1 11.7 9.5 110
Moderate 90.0 111.7 26.7 21.7 250
Mild 410.4 509.2 121.6 98.8 1140
Total 540 670 160 130 1500
Test statistic:
6
104 98.8 5.2 27.04 0.2737
Total 12.2347
∴ =12.2347
Example 3
In order to determine the possible effect of a chemical treatment on the rate of
germination of cotton seeds a pot culture experiment was conducted. The results are
given below
Chemical treatment and germination of cotton seeds
Germinated Not germinated Total
Chemically Treated 118 22 140
Untreated 120 40 160
Total 238 62 300
Does the chemical treatrment improve the germination rate of cotton seeds?
Solution
H0:The chemical treatment does not improve the germination rate of cotton seeds.
H1: The chemical treatment improves the germination rate of cotton seeds.
Level of significance = 1%
Test statistic
7
Table value
Inference
< tab
Example 4
In an experiment on the effect of a growth regulator on fruit setting in muskmelon
the following results were obtained. Test whether the fruit setting in muskmelon and the
application of growth regulator are independent at 1% level.
Fruit set Fruit not set Total
Treated 16 9 25
Control 4 21 25
Total 20 30 50
Solution
H0:Fruit setting in muskmelon does not depend on the application of growth regulator.
H1: Fruit setting in muskmelon depend on the application of growth regulator.
Level of significance = 1%
8
Tet statistic
Table value
Inference
> tab
Questions
Ans: True
4. The distribution of the χ depends on the degrees of freedom.
2
Ans: True
5. The greater the discrepancy between the observed and expected Frequency lesser the
value of χ .2
Ans: False
9
6. When observed and expected frequencies completely coincide χ will be zero.
2
Ans: True
7. What is a contignecy table?
8. When and how to apply Yates correction?
9. Explain the χ test of goodness of fit?
2
10