0% found this document useful (0 votes)
185 views24 pages

Chi Square Test

Here are the steps to solve this problem using a chi-square test: 1. State the null (H0) and alternative (H1) hypotheses: H0: The sampling technique is not defective H1: The sampling technique is defective 2. Calculate the expected frequencies based on the genetic theory of 1:2:1 ratio. Expected A: 75 Expected AB: 150 Expected B: 75 3. Calculate the chi-square statistic: χ2 = Σ(Observed - Expected)2/Expected χ2 = (90-75)2/75 + (135-150)2/150 + (75-75)2/75 = 4

Uploaded by

Aneesha Panda
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
185 views24 pages

Chi Square Test

Here are the steps to solve this problem using a chi-square test: 1. State the null (H0) and alternative (H1) hypotheses: H0: The sampling technique is not defective H1: The sampling technique is defective 2. Calculate the expected frequencies based on the genetic theory of 1:2:1 ratio. Expected A: 75 Expected AB: 150 Expected B: 75 3. Calculate the chi-square statistic: χ2 = Σ(Observed - Expected)2/Expected χ2 = (90-75)2/75 + (135-150)2/150 + (75-75)2/75 = 4

Uploaded by

Aneesha Panda
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 24

Chi-Square Test

Dr. Deepak Mehra


B.E., M.TECH & PhD (IIT ROORKEE)
Assistant Professor
Department of Fashion Technology
NIFT BHUBANESWAR
8058111512, [email protected], [email protected]
Chi-Square Test
• Chi-square, symbolically written as χ2 (Pronounced as Ki-square),
• Comparing a variance to a theoretical variance.
• As a non-parametric* test, it “can be used to determine if categorical
data shows dependency or the two classifications are independent.
• Used to make comparisons between theoretical populations and
actual data when categories are used.”
• Thus, the chi-square test is applicable in large number of problems.
The test is, in fact, a technique through the use of which it is possible
for all researchers to
• (i) test the goodness of fit;
• (ii) test the significance of association between two attributes, and
• (iii) test the homogeneity or the significance of population variance.
Formulas
Chi-square test formula

• Chi-square test is symbolically written as χ2 and the formula of chi-


square for comparing variance is given as:

• Where σs2 is the variance of the sample,


σp2 is the variance of the sample.
Conditions for the chi-square test
For the chi-square test to be performed, the following conditions are to
be satisfied:
1.The observations are to be recorded and collected on a random basis.
2.The items in the samples should all be independent.
3.The frequencies of data in a group should not be less than 10. Under
such conditions, regrouping of items should be done by combining
frequencies.
4.The total number of individual items in the sample should also be
reasonably large, about 50 or more.
5.The constraints in the frequencies should be linear and not
containing squares or higher powers.
Chi-square test of independence
• When the chi-square test is used as a test of independence, it allows the
researcher to test whether the two attributes being tested are associated or
not.
1. For this test, a null and alternative hypothesis is formulated where the null
hypothesis is that the two attributes are not associated, and the alternative
hypothesis is that the attributes are associated.
2. From the given data, the expected frequencies are then calculated, followed by
the calculation of chi-square value.
3. Based on the calculated value of chi-square, either the null or alternative
hypothesis is accepted.
4. Here, if the calculated value of chi-square is less than the value in the table at
the given level of significance, the null hypothesis is accepted, indicating that
there is no relationship between the two attributes.
5. However, if the calculated value of chi-square is found to be higher than the
value in the table, the alternative hypothesis is accepted, indicating that there
is a relationship between the two attributes.
6. The chi-square test only established the existence of a relationship but not the
degree of the relationship or its form.
Chi-square test of goodness of fit
• Chi-square test is performed as a test of goodness of fit, which helps
the researcher to compare the theoretical distribution with the
observed distribution.

• When the calculated value of chi-square is found to be less than the


table value at a certain level of significance, the fit between the data
is considered to be good

• A good fit indicates that the variation between the observed and
expected frequencies is due to fluctuations during sampling

• However, if the calculated value of chi-square is greater than the


table value, the fit is considered not to be as good.
Chi-square test examples
• if a new medication is effective against fever or not is an example of a
chi-square test as the test of independence to determine the
relationship between medicine and fever.

• Another example of the chi-square test is the testing of some genetic


theory that claims that children having one parent of blood
type A and the other of blood type B will always have the blood group
as one of three types, A, AB, B, and that the proportion of three types
will on an average be as 1: 2: 1. On the basis of expected and
observed outcomes, the goodness of fit of the hypothesis can be
determined.
Cont..
• Similarly, it is also used in bioinformatics to determine the distribution
of different genes like disease genes and other important genes.

• A Chi-square test is performed by various researchers of different


fields to test the minor or major hypothesis.
Problem : 1
• Weight of 10 students is as follows:

Can we say that the variance of the distribution of weight of all students from which the above
sample of 10 students was drawn is equal to 20 kgs? Test this at 5 per cent and 1 per cent level
of significance
Problem 2
• A sample of 10 is drawn randomly from a certain population. The sum
of the squared deviations from the mean of the given sample is 50.
Test the hypothesis that the variance of the population is 5 at 5 per
cent level of significance.
CHI-SQUARE AS A NON-PARAMETRIC TEST
• Chi-square is an important non-parametric test and as such no rigid
assumptions are necessary in respect of the type of population.
• We require only the degrees of freedom (implicitly of course the size
of the sample) for using this test.
• As a non-parametric test, chi-square can be used (i) as a test of
goodness of fit and (ii) as a test of independence
1. test enables us to see how well does the assumed theoretical
distribution (such as Binomial distribution, Poisson distribution or
Normal distribution) fit to the observed data
2. test enables us to explain whether or not two attributes are
associated. For instance, we may be interested in knowing whether
a new medicine is effective in controlling fever or not, χ2 test will
helps us in deciding this issue.
Similarly, when chi-square is used as a non-
parametric test for testing the goodness of fit or
for testing the independence, the following
formula:

• Where Oij is the observed frequency of the cell in the ith row
and jth column,
Eij is the expected frequency of the cell in the ith row
and jth column.
STEPS INVOLVED IN APPLYING CHI-SQUARE TEST
Problem 1
• A die is thrown 132 times with following results

• Is the die unbiased?


Problem 2
• Genetic theory states that children having one parent of blood type A
and the other of blood type B will always be of one of three types, A,
AB, B and that the proportion of three types will on an average be as
1 : 2 : 1. A report states that out of 300 children having one A parent
and B parent, 30 per cent were found to be types A, 45 per cent per
cent type AB and remainder type B. Test the hypothesis by χ2 test
• The observed frequencies of type A, AB and B is given in the question
are 90, 135 and 75 respectively The expected frequencies of type A,
AB and B (as per the genetic theory) should have been 75, 150 and 75
respectively. We now calculate the value of χ2 as follows:
Problem 3

• Show that the sampling technique of at least one research worker is


defective

You might also like