0% found this document useful (0 votes)

173 views24 pages

11 Sample Problems On Chi-Square Tests (Chapter 11) - ANSWER KEY

The document provides sample problems and exercises related to Chi-Square Tests, focusing on the distribution of M&M colors and car colors in Oro Valley. It outlines hypotheses, expected values, conditions for validity, calculations for test statistics, and interpretations of P-values. Additionally, it discusses follow-up analyses and statistical software usage for conducting Chi-Square tests for goodness of fit.

Uploaded by

rrk6257

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

173 views24 pages

11 Sample Problems On Chi-Square Tests (Chapter 11) - ANSWER KEY

Uploaded by

rrk6257

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Name: _______________ Period: _

Sample problems on Chi-Square Tests

(Chapter 11)

Which color M&M is the most common?

The company that makes milk chocolate M&Ms claims the following distribution:
13% red, 20% orange, 16% green, 13% brown, 14% yellow, and 24% blue. Is this true?

1. Suppose that we opened a large bag of M&M’s. Think of this bag as being a random
sample of the entire population of M&Ms. You will count the number of candies that you
have and record the counts of each color. Here is the observed frequency:

Red Orange Green Brown Yellow Blue

66 115 85 48 70 107

Total number of M&Ms: 491

2. As a class, write down hypotheses for a significance test.

H0: ρred = 0.13, ρorange = 0.20, ρgreen = 0.16, ρbrown = 0.13, ρyellow = 0.14, ρblue = 0.24

Ha: At least two of the proportions specified in the null hypothesis are not correct.

3. Let’s suppose that M&Ms claimed distribution is correct. If they are correct, how many of
each color would we expect to get in our sample.

Expected values: Expected values = n*pi for all i categories.

Red: 491*0.13 = 63.83 Orange: 98.20 Green: 78.56

Brown: 63.83 Yellow: 68.74 Blue: 117.84

You must show your work for one of the expected values.

1
4. Check conditions.

• Random? The bags of M&Ms purchased locally are nowhere near being SRSs of
production, violating the assumption of random selection. However, we believe the
bag of M&Ms purchased locally is representative of all bags. So, we will proceed
with caution.
• Independent (10% condition)? It is reasonable to assume that there are at least
491*10 = 4,910 M&Ms in produced.
• Large count. Are all the expected counts greater than 5? What is the lowest
expected count? All expected counts are greater than 5. The smallest is 63.83 (red).

5. Calculations. For this test, the degrees of freedom is n – 1 where n represents the
number of categories (colors).

df = n – 1 = 6 –1=5
Use the table to calculate the test statistic.
(Observed - Expected ) 2
Observed Expected (Observed - Expected) (Observed - Expected)2 Expected
Red 66 63.83 66 - 63.83 = 2.17 (2.17)2 = 4.71 0.074
Orange 115 98.20 16.8 282.24 2.874
Green 85 78.56 6.44 41.47 0.528
Brown 48 63.83 -15.83 250.59 3.926
Yellow 70 68.74 1.26 1.59 0.023
Blue 107 117.84 -10.84 117.51 0.997

Sometimes the observed is greater than expected and sometimes it is less. We square the
difference (O-E) so that all of our values are positive.

Add up all the numbers in the last column. This is our test statistic: c2 = 8.42
c2 = Σ(O-E)2/E

6. What value would we get for the test statistic if our sample was very close to what is
expected? Explain.

It would be a positive value close to zero because the values for (Observed
– Expected)2 would be small.
7. What value would we get for the test statistic if our sample was very far from what is
expected? Explain.

It would be a positive large number because the values for (Observed –

Expected)2 would be large. This is strong evidence against H0.
2
8. Use Table C or the calculator to find the P-value

P(c2 > 8.42) = 0.134

Casio calculator syntax for chi-square test for goodness of fit: From the main menu (MENU),
enter the Statistics module (2). Input Observed counts in List1. Separately, calculate Expected
counts and enter them in List 2. TEST (F3) à CHI (F3) à GOF (F1)
Observed: List1
Expected: List2
df: 5
CNTRB: List3
Hit the EXE button, which returns
χ2 = 8.42 chi-square statistic
p= 0.13446252 P-value
df = 5 degrees of freedom
CNTRB List3 List showing the contributions to test statistic

7. Make a conclusion. Do the data provide significant evidence that the company was lying
about the distribution of colors of M&Ms? Use α = 0.05 .

Because the P-value (0.134) is greater than the significance level (α =

0.05), we fail to reject H0. This is NOT convincing evidence that the
company was lying about the distribution of colors of M&Ms.
8. Interpret the P-value.

Assuming the company’s claimed color distribution of M&Ms is true, there

is a 0.134 probability of getting a calculated c2 value of 8.42 or greater
purely by chance.

Equivalently, assuming the claimed color distribution is true, there is a

0.134 probability of getting an observed distribution at least this different
from the expected distribution by random chance alone.

9. Follow-up analysis. If you rejected the null hypothesis, which color M&M had an observed
value farthest from the expected value?

Color ________ had the largest contribution (___) to c2. That color had
___ more/fewer observed (___) than expected (___).

R statistical software for chi-square test

MM <- c(66, 115, 85, 48, 70, 107)
res <- chisq.test(MM, p = c(0.13, 0.20, 0.16, 0.13, 0.14, 0.24)) # GOF test
res
res$observed # Observed counts

3
round(res$expected, 3) # Expected counts
Chi-Square Test: Goodness of Fit
Hypotheses:

H0: ρred = 0.13, ρorange = 0.20, ρgreen = 0.16, ρbrown = 0.13 ρyellow = 0.14, ρblue = 0.24.
Ha: At least two of these proportions are not as specified in the null hypothesis.
OR
H0: The claimed distribution (in context) is true.
Ha: The claimed distribution (in context) is not true.

Conditions:

ü Random. The data come from a random sample or randomized experiment.

ü 10%: When sampling without replacement, check that n ≤ N/10.

ü Large counts. The chi-square test for goodness of fit becomes more accurate
with more observations, so large counts should be used. A conservative check
for large counts is that all expected counts (rather than actual counts) must be at
least 5. Show lowest expected count.

The expected count for any categorical variable is obtained by multiplying the expected
proportion for each category by the sample size. That is, expected count = pi*n for all i
categories. Don’t round the expected counts!

In the formula, you must use counts, not proportions.

In the goodness-of-fit test, degrees of freedom (df) = number of categories - 1.

4
Check Your Understanding (fair six-sided die?)
Carrie made a 6-sided die in her ceramics class and rolled it 90 times to test if each side
was equally likely to show up. The table summarizes the outcomes of her 90 rolls.

(a) State the hypotheses that Carrie should test.

H0: The probability of getting each outcome is 1/6, i.e., the die is fair.

Ha: The probability of getting one or more of the outcomes is not 1/6, i.e., the die is not fair.

(b) Calculate the expected count for each of the possible outcomes.

The expected count for each outcome is np = 90(1/6) = 15.

(d) Which degrees of freedom should you use?

df = 6 – 1 = 5
(e) Use table C or your calculator to find the p-value. What conclusion would you make?
P(c2 > 14.4) = 0.0133

Because the P-value (0.0133) is less than the significance level (α = 0.05),
we reject H0. This is convincing evidence that the die is not fair.

Table: Using df = 5, P-value < 0.05 because the calculated c2 statistic

(14.4) is greater than the critical value (11.07) in the df = 5 row.

5
(f) Which side of the die had an observed value the farthest from the expected?

The side with 2 dots had 13 (=28-15) more observed than expected.

If the χ2 is statistically significant, be prepared to discuss which values were the largest
contributor to χ2. To see which outcome had the biggest contribution to χ2, go to LIST3
on your calculator. Find the largest contributions and then calculate (O – E) and
discuss the difference between observed and expected.

Casio calculator syntax for chi-square test for goodness of fit: From the main menu
(MENU), enter the Statistics module (2). Input Observed counts in List1. Separately,
calculate Expected counts and enter them in List 2. TEST (F3) à CHI (F3) à GOF
(F1)
Observed: List1
Expected: List2
df: 5 df = number of categories - 1
CNTRB: List3
Hit the EXE button, which returns
χ2 = 14.4 chi-square statistic
p= 0.01325859 P-value
df = 5 degrees of freedom
CNTRB List3 List showing the contributions to test statistic

R statistical software code

die <- c(12, 28, 12, 13, 10, 15)
res <- chisq.test(die, p = c(1/6, 1/6, 1/6, 1/6, 1/6, 1/6))
res
res$observed # Observed counts
round(res$expected,2) # Expected counts
residuals = res$residuals # Pearson residuals r = (O-E)/sqrt(E)
contrib = (res$residuals)^2. # Contributions
contrib

6
Chi-Square Test for Goodness of Fit

The sampling distribution of the chi-square statistic is not a Normal

distribution. It is a right-skewed distribution that allows only positive values
because χ2 can never be negative.

Within the family of χ2 distribution density curves, the skew becomes less
pronounced with increasing degrees of freedom.

7
Check Your Understanding (choice of car color)
Does the warm, sunny weather in Arizona affect a driver’s choice of car color? Cass thinks
that Arizona drivers might opt for a lighter color with the hope that it will reflect some of the
heat from the sun. To see whether the distribution of car colors in Oro Valley, near Tucson,
is different from the distribution of car colors across North America, she selected a random
sample of 300 cars in Oro Valley. The table shows the distribution of car color for Cass’s
sample in Oro Valley and the distribution of car color in North America, according to
www.ppg.com.

1. Do these data provide convincing evidence that the distribution of car color in Oro Valley
differs from the North American distribution? Use the four-step (ICCI) process.

Identify: We want to perform a test of the following hypotheses using a = 0.05:

H0: The distribution of car colors in Oro Valley is the same as the distribution of car
colors in North America.

Ha: The distribution of car colors in the Oro Valley is not the same as the distribution of
car colors in North America.

Conditions: If conditions are met, we will perform a chi-square test for goodness of fit
(“GOF”).
ü Random: The data came from a random sample of 300 cars in Oro Valley.
ü Independent (10% condition): Because we are sampling without replacement,
there must be at least 10(300) = 3000 cars in Oro Valley. This is reasonable to
assume.
ü Large Counts: The expected counts are 300(0.23) = 69, 300(0.18) = 54,
300(0.16) = 48, 300(0.15) = 45, 300(0.10) = 30, 300(0.09) = 27, 300(0.02) = 6,
300(0.07) = 21. All expected counts are at least 5. (The smallest is 6 for green.)
(84 − 69) + (38 − 54)
2 2

Calculate: χ 2
= +! = 29.92
69 54
Degrees of freedom = df = 8 − 1 = 7
• Using technology: Using df = 7, c2cdf(lower: 29.92, upper: 1000, df: 7) reveals
P(c2 ≥ 29.92) = 0.0000982 » 0.
• OR Using the calculator’s c2 test with df = 7, P(c2 ≥ 29.92) = 0.0000982 » 0

Conclude: Because the P-value of approximately 0 is less than a = 0.05, we reject H0.
We have convincing evidence that at least one of the proportions car colors in the Oro
Valley is not the same as that in North America.

8
2. If there is convincing evidence of a difference in the distribution of car color, perform a
follow-up analysis.

The largest contribution to c2 came from

• “other”, which was 18 (=39-21) more than expected, and
• “gray”, which was 17 (=31-48) less than expected.

Open TEST soft menu (F3) à CHI (F3) à choose the Goodness-of-fit test GOF (F1)
Observed: List1
Expected: List2
df: 7
CNTRB: List3
Hit the EXE button, which returns
χ2 = 29.9213854
p = 9.8165x10-5
df =7
CNTRB: List3

R statistical software code

Oro <- c(84, 38, 31, 46, 27, 29, 6, 39)
out <- chisq.test(Oro, p = c(0.23, 0.18, 0.16, 0.15, 0.10, 0.09, 0.02, 0.07)) # GOF test
out
round(out$expected,2) # Expected counts
residuals = out$residuals # Pearson residuals r = (O-E)/sqrt(E)
contrib = (out$residuals)^2. # Contributions
round(contrib, 2)

9
Does gummy bear brand matter?

Is the distribution of gummy bear colors the same for Haribo gummy bears and Great
Value (Walmart brand) gummy bears? We’ll collect data as a class and determine if we
have convincing evidence of a difference.

Suppose that we open a large bag of each brand of gummy bears and observe the
following distribution of colors. Fill in the table with the totals.

Brand
Haribo Great Value Total
Red 181 79 260
Green 91 59 150
Color Yellow 105 55 160
Orange 123 57 180
Clear 100 50 150
Total 600 300 900

1. How many samples do we have? What population are they from? Explain.
We have two samples: one sample from the all Haribo brand gummy
bears and one sample from all Great Value brand gummy bears.

This question may look like the M&M goodness-of-fit question from the previous lesson,
but there is a very important distinction. With the M&M question, we were comparing data
from one sample to a claimed distribution of color. With this gummy bear question, we are
comparing data from one sample to data from another sample. This is analogous to the
difference between a one-proportion z-test and a two-proportion z- test.

2. How many variables are we examining? Explain.

We are examining one variable, color, in both populations.
3. As a class, write down hypotheses for a significance test.

H0: The color distribution is the same for both the Haribo and Great Value brands.

Ha: The color distribution is not the same for the Haribo and Great Value brands.

10
4. Now we will use a chi-square test to test whether there is a difference between the
two populations. We first need to find the expected values. Complete the table below
by writing down the value of the expected count in the space provided in each cell.
Show your calculations for one expected count.
The expect count in a particular cell of a two-way table of categorical data can be
calculated using the formula:
Expected count = (row total)(column total) Expected count = 260*600
Table total 900

Brand
Haribo Great Value Total
Red 181 173.33 79 86.67 260
Green 91 100.00 59 50.00 150
Color Yellow 105 106.67 55 53.33 160
Orange 123 120.00 57 60.00 180
Clear 100 100.00 50 50.00 150
Total 600 300 900
Always write down the values of the expected count. Show your work for one of
the expected counts. Don’t round expected counts!

5. Use your work above to complete a 4-step (ICCI) significance test. α = 0.05

Identify: We want to perform a test of the following hypotheses at the α = 0.05 level:

H0: The color distribution is the same for both the Haribo and Great Value brands.

Ha: The color distribution is not the same for the Haribo and Great Value brands.

Conditions: If conditions are met, we will perform a chi-square test for homogeneity.
ü Random: The data came from two independent random samples.
ü 10%: 600 is less than 10% of all Haribo gummy bears and 300 is less
than 10% of all Great Value gummy bears. So, n ≤ N/10 for
both.
ü Large Counts: The expected counts (in the table above) are all at
least 5. The smallest is 50 for Great Value Green and Clear.

Calculate: Test statistic: χ 2

=
(181 − 173.33) + ( 91 − 100)
2 2

+ ! = 3.75
173.33 100
Using technology: With df = (5-1)*(2-1) = 4, P-value = P(χ2 ≥ 3.75) = 0.441.
c2cdf(lower: 3.75, upper: 1000, df: 4)
df = (# of rows – 1)∙(# of columns – 1)

Conclude: Because the P-value of 0.441 is greater than a = 0.05, we fail to reject H0.
We do not have convincing evidence that there is a difference in the true distributions of
color Haribo gummy bears and Great Value gummy bears.

11
Casio calculator syntax for chi-square test for homogeneity/independence: In the
STATS module, TEST (F3) à CHI (F3) à 2WAY (F2) à ►MAT (F2).
ü Navigate to the matrix you want to use, e.g., Observed count: MAT A and hit
EXE.
ü Specify the matrix dimensions: m is for rows, n is for columns.
Here, m=5 and n = 2.
ü Enter the data.
ü Return to the test page by hitting EXIT twice.
ü Optionally, do the same thing for Expected count: MAT B. If you don’t, the
calculator will generate Expected counts.
ü Hit the EXE button, which returns
χ2 = 3.75043269
p = 0.44083332
df = 4

R statistical software
data = matrix(c(181, 91, 105, 123, 100, 79, 59, 55, 57, 50), nrow=5)
# By default, R fills a matrix by column, so enter the first column, then the second
column, etc.
colnames(data)=c("Haribo", "Great Value") # Column names
rownames(data)=c("Red", "Green", "Yellow", "Orange", "Clear") # Row names
data
test <- chisq.test(data)
test
marginals = addmargins(data) # Add row and column totals
marginals
round(test$expected, 2) # Expected counts under null
residuals = test$residuals # Pearson residuals r = (O-E)/sqrt(E)
contrib = (test$residuals)^2 # Contributions
round(contrib, 2)

6. Explain how this test is different from a chi-square test for goodness of fit?
Here, we have two samples from two populations. We are comparing data from one
sample to data from another sample.

With the c2 test for goodness-of-fit, we had one sample from one population. In that test,
we compared data from one sample to a claimed distribution of color.

12
Chi-Square Test for Homogeneity
Hypotheses for chi-square test for homogeneity:

H0 : There is no difference in the (categorical variable) distribution for (population1)

and (population2).

Ha: There is a difference in the (categorical variable) distribution for (population1)

and (population2).

The expect count in a particular cell of a two-way table of categorical data can be
calculated using the formula:

Expected counts = (row total)*(column total)

table total

Always write down the values of the Expected counts. Don’t round. Show your
calculations for at least one term.

Conditions
ü Random. Data should be collected using a stratified random sample or a
randomized experiment.
ü 10%: When sampling without replacement, check that n ≤ N/10 for both
samples.
ü Large counts. A conservative check for large counts is that all expected counts must
be at least 5.

df = (# of rows – 1)*(# of columns – 1)

Difference between c2 test for goodness of fit and χ2 test of homogeneity

• c2 test of GOF: 1 sample, 1 variable
• c2 test of homogeneity: 2 samples, 1 variable

If you reject H0 and are asked to do a follow-up (contribution) analysis, look to see
which cells have the largest contribution to the c2 statistic and discuss the difference
between observed counts and expected counts.

13
Check Your Understanding (gender of interviewer)
For a class project, Abby and Mia wanted to know if the gender of an interviewer could
affect the responses to a survey question. The subjects in their experiment were 100
males from their school. Half of the males were randomly assigned to be asked, “Would
you vote for a female president?” by a female interviewer. The other half of the males
were asked the same question by a male interviewer. The table shows the results.

(a) State the appropriate null and alternative hypotheses.

H0: There is no difference in the distribution of response to the question when
asked by a male or female.

Ha: There is a difference in the distributions.

(b) Show the calculation for the expected count in the Male/Yes cell. Then provide a
complete table of expected counts.
Expected count for Male/Yes cell = 50*69 = 34.5
100
M F
Yes 34.5 34.5
No 5.5 5.5
Maybe 10 10

(c) Show that the conditions have been met.

ü Random: The three treatments were assigned at random.
ü 10%: Because we are not sampling without replacement, we do not need
to check the 10% condition.
ü Large Counts: The expected counts (listed above) are all at least 5. The smallest
is 5.5 for No.

(d) Calculate the value of the chi-square test statistic.

Test statistic: χ
2
=
( 30 − 34.5) + ( 39 − 34.5)
2 2

+!= 4.25
34.5 34.5
With df = (3-1)*(2-1) = 2, c2cdf(lower: 4.25, upper: 1000, df: 2) reveals
P-value = P(c2 ≥ 4.25) = 0.1196.

14
Are Taco Tongue and Evil Eyebrow independent?

Is there an association between the Taco Tongue and the Evil Eyebrow? Below is the data
for a random sample of 600 senior students. Do we have convincing evidence that the
ability to do the Taco Tongue and Evil Eyebrow are associated?

1. Describe what it means for two events to be independent. (Chapter 5)

Two events are independent if the occurrence of one does not affect the probability of the
other, i.e., P(A) = P(A | B) = P(A | BC).

In chapter 5, we learned about making a claim about independence within a sample. That
method did not make a claim about the population. The chi-square technique that we learn
about in this chapter allows us to make a claim about the population.
2. Calculate the expected counts. 480*200 = 160
600
Observed: Expected:
Evil Eyebrow Evil Eyebrow
Yes No Total Yes No Total
Taco Yes 180 300 480 Taco Yes 160 320 480
Tongue No Tongue No
20 100 120 40 80 120
Total 200 400 600 Total 200 400 600
3. Do the data provide significant evidence that there is an association between the ability to
Taco Tongue and Evil Eyebrow? Use α = 0.05
Identify: We want to perform a test of the following hypotheses using a = 0.05:
H0: There is no association between being able to make an Evil Eyebrow and being
able to make a Taco Tongue in the population of seniors.
Ha: There is an association between being able to make an Evil Eyebrow and being
able to make a Taco Tongue in the population of seniors.
Conditions: If conditions are met, we will perform a chi-square test for independence.
ü Random: Random sample of 600 seniors.
ü 10%: The sample of 600 seniors is less than 10% of all seniors.
ü Large Counts: The expected counts (see table below) are all at least 5.
(The smallest is 40 > 5)

15
Calculate: Test statistic c2 = (180 – 160)2 + ... + (100 – 80)2 = 18.75
160 80

df = (2-1)*(2-1) = 1

Using technology: c2cdf(lower: 18.75, upper: ∞, df: 1) reveals P-value =

P(c2 ≥ 18.75) ≈ 0.000015

Conclude: Because the P-value (0.000015) is less than a = 0.05, we reject H0. The
data provide convincing evidence to conclude that there is an association between
being able to make a Taco Tongue and being able to make an Evil Eyebrow in the
population of seniors.

Casio calculator syntax for chi-square test for homogeneity/independence:

In the STATS module, TEST (F3) à CHI (F3) à 2WAY (F2) à ►MAT
(F2).
ü Navigate to the matrix you want to use, e.g., Observed count: MAT A
and hit EXE.
ü Specify the matrix dimensions: m is for rows,
ü n is for columns.
Here, m=2 and n = 2.
ü Enter the data.
ü Return to the test page by hitting EXIT twice.
ü Optionally, do the same thing for Expected count: MAT B. If you
don’t, the calculator will generate Expected counts.

R statistical software
data = matrix(c(180, 20, 300, 100), nrow=2)
# By default, R fills a matrix by column, so enter the first column, then the second
column, etc.
colnames(data)=c("Yes Evil Eyebrow","No Evil Eyebrow") # Column names
rownames(data)=c("Yes Taco Tongue","No Taco Tongue") # Row names
marginals = addmargins(data) # Add row and column totals
marginals
test <- chisq.test(data, correct = FALSE) # Chi-square test for two-way table without continuity
correction
test
round(test$expected, 2) # Expected counts
residuals = test$residuals # Pearson residuals r = (O-E)/sqrt(E)
contrib = (test$residuals)^2 # Contributions
contrib

16
Chi-Square Test for Independence
Hypotheses for chi-square test for independence:

H0 : There is no association between variable A (e.g., gender identity) and variable B

(e.g., voting preferences) in the population, i.e., the variables A & B are
independent.

Ha: There is an association between variable A (gender identity) and variable B

(voting preferences) in the population, i.e., the variables A & B are dependent.

The difference between chi-square tests for homogeneity and tests for
association/independence rests on how you get the data.
• For homogeneity of populations: One categorical variable is observed in two or
more populations (groups) from a stratified random sample or randomized
experiment, e.g., remember the context of gummy bears. All experiments are
tests of homogeneity.
• For Association/Independence: Two categorical variables are observed in a
single population, e.g., remember the context of Evil Eyebrow.

For chi-square tests for goodness of fit, we have one variable and one population, e.g.,
remember the context of M&Ms.

The calculator mechanics for the chi-square tests for homogeneity and independence
are the same.

17
Are gender and favorite class independent?

Is there an association between gender and preference for English or math class? Below is
the data for a random sample of senior students. Do we have convincing evidence that
gender and favorite class are associated?

1. Calculate the expected counts.

Observed: Expected:
English Math Total English Math Total
Female 43 22 65 Female 36.49 28.51 65
Male 21 28 49 Male 27.51 21.49 49
Total 64 50 114 Total 64 50 114

2. Do the data provide significant evidence that there is an association between gender and
preference for English or math class? Use α = 0.05

Identify: We want to perform a test of the following hypotheses using a = 0.05:

H0: There is no association between Gender and favorite class (English or Math) in the
population of senior students.
Ha: There is an association between Gender and favorite class in the population of
senior students.
Conditions: If conditions are met, we will perform a chi-square test for independence.
ü Random: Random sample of 114 seniors.
ü 10%: The sample of 114 seniors is less than 10% of all seniors.
ü Large Counts: The expected counts (see table below) are all at least 5.
(The smallest is 21.49 > 5)

Calculate: Test statistic c2 = (43 – 36.49)2 + ... + (28 – 21.49)2 = 6.1582

36.49 21.49
df = (2-1)*(2-1) = 1
P-value = P(χ2 ≥ 6.1582) ≈ 0.01308
c2cdf(lower: 6.1582, upper: ∞, df: 1)

Table: Using df = 1, P-value < 0.05 because the calculated c2 statistic (6.1582) is
greater than the critical value (3.84) in the df = 1 row.

18
Conclude: Because the P-value (0.0131) is less than a = 0.05, we reject H0. The data
provide convincing evidence to conclude that there is an association between Gender
and preference for English or Math among senior students.

R statistical software code

data = matrix(c(43, 21, 22, 28), nrow=2)
# By default, R fills a matrix by column, so enter the first column, then the second column, etc.
colnames(data)=c("English", "Math") # Column names
rownames(data)=c("Female", "Male") # Row names
data
test <- chisq.test(data, correct = FALSE) # Chi-square test for two-way table
test
marginals = addmargins(data) # Add row and column totals
marginals
round(test$expected,2) # Expected counts under null
residuals = test$residuals # Pearson residuals r = (O-E)/sqrt(E)
contrib = (test$residuals)^2 # Contributions
contrib

19
Check Your Understanding (pick the test)
For each of the following situations decide what type of chi square test is
appropriate. Explain.

1. Shopping at secondhand stores is becoming more popular and has even

attracted the attention of business schools. A study of customers’ attitudes
toward secondhand stores interviewed separate random samples of shoppers at
two secondhand stores of the same chain in different cities. The two-way table
shows the breakdown of respondents by
gender.

c2 test for homogeneity.

Two separate random samples.
One variable (gender)
The study wants to see whether the different
stores have similar gender distributions (homogeneous).

2. The General Social Survey (GSS) asked a random sample of adults their opinion
about whether astrology is very scientific, sort of scientific, or not at all scientific.
Here is a two-way table of counts for people in the sample who had three levels
of higher education:

c2 test for independence.

One random sample.
Two variables (Degree & Opinion)
We want to know whether the
variables are independent.

3. Casinos are required to verify that their games operate as advertised. American
roulette wheels have 38 slots—18 red, 18 black, and 2 green. In one casino,
managers record data from a random sample of 200 spins of one of their
American roulette wheels. The table displays the results.

c2 test for goodness of fit.

One sample and one variable.

20
Check your understanding (Ibuprofen or acetaminophen?)
In a study reported by the Annals of Emergency Medicine (March 2009), researchers
conducted a randomized, double-blind clinical trial to compare the effects of ibuprofen
and acetaminophen plus codeine as a pain reliever for children recovering from arm
fractures. There were many response variables recorded, including the presence of any
adverse effect, such as nausea, dizziness, and drowsiness. Here are the results:

Ibuprofen Acetaminophen plus codeine Total

Adverse effects 36 57 93
No adverse effects 86 55 141
Total 122 112 234

(a) Which type of chi-square test is appropriate here? Explain.

All experiments are tests of homogeneity. Here, one group took Ibuprofen, and another
took Acetaminophen plus codeine.

(b) Do these data provide convincing evidence at the a = 0.05 level that there is a
difference in proportion of subjects who had adverse effects across treatments.

Identify: We want to perform a test of the following hypotheses using a = 0.05:

H0: There is no difference in the proportions of patients like these who suffer adverse
effects when taking ibuprofen or acetaminophen plus codeine.
Ha: There is a difference in the proportions of patients like these who suffer adverse
effects when taking ibuprofen or acetaminophen plus codeine.

Conditions: If conditions are met, we will perform a chi-square test for homogeneity.
ü Random: The treatments were assigned at random.
ü Independent: Knowing if one subject had an adverse effect shouldn’t give
any additional information about the responses of other subjects, so the
observations can be considered independent. (Remember, do not check
the 10% condition for experiments.)
ü Large Sample Size: The expected counts (listed below) are all at least 5.
Acetaminophen
Expected counts Ibuprofen Total
plus Codeine
Adverse effects 48.5 44.5 93
No adverse effects 73.5 67.5 141
Total 122 112 234

21
( 36 - 48.5)
2

Calculate: Test statistic: c =

2
+ ! = 11.15
48.5
df = (2 – 1)∙(2 – 1) = 1.
Using technology: c 2 cdf(lower=11.15, upper=1000, df =1) reveals P-value = P(c2 ≥
11.15) = 0.00084
Conclude: Because the P-value (0.00084) is less than α = 0.05, we reject H0. We have
convincing evidence that there is a difference in the proportions of patients like these
who suffer adverse effects when taking ibuprofen or acetaminophen plus codeine.

The chi-square test for homogeneity based on a 2x2 two-way table is equivalent to the
two-sample z-test test for ρ1 = ρ2 with a two-sided alternative hypothesis.

(c) Show that the results of a two-sample z test for a difference in proportions generate
give the same P-value.
Identify: Since we are comparing the proportion of subjects with adverse effects for just
two treatments, we can use a two-sample z test to test the following hypotheses:

H0 : rI - rA = 0

Ha : rI - rA ¹ 0

where
ρI = the true proportion of adverse effects for Ibup. users, and
ρA = the true proportion of adverse effects for Acet. users.
Conditions: Two-proportion z-test.
ü Random: Same as for the chi-square test of homogeneity.
ü Independent: Same as for the chi-square test of homogeneity.
ü Large count: Success and failures are all greater than 10 {36, 86, 57, 55}.
Calculate: When the conditions are met, the two-sample z test for difference in
proportions p1 - p2 uses the test statistic

Pooled proportion calculation:

p = pC = X1 + X2 = 36 + 57 = 93 q = 1 – p = 141
n1 + n2. 122 + 112 234 234

22
normalcdf(lower= -∞, upper= -3.33924, σ=1, µ=0) reveals P(Z < -3.33924) = 0.00042
P-value = P(Z ≤ -3.33924 or Z ≥ 3.33924) = 2*(0.00042) = 0.000084
Conclude: Same conclusion as for the chi-square test of homogeneity above.
Note that the P-value from the two-sample z test is the same as the P-value from the
chi-square test.
(d) Show that the square of the calculated z-statistic from the two-sample z test is equal
to the calculated chi-square statistic from the test of homogeneity.

z 2 = (-3.339) 2 = 11.15 = c 2
Casio calculator syntax for two-sample z test: Go to module 2 (Statistics) à TEST (F3)
à Z (F1) à 2-PROP (F4). Calculator input:
p1 ≠ p 2
x1 = 36 x2 = 57
n1 = 122 n2 = 112
Press EXE, which returns
z= -3.333924
p= 0.000084

When should you use a chi-square test and when should you use a two-sample z test?

23
Here are some things to keep in mind:

• The chi-square test is always two-sided. That is, it only tests for a difference in
the two proportions. If you want to test whether one proportion is larger than the
other (a one-sided test), use the two-sample z test.
• If you want to estimate the difference between two proportions, use a two-sample
z interval. There are no confidence intervals that correspond to chi-square tests.
• If you are comparing more than two treatments or the response variable has
more than two categories, you must use a chi-square test.
• You can also use a chi-square goodness-of-fit test in place of a one-sample z
test for a proportion if the alternative hypothesis is two-sided. The chi-square test
will use two categories (success and failure) and have df = 2 – 1 = 1.

Chi-square test of homogeneity Chi-square test of independence

• Two or more samples and one One sample from one population. Then
variable sort by two variables.
• All experiments
Example: One SRS of male voters and Example: Collect one sample of CHS
another SRS of female voters. students and then sort by gender and
political affiliation.
Hypotheses: Hypotheses:
H0: There is no difference in the H0: There is no association between
distribution of a categorical variable (e.g., variable A (e.g., gender) and variable B
political affiliation) across several (e.g., political affiliation) in the population
populations (e.g., gender) or treatments. (of CHS students), i.e., the variables A &
B are independent.
Ha: There is a difference in the
distribution of a categorical variable Ha: There is an association between
(political affiliation) across several variable A (gender) and variable B
populations (gender) or treatments. (political affiliation) in the population, i.e.,
the variables A & B are dependent.

Many of the problems in this assignment are based on problems from

https://www.statsmedic.com/.

SPSS Statistics - 210303
100% (2)
SPSS Statistics - 210303
25 pages
Statistics: a QuickStudy Laminated Reference Guide
From Everand
Statistics: a QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet
New Text Document
0% (1)
New Text Document
3 pages
Methods Lectures: Financial Econometrics Linear Factor Models and Event Studies
No ratings yet
Methods Lectures: Financial Econometrics Linear Factor Models and Event Studies
46 pages
TPS6 LecturePowerPoint 11.1 DT 043018
No ratings yet
TPS6 LecturePowerPoint 11.1 DT 043018
62 pages
CH 11 Notes
No ratings yet
CH 11 Notes
20 pages
Chapter11 Stats
No ratings yet
Chapter11 Stats
6 pages
Chapter 11 Study Guide - Chi-Square
No ratings yet
Chapter 11 Study Guide - Chi-Square
16 pages
OpenStax Chapter 11 Power Point
No ratings yet
OpenStax Chapter 11 Power Point
35 pages
Muklis
No ratings yet
Muklis
23 pages
NUMB3RS Goodness of Fit Student Worksheet
No ratings yet
NUMB3RS Goodness of Fit Student Worksheet
6 pages
Chi Squared Goodness of Fit
No ratings yet
Chi Squared Goodness of Fit
24 pages
Lesson 12 1 Answer Key AP Stats Math Medic V2 3fbe999437
No ratings yet
Lesson 12 1 Answer Key AP Stats Math Medic V2 3fbe999437
2 pages
T Dist&chisquare
No ratings yet
T Dist&chisquare
21 pages
Statistical Theory Lecture 5-2025
No ratings yet
Statistical Theory Lecture 5-2025
13 pages
Lecture3 - Contingency Analysis
No ratings yet
Lecture3 - Contingency Analysis
16 pages
Assessment in Learning 1 Chi Square
No ratings yet
Assessment in Learning 1 Chi Square
5 pages
AP Stats Ch25
No ratings yet
AP Stats Ch25
105 pages
Keya's Copy of M+M Chi Square Lab
No ratings yet
Keya's Copy of M+M Chi Square Lab
7 pages
Stat 130 - Chi-Square Goodnes-Of-Fit Test
100% (3)
Stat 130 - Chi-Square Goodnes-Of-Fit Test
32 pages
Chi Square Exercises
100% (1)
Chi Square Exercises
14 pages
Chi Square Test
No ratings yet
Chi Square Test
6 pages
MM ChiSquare Lab
No ratings yet
MM ChiSquare Lab
6 pages
5 Basic Steps in Hypothesis Test: Men Willingly Believe What They Wish." - Julius Caesar (100-44 BC)
No ratings yet
5 Basic Steps in Hypothesis Test: Men Willingly Believe What They Wish." - Julius Caesar (100-44 BC)
11 pages
AI22 Chi Square Goodness of Fit Test
No ratings yet
AI22 Chi Square Goodness of Fit Test
15 pages
CH 11
No ratings yet
CH 11
22 pages
Chi Square
No ratings yet
Chi Square
8 pages
Stats Unit 12 Notes
No ratings yet
Stats Unit 12 Notes
12 pages
08 Chi Square Test of Signific
No ratings yet
08 Chi Square Test of Signific
4 pages
Notes
No ratings yet
Notes
9 pages
M&M Lab Activity-Advanced Problem: Part-2
No ratings yet
M&M Lab Activity-Advanced Problem: Part-2
4 pages
Chi-Square Test Presentation
100% (1)
Chi-Square Test Presentation
28 pages
Tps5e Ch11 1
No ratings yet
Tps5e Ch11 1
21 pages
Final Simulation Theory - BT
No ratings yet
Final Simulation Theory - BT
13 pages
Chi Square Test
No ratings yet
Chi Square Test
11 pages
Student Guide To The Chi Square Test
No ratings yet
Student Guide To The Chi Square Test
4 pages
MM Lab Chi Square
No ratings yet
MM Lab Chi Square
8 pages
Chi Square
No ratings yet
Chi Square
37 pages
Statistics Unit 9 Notes
No ratings yet
Statistics Unit 9 Notes
10 pages
Chi Square (KI Square) Test
No ratings yet
Chi Square (KI Square) Test
30 pages
Null and Alternative Hypotheses: N or n/6. in Fact, For This Example, The Expected Number of Candies For Each
No ratings yet
Null and Alternative Hypotheses: N or n/6. in Fact, For This Example, The Expected Number of Candies For Each
2 pages
MTH262 1
No ratings yet
MTH262 1
128 pages
Chi Square
No ratings yet
Chi Square
19 pages
LAB 5 - Chi - Square Analysis - 231016 - 232108
No ratings yet
LAB 5 - Chi - Square Analysis - 231016 - 232108
2 pages
Chi Square M&M's-1
No ratings yet
Chi Square M&M's-1
3 pages
MM Lab Chi Square
No ratings yet
MM Lab Chi Square
4 pages
Ch. 10.1, 10.2
No ratings yet
Ch. 10.1, 10.2
42 pages
Ch. 11 Student Notes
No ratings yet
Ch. 11 Student Notes
8 pages
Week 9
No ratings yet
Week 9
39 pages
Maths Report
No ratings yet
Maths Report
15 pages
Stat 213 Chapter 7 2
No ratings yet
Stat 213 Chapter 7 2
18 pages
8 1 Categorical Data Ninell
No ratings yet
8 1 Categorical Data Ninell
26 pages
Goodness of Fit Tests - Fstats - ch5 PDF
100% (1)
Goodness of Fit Tests - Fstats - ch5 PDF
26 pages
Chi
No ratings yet
Chi
26 pages
Test For Goodness of Fit
No ratings yet
Test For Goodness of Fit
15 pages
Ppt. AP Bio Chi Square Test
No ratings yet
Ppt. AP Bio Chi Square Test
7 pages
Define The Null Hypothesis (No Difference Between Sample and Theoretical Distribution) and The Alternative Hypothesis (Difference Exists) .
No ratings yet
Define The Null Hypothesis (No Difference Between Sample and Theoretical Distribution) and The Alternative Hypothesis (Difference Exists) .
21 pages
Chi-Square Test For For Goodness-of-Fit: Announcements
No ratings yet
Chi-Square Test For For Goodness-of-Fit: Announcements
4 pages
Chi-Square Test For For Goodness-of-Fit: Announcements
No ratings yet
Chi-Square Test For For Goodness-of-Fit: Announcements
4 pages
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Basics of Math
From Everand
Basics of Math
Younish Pathan
No ratings yet
Business Statistics For Dummies
From Everand
Business Statistics For Dummies
Alan Anderson
No ratings yet
Autumn Break EnglushHw 12th 2024-25
No ratings yet
Autumn Break EnglushHw 12th 2024-25
5 pages
Stat 102 Module 1
No ratings yet
Stat 102 Module 1
11 pages
Robust - Robust Variance Estimates
No ratings yet
Robust - Robust Variance Estimates
25 pages
Types of Forecasting Methods
No ratings yet
Types of Forecasting Methods
3 pages
Sta104 Chapter 1
No ratings yet
Sta104 Chapter 1
16 pages
Bio Statistics
No ratings yet
Bio Statistics
115 pages
DS Practical (BSC CS)
No ratings yet
DS Practical (BSC CS)
49 pages
Estimating Population Values PPT at BEC DOMS
No ratings yet
Estimating Population Values PPT at BEC DOMS
50 pages
Module 1 Quarter4 Introduction To Statistics
No ratings yet
Module 1 Quarter4 Introduction To Statistics
62 pages
Statistics For Economics
100% (1)
Statistics For Economics
214 pages
Decomposition Exercise Solution 19
No ratings yet
Decomposition Exercise Solution 19
11 pages
Chapter 11: Chi-Square and ANOVA Tests
No ratings yet
Chapter 11: Chi-Square and ANOVA Tests
40 pages
Panel Patent Data Using Poisson, - Ve Binomial and GMM
No ratings yet
Panel Patent Data Using Poisson, - Ve Binomial and GMM
32 pages
Test Suite
No ratings yet
Test Suite
19 pages
A Study of Environmental Accounting and Reporting An Empirical Analysis
No ratings yet
A Study of Environmental Accounting and Reporting An Empirical Analysis
17 pages
Confidence Intervals with σ unknown
No ratings yet
Confidence Intervals with σ unknown
9 pages
Forecasting For Asian Paints
No ratings yet
Forecasting For Asian Paints
3 pages
Assignment For Statistics
No ratings yet
Assignment For Statistics
3 pages
A Review of Cross Validation and Adaptive Model Selection
No ratings yet
A Review of Cross Validation and Adaptive Model Selection
36 pages
Applied Maths-Unit5
No ratings yet
Applied Maths-Unit5
4 pages
Type I & Type II Error
No ratings yet
Type I & Type II Error
19 pages
Plant-Growth Experiment: 15. Brief Version of The Case Study
No ratings yet
Plant-Growth Experiment: 15. Brief Version of The Case Study
11 pages
Population: Year Population China India
No ratings yet
Population: Year Population China India
3 pages
Applied Data Analytics II (Informatics) IK1024.4
100% (1)
Applied Data Analytics II (Informatics) IK1024.4
7 pages
Weekly Sales of Hot Pizza Are As Follows:: Week Demand Week Demand Week Demand
No ratings yet
Weekly Sales of Hot Pizza Are As Follows:: Week Demand Week Demand Week Demand
15 pages

11 Sample Problems On Chi-Square Tests (Chapter 11) - ANSWER KEY

Uploaded by

11 Sample Problems On Chi-Square Tests (Chapter 11) - ANSWER KEY

Uploaded by

Name: _________________ Period: ___

Sample problems on Chi-Square Tests

Which color M&M is the most common?

Red Orange Green Brown Yellow Blue

Total number of M&Ms: 491

2. As a class, write down hypotheses for a significance test.

Expected values: Expected values = n*pi for all i categories.

Red: 491*0.13 = 63.83 Orange: 98.20 Green: 78.56

Brown: 63.83 Yellow: 68.74 Blue: 117.84

It would be a positive large number because the values for (Observed –

P(c2 > 8.42) = 0.134

Because the P-value (0.134) is greater than the significance level (α =

Assuming the company’s claimed color distribution of M&Ms is true, there

Equivalently, assuming the claimed color distribution is true, there is a

R statistical software for chi-square test

ü Random. The data come from a random sample or randomized experiment.

ü 10%: When sampling without replacement, check that n ≤ N/10.

In the formula, you must use counts, not proportions.

In the goodness-of-fit test, degrees of freedom (df) = number of categories - 1.

(a) State the hypotheses that Carrie should test.

The expected count for each outcome is n*p = 90*(1/6) = 15.

(d) Which degrees of freedom should you use?

Table: Using df = 5, P-value < 0.05 because the calculated c2 statistic

R statistical software code

The sampling distribution of the chi-square statistic is not a Normal

Identify: We want to perform a test of the following hypotheses using a = 0.05:

The largest contribution to c2 came from

R statistical software code

2. How many variables are we examining? Explain.

Calculate: Test statistic: χ 2

H0 : There is no difference in the (categorical variable) distribution for (population1)

Ha: There is a difference in the (categorical variable) distribution for (population1)

Expected counts = (row total)*(column total)

df = (# of rows – 1)*(# of columns – 1)

Difference between c2 test for goodness of fit and χ2 test of homogeneity

(a) State the appropriate null and alternative hypotheses.

Ha: There is a difference in the distributions.

(c) Show that the conditions have been met.

(d) Calculate the value of the chi-square test statistic.

1. Describe what it means for two events to be independent. (Chapter 5)

Using technology: c2cdf(lower: 18.75, upper: ∞, df: 1) reveals P-value =

P(c2 ≥ 18.75) ≈ 0.000015

Casio calculator syntax for chi-square test for homogeneity/independence:

H0 : There is no association between variable A (e.g., gender identity) and variable B

Ha: There is an association between variable A (gender identity) and variable B

1. Calculate the expected counts.

Identify: We want to perform a test of the following hypotheses using a = 0.05:

Calculate: Test statistic c2 = (43 – 36.49)2 + ... + (28 – 21.49)2 = 6.1582

R statistical software code

1. Shopping at secondhand stores is becoming more popular and has even

c2 test for homogeneity.

c2 test for independence.

c2 test for goodness of fit.

Ibuprofen Acetaminophen plus codeine Total

(a) Which type of chi-square test is appropriate here? Explain.

Identify: We want to perform a test of the following hypotheses using a = 0.05:

Calculate: Test statistic: c =

Pooled proportion calculation:

Chi-square test of homogeneity Chi-square test of independence

Many of the problems in this assignment are based on problems from

You might also like

Name: _______________ Period: _

The expected count for each outcome is np = 90(1/6) = 15.