0% found this document useful (0 votes)

36 views16 pages

Lecture3 - Contingency Analysis

Course

Uploaded by

audengweha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views16 pages

Lecture3 - Contingency Analysis

Course

Uploaded by

audengweha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

1.

Contingency Analysis ("crosstab“)/ Measures of

Association

Objectives
After studying this lesson and answering the questions in the exercises, a student will be able
to do the following:

Concepts:
 Goodness of fit test
 Two-Way Tables
 Expected Counts in Two-Way Tables
 The Chi-Square Test Statistic
 Cell Counts Required for the Chi-Square Test
 Uses of the Chi-Square Test
 The Chi-Square Distributions

Objectives:
 Perform a chi-square goodness of fit test.
 Construct and interpret two-way tables.
 Calculate expected counts in two-way tables.
 Describe the chi-square test statistic.
 Describe the cell counts required for the chi-square test.
 Describe uses of the chi-square test.
 Describe the chi-square distributions.

References:
Moore, D. S., Notz, W. I, & Flinger, M. A. (2013). The basic practice of statistics (6th ed.).
New York, NY: W. H. Freeman and Company.

1.1. Brief History

1
By "𝑋 2 tests" it's not usually meant the preceding test concerning variance but a group of
tests based on the Pearson-approximation and contingency tables. Karl (Carl) Pearson (1857-
1936), is believed to be the "father" of statistics.

1.2. The chi-square distribution

The chi-square distributions are a family of distributions that take only positive values and
are skewed to the right. A particular chi-square distribution is specified by giving its degrees
of freedom.

The Chi-Square distribution has only one parameter: df = degrees of freedom. The degrees of
freedom depend on the application, as we will see later. Here are a few facts about the Chi-
Square distribution.
The degree of freedom depends on the application, as we will see later. Here are a few fact
about the Chi square distribution. If 𝑋 2 ~𝑋 2 𝑑𝑓 the following are true of𝑋 2 :
 𝑋 2 is a continuous random variable
 𝑋 2 = 𝑍 2 + 𝑍 2 + 𝑍 2 + ⋯ + 𝑍 2 ; 𝑋 2 is the sum of df independent squared standard
normal random variable
 Data values cannot be negative, x∈ ⦋0, ∞)
 μ = df (the mean of the Chi square distribution is the degrees of freedom)
 δ = √2 ∗ 𝑑𝑓, V(X) = 2*df
 when df >90, 𝑋 2 is approximately normal
 Probability distributions that are continuous, have one mode, and are skewed to the
right or positively skewed.
 The critical value of a test statistic in a chi-square distribution is determined by
specifying a significance level and the degrees of freedom.
The chi-square distributions are a family of distributions that take only positive values and
are skewed to the right. A particular chi-square distribution is specified by giving its degrees
of freedom.(Fig 13)

2
Figure 13: Chi squared distribution shapes

The chi-square test for a two-way table with r rows and c columns uses critical values from
the chi-square distribution with (r – 1)(c – 1) degrees of freedom. The P-value is the area
under the density curve of this chi-square distribution to the right of the value of the test
statistic.
 The image above shows that the distribution of the chi-square statistic starts at zero
and can only have positive values.
 The shape of the distribution is much different than the t or z statistic and is skewed to
the right.
 The shape of the distribution changes as the degrees of freedom increases.

1.3. Uses of the Chi-Square Test

Use the chi-square test to test the null hypothesis
H0: there is no relationship between two categorical variables when there is a two-way table
from one of these situations:
 Independent random samples from two or more populations, with each individual
classified according to one categorical variable.
 A single random sample, with each individual classified according to both of two
categorical variables.

1.4. The main types of Chi Square Test

Three main types of tests will be covered here:

1. Goodness of Fit Test:

This test is for assessing if a particular discrete model is a good fitting model for a discrete
characteristic, based on a random sample from the population. For example, has the model

3
for the method of transportation (drive, bike, walk, other) used by students to get the Class
changed from that for 5 years ago?

2. Test of Homogeneity:
This test is for assessing if two or more populations are Homogeneous (alike) with respect to
the distribution of some discrete (categorical) variable. For example, is the distribution of
opinion on legal gambling the same for adult males versus adult females?

3. Test of Independence:
This test helps us to assess if two discrete (categorical) variables are independent for a
population, or if there is an association between the two variables. For example, is there an
association between satisfaction with the quality of public schools (not satisfied, somewhat
satisfied, very satisfied) and religious affiliation (Catholic, Protestants, Muslims, etc.)

The first test is the one-sample test for count data. The other two tests (homogeneity and
independence) are actually the same test. Although the hypotheses are stated differently and
the underlying assumptions about how the data is gathered are different, the steps for doing
the two tests are exactly the same.

All three tests are based on an X2 test statistic that, if the corresponding H0 is true and the
assumptions hold, follows a chi-‐square distribution with some degrees of freedom, written
χ 2 (df ) .

1.5. Goodness of Fit Test

Hypotheses
 We use the goodness of fit test to test if a discrete categorical random variable
matches a predetermined “expected" distribution. The hypotheses in a goodness of fit
test are
H0: the actual distribution fits the expected distribution
HA: the actual distribution does not fit the expected distribution
REQUIREMENT: In order for chi-square goodness of fit test to be appropriate, the expected
value in each category must be at least 5. It may be possible to combine categories to meet
this requirement.
4
Our goal is to see if the observed values are close enough to the expected values that the
differences could be due to random variation or, alternatively, if the differences great enough
that we can conclude that the distribution is not as expected. Therefore, our sample statistic
(which is also the test statistic in this case) should provide a measure of how far the from
“expected" frequencies the \observed" frequencies are, as a group.

1.5.1.Steps in Testing the Hypotheses

Step 1: State the Hypotheses

H0: Variable A is independent of variable B
H1: Variable A is not independent of variable B
Step 2: Calculate the expected frequency counts. The expected frequency counts at each
level of the categorical variable are equal to:
Ei = npi
where Ei is the expected frequency count for the ith level of the categorical variable, n
is the total sample size, and pi is the hypothesized proportion of observations in
level i.
Step 3: Calculate the Chi square statistic:
(𝑂𝑖 −𝐸𝑖 )2
𝑋2 = ∑
𝐸𝑖
Step 4: Calculate the critical value:
𝑋 2 𝑐 = 𝑋 2 (𝑑𝑓, ∝)
Where
df = (r–1)(c–1)
Step 5: Decision rule
Reject H0 if 𝑋 2 𝑐 < 𝑋 2

Example 1

A certain Toy Company prints baseball cards. The company claims that 30% of the cards are
rookies, 60% veterans but not All-Stars, and 10% are veteran All-Stars. Suppose a random
sample of 100 cards has 50 rookies, 45 veterans, and 5 All-Stars. Is this consistent with the
company's claim? Use a 0.05 level of significance.

Solution

5
1. State the hypotheses.
 Null hypothesis: The proportion of rookies, veterans, and All-Stars is 30%,
60% and 10%, respectively.

 Alternative hypothesis: At least one of the proportions in the null hypothesis is

false.
2. Calculate the expected frequency counts and the Chi square statistic (note that
steps 2 &3 are combined in this case):
Card Number of Percent Ei = ni*pi (𝑂𝑖 −𝐸𝑖 )2
𝑋2 = ∑
samples 𝐸𝑖

Rookies 50 30 100 * 0.30 = 30 13.33

Veterans 45 60 100 * 0.60 = 60 3.75
All-Stars 5 10 100 * 0.10 = 10 2.50
Total 100 100 100 19.58
3. Analyze sample data. Based on the chi-square statistic and the degrees of freedom,
we determine the critical value, X 2 c :
df = k - 1 = 3 - 1 = 2
Where
df = the degrees of freedom,
k =the number of levels of the categorical variable,
Hence
X 2 c = X 2 (2, 0.05) = 5.99 (see the chi square distribution table)

Percentage points of the Chi-squared Distribution

6
4. Decision rule/Conclusion

Since the Χ2 > X 2 c , we cannot accept the null hypothesis.

Example 2:
A University conducted a survey of its recent graduates to collect demographic and health
information for future planning purposes as well as to assess students' satisfaction with their
undergraduate experiences. The survey revealed that a substantial proportion of students
were not engaging in regular exercise, many felt their nutrition was poor and a substantial
number were smoking. In response to a question on regular exercise, 60% of all graduates
reported getting no regular exercise, 25% reported exercising sporadically and 15% reported
exercising regularly as undergraduates. The next year the University launched a health
promotion campaign on campus in an attempt to increase health behaviors among
undergraduates. The program included modules on exercise, nutrition and smoking cessation.
To evaluate the impact of the program, the University again surveyed graduates and asked
the same questions. The survey was completed by 470 graduates and the following data were
collected on the exercise question:

No Regular Exercise Sporadic Exercise Regular Exercise Total

Observed # 255 125 90 470

Based on the data, is there evidence of a shift in the distribution of responses to the exercise
question following the implementation of the health promotion campaign on campus?

Solution
In this example, we have one sample and a discrete (ordinal) outcome variable (with three
response options). We specifically want to compare the distribution of responses in the
sample to the distribution reported the previous year (i.e., 60%, 25%, 15% reporting no,

7
sporadic and regular exercise, respectively). We now run the test using the five-step
approach:

Step 1: We set up the hypotheses and determine level of significance.

The null hypothesis again represents the "no change" or "no difference" situation. If the
health promotion campaign has no impact then we expect the distribution of responses to the
exercise question to be the same as that measured prior to the implementation of the
program:
H0: p1=0.60, p2=0.25, p3=0.15, or equivalently
H0: The distribution of responses is 0.60, 0.25, 0.15
H1: H0 is false or
H1: The distribution of responses is different from 0.60, 0.25, 0.15 α =0.05
NB:
 The research hypothesis (H1) as stated captures any difference in the distribution of
responses from that specified in the null hypothesis.
 We do not specify a specific alternative distribution, instead we are testing whether
the sample data "fit" the distribution in H0 or not. With the χ2 goodness-of-fit test
there is no upper or lower tailed version of the test.
Step 2: Calculate the number of students expected in each exercise category!.
No Exercise Sporadic Exercise Regular Exercise Total
# Observed 255 125 90 470
# Expected E1 = 490*0.6 = E2 = 490*0.25 = E3 = 490*0.15 = 470
282 117.5 70.5
χ2 = (O- 2.59 0.48 5.39 8.46
E)2/E

NB:

Since there are three categories, the degrees of freedom = df = k-1 = 3-1 = 2).
Now, go to the chi-squared table. There you will find that the critical value,
X 2 2,0.05 = 5.99
Conclusion:
X 2 2,0.05 = 5.99 < X 2 = 8.46

8
We reject the null hypothesis, and conclude that the distribution of exercise has changed; it is
no longer 60%, 25%, 15%.

1.6. Chi-Square Test for Independence/Homogeneity

Contingency Tables

o A contingency table is a cross-tabulation of n paired observations into

categories
o Each cell shows the count of observations that fall into the category defined
by its row (r) and column (c) heading.
[Please note that the Kolmogorov-Smirnoff test is another test for the goodness of fit. The
Kolmogorov-Smirnov test has a higher power, but can only be applied to continuous-level
variables.]

Secondly, it tests whether or not a statistically significant relationship exists between a

dependent and an independent variable. When used as test of independence, the Chi-Square
Test is applied to a contingency table, or cross tabulation (sometimes called crosstabs for
short).

9
Test of Independence helps us to assess if two discrete (categorical) variables are
independent for a population, or if there is an association between the two variables

5.6.1. Chi-squared of independence Step-by-Step

1) Formulate Hypotheses
2) Calculate row and column totals
3) Calculate row and column proportions
4) Calculate expected frequencies (Ei)
5) Calculate χ2 statistic
6) Calculate degrees of freedom
7) Obtain Critical Value from table
8) Make decision regarding the Null-hypothesis
The chi-squared test of independence also uses the chi-squared statistic and chi-squared
distribution, but it is used to test whether there is a difference in frequency among two or
more groups. The outcome is categorical (2 or more levels) or ordinal. Therefore, there can
be multiple rows or columns in our contingency table, and the degress of freedom are

where r= the number of rows in the contingency table, and c= the number of columns.
For example, in the following contingency table, df=(r-1)*(c-1)= (3-1)*(3-1)=4:

Good Fair Poor

High Exposure

Medium Exposure

Low Exposure
There are 3 exposure categories and 3 outcome categories, so df= (3-1) * (3-1) = 2*2 = 4
The research question can be phrased as either:
 Is there a difference in outcome between two or more groups?
 Is there an association between two variables?
Therefore,
 H0: The distribution of the outcome is independent of the groups
 H1: H0 is false

10
Example:
We have one population of interest -‐ say factory workers.

Question:
Is there a relationship between smoking habits and whether or not a factory worker
experiences hypertension?
Data:
1 random sample of 180 factory workers, we measure the two variables:
Y = hypertension status (yes or no)
X = smoking habit (non, moderate, heavy)
The table below summarizes the data in terms of the observed counts.

Observed Counts:
X ( Smoking habits)
Non Moderate Heavy Total
Y Yes 21 36 30 87
(Hypertension No 48 26 19 93
status) Total 69 62 49 180

The null hypothesis:

H0: There is no association between smoking habit and hypertension status for the
population of factory workers. (or The two factors, smoking habit and hypertension status,
are independent for the population.)
Mathematically, this can be stated as:
H0: P(X = I and Y = j) = P(X = i)P(Y = j)
Ha: There is an association between smoking habit and hypertension status for the
population of factory workers. (or The two factors, smoking habit and hypertension status,
are dependent for the population.); α =0.01

11
The two-way table provides the OBSERVED counts. Our next step is to compute the
EXPECTED counts, under the assumption that H0 is true. The expected counts are obtained
using the cross tabulation rule:
(𝑟𝑜𝑤 𝑡𝑜𝑡𝑎𝑙)(𝑐𝑜𝑙𝑢𝑚𝑛 𝑡𝑜𝑡𝑎𝑙)
𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑐𝑜𝑢𝑛𝑡 =
𝐺𝑟𝑎𝑛𝑑 𝑡𝑜𝑡𝑎𝑙, 𝑁

X (Smoking habits)
Non Moderate Heavy Total
Y Yes 21 (33.35) 36(29.97) 30(23.68) 87
(Hypertension status) No 48(35.65) 26(32.03) 19(25.32) 93
Total 69 62 49 180

Calculation of the Chi squared statistic

(21 − 33.35)2 (36 − 29.97)2 (48 − 35.65)2

𝑋2 = + + ⋯+
33.35 29.97 35.65
= 4.57 + 1.21 +… + 0.86
= 14.46
Next, we calculate the critical value or test statistic:
df = (r-1)(c-1)
= (3-1)(2-1)
=2
Therefore,
X 2 (2,0.01) = 9.21

Decision rule:
𝑋 2 > X 2 (2,0.01) , so we reject Ho
Report:
A 2 by 3 Chi-Square test of independence indicated a non-significant difference between
Hypertension status and Smoking habits,(X2(180) =14.46; p>0.01). Therefore, the null

12
hypothesis was rejected and we conclude that the two factors, smoking habit and
hypertension status, are dependent for the population
In sum,

1.6.2. Assumptions
Assumption #1: Your two variables should be measured at an ordinal or nominal level (i.e.,
categorical data). You can learn more about ordinal and nominal variables in our article:.
Assumption #2: Your two variables should consist of two or more categorical, independent
groups. Example independent variables that meet this criterion include gender (2 groups:
Males and Females), ethnicity (e.g., 3 groups: Caucasian, African American and Hispanic),
physical activity level (e.g., 4 groups: sedentary, low, moderate and high), profession (e.g., 5
groups: surgeon, doctor, nurse, dentist, therapist), and so forth.

Assumption #3: Chi squared tests are only valid when you have reasonable sample size
For 2 by 2 tables (i.e. only two categories in each variable):
 If the total sample size is greater than 40, chi squared can be used
 If the total sample size is between 20 and40, and the smallest expected frequency is
atleast 5, chi squared can be used ( see note at the bottom of SPSS output to see if this
is a problem)

13
 Otherwise Fisher’s exact test must be used.
For other tables:
 Chi squared can be used if no more than 20% of the expected frequencies are less
than 5 ( see note at the bottom of SPSS output to see if this is a problem)

1.7. Measuring associations between variables

Exercises
1. In this exercise, we look at the relationship between reported diabetes and high blood
pressure. This is a crosstabulation:

Diabetes High blood pressure Total

No Yes
No 172 20 192
Yes 7 3 10
Total 179 23 202

a) What kinds of variables are diabetes and high blood pressure?

b) Which cell of the table would have the smallest expected frequency and, roughly,
what would this be?
c) What statistical method should be used to test the null hypothesis that diabetes and
high blood pressure are unrelated in this population, and why?
d) The test gives P = 0.09. What can we conclude about high blood pressure and
diabetes, given that the test was conducted at 5% significance level?

2. The table below is taken from a study investigating the cause of diarrhoea in patients with
gastroenteritis and shows the relationship between foreign travel and a positive result for
the organism Providencia alcalifaciens (Haynes and Hawkey 1989).

14
P. alcalifaciens
Recent travel positive (no.) Negative (no.) Total
abroad?
Yes 25 229 254
No 5 368 373
Total 28 597 627
Chi Squared = 23.98, P<0.001
a) What is meant by ‘chi-squared = 23.98, P<0.001?’
b) What conditions do the data have to meet for the test to be valid?
c) What conclusions can be drawn from these data?
d) What other information would be useful in deciding whether P. alcalifaciens was a likely
cause of gastroenteritis in travelers?
3. Conduct a hypothesis test to determine if the actual majors of graduating females fit the
expected distribution of their majors. The observed data were collected from 5,000
graduating females. Complete a hypothesis test at the 𝛼= 0:05 significance level to test if
the actual distribution of female students to majors matches the expected distribution.

a. Find the expected frequencies and complete the table.

b. Are the requirements for a chi-square goodness of _t test satisfied? Explain
and adjust the categories if needed.
c. Write the null and alternative hypotheses.
d. What is the distribution?
e. Find the test statistic.
f. Find the p-value.
4. Treating stress fractures. With respect to stress fractures in a foot bone, does the success
rate of the treatment depend on the treatment method, or do all methods of treatment have
15
basically the same success rate? Use the following data and a significance level of ∝ =
0:01 to complete a test of independence.

a. State the null and alternative hypotheses for this test of independence.
b. Complete the table of expected values assuming the success rate is
independent of the treatment method. Use two decimal places of accuracy.

c. Is the requirement for a test of independence satisfied?

d. Find the distribution of the test statistic, including the degrees of freedom.
e. Calculate the test statistic value using your preferred method.
f. Sketch the density curve, marking and labeling the test statistic and p-value.
g. What is the outcome of the test for independence? (Can we conclude that the
success rate depends on the method of treatment or not?)
5. A one year follow-up study was conducted to examine the effect of an experimental drug
on mortality in 296 cases of advanced non-Hodgkin's lymphoma. Controls received
standard treatment. The data are provided below.

a) Provide the null and alternative hypothesis and an interpretation of the results
b) Calculate the expected counts for the cells in the table above.
c) Test to see if the association between mortality outcome and treatment status is
statistically significant.

Chi Square Test
No ratings yet
Chi Square Test
13 pages
Chi Square Test
100% (2)
Chi Square Test
75 pages
Mini Project Statistics)
100% (1)
Mini Project Statistics)
22 pages
Test of Association
No ratings yet
Test of Association
27 pages
Chi-Square As A Test For Comparing Variance
No ratings yet
Chi-Square As A Test For Comparing Variance
9 pages
Module 6 Chi-Square T Z Test
100% (1)
Module 6 Chi-Square T Z Test
72 pages
CH 10
No ratings yet
CH 10
64 pages
Chi Square (KI Square) Test
No ratings yet
Chi Square (KI Square) Test
30 pages
Module 5 Quiz Rev
No ratings yet
Module 5 Quiz Rev
118 pages
1 STAT511 U4-1
No ratings yet
1 STAT511 U4-1
45 pages
7 Chi-Square and F
No ratings yet
7 Chi-Square and F
68 pages
Nonparametric Methods: Chi-Square Applications
No ratings yet
Nonparametric Methods: Chi-Square Applications
21 pages
Block-3
No ratings yet
Block-3
68 pages
Abisola
No ratings yet
Abisola
12 pages
10measures of Association
No ratings yet
10measures of Association
249 pages
T Test,ANOVA,Chi Square Test
No ratings yet
T Test,ANOVA,Chi Square Test
26 pages
RM Unit 4 - Part 2
No ratings yet
RM Unit 4 - Part 2
35 pages
AP Stats Ch25
No ratings yet
AP Stats Ch25
105 pages
Chapter 6. Chi-Square Test
No ratings yet
Chapter 6. Chi-Square Test
25 pages
ChiSquare Examples
No ratings yet
ChiSquare Examples
22 pages
BS IMI U8 Oct23
No ratings yet
BS IMI U8 Oct23
100 pages
Non-Parametric
No ratings yet
Non-Parametric
37 pages
CHAPTER FOUR (1)
No ratings yet
CHAPTER FOUR (1)
26 pages
Chapter12_X2 - Student(1)
No ratings yet
Chapter12_X2 - Student(1)
31 pages
Chi Square Method
No ratings yet
Chi Square Method
34 pages
Module 5a Chi Square - Introduction - Goodness of Fit Test
No ratings yet
Module 5a Chi Square - Introduction - Goodness of Fit Test
39 pages
Chisquare Gonzales
No ratings yet
Chisquare Gonzales
32 pages
Chi-Square by MPH
No ratings yet
Chi-Square by MPH
55 pages
8_1_categorical_data_ninell
No ratings yet
8_1_categorical_data_ninell
26 pages
X Test PDF
No ratings yet
X Test PDF
38 pages
Engineering Mathematics 2
No ratings yet
Engineering Mathematics 2
29 pages
Chi-Square Distribution
No ratings yet
Chi-Square Distribution
28 pages
Chi Square Test
No ratings yet
Chi Square Test
22 pages
Chi Square (Χ) : Yetty Dwi Lestari Department of Management, FEB Airlangga University
No ratings yet
Chi Square (Χ) : Yetty Dwi Lestari Department of Management, FEB Airlangga University
71 pages
Stat-213-Chapter-7-2
No ratings yet
Stat-213-Chapter-7-2
18 pages
Ermi Stat LL CH 4
No ratings yet
Ermi Stat LL CH 4
32 pages
QM Lecture 10 - Chi Square Tests (1)
No ratings yet
QM Lecture 10 - Chi Square Tests (1)
48 pages
chisquaretest
No ratings yet
chisquaretest
16 pages
1 - CA51018 - Chi Square - Introduction - Goodness of Fit Test - 2
No ratings yet
1 - CA51018 - Chi Square - Introduction - Goodness of Fit Test - 2
36 pages
Univariate Statistics: Statistical Inference: Testing Hypothesis
No ratings yet
Univariate Statistics: Statistical Inference: Testing Hypothesis
28 pages
Chi Square Test 2
No ratings yet
Chi Square Test 2
27 pages
PSAI Unit 5
No ratings yet
PSAI Unit 5
25 pages
Statistical Theory Lecture 5-2025
No ratings yet
Statistical Theory Lecture 5-2025
13 pages
Chapter Four
No ratings yet
Chapter Four
12 pages
Dueling Loops
100% (1)
Dueling Loops
209 pages
Measurement 6th Sem (H) DSE4 Lec 4 05 05 2020
No ratings yet
Measurement 6th Sem (H) DSE4 Lec 4 05 05 2020
19 pages
X2 Test (Chi Squared Test)
No ratings yet
X2 Test (Chi Squared Test)
5 pages
Chapter 6
No ratings yet
Chapter 6
10 pages
Ba h Political Science 1st Merit
No ratings yet
Ba h Political Science 1st Merit
42 pages
0064ED90-5D9C-4A27-93B4-DBC9A22B0382
No ratings yet
0064ED90-5D9C-4A27-93B4-DBC9A22B0382
37 pages
Maths report (2)
No ratings yet
Maths report (2)
15 pages
Chapter 9 - Chi-Square Test
No ratings yet
Chapter 9 - Chi-Square Test
3 pages
Chisquare
No ratings yet
Chisquare
10 pages
Chapter 6
No ratings yet
Chapter 6
13 pages
Chi-Square Test: by Dr. M.Supriya Moderator:Dr.B.Aruna, M.D. (H)
No ratings yet
Chi-Square Test: by Dr. M.Supriya Moderator:Dr.B.Aruna, M.D. (H)
75 pages
Statistics Unit 9 Notes
No ratings yet
Statistics Unit 9 Notes
10 pages
Countries and Nationalities and The Verb To Be PDF
No ratings yet
Countries and Nationalities and The Verb To Be PDF
2 pages
All in The Federal Strategic Plan To Prevent and End Homelessness
No ratings yet
All in The Federal Strategic Plan To Prevent and End Homelessness
104 pages
Manual GTD Eng
100% (1)
Manual GTD Eng
25 pages
4 - Regression Analysis
No ratings yet
4 - Regression Analysis
27 pages
Derecho de Contratos en Rusia PDF
No ratings yet
Derecho de Contratos en Rusia PDF
26 pages
CH 14
No ratings yet
CH 14
13 pages
Behavioral Interview For Product Managers 1694899273
No ratings yet
Behavioral Interview For Product Managers 1694899273
23 pages
R v Vickers (John Willson) (1)
No ratings yet
R v Vickers (John Willson) (1)
5 pages
Chapter - Six The Chi-Square Distribution Objectives
No ratings yet
Chapter - Six The Chi-Square Distribution Objectives
16 pages
Foundations of Service Level Management PDF
No ratings yet
Foundations of Service Level Management PDF
146 pages
Formula Grammar PPT B2 U5
No ratings yet
Formula Grammar PPT B2 U5
7 pages
Ijsa 00062
No ratings yet
Ijsa 00062
14 pages
Timed Trial Review Lecture
No ratings yet
Timed Trial Review Lecture
59 pages
Using Conversation Analysis in The Second Language Classroom
No ratings yet
Using Conversation Analysis in The Second Language Classroom
22 pages
Seminar 1 Pulp
100% (2)
Seminar 1 Pulp
128 pages
"Xi Jinping: A Modern Emperor" What Part Xi's Early Socialization and Family Background Played in The Rise of His Leadership?
No ratings yet
"Xi Jinping: A Modern Emperor" What Part Xi's Early Socialization and Family Background Played in The Rise of His Leadership?
5 pages
BA Hons Syllabus English NEP 2020
No ratings yet
BA Hons Syllabus English NEP 2020
84 pages
Datasheet
No ratings yet
Datasheet
7 pages
The Sacred Heart Church in Elk Rapidsjnjpm PDF
No ratings yet
The Sacred Heart Church in Elk Rapidsjnjpm PDF
2 pages
Tangent Galvanometer: Physics Investigatory Project
No ratings yet
Tangent Galvanometer: Physics Investigatory Project
17 pages
Rajiv Malhotra On Anant Rambachan 'S Neo Hinduism Reply and Response
No ratings yet
Rajiv Malhotra On Anant Rambachan 'S Neo Hinduism Reply and Response
3 pages
Eim 11 PR
No ratings yet
Eim 11 PR
11 pages
Unit 3 - Tenses - Review
No ratings yet
Unit 3 - Tenses - Review
28 pages
Chapter-4 Antenna Arrays: Antenna Arrays Are Groups of Similar Antennas Arranged in Various Configurations
No ratings yet
Chapter-4 Antenna Arrays: Antenna Arrays Are Groups of Similar Antennas Arranged in Various Configurations
16 pages
ATI Med Template Vitamin D
No ratings yet
ATI Med Template Vitamin D
1 page
Teaching Kids Using A Story Miko The Monkey Worksheet
No ratings yet
Teaching Kids Using A Story Miko The Monkey Worksheet
2 pages
Diploma Project 1st Presentation PPT Template
No ratings yet
Diploma Project 1st Presentation PPT Template
13 pages
Shakespeare
No ratings yet
Shakespeare
2 pages
The Growing Industry of Internet Based Business in MSU - Marawi City
No ratings yet
The Growing Industry of Internet Based Business in MSU - Marawi City
7 pages
Module 1
0% (1)
Module 1
5 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet

Lecture3 - Contingency Analysis

Uploaded by

Lecture3 - Contingency Analysis

Uploaded by

1.

Contingency Analysis ("crosstab“)/ Measures of

1.1. Brief History

1.2. The chi-square distribution

1.3. Uses of the Chi-Square Test

1.4. The main types of Chi Square Test

1. Goodness of Fit Test:

1.5. Goodness of Fit Test

1.5.1.Steps in Testing the Hypotheses

Step 1: State the Hypotheses

 Alternative hypothesis: At least one of the proportions in the null hypothesis is

Rookies 50 30 100 * 0.30 = 30 13.33

Percentage points of the Chi-squared Distribution

Since the Χ2 > X 2 c , we cannot accept the null hypothesis.

No Regular Exercise Sporadic Exercise Regular Exercise Total

Observed # 255 125 90 470

Step 1: We set up the hypotheses and determine level of significance.

1.6. Chi-Square Test for Independence/Homogeneity

o A contingency table is a cross-tabulation of n paired observations into

Secondly, it tests whether or not a statistically significant relationship exists between a

5.6.1. Chi-squared of independence Step-by-Step

Good Fair Poor

The null hypothesis:

Calculation of the Chi squared statistic

(21 − 33.35)2 (36 − 29.97)2 (48 − 35.65)2

1.7. Measuring associations between variables

Diabetes High blood pressure Total

a) What kinds of variables are diabetes and high blood pressure?

a. Find the expected frequencies and complete the table.

c. Is the requirement for a test of independence satisfied?

You might also like