0% found this document useful (0 votes)

220 views4 pages

Continuity Correction of Pearson's Chi-Square Test in 2x2 Contingency Tables: A Mini-Review On Recent Development

1) The Pearson's chi-square test is commonly used but introduces errors for 2x2 contingency tables when approximating discrete probabilities with continuous distributions. 2) Several authors have proposed continuity corrections to the Pearson's chi-square statistic to reduce this error, including Yates' correction from 1934 and more recent corrections. 3) This document reviews recent developments in continuity corrections for Pearson's chi-square test in 2x2 tables, summarizing Serra's minimized Pearson's chi-square statistic from 2008 and Kajita Matchita et al.'s correction from an unknown date that aims to better control type 1 errors.

Uploaded by

Melina FERNANDEZCARAFFINI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

220 views4 pages

Continuity Correction of Pearson's Chi-Square Test in 2x2 Contingency Tables: A Mini-Review On Recent Development

Uploaded by

Melina FERNANDEZCARAFFINI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

ORIGINAL ARTICLES Epidemiology Biostatistics and Public Health - 2019, Volume 16, Number 2

Continuity correction of Pearson’s chi-square

test in 2x2 Contingency Tables:
A mini-review on recent development
Nicola Serra (1), Teresa Rea (1), Paola Di Carlo (2), Consolato Sergi (3)

(1) Department of Public Health, University Federico II of Naples, Italy

(2) Department of Sciences for Health Promotion, Mother & Child Care, Univ. of Palermo, Italy
(3) Department of Lab. Medicine and Pathology, Univ. of Alberta, Edmonton, AB, Canada, Stollery Children’s Hospital, Univ. of Alberta, Edmonton, AB,
Canada

CORRESPONDING AUTHOR: Nicola Serra Ph.D., Department of Public Health, School of Medicine and Surgery, University Federico II of Naples, Italy.
E-mail: [email protected]

DOI: 10.2427/13059
Accepted on April 16, 2019

ABSTRACT

The Pearson’s chi-square test represents a nonparametric test more used in Biomedicine and Social Sciences, but
it introduces an error for 2x2 contingency tables, when a discrete probability distribution is approximated with a
continuous distribution. The first author to introduce the continuity correction of Pearson’s chi-square test has been
Yates F. (1934). Unfortunately, Yates’s correction may tend to overcorrect of p-value, this can implicate an overly
conservative result. Therefore many authors have introduced variants Pearson’s chi-square statistic, as alternative
continuity correction to Yates’s correction. The goal of this paper is to describe the most recent continuity corrections,
proposed for Pearson’s chi-square test.

Key words: Pearson’s x2 statistic; continuity correction; 2x2 contingency table; Yates’s continuity correction, Serra’s
continuity correction

INTRODUCTION variables. For two dichotomous variables, it is possible

to define a 2x2 contingency table, with the frequencies
Pearson’s chi-square test or c 2 test is the of occurrence of all combinations of their levels,
nonparametric test commonly used by researchers considering a sample size equal to N, as it is shown
in Biology, Medicine and Social Sciences. This test in Table 1
is based on the calculation of Pearson’s c2 statistic, In a 2x2 contingency table, Pearson’s c2 statistic
introduced by Pearson K. [1], considering a sample of is used to test the association between dichotomous
a population characterized by two o more dichotomous variables, for example to individualize a possible

Mini-review on continuity correction of Pearson’s x2 e13059-1

Epidemiology Biostatistics and Public Health - 2019, Volume 16, Number 2 ORIGINAL ARTICLES

association between variables such as sex (Male/Female) METHODS

and smoke (Yes/No). For this scope Pearson introduce
the chi-square statistic to evaluate the discrepancy In this section we introduce the most recent study
between observed (Oi,j ) and expected frequencies (Ei,j about continuity correction of Pearson’s c2 statistic in 2x2
), where the observed frequencies are a, b, c and d of contingency tables.
Tables 1. Instead the expected frequencies are defined
for every cell such as:
Serra’s continuity correction
ri c j
Ei , j = , i, j = 1, 2
N Recently Serra N. [8] introduces a significant
where i and j indicate the row and column index minimized of Pearson’s c2 statistic as a continuity correction
respectively. The formula to compute Pearson’s c2 statistic of Pearson’s c2 test, for small samples (sample size ≤
is described by Pearson K. (1900): 25). This approach is based on the observation that
the denominator r1 r2 c1 c2 of (1), can be interpreted
[1] as a geometric mean. The formula to compute minimize
Pearson’s c2 statistic in a 2x2 contingency table is:
where r1, r2, c1 and c2 i.e. the totals across rows and
columns are generally called marginal totals. [3]
Using the c2 distribution to interpret Pearson’s
c statistic requires one to assume that the discrete
2
Serra N., showed with a statistical approach, that for
probability of observed binomial frequencies of 2x2 small samples (≤25), the minimized Pearson’s c2 statistic in
contingency table, can be approximated by the 2x2 contingency tables, represents a continuty correction
continuous c2 distribution. This assumption is not entirely for Pearson’s c2 statistic more effective in comparison to
correct and introduces some error. To reduce the error Yates’continuity correction. Particularly in this study the author
in approximation, many authors introduced a continuity verify that, the Fisher’s exact test [9,10], actually considered
correction or variants of Pearson’s c2 test. the “gold test” used when c2 test is not appropriate, i.e.
To reduce the error introduced by Pearson’s when the sample size is small and the expected values in
c2 statistic, Yates F. [2] suggested a correction for any of the cells of a 2x2 contingency table are below 5,
continuity that adjusts the formula for Pearson’s c2 by had performance statistically equal to c2 Serra test.
subtracting the value 0.5, from the difference between
each observed value and its expected value for 2x2
contingency table. This correction reduces the c2 Kajita Matchita et al.’s continuity correction
value obtained and consequently increases its p-value.
The formula to compute Yates’s c2 statistic in a 2x2 Kajita Matchita et al. [11] proposed a continuity
contingency table is: correction to maintain a continuity value to be used when
small expected cell frequencies on Pearson’s c2 test for
[2] independence exist in the research data. This correction
method is used to control the type I error and obtained
using a developed correction in more condition. For this
Unfortunately, Yates’s correction may tend to scope the authors used a simulation study. The simulations
overcorrect of p-value; this can implicate an overly were performed with Monte Carlo method, to evaluate
conservative result, as reported by several authors [3-7]. the performance of their method in comparison to other
The goal of this study is with literature review, to continuity corrections such as Yates’s correction and
describe the most recent development about the continuity Williams’s correction [12]. It shows an outperformed
corrections by variants of Pearson’s c2 test defined for control of type I error, considering a pattern of data set at a
2x2 contingency tables. significant level of 0.05 and 0.01, simulated contingency
tables between 2x2 and 4x4 (2x2, 2x3, 2x4, 3x3, 3x4

TABLE 1. 2x2 contingency table form.

Column variable (X)

Row variable (Y) State 1 State 2 Row totals
State 1 a b a + b = r1
State 2 c d c + d = r2
Column totals a + c = c1 b + d = c2 N=a+b+c+d

e13059-2 Mini-review on continuity correction of Pearson’s x2

ORIGINAL ARTICLES Epidemiology Biostatistics and Public Health - 2019, Volume 16, Number 2

and 4x4), a number of small expected cell frequencies up funding agencies in the public, commercial, or not for
to 30% of the total cell used, a sample size between 5 profit sectors.
and 10 times that total cell, and using 10,000 data set
simulated by Monte Carlo method for each pattern. The
type I error (number rejection of null hypothesis divided Competing interests statement
by 10,000) was evaluated by Pearson’s c2 test, i.e. by
classical c2 test without continuity correction. There are no competing interests for this study.
In the case of 2x2 contingency tables, where the
type I error is greater than the significant level, the c2 test
equation to be used is as follows: References
1. Pearson K. (1900), On the criterion that a given system of deviations
[4] from the probable in the case of a correlated system of variables is
such that it can be reasonably supposed to have arisen from random
instead, where the type I error is less than the sampling. Philosophical Magazine Series 5; 50(302):157–175.
significant level, the c2 test equation is 2. Yates, F. (1934). Contingency tables involving small numbers and
the c2 test. Supplement to the Journal of the Royal Statistical Society,
[5] 1(2), 217-235.
3. Camilli, G., & Hopkins, K. D. (1978). Applicability of chi-square
where Oi,j and Ei,j represent the observed and to 2×2 contingency tables with small expected cell frequencies.
expected frequencies respectively, instead C is the Psychological Bulletin, 85(1), 163.
developed correction value. It was computed in two cases 4. Campbell, I. (2007). Chi-squared and Fisher–Irwin tests of two-
as follows, if the type I error is higher than the significant by-two tables with small sample recommendations. Statistics in
level, the authors try to replace the value C into equation Medicine, 26(19), 3661-3675.
(4) start from 0.01, 0.02, 0.03, ..., . If the type I error 5. Haber, M. (1982). The continuity correction and statistical testing.
is less than the significant level, they try to replace the International Statistical Review/Revue Internationale de Statistique,
value C into equation (5) start from 0.01, 0.02 , 0.03 135-144.
..., . After they replaced value C and computed type I 6. Richardson, J. T. (1990). Variants of chi-square for 2×2 contingency
error then to compared with significant level. Developed tables. British Journal of Mathematical and Statistical Psychology,
correction value (C) is the value which gets very similar 43(2), 309-326.
values between type I error and significant level. 7. Richardson, J. T. (2011). The analysis of 2×2 contingency tables—
Yet again. Statistics in Medicine, 30(8), 890-890.
8. Serra, N. (2018). A significant minimization of Pearson’s c2
CONCLUSION statistics in 2x2 contingency tables: preliminary results for small
samples. Epidemiology, Biostatistics and Public Health, 15(3).
In this paper we described the most recent studies 9. Agresti, A. (2001). Exact inference for categorical data: recent
of continuity correction of Pearson’s c2 test. Since the first advances and continuing controversies. Statistics in medicine,
continuity correction proposed by Yates (1934), produced an 20(17-18), 2709-2722.
overcorrection of the p-value, many authors are discouraging 10. Fisher, R.A. (1934), Statistical Methods for Research Workers.
its use. Instead other authors [13-18], have followed Yates Chapter 12. 5th Ed., Oliver & Boyd.
(1934) in claiming that the use of Pearson’s c2 in the case 11. Matchima, K., Vongprasert, J., & Chutiman, N. (2018). The
of 2x2 contingency tables tends to generate too many type Development of a Correction Method for Ensuring a Continuity
I errors, especially with small samples, therefore they defined Value of The Chi-square Test with a Small Expected Cell Frequency.
different continuity corrections of Pearson’s c2 statistic, to Naresuan University Journal: Science and Technology (NUJST),
reduce the type I error, and simultaneously to reduce the type 26(1), 98-105.
II error that Yates’s correction introduces 12. Mcdonald, J.H. (2014), Handbook of Biological Statistics.
Unfortunately, the study of continuity correction of Maryland: Sparky House Publishing.
Pearson’s c2 statistic is very limited in the recent statistical 13. Cochran WG. (1954), Some methods for strengthening the
literature, only two recent studies are dedicated at this common c2 tests. Biometrics; 10(4):417–451.
problem (Serra N., 2018 and Kajita Matchita et al., 14. Cox, D.R. (1970). The continuity correction. Biometrika 57: 217-
2018), showing of the variants of c2 statistic as continuity 219.
correction of Pearson’s c2 test. 15. Feller, W. (1968). An Introduction to Probability Theory and Its
Applications. Volume I, 3rd ed. John Wiley & Sons, Inc. New York.
16. Mantel, N., & Haenszel, W. (1959). Statistical aspects of the
Funding statement analysis of data from retrospective studies of disease. Journal of the
National Cancer Institute, 22(4), 719-748.
This research did not receive any specific grant from 17. Maxwell, E. A. (1976). Analysis of contingency tables and further

Mini-review on continuity correction of Pearson’s x2 e13059-3

Epidemiology Biostatistics and Public Health - 2019, Volume 16, Number 2 ORIGINAL ARTICLES

reasons for not using Yates correction in 2× 2 tables. The Canadian 2 comparative trial. Journal of the Royal Statistical Society. Series A
Journal of Statistics/La Revue Canadienne de Statistique, 277-290. (General), 86-105.
18. Upton, G. J. G. (1982). A comparison of alternative tests for the 2×

e13059-4 Mini-review on continuity correction of Pearson’s x2

Ships Electrical System - Rene Borstlap, Hans Ten Katen - V1
86% (21)
Ships Electrical System - Rene Borstlap, Hans Ten Katen - V1
224 pages
Analysis of Categorical Data
No ratings yet
Analysis of Categorical Data
75 pages
Loan Approval Letter: Reliance Finance Group
100% (1)
Loan Approval Letter: Reliance Finance Group
4 pages
Reverse Speech
100% (2)
Reverse Speech
55 pages
Why Study History
No ratings yet
Why Study History
4 pages
Samadhi - The Natural State
100% (3)
Samadhi - The Natural State
22 pages
Yates PDF
No ratings yet
Yates PDF
8 pages
ChiSquare Test 6.1
No ratings yet
ChiSquare Test 6.1
21 pages
Week 16 - Testing For Independene - Pearson Chi-Square Test
No ratings yet
Week 16 - Testing For Independene - Pearson Chi-Square Test
18 pages
Yate's Correction
No ratings yet
Yate's Correction
15 pages
Chi-Square (X2) Distribution
No ratings yet
Chi-Square (X2) Distribution
35 pages
Syl-2. Chi-Square Test
No ratings yet
Syl-2. Chi-Square Test
28 pages
Chapter#8 Association
No ratings yet
Chapter#8 Association
59 pages
Chapter 12 (Technical English For Statistics)
No ratings yet
Chapter 12 (Technical English For Statistics)
6 pages
Yates 1934a
No ratings yet
Yates 1934a
20 pages
1527223070H16RM37 Qi
No ratings yet
1527223070H16RM37 Qi
10 pages
21 - Contingency Tables
No ratings yet
21 - Contingency Tables
33 pages
6.3 Chi-Square
No ratings yet
6.3 Chi-Square
35 pages
The Chi-Square Test: Statistics and Research Design
No ratings yet
The Chi-Square Test: Statistics and Research Design
2 pages
11 Chi Square
No ratings yet
11 Chi Square
4 pages
Module-III Chi Square
No ratings yet
Module-III Chi Square
4 pages
Statistical Method of Categorical Variable
No ratings yet
Statistical Method of Categorical Variable
68 pages
Chapter 4
No ratings yet
Chapter 4
48 pages
Contingency Tables: Example: The Following Table Shows Results of HIV Testing. What Is The Probability That
No ratings yet
Contingency Tables: Example: The Following Table Shows Results of HIV Testing. What Is The Probability That
2 pages
Fisher Exact Test Paper
No ratings yet
Fisher Exact Test Paper
9 pages
Tablas Criterios
No ratings yet
Tablas Criterios
9 pages
Chi-Square Test
No ratings yet
Chi-Square Test
20 pages
Chi Square Test 2
No ratings yet
Chi Square Test 2
27 pages
Pearson Stable
No ratings yet
Pearson Stable
3 pages
Lesson 6 - Chi-Square Test For Independence
No ratings yet
Lesson 6 - Chi-Square Test For Independence
4 pages
Better To Be in Agreement Than in Bad Company: A Critical Analysis of Many Kappa-Like Tests
No ratings yet
Better To Be in Agreement Than in Bad Company: A Critical Analysis of Many Kappa-Like Tests
22 pages
Chi Sqaure &anova
No ratings yet
Chi Sqaure &anova
14 pages
Rangkuman Rumus & Tabel Statistika
No ratings yet
Rangkuman Rumus & Tabel Statistika
12 pages
Activity 3 - Cuyugan, Ma. Lourdes J. EDUC 307
No ratings yet
Activity 3 - Cuyugan, Ma. Lourdes J. EDUC 307
2 pages
02 Permutation Test
No ratings yet
02 Permutation Test
15 pages
Prueba de Fisher. R. A Fisher
No ratings yet
Prueba de Fisher. R. A Fisher
9 pages
Spearman P Value
No ratings yet
Spearman P Value
9 pages
Contingency Tables Involving Small Numbers and the χ2 Test
No ratings yet
Contingency Tables Involving Small Numbers and the χ2 Test
20 pages
Chi-Square Questions - Biostatistics
No ratings yet
Chi-Square Questions - Biostatistics
10 pages
Chi Square
No ratings yet
Chi Square
41 pages
Chi-Square Test: Advance Statistics
No ratings yet
Chi-Square Test: Advance Statistics
26 pages
Chi-Square Test Fishers Exact Test Mcnemars Ttest Odds Ration
No ratings yet
Chi-Square Test Fishers Exact Test Mcnemars Ttest Odds Ration
10 pages
Chi Square Test
No ratings yet
Chi Square Test
14 pages
Mcnemar Significance Test of Change
No ratings yet
Mcnemar Significance Test of Change
5 pages
STAT1371 Topic11 PDF
No ratings yet
STAT1371 Topic11 PDF
41 pages
SRT 605 - Topic (8) Chi Square
No ratings yet
SRT 605 - Topic (8) Chi Square
22 pages
Chi-Square: Heibatollah Baghi, and Mastee Badii
No ratings yet
Chi-Square: Heibatollah Baghi, and Mastee Badii
37 pages
Lesson 17
No ratings yet
Lesson 17
3 pages
3 One-Way Table
No ratings yet
3 One-Way Table
30 pages
Application of Coefficient of Contingency Among Classification
No ratings yet
Application of Coefficient of Contingency Among Classification
12 pages
Chi Square
No ratings yet
Chi Square
37 pages
Chi-Square Test: Milan A Joshi
No ratings yet
Chi-Square Test: Milan A Joshi
39 pages
Chi Square Test
No ratings yet
Chi Square Test
11 pages
Uji Kecocokan Model 2021
No ratings yet
Uji Kecocokan Model 2021
79 pages
Yellow Modern Minimal Proposal Cover Page
No ratings yet
Yellow Modern Minimal Proposal Cover Page
5 pages
Basic Concept Hypothesis Testing
No ratings yet
Basic Concept Hypothesis Testing
7 pages
Chi Square Notes
No ratings yet
Chi Square Notes
12 pages
Chi Square Test
No ratings yet
Chi Square Test
38 pages
C31-Chi Square Test
No ratings yet
C31-Chi Square Test
7 pages
Chi Square (Χ) : Yetty Dwi Lestari Department of Management, FEB Airlangga University
No ratings yet
Chi Square (Χ) : Yetty Dwi Lestari Department of Management, FEB Airlangga University
71 pages
ExtendedAbstract AnaVilaFernandes
No ratings yet
ExtendedAbstract AnaVilaFernandes
10 pages
Lecture 14 - Categorical Data Analysis For Survey Data PDF
No ratings yet
Lecture 14 - Categorical Data Analysis For Survey Data PDF
46 pages
Chi - Square Test
No ratings yet
Chi - Square Test
12 pages
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
Mathematical Statistics
From Everand
Mathematical Statistics
S. Wilks
5/5 (2)
The Chemical Senses - Taste and Smell
100% (1)
The Chemical Senses - Taste and Smell
28 pages
VBM
No ratings yet
VBM
24 pages
Snakes, Goddesses, and Anthills (PDFDrive)
No ratings yet
Snakes, Goddesses, and Anthills (PDFDrive)
603 pages
TD 02 Solution
No ratings yet
TD 02 Solution
3 pages
LP Ix LS 3
No ratings yet
LP Ix LS 3
4 pages
Jeffries Resume
No ratings yet
Jeffries Resume
2 pages
Tutorial Forecasting
No ratings yet
Tutorial Forecasting
3 pages
Values: Redirecting Research in The Workplace
No ratings yet
Values: Redirecting Research in The Workplace
9 pages
Ass II (Phys 602)
No ratings yet
Ass II (Phys 602)
18 pages
Bank Soal Bing Xi.4 Tugas
No ratings yet
Bank Soal Bing Xi.4 Tugas
5 pages
Kovacs 2011
No ratings yet
Kovacs 2011
32 pages
Inventory Dycoco
No ratings yet
Inventory Dycoco
3 pages
Entrepreneurship Development Programme (EDP) For Micro Entrepreneurs
No ratings yet
Entrepreneurship Development Programme (EDP) For Micro Entrepreneurs
10 pages
NCM 109 (OB) - 4.1 Nursing Care of The Postpartum Client
No ratings yet
NCM 109 (OB) - 4.1 Nursing Care of The Postpartum Client
9 pages
Business Culture in Spain
No ratings yet
Business Culture in Spain
2 pages
Mcgraw Hill/Irwin
No ratings yet
Mcgraw Hill/Irwin
12 pages
Information Design
No ratings yet
Information Design
26 pages
Invention PDF
No ratings yet
Invention PDF
28 pages
2024-2025 File (1) of Result in Himachal College
No ratings yet
2024-2025 File (1) of Result in Himachal College
9 pages
Pachnickesfiberoptictransmissionnetworksefficientdesig
No ratings yet
Pachnickesfiberoptictransmissionnetworksefficientdesig
164 pages
Matiene - Wikipedia
No ratings yet
Matiene - Wikipedia
2 pages
Opera PMS Front Desk Check-In and Check-Out With The Following Front Desk Menu Options
No ratings yet
Opera PMS Front Desk Check-In and Check-Out With The Following Front Desk Menu Options
9 pages
Unit 1 Review Trig. Assignment
No ratings yet
Unit 1 Review Trig. Assignment
16 pages
The Angel in The House and Fallen Women - Sarah Kuhl
No ratings yet
The Angel in The House and Fallen Women - Sarah Kuhl
8 pages
Bioaccumulation and Human Health Risk Assessment of DDT
No ratings yet
Bioaccumulation and Human Health Risk Assessment of DDT
12 pages

Continuity Correction of Pearson's Chi-Square Test in 2x2 Contingency Tables: A Mini-Review On Recent Development

Uploaded by

Continuity Correction of Pearson's Chi-Square Test in 2x2 Contingency Tables: A Mini-Review On Recent Development

Uploaded by

ORIGINAL ARTICLES Epidemiology Biostatistics and Public Health - 2019, Volume 16, Number 2

Continuity correction of Pearson’s chi-square

(1) Department of Public Health, University Federico II of Naples, Italy

INTRODUCTION variables. For two dichotomous variables, it is possible

Mini-review on continuity correction of Pearson’s x2 e13059-1

association between variables such as sex (Male/Female) METHODS

TABLE 1. 2x2 contingency table form.

Column variable (X)

e13059-2 Mini-review on continuity correction of Pearson’s x2

Mini-review on continuity correction of Pearson’s x2 e13059-3

e13059-4 Mini-review on continuity correction of Pearson’s x2

You might also like