0% found this document useful (0 votes)

13 views7 pages

The Margin of Error' For Differences in Polls

The ‘Margin of Error’ for Differences in Polls

Uploaded by

taixs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views7 pages

The Margin of Error' For Differences in Polls

The ‘Margin of Error’ for Differences in Polls

Uploaded by

taixs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

The ‘Margin of Error’ for Differences in Polls

Charles H. Franklin
University of Wisconsin, Madison

October 27, 2002

(Revised, February 9, 2007)

The “margin of error” for a poll is routinely reported.1 But frequently we want to
know about the difference between two proportions (or percentages). Often the question
concerns differences between two responses to the same question within a single poll.
For example, what is the lead of one candidate over another in an election poll. The
second common question is whether a proportion has changed from one poll to the
next. For example, has presidential approval increased from one poll to the next. The
margin of error for these differences is not the same as the margin of error for the poll,
which is what virtually all polls routinely report. The margin of error for the poll is for a
single proportion, not differences. This leads to considerable confusion among reporters
and interpreters of polls.
This note explains the correct way to calculate the margin of error (and hence the
“significance”) for differences of proportions in polls. There is also a “quick reference”
section at the end that provides the formulas in a single spot.

1 The Margin of Error for a Single Proportion

The usual “margin of error” for a poll is the 95% confidence interval for an individual
proportion.2
The formula for the variance of a proportion, p, is
pq
Var(p) =
n−1
where q = 1 − p.3 We usually use just n in the denominator since n is rather large for
1
In fact this is a misnomer. Each item in a poll has a margin of error which depends on the sample
design and the distribution of responses to the item. We characterize the “margin of error for the poll”
by arbitrarily picking one type of item, a dichotomous item equally split, as a conservative estimate of the
margin of error for all items. This practice is a convenient rule of thumb but may overstate (or in some
cases understate) the actual margin for a particular item.
2
At least those not too close to 0% or 100% where the confidence interval becomes asymmetric. We usually
ignore such technical issues.
3
This assumes “simple random sampling”. Some polls use stratification of the sample to increase the
precision of the sample, for example stratifying by region. Others may cluster the interviews, which de-
creases the precision. Clustering is rarely done for telephone polls but is universal practice for face-to-face
interviews. The typical telephone poll may use some stratification, but the details of the sampling design
are rarely made public. The simple random sampling assumption is used for almost all reported margin of
error calculations.

1
virtually all survey samples so the difference in n and n − 1 is trivial. The standard error
for the proportion is therefore
pq
r
se(p) =
n
The 95% confidence interval (usually called the “margin of error” of the poll) is ±1.96 ×
se(p), using the normal distribution approximation for large samples.
The standard error depends on the proportion, p, and is at a maximum for p = .5, so
a quick approximation of the widest confidence interval for a single proportion is
s
.5 × .5
CI(p) = ±1.96 ×
n−1
.5
≈ ±2 × √
n
1
≈ ±√
n
This a poll. For example, if n = 400,
√ is usually what is reported as the margin of error for √
1/ 400 = .050,√ a MOE of ±5%. For n = 625, the MOE is 1/ 625 = .040 and for n = 1111,
the MOE is 1/ 1111 = .030.
For proportions different from .5, thep MOE is somewhat smaller. For example, if
p = .6, then the MOE is approximately 2 × .6 × .4/625 = .039, a trivial difference. As the
p become more skewed the MOE can be noticeably smaller, as for example if p =
responses
.2, 2 × .2 × .8/625 = .032, or about a 3 point MOE compared to a 4 point margin for the
p = .5 case. Still, unless we are looking at a highly skewed variable, these differences in
the MOE are usually small enough to be ignored. Calculations for different distributions,
such as these, are almost never reported in media accounts of polls.

2 Differences within a single poll question

The problem is that in political horse-race polls there is usually less interest in the pro-
portion supporting a single candidate than in the lead one candidate has over another,
the difference between the two proportions. When reporters (and political scientists) try
to explain this they usually run into trouble because they don’t use the correct formula
for the confidence interval of the difference of proportions.
A voter preference survey question will always have at least three categories and often
more. There will be response categories for the Republican and Democratic candidates
plus at least an undecided category. There may also be categories for candidates for
third parties and perhaps an “other” category as well. In presidential or other primary
contests, there will usually be a number of candidates, often considerably greater than
two. When responses fall into more than two discrete categories the proportions have a
multinomial distribution.
In horse-race polls, we want to know the difference in proportions supporting the top
two candidates (and perhaps between other pairs) and we need the confidence interval
for this difference to tell if the lead is statistically significant (or “outside the margin
of error”). Often pollsters, journalists and political scientists calculate this as twice the
reported margin of error of the poll. This is done following the logic that if one candidate

2
is at 55% and the other at 45% and the poll has a ±5% margin of error, then the first
candidate could be as low as 55−5 = 50% and the second could be as high as 45+5 = 50%.
In this case reporters would say the race was a statistical dead heat because the gap
between the candidates (55 − 45 = 10%) is not more than two times the margin of error
of the poll (5%). While this is the correct conclusion when there are only two possible
survey responses, it is not correct when there are more than two possible responses,
which is in fact virtually always the case. How much difference this makes depends on
how many responses are outside the two categories of interest.
The correct formula for the variance of the difference of two multinomial proportions
for candidates 1 and 2, p1 and p2 , is (adapted from Kish, Survey Sampling, 1965, p. 498-
501)
(p1 + p2 ) − (p1 − p2 )2
Var(p1 − p2 ) =
n−1
The 95% confidence interval (“margin of error”) for the difference of proportions is there-
fore
s
(p1 + p2 ) − (p1 − p2 )2
CI(p1 − p2 ) = 1.96
n−1
s
(p1 + p2 ) − (p1 − p2 )2
≈ 2
n

Note that it doesn’t matter what candidates 3 . . . k have. We only need the proportions
for the pair of candidates we care about in the formula. If there is considerable support
for these other candidates then p1 + p2 will be a good deal less than 1.0, and this will
shrink the standard error for the difference between p1 and p2 , as we’ll see below.
Whenever we compare proportions of candidate support within a single survey, this
is the formula we should use. For low amounts of undecided or third party support the
results will be close to the “twice the margin of error” formula, but the correct margin of
error will be less than this as the proportion of “other” responses increases.
To see this effect Figure 1 plots the margin of error against the proportion of cases in
the two categories of interest. For the example, I’ve picked a difference between p1 and
p2 of eight points but let the total in p1 + p2 vary to illustrate how much the size of other
categories matters for the MOE of the difference. √
If n = 600, the margin of error for the sample is 1/ 600 = 0.0408. Using the “twice
the MOE of the sample” rule, we would find that the margin of error of the difference
would be 0.0816. (If we were a bit pickier we would multiply by 1.96 instead of 2.0, and
get .07997, and round that to .08, which is what is shown in the plot.) This is the mar-
gin of error for the difference if every respondent chose candidate 1 or candidate 2 and
there were no “other” responses. In such a case, our hypothetical difference of 8 points
between candidates 1 and 2 would be just barely statistically significant. But as the num-
ber of “other” responses grows (because of a single additional option, or because there
are 11 other presidential candidates in the field) the margin of error declines (reading
right to left in the figure.)

3
Margin of Error for an 8 Point Difference
0.12

n=300
0.10

n=400

n=600
0.08
Margin of Error

n=1000
0.06
0.04
0.02
0.00

0.2 0.4 0.6 0.8 1.0

p1 + p2

Figure 1: Margin of error depends on p1 + p2 . The curved lines plot the margin of
error as the proportion of the sample in p1 + p2 varies from 20% to 100% of the sample,
for various sample sizes. The horizontal lines show the margin of error when the two
categories are 100% of the sample, the “twice the margin of error of the poll” rule of
thumb. The purple lines converge to this value as the proportion of the sample in the
two categories of interest rises to 1.0. When p1 + p2 is less than 1.0, the correct margin
of error is less, and sometimes substantially less, than twice the MOE of the sample. The
precise margin of error also depends on the difference, p1 − p2 , in the sample. Here the
results are illustrated for an eight point difference.

4
3 Difference between two polls
A different issue is posed by the difference of proportions between two independent
polls. Given two polls we want to know if opinion changed by a statistically significant
amount from one poll to the next. Unlike the case within a single poll, here the percent
support for a candidate in one poll is independent of that support in the other poll.
(Because we draw independent random samples for each poll, this will be the case.)
Now the difference of interest is p2 − p1 where the subscripts 1 and 2 indicate polls 1
and 2, and we are measuring support for the same candidate in both polls. The variance
of the difference with independent samples is

Var(p2 − p1 ) = Var(p1 ) + Var(p2 )

p1 q1 p2 q2
= +
n1 n2

where q1 = 1 − p1 and q2 = 1 − p2 , so the margin of error becomes

s
p1 q1 p2 q2

CI(p2 − p1 ) = 1.96 +
n1 n2

This is not twice the margin of error for either poll (and the MOE for each poll may
differ.) If the margin of error is the same for both polls, then the MOE for their difference
is 1.41 times the MOE for the polls, not 2 times it.
Why the difference? Within a single poll, if support for the Democrat goes down,
support for the Republican must go up if there are no undecided or third party voters.
(If there are third alternatives then the proportions are likely to still be correlated, just
not perfectly.) The formula for the difference of multinomial proportions given above
takes this nonindependence between the proportions into account. In contrast when
we compare across two independent samples, the proportion support estimated for the
candidate in each poll is statistically independent because the samples are drawn inde-
pendently. Hence a different formula for the independent samples case.

4 Examples
A Time magazine poll conducted January 22-23, 2007, surveyed 441 registered voters
who are Republicans or lean Republican. The margin of error for the poll would be
s
.5 × .5
MOE = 1.96 ×
441 − 1
= .0467
≈ .05 or 5%

The Time poll found support for Senator John McCain at 30%. Former Mayor Rudy
Giuliani received 26% support. The McCain lead over Giuliani is 4 points, but what is the
margin of error? Using the “twice the MOE of the poll” rule, this would fall clearly short
of 2 × .0467 = .0934 or just over 9 point MOE for the difference.

5
Using the correct formula for the MOE of a difference, we get
s
p1 + p2 − (p1 − p2 )2
MOE(McCain − Giuliani) = 1.96 ×
441 − 1
s
.30 + .26 − (.30 − .26)2
= 1.96 ×
440
= .0698
≈ .07 or 7%

So the margin of error of the difference is 7 points and the difference of 4 points is
not outside the MOE, and we conclude that the McCain lead is not statistically significant
in this case.
An example where this distinction does matter is provided by a Public Policy Polling
(PPP) survey of North Carolina Republican primary voters, taken February 5-6, 2007.4
This poll of 735 respondents found support for Giuliani at 31% with former Speaker of
the House Newt Gingrich at 25%. The margin of error for this poll is .036, using the
standard calculation for a single question. If we use the “twice the MOE” rule, the MOE
for the difference between Giuliani and Gingrich would be 2 × .036 = .072 or 7.2%. In that
case, the six point Giuliani lead would not reach statistical significance. If we apply the
correct formula for differences of multinomial proportions, using p1 = .31 and p2 = .25,
then the MOE is .054, or 5.4%, in which case the Giuliani lead is outside the margin of
error and we would be justified in calling this a statistically significant lead.
Now consider the difference between two independent polls. In two Associated Press/Ipsos
polls, approval of President George W. Bush’s handing of his job was at 36% in a January
16-18, 2007 poll of 1002 adults, and at 32% in a February 5-7, 2007 poll of 1000 adults.
Is this difference statistically significant?
The MOE for the polls is .031, or 3.1% for both polls. Since the polls were conducted
independently, the MOE for the difference in approval is
s
.32 × .68
MOE(p2 − p1 ) = 1.96 × f r ac.36 × .641002 − 1 +
1000 − 1
= .0415 or 4.15%

The change of 4 percentage points is slightly less than this margin of error, so the change
in approval is not quite statistically significant.

5 Conclusions
The margin of error for a poll is not a simple guide to the margin of error for differences
either within the poll or across independent polls. The multiple uses of the phrase
“margin of error” compounds the confusion. It is not easy to accurately convey statistical
issues in reports intended for a mass audience (or even many political professionals and
4
This poll was conducted using “Interactive Voice Response” (IVR) technology in which a recorded voice
asks the questions and respondents push the phone keypad to register their responses. I ignore the issue
of IVR technology compared to standard polls with live interviewers here.

6
journalists) without becoming arcane. Nonetheless, the frequent confusion over when
a difference is “significant” and when it is not needs to be addressed. The most direct
way would be for authors of reports to use the correct calculation and report the result
without bothering the reader with the details. For audiences accustomed to footnotes,
perhaps the details could go there.
And for wordsmiths and others unaccustomed to making statistical calculations, a
quick consultation with those who have a calculator ready at hand would solve a number
of confusions.

6 Quick Reference
All percentages should be expressed as proportions, so 8% = .08, 30% = .30 and so on.
The terms p1 and p2 are the proportions supporting candidate 1 and 2, or the propor-
tions for a candidate in polls 1 and 2 and n, n1 and n2 are the number of respondents
to the poll or to polls 1 and 2.

6.1 Margin of error of the poll

s
p × (1 − p)
MOE(p) = 1.96 ×
n−1

6.2 Margin of error for difference in a single question

s
(p1 + p2 ) − (p1 − p2 )2
MOE(p1 − p2 ) = 1.96 ×
n−1

6.3 Margin of error for difference between two polls

s
p1 q1 p2 q2

MOE(p2 − p1 ) = 1.96 × +
n1 n2

7 References

Kish, Leslie. 1965. Survey Sampling New York: John Wiley & Sons.
Scott, Alastair J. and George A. F. Seber. 1983. “Difference of Proportions from the Same
Survey.” The American Statistician 37:319-320.

Inferential Statistics
No ratings yet
Inferential Statistics
111 pages
Probability and Statistics
100% (1)
Probability and Statistics
26 pages
Confidence Intervals For Proportions Take Home Test
100% (1)
Confidence Intervals For Proportions Take Home Test
5 pages
Powerpoint
No ratings yet
Powerpoint
66 pages
Statistics
No ratings yet
Statistics
46 pages
Unit 6 Pt. 1 Student Packet 22-23 KEY
No ratings yet
Unit 6 Pt. 1 Student Packet 22-23 KEY
46 pages
Math 227 Chapter 9
No ratings yet
Math 227 Chapter 9
16 pages
Confidence Interval Review
No ratings yet
Confidence Interval Review
13 pages
Political Polls
No ratings yet
Political Polls
13 pages
Estimation and CI
No ratings yet
Estimation and CI
87 pages
Chapter 8 Confidence Intervals
No ratings yet
Chapter 8 Confidence Intervals
34 pages
SIM Statistics
No ratings yet
SIM Statistics
15 pages
Chapter 07 Estimation
No ratings yet
Chapter 07 Estimation
31 pages
Statistics Notes
No ratings yet
Statistics Notes
23 pages
4.1. Estimation
No ratings yet
4.1. Estimation
30 pages
Survey Sample Sizes and Margin of Error
No ratings yet
Survey Sample Sizes and Margin of Error
8 pages
9.4 A Confidence Interval For A Proportion: Estimation and Confidence Intervals
No ratings yet
9.4 A Confidence Interval For A Proportion: Estimation and Confidence Intervals
11 pages
MIT18 05S14 Class23slides PDF
No ratings yet
MIT18 05S14 Class23slides PDF
16 pages
What Is A Margin of Error?
No ratings yet
What Is A Margin of Error?
8 pages
2 2020 12 21!07 45 30 PM
No ratings yet
2 2020 12 21!07 45 30 PM
40 pages
Q4-M1 Stat
No ratings yet
Q4-M1 Stat
18 pages
A Confidence Interval For A Population Proportion
No ratings yet
A Confidence Interval For A Population Proportion
4 pages
Hypothesis Testing and Confidence Interval
No ratings yet
Hypothesis Testing and Confidence Interval
14 pages
Answer 4 MGT 2 3rd Year
No ratings yet
Answer 4 MGT 2 3rd Year
8 pages
Statistics Book Problem Answer
No ratings yet
Statistics Book Problem Answer
37 pages
Confidence Interval
100% (1)
Confidence Interval
19 pages
9.1 Classroom Notes Key
No ratings yet
9.1 Classroom Notes Key
8 pages
Mit18 05 s22 Class23-Prep-B
No ratings yet
Mit18 05 s22 Class23-Prep-B
6 pages
Stat - Prob Q3 Module 8
No ratings yet
Stat - Prob Q3 Module 8
18 pages
Estimating Proportions With Confidence: Some Definitions: Details For Proportions
No ratings yet
Estimating Proportions With Confidence: Some Definitions: Details For Proportions
6 pages
14622inferenceforsingleproportions 160909005557
No ratings yet
14622inferenceforsingleproportions 160909005557
19 pages
8.2 Exercise
No ratings yet
8.2 Exercise
3 pages
Managerial Statistics Final Exam Practise
No ratings yet
Managerial Statistics Final Exam Practise
18 pages
Word Format LAS Statistics and Prob 3
No ratings yet
Word Format LAS Statistics and Prob 3
6 pages
Ch. 9 Multiple Choice Review Questions: 1.96 B) 1.645 C) 1.699 D) 0.90 E) 1.311
100% (1)
Ch. 9 Multiple Choice Review Questions: 1.96 B) 1.645 C) 1.699 D) 0.90 E) 1.311
5 pages
Estimation of Parameters 1
No ratings yet
Estimation of Parameters 1
34 pages
Worksheet 4 - Solutions
No ratings yet
Worksheet 4 - Solutions
3 pages
8 3+Quiz+KEY
No ratings yet
8 3+Quiz+KEY
3 pages
WS1 Exemplar+Confidence+Intervals+for+Proportions
No ratings yet
WS1 Exemplar+Confidence+Intervals+for+Proportions
2 pages
Point and Interval Estimates
No ratings yet
Point and Interval Estimates
17 pages
Confidence and Significance
No ratings yet
Confidence and Significance
4 pages
AP Stat 7.1 Notes
No ratings yet
AP Stat 7.1 Notes
6 pages
05 - Estimating A Proportion
No ratings yet
05 - Estimating A Proportion
5 pages
S&P Computing For The Lengths of The Confidence Interval
No ratings yet
S&P Computing For The Lengths of The Confidence Interval
27 pages
Error Analysis - Statistics: - Accuracy and Precision - Individual Measurement Uncertainty
No ratings yet
Error Analysis - Statistics: - Accuracy and Precision - Individual Measurement Uncertainty
33 pages
Confidence Interval 2
No ratings yet
Confidence Interval 2
10 pages
Parameter
No ratings yet
Parameter
1 page
Wooldridge 7e AppC SSM
No ratings yet
Wooldridge 7e AppC SSM
2 pages
Chapter 8 Test Review
No ratings yet
Chapter 8 Test Review
7 pages
Standard Error
No ratings yet
Standard Error
2 pages
Important Formulas: Chapter 5: Discrete Probability Distributions
No ratings yet
Important Formulas: Chapter 5: Discrete Probability Distributions
2 pages
SG - Interval Estimate P p1p2 U U1u2
No ratings yet
SG - Interval Estimate P p1p2 U U1u2
7 pages
Estimating With Confidence Practice
No ratings yet
Estimating With Confidence Practice
1 page
Stat Handout 2
No ratings yet
Stat Handout 2
11 pages
Confidence Intervals
No ratings yet
Confidence Intervals
16 pages
Ch. 10 Review
No ratings yet
Ch. 10 Review
8 pages
Feat of Strength #5: Confidence Intervals For A Population Proportion by Carlos Vazquez
No ratings yet
Feat of Strength #5: Confidence Intervals For A Population Proportion by Carlos Vazquez
6 pages
Statistical Estimation
No ratings yet
Statistical Estimation
31 pages
Length of Confidence
No ratings yet
Length of Confidence
3 pages
Discussion1 Solution
No ratings yet
Discussion1 Solution
5 pages
Stat 2 MCQ
No ratings yet
Stat 2 MCQ
115 pages
Lecture 24 PDF
100% (2)
Lecture 24 PDF
12 pages
MathforRealLife PDF
No ratings yet
MathforRealLife PDF
213 pages
Slovin's Formula
100% (1)
Slovin's Formula
7 pages
Determining Sample Size
100% (1)
Determining Sample Size
2 pages
Your Answer Score Explanation
0% (1)
Your Answer Score Explanation
18 pages
Udacity Statistics Notes
No ratings yet
Udacity Statistics Notes
37 pages
CH 10
No ratings yet
CH 10
18 pages
Statistics For Data Science PDF - Statistics-for-Data-Science PDF
No ratings yet
Statistics For Data Science PDF - Statistics-for-Data-Science PDF
14 pages
Length of Confidence Interval: John Daniel G. Ibon Siloe T. Tolentino Steven Ivanrey A. Medrano Kirk Jan Harvey F. Leynes
No ratings yet
Length of Confidence Interval: John Daniel G. Ibon Siloe T. Tolentino Steven Ivanrey A. Medrano Kirk Jan Harvey F. Leynes
17 pages
Trig Exam Example 20-2
No ratings yet
Trig Exam Example 20-2
10 pages
Questionners and Data Analysis
No ratings yet
Questionners and Data Analysis
98 pages
Fashion - Sustainability - France
No ratings yet
Fashion - Sustainability - France
110 pages
Statistical Thinking - Noba
No ratings yet
Statistical Thinking - Noba
28 pages
CHAPTER IV Estimation of Parameters
No ratings yet
CHAPTER IV Estimation of Parameters
93 pages
Sampling Technique Written REport
No ratings yet
Sampling Technique Written REport
4 pages
Estimation
No ratings yet
Estimation
35 pages
Stat - Prob-Q3-Module-8
No ratings yet
Stat - Prob-Q3-Module-8
21 pages
The Information Systems Capability of Brigadier General Benito N Ebuen Air Base Units
No ratings yet
The Information Systems Capability of Brigadier General Benito N Ebuen Air Base Units
122 pages
Estimation
No ratings yet
Estimation
32 pages
MMW 7-11
No ratings yet
MMW 7-11
11 pages
Statistics Probability Q3 Mod8 Length of Confidence Interval Statistics Probability Q3 Mod8 Length of Confidence Interval
No ratings yet
Statistics Probability Q3 Mod8 Length of Confidence Interval Statistics Probability Q3 Mod8 Length of Confidence Interval
25 pages
The Role of Job Rotation in Enhancing Employee Performance in The Egyptian Travel Agents: The Mediating Role of Organizational Behavior
No ratings yet
The Role of Job Rotation in Enhancing Employee Performance in The Egyptian Travel Agents: The Mediating Role of Organizational Behavior
18 pages
AAPOR Guidance Nonprob Precision 042216
No ratings yet
AAPOR Guidance Nonprob Precision 042216
5 pages
Comm291 Practice Midterm
No ratings yet
Comm291 Practice Midterm
70 pages
The U.S.-Global Leadership Project: A Partnership Between The Meridian International Center and Gallup
No ratings yet
The U.S.-Global Leadership Project: A Partnership Between The Meridian International Center and Gallup
24 pages
7B.2 IvyLearn - Last Homework Polls and Margin of Error - 2HH-Fall 2023-Quantitative Reasoning
No ratings yet
7B.2 IvyLearn - Last Homework Polls and Margin of Error - 2HH-Fall 2023-Quantitative Reasoning
1 page
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet

The Margin of Error' For Differences in Polls

Uploaded by

The Margin of Error' For Differences in Polls

Uploaded by

The ‘Margin of Error’ for Differences in Polls

October 27, 2002

1 The Margin of Error for a Single Proportion

2 Differences within a single poll question

0.2 0.4 0.6 0.8 1.0

Var(p2 − p1 ) = Var(p1 ) + Var(p2 )

where q1 = 1 − p1 and q2 = 1 − p2 , so the margin of error becomes

6.1 Margin of error of the poll

6.2 Margin of error for difference in a single question

6.3 Margin of error for difference between two polls

You might also like