0% found this document useful (0 votes)

6 views31 pages

Biostats Lecture 9 Difference of Two Proportions v2

The document covers statistical methods for comparing two proportions, including hypothesis testing and confidence intervals. It discusses the importance of pooled estimates and variance calculations, as well as the chi-square test for goodness of fit and testing independence in contingency tables. Practical examples are provided to illustrate the application of these statistical techniques in analyzing categorical data.

Uploaded by

Cesar Calderon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views31 pages

Biostats Lecture 9 Difference of Two Proportions v2

Uploaded by

Cesar Calderon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

IPHS 405

Inference for Categorical Data

Comparing Difference of Two Proportions
Biostats Lecture 9

Hua Yun Chen

Lester Arguelles

1
Biostats Lecture 9 (Diez Chapter 6.2)
Difference of Two Proportions (6.2)
Sampling Distribution of the Difference of Two Proportions (6.2.1)
Confidence Intervals for p1-p2 (6.2.2)
Hypothesis Tests for the Difference of Two Proportions (6.2.3)
More on 2-Proportions Hypothesis Tests (6.2.4)
Examining the Standard Error Formula (6.2.5)

2
Hypothesis testing about two
proportions
Baby Sex Proportion with Smoker Mother and
Nonsmoker Mother

Consider variables: sex_baby and smoke.

Test of equality of two population
proportions
1. The probability of baby is a boy when
mother does not smoke.

2. The probability of baby is a girl when mother

smokes.

3. The null hypothesis is: the two probabilities

are equal.
Estimate of the difference of two proportions
from the sample

Mother Female Male Total

smoke
Nonsmoker 49 51 100
Smoker 19 31 50
Total 68 82 150
The Pooled Estimate Of A Proportion
Is Needed For HT
In the case of comparing two proportions where H0: p1 = p2, there
isn't a given null value we can use to calculate the expected
number of successes and failures in each sample.
Therefore, we first need to find a common (pooled) proportion
for the two groups, and use that in our analysis.
This simply means finding the proportion of total successes
among the total number of observations.
Pooled estimate of a proportion

7
Variance estimates under the null
hypothesis
1. The variance of under the null hypothesis

2. The variance of

3. The variance of
Steps for testing the hypothesis
1. Find the point estimate for

2. Find the variance estimate and standard error

for ,

3. Find test statistic,

Steps for testing the hypothesis
(continuing)
4. Find the p-value (2-sided)

5. Compare p-value with type I error

6. Make a decision.
Fail to reject (accept) the hypothesis.
Confidence interval comparing
two proportions
Comparing two population
proportions
1. When two proportions may not be equal, we
are interested in finding the difference:

2. The first step is to obtain the corresponding

quantity from the sample (a point estimator),

3. The next step is to find variance of .

Variances Estimate
a). The variance of

b). The variance of

c). The variance of

Comparing two population
proportions (continuing)
4. Find the standard error for ,

5. Determine the margin of error. For 95%

confidence interval,

6. Find the confidence interval,

Comparing two population
proportions (continuing)
6. Find the confidence interval,

7. Interpretation of the confidence interval: With

95% confidence that is within to .

Practice: What is the 99% confidence interval

for ?
Comparing two population
proportions (continuing)
6. Find the confidence interval,

7. Interpretation of the confidence interval: With

95% confidence that is within to .

Practice: What is the 99% confidence interval

for ?
Conditions Necessary for Normal
Approximation
Independence between groups The subjects in the birth weight
study are sampled independently.

Success-failure At least 10 observed successes and 10 observed

failures in each group

17
Chi-square test of goodness of
fit.
Goodness of fit test for one-way
table.
1. The test for a proportion is for a binary
distribution.
2. For a categorical variable of more than two
categories, test of goodness of fit can be
done.
3. The test of goodness of fit is to examine if
the sample data follows a give distribution.
4. Such a test statistic is a chi-square
distributed.
Example on the mother age of birth
distribution

1. Mother’s age at birth is categorized into

three intervals less than 25, between 25-35,
and 35+.
2. The frequency table
Age range <25 years >=25 years, <35 >=35 years Total
years
Counts 64 65 21 150

Frequency 0.427 0.433 0.140 1.0

Hypothetical 0.4 0.4 0.2 1.0

Population freq.
Expected 150*0.4=60 150*0.4=60 150*0.2=30 150
frequency
Chi-square test of goodness of fit

1. Test statistic

2. For the example

3. P-value=chisq.dist(3.383,2,TRUE)=0.184.
Fails to reject the null hypothesis.
Testing independence (No
association) in contingency table
Test of independence
(no association)
Question: Is baby sex associated with whether
mother smokes? No association implies

Mother smoke BS= BS= Total

Female Male
MS=Nonsmoker 49(45.33) 51(54.67) 100(0.6667)
MS=Smoker 19(22.66) 31(27.33) 50(0.3333)
Total 68(0.4533) 82(0.5467) 150(1)
Steps for Testing of independence

Step 1. Find the margin distribution estimates

Mother smoke BS= BS= Total
Female Male
MS=Nonsmoker 100(0.6667)
MS=Smoker 50(0.3333)
Total 68(0.4533) 82(0.5467) 150(1)
Steps for Testing of independence

Step 2. Find the joint cell probability under

Mother smoke BS= BS= Total
Female Male
MS=Nonsmoker 0.3022 0.3645 0.6667
MS=Smoker 0.1511 0.1822 0.3333
Total 0.4533 0.5467 1

Do the same for the rest of cells.

Steps for Testing of independence

Step 3. Find the expected cell counts under

Mother smoke BS= BS= Total
Female Male
MS=Nonsmoker 45.33 54.67 100
MS=Smoker 22.67 27.33 50
Total 68 82 150

Do the same for the rest of cells.

Steps for Testing of independence

Step 4. Compare the expected cell counts with

the observed cell counts under
Mother smoke BS= BS= Total
Female Male
MS=Nonsmoker 49(45.33)[0.2971] 51(54.67)[0.2467] 100
MS=Smoker 19(22.67)[0.5941] 31(27.33)[0.4982] 50
Total 68 82 150

Calculate for each cell.

Steps for Testing of independence

Step 5. Compute the chi-square statistics.

Mother smoke BS= BS= Total
Female Male
MS=Nonsmoker 0.2971 0.2467 0.5438
MS=Smoker 0.5941 0.4982 1.0923
Total 0.8912 0.7449 1.6361

Calculate over all cells.

Steps for Testing of independence

Step 6. Determine the p-value.

This can be done in Excel as follows.

CHISQ.DIST(statistic, degree of freedom, cumulative).

Degree of freedom is determined by
Compare with test
1. Result of the test of independence (no
association) is usually similar to that of test
of two proportions in a 2x2 table.

2. The advantage of test of independence is

that it can be directly applied to 2x3 table,
3x2 table, 3x3 table, and any JxK table.

3. For JxK table, the degree of freedom for chi-

square distribution is .
Practice
The following table gives results of a study investigating the
low birth weight and the race of the mother.
Race Low birth Normal birth
weight weight
Black 13 18
White 25 80
Other 27 43

Test the hypothesis that there is no association between

race and birth weight.

Cheat Sheet PDF
No ratings yet
Cheat Sheet PDF
4 pages
Biostats Lecture 8 Foundations For Inference
No ratings yet
Biostats Lecture 8 Foundations For Inference
36 pages
Independent T Test & Paired T Test 2023
No ratings yet
Independent T Test & Paired T Test 2023
58 pages
Chi-Square Testing
No ratings yet
Chi-Square Testing
38 pages
Chapter 2-2
No ratings yet
Chapter 2-2
53 pages
Topic 6
No ratings yet
Topic 6
39 pages
ANP 802 Lecture 2verynew
No ratings yet
ANP 802 Lecture 2verynew
50 pages
Biostatistics 521 Lecture 14 Inference For Numerical Data II
No ratings yet
Biostatistics 521 Lecture 14 Inference For Numerical Data II
79 pages
11paired T
No ratings yet
11paired T
49 pages
Chapter 3
No ratings yet
Chapter 3
20 pages
HA 2 Solved
No ratings yet
HA 2 Solved
13 pages
Session 09
No ratings yet
Session 09
6 pages
7A. Comparing 2 Pop. Means
No ratings yet
7A. Comparing 2 Pop. Means
16 pages
Lecture 6 T Test Edited
No ratings yet
Lecture 6 T Test Edited
46 pages
Wa0102.
No ratings yet
Wa0102.
52 pages
Week 4 - Statistical Hypothesis Testing
No ratings yet
Week 4 - Statistical Hypothesis Testing
22 pages
Inference For One and Two Proportions
No ratings yet
Inference For One and Two Proportions
34 pages
Notes
No ratings yet
Notes
3 pages
Biostats Lecture 10 Inference For Means
No ratings yet
Biostats Lecture 10 Inference For Means
43 pages
Statistical Inferences
No ratings yet
Statistical Inferences
46 pages
Lecture Notes Stats Ich 9
No ratings yet
Lecture Notes Stats Ich 9
28 pages
Minitab Workbook
No ratings yet
Minitab Workbook
28 pages
5 & 6 - BIOSTATISTICS V & VI Inferential Statistics I & II
No ratings yet
5 & 6 - BIOSTATISTICS V & VI Inferential Statistics I & II
68 pages
MATH& 146 Lesson 30: Difference of Two Means
No ratings yet
MATH& 146 Lesson 30: Difference of Two Means
28 pages
Phân Tích Dữ Liệu Và Xác Định Phép Kiểm Thống Kê
No ratings yet
Phân Tích Dữ Liệu Và Xác Định Phép Kiểm Thống Kê
50 pages
Two Sample T-Test 29-12-14
No ratings yet
Two Sample T-Test 29-12-14
38 pages
(Ebook PDF) Qualitative Data Analysis: A Methods Sourcebook 4th Editioninstant Download
100% (3)
(Ebook PDF) Qualitative Data Analysis: A Methods Sourcebook 4th Editioninstant Download
57 pages
Biostat Estimation
100% (1)
Biostat Estimation
48 pages
25-Tests For Population Means (One Sample and Two Samples) - 30!09!2023
No ratings yet
25-Tests For Population Means (One Sample and Two Samples) - 30!09!2023
12 pages
Lecture 26 Compact
No ratings yet
Lecture 26 Compact
5 pages
ANOVA (Analysis of Variance)
No ratings yet
ANOVA (Analysis of Variance)
45 pages
Chapter 2
No ratings yet
Chapter 2
62 pages
Isds361b Notes
No ratings yet
Isds361b Notes
103 pages
Chap8.+T Test
No ratings yet
Chap8.+T Test
45 pages
47 independentTTest
No ratings yet
47 independentTTest
4 pages
PHPS30020 Week1 (5) - 29nov2023 (Test Decisions & Assumptions, Hypothesis, Compare 2 Groups)
No ratings yet
PHPS30020 Week1 (5) - 29nov2023 (Test Decisions & Assumptions, Hypothesis, Compare 2 Groups)
16 pages
Introduction To Hypothesis Testing24
No ratings yet
Introduction To Hypothesis Testing24
54 pages
Two Sample Inference: By: Girma M
No ratings yet
Two Sample Inference: By: Girma M
33 pages
Cube - 3x3x3 - OLL-PLL - 4-Look Version Updated
No ratings yet
Cube - 3x3x3 - OLL-PLL - 4-Look Version Updated
1 page
Class 12 Maths Mid-Term Paper
No ratings yet
Class 12 Maths Mid-Term Paper
7 pages
Bios O6s A4
No ratings yet
Bios O6s A4
23 pages
Z-Test For Single Mean
No ratings yet
Z-Test For Single Mean
32 pages
Class 19 Z Test T Test Copy 25
No ratings yet
Class 19 Z Test T Test Copy 25
10 pages
Mann-Whitney - Reading Material
No ratings yet
Mann-Whitney - Reading Material
5 pages
Ttest
No ratings yet
Ttest
14 pages
Lesson 4 TEST OF DIFFERENCE
No ratings yet
Lesson 4 TEST OF DIFFERENCE
26 pages
Hypothesis Testing-2 PDF
No ratings yet
Hypothesis Testing-2 PDF
16 pages
Non-Parametric Tests
100% (1)
Non-Parametric Tests
55 pages
Biostatistics L11+12 2021
No ratings yet
Biostatistics L11+12 2021
9 pages
Introduction To Inferential Statistics & Important Statistical Tests
100% (1)
Introduction To Inferential Statistics & Important Statistical Tests
55 pages
Mini
No ratings yet
Mini
28 pages
Modul 05
No ratings yet
Modul 05
13 pages
Statistical Technique Summary Table
No ratings yet
Statistical Technique Summary Table
4 pages
Inferential Statistics Powerpoint
No ratings yet
Inferential Statistics Powerpoint
65 pages
Summary Table For Statistical Techniques
No ratings yet
Summary Table For Statistical Techniques
4 pages
Biostat Handouts Lesson 12 PDF
No ratings yet
Biostat Handouts Lesson 12 PDF
41 pages
Bec403 (CS)
No ratings yet
Bec403 (CS)
7 pages
Mann Whitney U Test
No ratings yet
Mann Whitney U Test
9 pages
The IMA Volumes in Mathematics and Its Applications: Avner Friedman Willard Miller, JR
No ratings yet
The IMA Volumes in Mathematics and Its Applications: Avner Friedman Willard Miller, JR
172 pages
Design of Rural Water Supply System Using Loop 4.0
No ratings yet
Design of Rural Water Supply System Using Loop 4.0
9 pages
Thermal Deformation Analysis of Automotive Disc Brake Squeal
No ratings yet
Thermal Deformation Analysis of Automotive Disc Brake Squeal
26 pages
Unit-1 PCT
No ratings yet
Unit-1 PCT
14 pages
2022-2023 ASVAB Arithmetic Reasoning and Mathematics
No ratings yet
2022-2023 ASVAB Arithmetic Reasoning and Mathematics
4 pages
Stress Concentration Problems
No ratings yet
Stress Concentration Problems
15 pages
Curriculum Physics Program - Assignment 1 - Ryan Hamilton 91641872
No ratings yet
Curriculum Physics Program - Assignment 1 - Ryan Hamilton 91641872
32 pages
Principles of Artificial Intelligence
No ratings yet
Principles of Artificial Intelligence
15 pages
Principles of Discrete Time Mechanics Jaroszkiewicz G. PDF Download
100% (1)
Principles of Discrete Time Mechanics Jaroszkiewicz G. PDF Download
45 pages
Lec 8
No ratings yet
Lec 8
8 pages
Software Midterm
No ratings yet
Software Midterm
10 pages
Interpretation and Report Writing: Bm-Aryan Panchal
No ratings yet
Interpretation and Report Writing: Bm-Aryan Panchal
13 pages
Core Lap
No ratings yet
Core Lap
1 page
One Dimensional Array in Java - Tutorial & Example
No ratings yet
One Dimensional Array in Java - Tutorial & Example
4 pages
NFA To DFA Conversion: Rabin and Scott (1959)
No ratings yet
NFA To DFA Conversion: Rabin and Scott (1959)
14 pages
1.1 Functions and Theis Representations
No ratings yet
1.1 Functions and Theis Representations
17 pages
Vedic Mathematics 1 VAC PYQ
No ratings yet
Vedic Mathematics 1 VAC PYQ
8 pages
Presentation 2nd
No ratings yet
Presentation 2nd
26 pages
Wang-2024-Deep Reinforcement Learning For Dema
No ratings yet
Wang-2024-Deep Reinforcement Learning For Dema
13 pages
7TH Semester Syllabus
No ratings yet
7TH Semester Syllabus
9 pages
Control Proporcional
No ratings yet
Control Proporcional
5 pages
Error Checking in Java
No ratings yet
Error Checking in Java
18 pages
History of Exponents
No ratings yet
History of Exponents
2 pages
Generalized Extended Tanh-Function Method and Its Application
No ratings yet
Generalized Extended Tanh-Function Method and Its Application
10 pages
Bearing Capacityof Embedded Strip Footing Placed Adjacentto Sandy Soil Slopes
No ratings yet
Bearing Capacityof Embedded Strip Footing Placed Adjacentto Sandy Soil Slopes
8 pages
Efficient MIMO Detection With Imperfect Channel Knowledge - A Deep Learning Approach
No ratings yet
Efficient MIMO Detection With Imperfect Channel Knowledge - A Deep Learning Approach
6 pages
Tell Me The Odds: A 15 Page Introduction To Bayes Theorem
From Everand
Tell Me The Odds: A 15 Page Introduction To Bayes Theorem
Scott Hartshorn
4.5/5 (9)
Primordial Prescription: The Most Plaguing Problem of Life Origin Science
From Everand
Primordial Prescription: The Most Plaguing Problem of Life Origin Science
David L. Abel
No ratings yet
Neuroscientific based therapy of dysfunctional cognitive overgeneralizations caused by stimulus overload with an "emotionSync" method
From Everand
Neuroscientific based therapy of dysfunctional cognitive overgeneralizations caused by stimulus overload with an "emotionSync" method
Christian Hanisch
No ratings yet
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Self Muscle Testing: Two Reasons and 33 Beneficial Side-effects
From Everand
Self Muscle Testing: Two Reasons and 33 Beneficial Side-effects
Bruce Dickson
2/5 (1)

Biostats Lecture 9 Difference of Two Proportions v2

Uploaded by

Biostats Lecture 9 Difference of Two Proportions v2

Uploaded by

IPHS 405

Inference for Categorical Data

Hua Yun Chen

Consider variables: sex_baby and smoke.

2. The probability of baby is a girl when mother

3. The null hypothesis is: the two probabilities

Mother Female Male Total

2. Find the variance estimate and standard error

3. Find test statistic,

5. Compare p-value with type I error

2. The first step is to obtain the corresponding

3. The next step is to find variance of .

b). The variance of

c). The variance of

5. Determine the margin of error. For 95%

6. Find the confidence interval,

7. Interpretation of the confidence interval: With

Practice: What is the 99% confidence interval

7. Interpretation of the confidence interval: With

Practice: What is the 99% confidence interval

Success-failure At least 10 observed successes and 10 observed

1. Mother’s age at birth is categorized into

Frequency 0.427 0.433 0.140 1.0

Hypothetical 0.4 0.4 0.2 1.0

2. For the example

Mother smoke BS= BS= Total

Step 1. Find the margin distribution estimates

Step 2. Find the joint cell probability under

Do the same for the rest of cells.

Step 3. Find the expected cell counts under

Do the same for the rest of cells.

Step 4. Compare the expected cell counts with

Calculate for each cell.

Step 5. Compute the chi-square statistics.

Calculate over all cells.

Step 6. Determine the p-value.

This can be done in Excel as follows.

CHISQ.DIST(statistic, degree of freedom, cumulative).

2. The advantage of test of independence is

3. For JxK table, the degree of freedom for chi-

Test the hypothesis that there is no association between

You might also like