0% found this document useful (0 votes)

8 views32 pages

Module 3b - Random Sampling and Sampling Error

Public Health module, UNSW

Uploaded by

dewinrswr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views32 pages

Module 3b - Random Sampling and Sampling Error

Public Health module, UNSW

Uploaded by

dewinrswr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Module 3b – Random sampling

and sampling error

Katrina Blazek
PHCM9794: Foundations of Epidemiology
Learning outcomes
• State and distinguish between the main sources of bias in
epidemiological studies and distinguish between random and
systematic error
• Understand how to interpret statistical significance and confidence
intervals
Overview
Random sampling
Random sampling error
Confidence intervals and P values
Precision
Type I and Type II errors
A population of people
Let’s select 20 people randomly
Let’s select 20 people randomly
This is our sample
We can describe the sample

Mean SBP
= 126 mmHg
Random selection
Selection of participants for a study on the basis of chance
• Each participant in source population has same chance
(probability) of being included
• E.g. using a random number generator

Requires a sampling frame – a list of people in the source

population, to which the random selection process is applied

Random selection produces a representative sample

What if we select a different sample?
What if we select a different sample?
Here’s our second sample
The two samples are different
Sample 1 Sample 2 Random sampling error
• Sample means differ
from each other, just by
chance
• May also differ from
(unobserved)
population mean
Mean SBP Mean SBP
= 126 mmHg = 121 mmHg • This is OK. Handled
with confidence
intervals and p values
Confidence intervals
Range of values within which we are reasonably confident the
true (unobserved) mean lies

Confidence intervals can be calculated for many types of

statistics, e.g. proportion, risk ratio, odds ratio etc.

Most common are 95% confidence intervals

• 99% (wider) and 90% (narrower) sometimes used
95% Confidence Intervals (CI)
Sample 1 Sample 2

Mean SBP Mean SBP

= 126 mmHg = 121 mmHg
(95% CI 117 to 135 (95% CI 112 to 130
mmHg) mmHg)
95% Confidence Interval
If we repeat the study a number of times, the confidence intervals
would contain the true (unobserved) population mean 95% of
those times

Can be interpreted as level of confidence (95%) we have that the

true value lies within the given range

NOT: 95% probability that the true mean lies in this interval
Objective: To evaluate longer term symptoms and health outcomes
associated with post-covid-19 condition within a cohort of individuals
with a SARS-CoV-2 infection.

Results: 22.9% (95% confidence interval 20.4% to 25.6%) of individuals

infected with SARS-CoV-2 did not fully recover by six months.

We are 95% confident that in the population the true proportion of those who
don’t recover from SARS-CoV-2 within 6 months is between 20.4% and 25.6%

BMJ 2023;381:e074425 | doi: 10.1136/bmj-2022-074425

What if we collected a rd
3 , smaller sample?
Sample 1 Sample 2 Sample 3

The
interval
is wider
Mean SBP Mean SBP Mean SBP
= 126 mmHg = 121 mmHg = 115 mmHg
(95% CI 117 to 135 (95% CI 112 to 130 (95% CI 98 to 132
mmHg) mmHg) mmHg)
Precision
The larger the sample, the more precise the estimate
• Increasing sample size decreases confidence interval width
• Decreasing sample size increases confidence interval width
Hypothesis testing
Confidence interval gives a likely range within which we are
reasonably confident that the true value lies

Does not give a quantitative assessment of evidence against a

null-hypothesis
Hypothesis testing in a nutshell
1. Formulate a null hypothesis – no difference (no effect)
2. Calculate a ‘test statistic’ from your data
3. Obtain a P value

(Covered in more detail in Foundations of Biostatistics)

What is P?
The probability of obtaining data like yours, or more extreme, if
the null hypothesis is true

NOT: The probability the null hypothesis is true

Reminder: Probabilities range from 0 (the event will never occur)

to 1 (the event will always occur)
Statistical significance
Black and white? Shades of grey
1.0 1.0

Little or no evidence

0.10
Weak evidence
0.05 0.05
Evidence
0.01
Significant Strong evidence
0.001
0 Very strong evidence 0
Probability of obtaining a RR
of 1.011 (or larger), just by
chance, given these data
Objective: To assess whether an easy-to-use multifaceted
intervention for children presenting to primary care with respiratory Not the probability that
tract infections would reduce antibiotic dispensing, without there is no difference
increasing hospital admissions for respiratory tract infection.
between groups
Result: No evidence was found that antibiotic dispensing differed
between intervention practices … and control practices … (rate ratio
1.011, 95% confidence interval 0.992 to 1.029; P=0.25).

BMJ 2023;381:e072488 | doi: 10.1136/bmj-2022-072488

Our conclusion might be wrong
Reality vs our study

There’s no effect
(Null hypothesis is TRUE)

There is an effect
(Null hypothesis is FALSE)

Image by Monika Grafik from Pixabay

Our conclusion might be wrong
Reality vs our study

Reject Do not reject

null hypothesis null hypothesis

Probability = 𝜶
There’s no effect Type I error TRUE negative
(Null hypothesis is TRUE)
 ✓
There is an effect TRUE positive Type II error
(Null hypothesis is FALSE)
✓ 
Power (𝟏 − 𝜷) Probability = 𝜷

Image by Monika Grafik from Pixabay

First, the citizens commit a type I error
by believing there is a wolf when there
is not.

Second, the citizens commit a type II

error by believing there is no wolf when
there is one.

https://www.microsoft.com/en-au/p/the-boy-who-cried-
wolf/8d6kgwzxmmst?activetab=pivot%3aoverviewtab

https://www.ncbi.nlm.nih.gov/books/NBK557530/
Clinical vs statistical significance

Webb, Bain & Page, 2020 (Chapter 6)

Other pitfalls
Multiple testing
• If 𝛼 = 0.05 then 1 in 20 chance of false positive
• Can make alpha smaller (e.g. 0.01) or adjust for multiple
testing

Complex sample designs

• Stratified, clustered and multi-stage sampling
• Analysis is complex (use advanced techniques)
“Pet owners had significantly lower systolic blood pressure
and plasma triglycerides than non-owners. “

https://onlinelibrary.wiley.com/doi/epdf/10.5694/j.1326-5377.1992.tb137178.x
“While pet owners and non-pet owners had similar levels of systolic blood
pressure, those with pets had significantly higher diastolic blood pressure. “

https://onlinelibrary.wiley.com/doi/epdf/10.5694/j.1326-5377.2003.tb05649.x

Statistics For Dummies
100% (3)
Statistics For Dummies
41 pages
GHG Emissions Calculator Ver01.1 Web
0% (1)
GHG Emissions Calculator Ver01.1 Web
91 pages
Ionic Equilibria: Ostwald'S Dilution Law
No ratings yet
Ionic Equilibria: Ostwald'S Dilution Law
39 pages
Anticancer Drugs Classification
100% (1)
Anticancer Drugs Classification
19 pages
Estimate TRR SDF Roads & Drains 18-8-2023
No ratings yet
Estimate TRR SDF Roads & Drains 18-8-2023
750 pages
Philosophers Way Thinking Critically About Profound Ideas 5th Edition Chaffee Test Bank 1
100% (76)
Philosophers Way Thinking Critically About Profound Ideas 5th Edition Chaffee Test Bank 1
14 pages
Module 3 - Lecture Notes
No ratings yet
Module 3 - Lecture Notes
6 pages
Is Bigger Better?: An Introduction To Sample Size Calculations
No ratings yet
Is Bigger Better?: An Introduction To Sample Size Calculations
52 pages
Cognizance On Fish 1
No ratings yet
Cognizance On Fish 1
12 pages
Shell Analysis Manual
No ratings yet
Shell Analysis Manual
848 pages
BioStats and Epidemiology BNB Notes
No ratings yet
BioStats and Epidemiology BNB Notes
39 pages
3rd Grade Unit 2 Planner Weather - 1
100% (3)
3rd Grade Unit 2 Planner Weather - 1
9 pages
Things To Know PDF
No ratings yet
Things To Know PDF
56 pages
Inferential Statistics
100% (1)
Inferential Statistics
57 pages
Science General Chemistry 1: Whole Brain Learning System Outcome-Based Education
No ratings yet
Science General Chemistry 1: Whole Brain Learning System Outcome-Based Education
20 pages
TM AHU 60R410A Onoff T SA NA 171205
No ratings yet
TM AHU 60R410A Onoff T SA NA 171205
67 pages
WUC111 Final Exam
No ratings yet
WUC111 Final Exam
4 pages
Mercuria Energy&Commodities Brochure
No ratings yet
Mercuria Energy&Commodities Brochure
6 pages
Passmedicine Statistics Note 2021: Prepared by DR - Abohaneen Mrcpase Telegram Group
No ratings yet
Passmedicine Statistics Note 2021: Prepared by DR - Abohaneen Mrcpase Telegram Group
25 pages
Inferential Statistics Powerpoint
No ratings yet
Inferential Statistics Powerpoint
65 pages
Statistics For Dummies Rachel Enriquez
No ratings yet
Statistics For Dummies Rachel Enriquez
41 pages
Basic Biostats, 2
No ratings yet
Basic Biostats, 2
58 pages
Hypothesis Testing Ug
No ratings yet
Hypothesis Testing Ug
66 pages
Energy Efficiency@festo
No ratings yet
Energy Efficiency@festo
60 pages
Epidemiology 8: Fundamentals of Statistical & Epidemiological Inference
No ratings yet
Epidemiology 8: Fundamentals of Statistical & Epidemiological Inference
31 pages
MD115 Wk05
No ratings yet
MD115 Wk05
86 pages
Topic 7 - P Value, CI
No ratings yet
Topic 7 - P Value, CI
48 pages
Slides CH 14
No ratings yet
Slides CH 14
50 pages
Confidence Interval: DR - Renj U
No ratings yet
Confidence Interval: DR - Renj U
71 pages
Statistical Inferences
No ratings yet
Statistical Inferences
46 pages
Çıkarımsal İstatistik
No ratings yet
Çıkarımsal İstatistik
30 pages
Lecture Slides Sec 5 Power
No ratings yet
Lecture Slides Sec 5 Power
41 pages
Introduction To Key Statistical Concepts - 2024
No ratings yet
Introduction To Key Statistical Concepts - 2024
27 pages
Point Estimation and Interval Estimation: Learning Objectives
No ratings yet
Point Estimation and Interval Estimation: Learning Objectives
58 pages
Lec 9 (Hypothesis Testing)
No ratings yet
Lec 9 (Hypothesis Testing)
53 pages
What Do P-Values and Confidence Intervals Really Tell Us?
No ratings yet
What Do P-Values and Confidence Intervals Really Tell Us?
52 pages
Science
No ratings yet
Science
26 pages
Biostatistics
No ratings yet
Biostatistics
32 pages
Understanding P - Values and CI 20nov08
No ratings yet
Understanding P - Values and CI 20nov08
37 pages
Confidence Limits in Statistics
No ratings yet
Confidence Limits in Statistics
30 pages
14632practicalsignificance 161017020922
No ratings yet
14632practicalsignificance 161017020922
25 pages
Estimation
No ratings yet
Estimation
29 pages
Phân Tích Dữ Liệu Và Xác Định Phép Kiểm Thống Kê
No ratings yet
Phân Tích Dữ Liệu Và Xác Định Phép Kiểm Thống Kê
50 pages
Module 4 - Hypothesis Testing - 1pp
No ratings yet
Module 4 - Hypothesis Testing - 1pp
55 pages
P Value
No ratings yet
P Value
31 pages
STAB22 Final Exam Review Seminar (WINTER 2021)
No ratings yet
STAB22 Final Exam Review Seminar (WINTER 2021)
65 pages
Hypothesis Testing-2 PDF
No ratings yet
Hypothesis Testing-2 PDF
16 pages
2 Intro Inferrential Statistics
No ratings yet
2 Intro Inferrential Statistics
24 pages
PSM 201 Sampling Distributions and Hypothesis Testing
No ratings yet
PSM 201 Sampling Distributions and Hypothesis Testing
31 pages
2 Intro To Inferential Stat
No ratings yet
2 Intro To Inferential Stat
37 pages
2 2020 12 21!07 45 30 PM
No ratings yet
2 2020 12 21!07 45 30 PM
40 pages
95% Confidence Interval
No ratings yet
95% Confidence Interval
18 pages
Borang ISIP
No ratings yet
Borang ISIP
8 pages
Statistical Inference: Interval Estimation: Laboratory Session 4
No ratings yet
Statistical Inference: Interval Estimation: Laboratory Session 4
14 pages
Statics (Medicalstudyzone - Com)
No ratings yet
Statics (Medicalstudyzone - Com)
26 pages
7 Colin Shelley Facts Global Energy
No ratings yet
7 Colin Shelley Facts Global Energy
13 pages
6 - Hypothesis Testing
No ratings yet
6 - Hypothesis Testing
27 pages
375 6 2 +chance+5 6 24
No ratings yet
375 6 2 +chance+5 6 24
44 pages
Endangered Species
No ratings yet
Endangered Species
14 pages
Confidence Intervals (Cis) : DR Trevor Bryant
No ratings yet
Confidence Intervals (Cis) : DR Trevor Bryant
13 pages
PPT10 14 - Nov7
No ratings yet
PPT10 14 - Nov7
26 pages
Physiology of Lactation
No ratings yet
Physiology of Lactation
16 pages
IM (VM 8.2.1.5) Significance Testing (Lecture)
No ratings yet
IM (VM 8.2.1.5) Significance Testing (Lecture)
24 pages
Basic Concepts
No ratings yet
Basic Concepts
13 pages
Original Research Article: Chemical Composition and Antioxidant Activity of Essential Oil of Achillea
No ratings yet
Original Research Article: Chemical Composition and Antioxidant Activity of Essential Oil of Achillea
14 pages
Final Review Sp25
No ratings yet
Final Review Sp25
59 pages
Gas Pressure Regulator Series 240Pl: Serving The Gas Industry Worldwide
No ratings yet
Gas Pressure Regulator Series 240Pl: Serving The Gas Industry Worldwide
11 pages
Indigenous Environmental Education For Cultural Survival: Leanne Simpson, Trent University, Canada
No ratings yet
Indigenous Environmental Education For Cultural Survival: Leanne Simpson, Trent University, Canada
13 pages
Biostatistics 3 Two Samples, Discrete Case
No ratings yet
Biostatistics 3 Two Samples, Discrete Case
20 pages
Bhu 18
No ratings yet
Bhu 18
10 pages
Introduction To Hypothesis Testing - 23nov2023 - Updates
No ratings yet
Introduction To Hypothesis Testing - 23nov2023 - Updates
11 pages
Ap Statistics 2017-2018 Term 3 Assignment Sohee Han
No ratings yet
Ap Statistics 2017-2018 Term 3 Assignment Sohee Han
7 pages
Nciph ERIC2
No ratings yet
Nciph ERIC2
7 pages
Harvard MPH 65 Curriculum Guide 2021 2022 FINAL 08.20.2021
No ratings yet
Harvard MPH 65 Curriculum Guide 2021 2022 FINAL 08.20.2021
32 pages
AP Review Part IV - Harrison
No ratings yet
AP Review Part IV - Harrison
8 pages
1 PB
No ratings yet
1 PB
9 pages
Guarniz Flores, Joel Luis
No ratings yet
Guarniz Flores, Joel Luis
8 pages
Draft Article Chest CT
No ratings yet
Draft Article Chest CT
6 pages
Supplementary Material ADA
No ratings yet
Supplementary Material ADA
5 pages
Making Meaningful Inferences About Magnitudes: Invited Commentary
No ratings yet
Making Meaningful Inferences About Magnitudes: Invited Commentary
9 pages
03 Fact Sheet HME712 Bos - 3 General Principles of Hypothesis Testing
No ratings yet
03 Fact Sheet HME712 Bos - 3 General Principles of Hypothesis Testing
2 pages
Names of Member: Rev.C.O Fatoye Pastor Matthew O.Bamise
No ratings yet
Names of Member: Rev.C.O Fatoye Pastor Matthew O.Bamise
6 pages
Hypothesis Testing, P Values, Confidence Intervals, and Significance
No ratings yet
Hypothesis Testing, P Values, Confidence Intervals, and Significance
6 pages
1st Q EXAM
No ratings yet
1st Q EXAM
4 pages
Cerebral Toxoplasmosis in An Immunocompetent Patient: Mycobacterium Tuberculosis
No ratings yet
Cerebral Toxoplasmosis in An Immunocompetent Patient: Mycobacterium Tuberculosis
3 pages
Ranganathan Et Al., 2015 - Valores de P e IC
No ratings yet
Ranganathan Et Al., 2015 - Valores de P e IC
2 pages
1 Vocab Reasoning
No ratings yet
1 Vocab Reasoning
3 pages
Bridget 1
No ratings yet
Bridget 1
2 pages
Quiz3 Sol
No ratings yet
Quiz3 Sol
2 pages
CS 191x Courseware4
No ratings yet
CS 191x Courseware4
3 pages
Submittal Data: End Suction Stainless Steel Pumps
No ratings yet
Submittal Data: End Suction Stainless Steel Pumps
3 pages
Reading Froudekrylov
No ratings yet
Reading Froudekrylov
6 pages
What Is A P Value
No ratings yet
What Is A P Value
4 pages
Naked Truth of Covid Test
From Everand
Naked Truth of Covid Test
Dr. Biswaroop Roy Chowdhury
5/5 (1)

Module 3b - Random Sampling and Sampling Error

Uploaded by

Module 3b - Random Sampling and Sampling Error

Uploaded by

Module 3b – Random sampling

and sampling error

Requires a sampling frame – a list of people in the source

Random selection produces a representative sample

Confidence intervals can be calculated for many types of

Most common are 95% confidence intervals

Mean SBP Mean SBP

Can be interpreted as level of confidence (95%) we have that the

Results: 22.9% (95% confidence interval 20.4% to 25.6%) of individuals

BMJ 2023;381:e074425 | doi: 10.1136/bmj-2022-074425

Does not give a quantitative assessment of evidence against a

(Covered in more detail in Foundations of Biostatistics)

NOT: The probability the null hypothesis is true

Reminder: Probabilities range from 0 (the event will never occur)

BMJ 2023;381:e072488 | doi: 10.1136/bmj-2022-072488

Image by Monika Grafik from Pixabay

Reject Do not reject

Image by Monika Grafik from Pixabay

Second, the citizens commit a type II

Webb, Bain & Page, 2020 (Chapter 6)

Complex sample designs

You might also like