0% found this document useful (0 votes)

95 views26 pages

Sample Size Calculation & Software

This document discusses sample size calculations and the G*Power software for conducting them. It explains that sample size calculations help avoid studies that are too small and underpowered or too large and wasteful. The process of calculating sample size improves study design by requiring researchers to carefully define their scientific questions and planned analyses. G*Power is introduced as a widely used software that allows controlling type I and type II error rates to determine optimal sample sizes for a priori power analyses. The document reviews the types of power analyses and how to use G*Power by selecting the study design, endpoint, effect size, desired power, significance level, and number of groups.

Uploaded by

chelseapasiah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views26 pages

Sample Size Calculation & Software

Uploaded by

chelseapasiah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Sample size calculation &

software
KWENTI E. TEBIT
Why do sample size calculations?

 Prospective study design with sample size calculation helps to avoid studies that are:
• Too small: leads to equivocal results. An under powered study may dismiss a potentially beneficial
treatment, or may fail to detect an important relationship.
• Too large: wastes resources.
 Both sample size errors create ethical issues when using humans or animals.
• Too small: you have exposed them to harm with little likelihood of learning anything.
• Too big: you have exposed more of them to harm than was necessary.
Why do sample size calculations?

 Secondary benefit: Makes for better studies. Before you can do a sample size calculation, you will
have to:
• Define the scientific issue you are addressing.
• Translate the issue into research questions or hypotheses.
• Determine what data are needed.
• Formulate the questions or hypotheses in terms of parameters describing the distribution of the
data to be collected.
• Map out the statistical analysis plans
Cont…

 The process of sample size calculation can substantially improve study design. It requires one to
think through:
• definition of the scientific issue
• how the scientific issue is being formulated as an empirical question
• sampling plan
• variables to be collected
• statistical analysis plan
• expected results
 In general, if the details of implementation has been glossed over, this will become obvious
during sample size calculation.
 Recall that the p-value from a hypothesis test can be used to
1. decide whether to reject the null hypothesis (reject if p-value less than α)
2. summarize the evidence against the null
 For the purposes of designing a study we use the first method.
 Typically α = 0:05.
 When we run the study we can also interpret the p-value as evidence against the null.
Definitions:

• A type I error occurs if the null hypothesis is true and it is rejected.

• A type II error occurs if the null hypothesis is false and it is not rejected.

 Table of the possible outcomes of a hypothesis test

Cont…

 What is probability of a type I error occurring?

 Recall that the cutoff p-value α is the probability of rejecting the null hypothesis when the null
hypothesis is true.
α = Pr(reject H0jH0 is true)
also
Pr(Type I error) = Pr(reject H0jH0 is true)
so
α = Pr(Type I error)

 Usually α is set to 0.05.

Cont…

 β = Pr(Type II error) = Pr(fail to reject H0jH0 is false)

 When designing a study we must chose both α and β.
 We want both to be small but typically we set α smaller than β
• A type II error (claiming no difference when there is a difference) is not considered as bad as
• a type I error (claiming that there is a difference when there isn’t).
 While α is usually set to 0.05, β is usually set to 0.20 or 0.10
Cont…

 The power of a test is the probability of the correct decision when the null hypothesis is false.
 Power = Pr(reject H0jH0 is false)
 That is, the power is the probability of finding an effect when an effect exists.
 Power = Pr(reject H0jH0 is false)
 = 1 − Pr(fail to reject H0jH0 is false) = 1 − β
Cont…

 The probabilities α and β refer to what could happen in the study

 By designing the study with respect to these parameters, we minimize the probability of incorrect
conclusions.
 That is, for a given null and alternative hypothesis of interest, we design the study that has
adequate statistical power to lead to a correct decision.
 Power typically should be .8 or above.
Problems with Over- and Under-Powered Studies

Over powered:
 If the sample size is too large the study will be able to detect very small differences.
 This is a waste of money and time if the difference is so small it is scientifically or clinically
unimportant.
 If the intervention is risky you have put too many individuals at risk.

Under powered:
 If the sample size is too small the study will be unable to detect differences that are scientifically or
clinically important.
 The risk taken by the individuals in the study was unnecessary because the study was unlikely to
detect clinically important effects.
 Also a waste of money and time.
Methods of calculating sample sizes

 There are three main methods of estimating samples sizes; online, software and manual
calculation.

Online
 There are websites that can be used for calculating sample sizes hosted by a number of
organisations and are open for the public such as
 www.surveysystem.com/sscalc.htm
 www.nss.gov.au/nss/home.nsf/pages/Sample+size+calculator
 www.raosoft.com/samplesize.html
 https://www.surveymonkey.com/mp/sample-size-calculator/
 powerandsamplesize.com/
 www.calculator.net › Math Calculators
 https://fluidsurveys.com/survey-sample-size-calculator/

Manual computation
 Using formulae and calculators you can estimate the sample size required for your study such
as
𝒁2 𝒙 𝑷 (𝟏 −𝒑)
 n= 𝒆²
} Lorentz formula
 There are formulae for case control studies, comparison of proportions, comparison of means,
diagnostic accuracy and other descriptive studies. (see Hajian-Tilaki K. Journal of Biomedical
Informatics 2014; 48: 193–204).
Software
 Statistical software can also be used to calculate sample size e.g. Epi Info, SPSS, Stata, SAS,
R, Minitab etc.

 These do not give you control over your type 1 and type 2 errors.
 G*Power offers you with great control over your type 1 and type 2 error and provide a more
accurate approach in estimating your sample size.
Introduction to G*Power

 Many papers published in the scientific literature do not have enough power to make any
conclusion.
 G*Power is an easy to use program for performing various types of power analysis.
 G*Power version 3.1.9.2 was written by Franz Faul, Universitât Kiel, Germany.
 It is the most widely used software for power analysis.
Types of power analyses

 There are two most common types—a priori and post-hoc power analysis.

a priori
 An a priori analysis is done before a study takes place.
 It is the ideal type of power analysis because it provides users with a method to control both
the type 1 error probability α and the type 2 error probability β
 By implication, it also controls the power of the test, that is, the complement of the type-2
error probability (1 - β) (i.e., the probability of correctly rejecting H0 when it is in fact false).
 An a priori analysis is used to determine the necessary sample size N of a test given a desired α
level, a desired power level (1 - β), and the size of the effect to be detected
 i.e. a measure of the difference between the H0 and the H1

post-hoc analysis
 a post-hoc analysis is typically performed after a study has been conducted so that the sample
size N is already a matter of fact. Given N, α, and a specified effect size, this type of analysis returns
the power (1 – β), or the error probability of the test.
 Obviously, post-hoc analyses are less ideal than a priori analyses because only α is controlled, not β.
Both β and its complement (1 - β) are assessed but not controlled in post-hoc analyses.
 Thus, post-hoc power analyses can be characterized as instruments providing for a critical
evaluation of the (often surprisingly large) error probability β associated with a false decision in
favor of the H0.
Getting to know G*Power

 When you open G*Power for the first time, it presents with 3 window; input parameters, output
parameters, and a window presenting the distribution plot.
 Above you have the quick access toolbar with commands such as file, edit, view, tests,
calculator etc.
 Below the distribution plot window, you have three drop down menus; test family, statistical
test, type of power analysis.
 Below the input and output parameters, you have commands to plot an X-Y graph for a range
of values.
How to use G*Power

 Before you start using G*Power, you have to know

1) The type of study design
2) The endpoint of your study
3) The effect size to use
4) The power you intend to achieve
5) The α level you will be working with
6) The number of groups you will be working with.
Types of studies

1) Observational 2) Experimental
 Descriptive studies  Randomised controlled trails
 Ecological studies  Field trials
 Ecological fallacy  Community trials
 Cross-sectional studies
 Case-control studies
 Cohort studies
Effect size

 An effect size is simply an objective and standardized measure of the magnitude of observed
effect (Field, 2005).
 The fact that the measure is standardize thus means that we can compare effect sizes across
different studies that have measured different variables, or have used different scales of
measurement.
 The most common measures of effect sizes are Cohen’s d, and Pearson’s correlation
coefficient, r.
 Others include Hedges’ g, Glass’ , odd ratios and risk rates
1) Correlation coefficients (r)

 r can also be used to express difference between means and is constrained to lie
between 0 (no effect) and 1 (a perfect effect).
 r can also be used to express the difference between two groups.
 r is related to the t in the t-test: r can be easily obtained from several common test
statistics.
 For example, if a t test has been used, r is a function of the observed t-value and the
degree of freedom, df, on which it is based:
 When ANOVA has been used and an F-ratio is the test statistics, then when there is 1
degree of freedom for the effect, the following conversion can be used:

 In which F(1,-) is simply the F-ratio for the effect (which must have 1 degree of
freedom) and dfR is the degrees of freedom for the error term on which the F-ratio is
based.

 r can also be used to express relationships in categorical data because it is directly

related to the chi-square statistic (again, provided this chi-square statistic has only 1
degree of freedom):
 r can be calculated from the probability value of a test statistic.

 r = 0.10 (small effect): in this case, the effect explains 1% of the total variance.
 r = 0.30 (medium effect): the effect accounts for 9% of the total variance.
 r = 0.50 (large effect): the effect accounts for 25% of the total variance.
2) Cohen’s d

Where pooled SD =

.8 = large (8/10 of a standard deviation unit)

.5 = moderate (1/2 of a standard deviation)

.2 = small (1/5 of a standard deviation)

Exercise

 Among 7th graders in Lowndes County Schools taking the CRCT reading exam (N = 336),
there was a statistically significant difference between the two teaching teams, team 1 (M
= 818.92, SD = 16.11) and team 2 (M = 828.28, SD = 14.09). Compute the effect size.

Maths Project
67% (3)
Maths Project
21 pages
Sample Size R Module
No ratings yet
Sample Size R Module
85 pages
Hypothesis Testing: An Intuitive Guide for Making Data Driven Decisions
From Everand
Hypothesis Testing: An Intuitive Guide for Making Data Driven Decisions
Jim Frost
No ratings yet
Proposal of A Disaster Preparedness Plan For Buea
100% (1)
Proposal of A Disaster Preparedness Plan For Buea
9 pages
Public Health Exams
100% (1)
Public Health Exams
14 pages
Math 2 Album
100% (7)
Math 2 Album
95 pages
Big Book For Buckyballs Tricks
0% (2)
Big Book For Buckyballs Tricks
6 pages
Maths Cheat Sheet
No ratings yet
Maths Cheat Sheet
1 page
Kang (2021)
No ratings yet
Kang (2021)
12 pages
Complemento Aula 8
No ratings yet
Complemento Aula 8
43 pages
Introduction To Statistics With GraphPad Prism Slides
No ratings yet
Introduction To Statistics With GraphPad Prism Slides
101 pages
Introduction To Statistics With GraphPad Prism Slides
No ratings yet
Introduction To Statistics With GraphPad Prism Slides
101 pages
Sample Size Determination and A Priori Power Analysis Using GPower
No ratings yet
Sample Size Determination and A Priori Power Analysis Using GPower
40 pages
Sample Size and Power of Study
No ratings yet
Sample Size and Power of Study
5 pages
Statistical Power and Effect Size 1
No ratings yet
Statistical Power and Effect Size 1
29 pages
G Power
No ratings yet
G Power
5 pages
6 - Praktek G Power
No ratings yet
6 - Praktek G Power
74 pages
Power Analysis
No ratings yet
Power Analysis
37 pages
BioEpi Lab Module 9
No ratings yet
BioEpi Lab Module 9
2 pages
INVITRO Sample Size Estimation Course Manual
No ratings yet
INVITRO Sample Size Estimation Course Manual
32 pages
G - Power Guide
No ratings yet
G - Power Guide
86 pages
Pi Is 2058534917300732
No ratings yet
Pi Is 2058534917300732
3 pages
Power Analysis Talk
No ratings yet
Power Analysis Talk
40 pages
Power and Sample Size Calculation
No ratings yet
Power and Sample Size Calculation
13 pages
Power and Sample Size Calculation
No ratings yet
Power and Sample Size Calculation
13 pages
Some Practical Guidelines For Effective Sample Size Determination
No ratings yet
Some Practical Guidelines For Effective Sample Size Determination
7 pages
Power Analysis
No ratings yet
Power Analysis
8 pages
Sample Size Gpower Module
No ratings yet
Sample Size Gpower Module
106 pages
Power and Effect Size
No ratings yet
Power and Effect Size
3 pages
Samplesize Determination
100% (1)
Samplesize Determination
42 pages
Type I Type II Error
No ratings yet
Type I Type II Error
24 pages
Sample Size Determination: BY DR Zubair K.O
100% (1)
Sample Size Determination: BY DR Zubair K.O
43 pages
Sample Size Determination: BY DR Zubair K.O
100% (1)
Sample Size Determination: BY DR Zubair K.O
43 pages
2008Mar2008Shintani PDF
No ratings yet
2008Mar2008Shintani PDF
20 pages
Sample Size Determination: A Practical Guide For Health Researchers
No ratings yet
Sample Size Determination: A Practical Guide For Health Researchers
7 pages
Seminar 2
No ratings yet
Seminar 2
69 pages
Power of The Test: DR Smita Pandey
No ratings yet
Power of The Test: DR Smita Pandey
9 pages
Sequential Analyses Workshop Groningen
No ratings yet
Sequential Analyses Workshop Groningen
76 pages
Kajal Srivastava SPM Deptt. S.N.Medical College,: Determining The Size of A Sample
No ratings yet
Kajal Srivastava SPM Deptt. S.N.Medical College,: Determining The Size of A Sample
38 pages
Power Analysis and Sample Size - Richard
No ratings yet
Power Analysis and Sample Size - Richard
22 pages
M Api
No ratings yet
M Api
17 pages
Power Calculations
No ratings yet
Power Calculations
13 pages
DR Pinzon - Sample Size Klinik
No ratings yet
DR Pinzon - Sample Size Klinik
45 pages
Designing The Methodology
No ratings yet
Designing The Methodology
17 pages
Statnews #41 Sample Size Calculations October 2000 Revised 2012
No ratings yet
Statnews #41 Sample Size Calculations October 2000 Revised 2012
2 pages
Stat Lea Int Cal PDF
No ratings yet
Stat Lea Int Cal PDF
5 pages
Chapter 11
No ratings yet
Chapter 11
24 pages
Power Analysis - Statistics Descr
No ratings yet
Power Analysis - Statistics Descr
1 page
Sample Size and Power Presentation
No ratings yet
Sample Size and Power Presentation
26 pages
Running Head: G POWER ANALYSIS 1
No ratings yet
Running Head: G POWER ANALYSIS 1
4 pages
Power, Effect and Sample Size Using GPower
No ratings yet
Power, Effect and Sample Size Using GPower
5 pages
Lec 16
No ratings yet
Lec 16
60 pages
Power Analysis Notes-1
No ratings yet
Power Analysis Notes-1
10 pages
Newbies Trial
No ratings yet
Newbies Trial
4 pages
Sample Size
No ratings yet
Sample Size
45 pages
Sample Size
No ratings yet
Sample Size
34 pages
An Introduction To Power and Sample Size Estimation: Statistics
No ratings yet
An Introduction To Power and Sample Size Estimation: Statistics
9 pages
Why Is It Important To Consider Sample Size?
No ratings yet
Why Is It Important To Consider Sample Size?
98 pages
Sample Size Computations and Power Analysis With The SAS System
100% (2)
Sample Size Computations and Power Analysis With The SAS System
8 pages
Sample Size Determination and Power Analysis Using
No ratings yet
Sample Size Determination and Power Analysis Using
13 pages
Sample Size Calculation
No ratings yet
Sample Size Calculation
30 pages
05 Biostat L5
No ratings yet
05 Biostat L5
9 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Power, Power Curves and Sample Size
No ratings yet
Power, Power Curves and Sample Size
36 pages
Gelman 2014
No ratings yet
Gelman 2014
11 pages
GraphPad Prism Slides
No ratings yet
GraphPad Prism Slides
79 pages
Health Management Plan DR Ajeh 610
No ratings yet
Health Management Plan DR Ajeh 610
29 pages
The Geneva Foundation For Medical Education and Research
No ratings yet
The Geneva Foundation For Medical Education and Research
2 pages
Emergency Preparedness Plan For Mountain Eruption
No ratings yet
Emergency Preparedness Plan For Mountain Eruption
13 pages
University of Buea Faculty O Health Sciences
No ratings yet
University of Buea Faculty O Health Sciences
2 pages
Project Management Life-Cycle and Processes - Final
No ratings yet
Project Management Life-Cycle and Processes - Final
49 pages
SciProMEC Project Management Pre-Workshop Quiz
No ratings yet
SciProMEC Project Management Pre-Workshop Quiz
3 pages
Comprehensive Examination, Parts I & II: Program in Public Health
No ratings yet
Comprehensive Examination, Parts I & II: Program in Public Health
4 pages
BCHS Doctoral Student Manual 2020 2021
No ratings yet
BCHS Doctoral Student Manual 2020 2021
42 pages
Handbook of Occupational Hazards and Controls: For Alberta Acupuncturists
No ratings yet
Handbook of Occupational Hazards and Controls: For Alberta Acupuncturists
71 pages
Grant Writing Resources
0% (1)
Grant Writing Resources
11 pages
Community System Strengthening - Beyond Zero - 1614578921105
No ratings yet
Community System Strengthening - Beyond Zero - 1614578921105
5 pages
Child Abuse and Child Labor Final
No ratings yet
Child Abuse and Child Labor Final
56 pages
Ambulance Nurse Call For Applications
No ratings yet
Ambulance Nurse Call For Applications
2 pages
Human Resource Department: Reach Out Ngo
No ratings yet
Human Resource Department: Reach Out Ngo
4 pages
Algorithms and Data Structures
No ratings yet
Algorithms and Data Structures
167 pages
Exercise 11e
No ratings yet
Exercise 11e
15 pages
Causal Machine Learning - A Survey and Open Problems
No ratings yet
Causal Machine Learning - A Survey and Open Problems
191 pages
5
No ratings yet
5
6 pages
Test - Calculator: Math No
No ratings yet
Test - Calculator: Math No
15 pages
LESSON 12. Sampling Distribution of Sample Means
No ratings yet
LESSON 12. Sampling Distribution of Sample Means
16 pages
24 June Shift - 1 Question Paper & Answers
No ratings yet
24 June Shift - 1 Question Paper & Answers
29 pages
Important Questions - Sets QB365
No ratings yet
Important Questions - Sets QB365
3 pages
Frature Mechanics PDF
No ratings yet
Frature Mechanics PDF
10 pages
7.introduction To Clustering
No ratings yet
7.introduction To Clustering
11 pages
Machine Learning Methods
No ratings yet
Machine Learning Methods
27 pages
FSC115 Kinematics
No ratings yet
FSC115 Kinematics
68 pages
BJT Uhf Mixer
No ratings yet
BJT Uhf Mixer
17 pages
PRLD5111 LU2 Act2 2 2 Proposedsolutions
No ratings yet
PRLD5111 LU2 Act2 2 2 Proposedsolutions
6 pages
A LATTICE-THEORETICAL FIXPOINT THEOREM - Tarski PDF
No ratings yet
A LATTICE-THEORETICAL FIXPOINT THEOREM - Tarski PDF
26 pages
Data Structures and Algorithms
100% (1)
Data Structures and Algorithms
8 pages
Cryptography and Cyber Security - CB3491 - 2 Marks Questions With Answer
No ratings yet
Cryptography and Cyber Security - CB3491 - 2 Marks Questions With Answer
33 pages
Chapter 1 Challenges: Questions
No ratings yet
Chapter 1 Challenges: Questions
6 pages
10.05 Finding The Equation From A Graph - Worksheet
No ratings yet
10.05 Finding The Equation From A Graph - Worksheet
9 pages
Operations Management Assignment
No ratings yet
Operations Management Assignment
7 pages
Pre Term-II Class-X (Maths 241)
No ratings yet
Pre Term-II Class-X (Maths 241)
7 pages
Chevalier Mayzlin 2006 The Effect of Word of Mouth On Sales Online Book Reviews
No ratings yet
Chevalier Mayzlin 2006 The Effect of Word of Mouth On Sales Online Book Reviews
10 pages
Partial Differential Equations (Pdes)
No ratings yet
Partial Differential Equations (Pdes)
66 pages
FFJM 2018
No ratings yet
FFJM 2018
4 pages
Transmission Line Derivation Perry S Marshall
No ratings yet
Transmission Line Derivation Perry S Marshall
20 pages
Newbold, P. (2019) - Statistics For Business and Economics. 9thed, Pearson
No ratings yet
Newbold, P. (2019) - Statistics For Business and Economics. 9thed, Pearson
20 pages

Sample Size Calculation & Software

Uploaded by

Sample Size Calculation & Software

Uploaded by

Sample size calculation &

• A type I error occurs if the null hypothesis is true and it is rejected.

 Table of the possible outcomes of a hypothesis test

 What is probability of a type I error occurring?

 Usually α is set to 0.05.

 β = Pr(Type II error) = Pr(fail to reject H0jH0 is false)

 The probabilities α and β refer to what could happen in the study

 Before you start using G*Power, you have to know

 r can also be used to express relationships in categorical data because it is directly

.8 = large (8/10 of a standard deviation unit)

.5 = moderate (1/2 of a standard deviation)

.2 = small (1/5 of a standard deviation)

You might also like