6.sampling Technique
6.sampling Technique
March, 2024
1
Objective
2
Terminologies
• Sampling frame: the list of all units in the reference population, from
which the sample is to be picked.
3
Sampling technique
4
Why sample?
• The population of interest is usually too large to
attempt to survey all of its members.
5
Steps in sampling
1. Define population (N) to be sampled
6. Select sample
6
Basic conditions in sampling process
• Sample must be well chosen: – Representativeness
• Sample must be sufficiently large: – Minimizes sampling
variation
• There must be adequate coverage of the sample: –
Information should be obtained from almost all
Two keys
1. Selecting the right people
Have to be selected scientifically so that they are representative of the
population
2. Selecting the right number of the right people
To minimize sampling errors I.e. choosing the wrong people by chance
7
Basic terms
A population is a group of individuals persons, objects, or
items from which samples are taken for measurement.
Reference population (or target population): the population
of interest to whom the researchers would like to make
generalizations.
Source population: is the sub set of target population where
samples are drawn.
Study population: the group that is studied, either in total or
by selecting a sample of its members
Study unit: the units on which information will be collected:
persons, families, housing units, health facilities, schools
8
9
Study subjects
Hierarchy of sampling
The actual
participants in
the study
Sample
Subjects who are
selected
Sampling Frame
The list of potential subjects
from which the sample is
drawn
Source population
The Population from whom the study
subjects would be obtained
Target population
The population to whom the results would be 10
Advantage of sampling
11
Drawback of sampling
12
Characteristics of Good
Samples
1. Representation
• Sample surveys are almost never conducted for the purposes
of describing the particular sample under study. Rather they
are conducted for purposes of understanding the larger
population from which the sample was initially selected
13
3 factors that influence sample representativeness
• Sampling procedure
• Sample size
• Participation (response)
14
GOAL OF STATISTICS
Types of sampling method
16
Cont’d…
2. Non-probability sampling is a sampling method where
every item has an unknown chance of being selected
18
1.Simple random sampling
• Involves random selection
• Most common form of probability sampling.
• To use a SRS method:
– Make a numbered list of all the units in the population
(sampling frame)
– Each unit should be numbered from 1 to N (where N is
the size of the population)
- Decide on the size of sample
– Select the required number.
19
Cont’d…
20
Cont’d…
Random number table
• It is a table of random numbers constructed by a process
that
21
22
SIMPLE RANDOM SAMPLING
24
2. Systematic random sampling
25
Cont’d…
26
27
3. Stratified sampling
• It is done when the population is known to have
heterogeneity with regard to some factors and those factors
are used for stratification
28
Cont’d…
1) Proportionate STRS
2) Disproportionate STRS
29
• In the case of Proportionate STRS
- Determine the proportionof each stratum
in the study population
- p = elements (#) in each stratum
total pop. size
• Determine the number of elements to
be selected from each stratum = (n) x
(p)
• Select the required number of elements from
each stratum with SRS technique.
30
• In the case of Disproportionate STRS
-allocate equal sample size to each stratum
- Determine the number of element to be
selected from each stratum = Sample size (n)
No. of strata (k)
31
The advantage of stratified random sampling is that it
increases the likelihood of representation, especially if
the sample size is small
32
4.Cluster sampling
• It is selection of groups of study units (clusters) instead of
the selection of study units individually
33
Cont’d…
34
Steps in cluster sampling
1. Divide the population into groups or clusters
2. A number of clusters are selected randomly to represent
the total population, and then all units within selected
clusters are included in the sample.
3. No units from non-selected clusters are included in the
sample.
4. Differs from stratified sampling, where some units are
selected from each group.
35
5. Multi-stage sampling
• Similar to the cluster sampling, except that it involves picking
a sample from within each chosen cluster, rather than
including all units in the cluster.
36
Advantages
• No need to have a list of all units in the population.
• Saves a great amount of time and effort
Disadvantages
• More information is needed in this type of sampling, which
may not be available
• Error will be multiplied
• Provide less precise estimation
37
2. Non-probability sampling
38
1. Convenience sampling
• Sometimes known as grab or opportunity sampling
or accidental or haphazard sampling.
• For convenience, the study units that are available
at the time of data collection are selected
• Many clinic-based studies
39
2.Quota sampling
• is done until a specific number of units (quotas) for
different categories of populations have been
selected.
• Similar to stratified but does not involve random
selection
• It is based on the researcher’s judgment
40
3. Purposive sampling
• Often used in qualitative studies( such as those conducting
Focus Group Discussion and In-depth interview )
41
4. Snow ball sampling
• Also called chain referral sampling
42
Errors in sampling
1.Sampling error – Random error
The uncertainty associated with an estimate that is
based on data gathered from a sample of the
population rather than the full population is known
as sampling error.
It is an error arising from the sampling process
itself
Sampling error can be minimized by increasing the
size of the sample.
Can not be avoided or totally eliminated
43
2.Non-sampling error (Bias)
It is a type of systematic error in the design or
conduct of a sampling procedure which results in
distortion of the sample, so that it is no longer
representative of the reference population.
44
45
Sample size determination
46
• A sample size determination is the act of choosing the
number of observations or replicates to include in a
statistical sample.
47
Sample Size Determination
The answer will depend on the aims, nature and scope of the
study and on the expected result. All of which should be carefully
considered at the planning stage.
48
Basic things in sample size
determination
• The more heterogeneous a population is, the larger
the sample needs to be
49
Sample……
n
o If sample (“ ”) is
Large
Increase accuracy
Costy / complex
Take
Optimum
Small sample
o Decrease accuracy
o Less costy
How ?
50
Factors to determine sample size
• Size of population
• Resources: subjects, financial, manpower
• Method of sampling: random, stratified
• Degree of difference to be detected
• Degree of Accuracy (or errors)
- Type I error (alpha) p<0.05
- Type II error (beta) less than 0.2 (20%)
- Power of the test : more than 0.8 (80%)
• Statistical Formulae
• Dropout rate, non-compliance to treatment
51
Steps in Estimating Sample Size
52
Steps in Estimating Sample Size...
5. Adjust for population size
6. Adjust for estimated design effect
7. Adjust for expected response rate
• Tips:
• Consider calculating size for each (specific) objectives!
• Consider the number of variables in your research
53
Sample size for single population mean
54
Maximum acceptable difference (d or w): This is the maximum
amount of error that you are willing to accept.
Desired confidence level (Z/2 ) : is your level of certainty that
the sample mean does not differ from the true population mean by
more than the maximum acceptable difference. Commonly we use
a 95% confidence level.
Then the sample size determination formula for single population
mean is defined by:
z22 2
n
w2
55
Sample Size for Single Population Proportion
56
Then the formula for the sample size of single population proportion is defined
as:
z22 * p (1 p )
n 2
w
Where α = the level of significance which can be obtained as 1- confidence level.
P = best estimate of population proportions
W = maximum acceptable difference
z the value under standard normal table for the given value of confidence level
2
57
Example
58
Example 1
Solution:
Margin of error d= 5%
A confidence level of 95% will give the value of as Zα/2=1.96.
Then using the formula :
2 2
Z P (1 P ) Z
0.05 0.457 (1 0.457 )
n 2
2
2
W 0.05 2
1.96 0.457(0.543)
2
0.05 2
382
59
Some Considerations
60
Sample size for case control
study
61
Sample size in cohort study
62
Incorrect sample size will lead to
o Wrong conclusions
o Waste of resources
o Loss of money
o Ethical problems
o Delay in completion
63
Any question??
64
T hank you
65