Methodology of Sampling
Methodology of Sampling
METHODS L-2
Prof Dr Najlaa Fawzi
1
POPULATION:
The largest collection of anything, if this
collection has limits, this is finite population,
and if not, this is infinite population.
It can be
A-Population of entities: is the largest collection
of entities in which we have an interest at
a particular time (e.g. population of humans);
each population member has many variables.
2
B-Population of values: it is the largest collection
3
4
SAMPLING
A sample is “a smaller (but hopefully
representative) collection of units from a
population used to determine truths about
that population” Why sample?
Resources (time, money) and workload
Gives results with known accuracy that can
be calculated mathematically
7
SAMPLING
8
Define the Population
9
Process
The sampling process comprises several
stages:
Defining the population of concern
Specifying a sampling frame, a set of items
or events possible to measure
Specifying a sampling method for selecting
items or events from the frame
Determining the sample size
Implementing the sampling plan
Sampling and data collecting
Reviewing the sampling process 10
Population definition
A population can be defined as including
all people or items with the characteristic
one wishes to understand.
11
Population definition
Note also that the population from which the sample
is drawn may not be the same as the population about
which we actually want information. Often there is
large but not complete overlap between these two
groups due to frame issues etc.
12
SAMPLING FRAME
The sampling frame is the list from which the
potential respondents are drawn
Registrar’s office
Class lists
Must assess sampling frame errors
13
A sampling frame has the property that we
can identify every single element and include
any in our sample .
14
Types of sampling methods: they are probability
methods and non-probability methods, the
15
Types of Samples
Probability (Random) Samples
Simple random sample
Systematic random sample
Stratified random sample
Multistage sample
Multiphase sample
Cluster sample
Non-Probability Samples
Convenience sample
Purposive sample
Quota
16
Snow ball sample
PROBABILITY SAMPLING
18
PROBABILITY SAMPLING
19
Stages in random sampling:
Randomly Systematically
Assign select the select random
Define Develop
each required numbers until
sampling
population frame unit a amount of it meets the
number random sample size
numbers requirements
Disadvantages
It needs complete list of study population ,
which is often difficult to obtain.
22
REPLACEMENT OF SELECTED UNITS
* Computer software.
24
SYSTEMATIC SAMPLING
25
Systematic sampling involves a random start
and then proceeds with the selection of every
kth element from then onwards.
26
Systematic Random Sampling
–– Based on sampling fraction: Every Kth unit is chosen
in the population list,
where K is chosen by sampling interval
–– Sampling Interval (K) Q = Total no. of units in
population/ Total no. of units in
sample
–– Applicable for large, non-homogenous populations
where complete list of individuals is available
28
As described above, systematic sampling is an EPS
method, because all elements have the same
probability of selection (in the example given, one
in ten). It is not 'simple random sampling' because
different subsets of the same size have different
selection probabilities - e.g. the set
{4,14,24,...,994} has a one-in-ten probability of
selection, but the set {4,13,24,34,...} has zero
probability of selection.
29
ADVANTAGES:
Sample easy to select
Suitable sampling frame can be identified easily
Sample evenly spread over entire reference
population
Time and labour for sample collection is
relatively small.
DISADVANTAGES:
Sample may be biased if hidden periodicity in
population coincides with that of selection.
Difficult to assess precision of estimate from
one survey.
30
STRATIFIED SAMPLING
31
Every unit in a stratum has same chance of
being selected. This method gives more
representative sampling than simple random
sampling in a given large population.
32
Stratified Random Sampling
–Non-homogenous population is converted to homogenous
groups/classes
(strata); sample is drawn from each strata at random,
in proportion to its size
–Applicable for large non-homogenous population
–– Gives more representative sample than simple random
sampling
–None of the categories is under or over-represented
–– For example, In a population of 1000, sample of 100
is to be drawn for
Hemoglobin estimation; first convert non-homogenous
population is converted
to homogenous strata (i.e. 700 males and 300 females),
then draw 70 males and
30 females randomly respectively
33
Determine Systematic
Develop
the sampling
sampling
proportion of methods can
Define frame
each then be
population according to
population followed to
characteristi
variable of select sample
cs required
interest unit
34
Using same sampling fraction for all strata
ensures proportionate representation in the
sample.
35
Finally, since each stratum is treated as an
independent population, different sampling
approaches can be applied to different
strata.
36
Types of Stratified Samples
Proportional Stratified Sample:
The number of sampling units drawn
from each stratum is in proportion to
the relative population size of that
stratum
Disproportional Stratified Sample:
The number of sampling units drawn
from each stratum is allocated according
to analytical considerations e.g. as
variability increases sample size of
stratum should increase 37
Optimal allocation stratified sample
The number of sampling units drawn
from each stratum is determined on
the basis of both size and variation.
Calculated statistically
38
Advantage
It is more representative
It gives estimates with increased precision
As the population is more concentrated, the
time and money will be saved.
Disadvantage
Requires accurate information on proportions
of each stratum
It is very difficult task to divide the
population into homogenous strata .
Stratified lists costly to prepare. This may
require considerable time , money and
statistical expertise. 39
Disadvantages to using stratified sampling.
First, sampling frame of entire population has
to be prepared separately for each stratum
Second, when examining multiple criteria,
stratifying variables may be related to some,
but not to others, further complicating the
design, and potentially reducing the utility of
the strata.
Finally, in some cases (such as designs with a
large number of strata, or those with a
specified minimum sample size per group),
stratified sampling can potentially require a
larger sample than would other methods
40
Select a stratified random sample of 20 patients
from 200 patients.
Disease Disease Disease Disease TOTAL
A B C D
No. of 100 60 20 20 200
patients
% 50 30 10 10 100
43
Two types of cluster sampling methods
One-stage sampling. All of the elements within
selected clusters are included in the sample.
44
45
Advantages :
Cuts down on the cost of preparing a sampling
frame.
This can reduce travel and other administrative
costs.
Requires list of all clusters, but only of individuals
within chosen clusters
Can estimate characteristics of both cluster and
population
46
Disadvantages:
Sampling error is higher for a simple random
sample of same size. Clusters cannot be
compared with each other
49
MULTISTAGE SAMPLING
Complex form of cluster sampling in which two or
more levels of units are embedded one in the
other.
51
Multistage Random Sampling
–Is done in successive stages; each successive
sampling unit is nested in the previous sampling
unit. For example, in large country surveys,
states are chosen, then districts, then
villages, then every 10th person in village as
final sampling unit.
52
Advantage:
Introduces flexibility in sampling. This
method is very helpful in many large scale
surveys where population list preparation is
difficult.
It is less expensive and less time consuming.
It permits available resources to be
concentrated on
limited numbers of units of the frame.
Disadvantages
Sampling error is usually increased.
Sampling units will be of unequal size at various
stages resulting in analytical difficulties.
53
MULTI PHASE SAMPLING
• Is done in successive phases
Part of the information collected from whole
sample & part from subsample.
55
MATCHED RANDOM SAMPLING
57
Uses of probability sampling
There are multiple uses of probability sampling.
They are:
•Reduce Sample Bias: Using the probability
sampling method, the bias in the sample derived
from a population is unimportant to non-existent.
The selection of the sample mainly describes the
understanding and the inference of the
researcher. Probability sampling leads to higher
quality data collection as the sample
appropriately represents the population.
58
•Diverse Population: When the population is infinite
and different, it is essential to have adequate
representation so that the data is not tilted towards
one demographic. For example, if Square would like
to understand the people that could make their point-
of-sale devices, a survey conducted from a sample of
people across the US from different industries and
socio-economic backgrounds helps.
59
Example: Identify each of the following examples as
(qualitative) or numerical (quantitative) variables.
60
Example: Identify each of the following as examples of (1)
nominal, (2) ordinal, (3) discrete, or (4) continuous
variables:
61
END OF PART ONE
62