4 Sampling
4 Sampling
Sampling Methods
Learning Objectives
3
• A sample is a collection of individuals selected
from a larger population.
Population
• Therefore, it is essential that a sample should
be correctly defined and organized.
8
Sampling…
• Main reasons for sampling instead of doing a
census.
– Economy
– Timeliness
– The large size of many populations
– Inaccessibility of some of the population
– accuracy
9
Steps needed to select a sample and ensure that this
sample will fulfill its goals.
1. Establish the study's objectives
– The first step in planning a useful and efficient survey
is to specify the objectives with as much detail as
possible.
– Clarifying the aims of the survey is critical to its
ultimate success.
– Without objectives, the survey is unlikely to generate
valuable results.
– The initial users and uses of the data should be
identified at this stage.
2. Define the target population
– The target population is the total population
for which the information is required.
– Specifically, the target population is defined
by the following characteristics:
• Nature of data required
• Geographic location
• Reference period
• Other characteristics, such as socio-demographic
characteristics
3. Decide on the data to be collected
– The data requirements of the survey must be
established.
Study subjects
23
Sampling…
24
• While selecting a SAMPLE, there are basic
questions:
– What is the group of people (STUDY
POPULATION) from which we want to draw
a sample?
– How many people do we need in our sample?
Sample
Errors in sampling
1) Sampling error: Errors introduced due to errors
in the selection of a sample.
– They cannot be avoided or totally eliminated.
2) Non-sampling error:
- Observational error
- Respondent error
- Lack of preciseness of definition
- Errors in editing and tabulation of data
Sampling Methods
Two broad divisions:
• Computer programs
Random number table
• It is a table of random numbers constructed by a
process that
1. In any position in the table, each of the
numbers 0 through 9 has a probability
1/10 of occurring.
2. The occurrence of any number in one part of
the table is independent of the occurrence of
any number in any other part of the table.
Random numbers
…. 8094 2525 8247 1347 7433 3620 1897 ….
…. 3563 2198 8211 9045 2618 2751 2627 ….
…. 1330 6331 3753 9693 8738 6815 1538 ….
…. 3565 0016 2243 6432 4796 6095 5283 ….
…. 7850 5925 5588 7311 2192 4545 3530 ….
…. 4490 5417 9727 6153 5901 4878 9980 ….
…. 6545 9104 9318 8819 7537 2785 9373 ….
Example
• Suppose your school has 500 students and you
need to conduct a short survey on the quality of the
food served in the cafeteria.
• You decide that a sample of 10 students should be
sufficient for your purposes.
• In order to get your sample, you assign a number
from 1 to 500 to each student in your school.
• To select the sample, you use a table of
randomly generated numbers.
41
• SRS has certain limitations:
– Requires a sampling frame.
46
Advantages( merit) of SRS Disadvantages(Demerit)
• Simple to compute • Need sampling frame
• No bias • Units may be scattered
• Small variability and poorly accessible
• Heterogeneous population/
important minorities might
not be taken into account
47
Advantages of SRS Disadvantages
• Sampling frame is not a • If there is any sort of cyclic
must pattern in the ordering of
• Less time consuming and the subjects , the sample
easier to perform will not be representative
• Make geographical spread of the population.
certain if the units are in
“geographical” order.
48
Example
• To select a sample of 100 from a population of
400, you would need a sampling interval of
400 ÷ 100 = 4.
• Therefore, K = 4.
• You will need to select one unit out of every
four units to end up with a total of 100 units in
your sample.
• Select a number between 1 and 4 from a table
of random numbers.
• If you choose 3, the third unit on your frame
would be the first unit included in your
sample;
56
Merit Demerit
• The representativeness of • Sampling frame for the
the sample is improved/ entire population has to be
representation of minority prepared separately for
subgroups . each stratum.
• there is difficulty in
reaching all selected in the
sample
57
• Equal allocation:
– Allocate equal sample size to each stratum
• Proportionate allocation:
n
nj Nj
N
– nj is sample size of the jth stratum
– Nj is population size of the jth stratum
– n = n1 + n2 + ...+ nk is the total sample size
– N = N1 + N2 + ...+ Nk is the total population
size
Example: Proportionate Allocation
• Village A B C D Total
• HHs 100 150 120 130 500
• S. size ? ? ? ? 60
4. Cluster sampling
• Sometimes it is too expensive to carry out SRS
– Population may be large and scattered.
– Complete list of the study population unavailable
– Travel costs can become expensive if interviewers have
to survey people from one end of the country to the
other.
• Cluster sampling is the most widely used to
reduce the cost
• The clusters should be homogeneous, unlike
stratified sampling where the strata are
heterogeneous
Steps in cluster sampling
• Cluster sampling divides the population into groups or
clusters.
• A number of clusters are selected randomly to represent
the total population, and then all units within selected
clusters are included in the sample.
• No units from non-selected clusters are included in the
sample—they are represented by those from selected
clusters.
• This differs from stratified sampling, where some units are
selected from each group.
Example
• In a school based study, we assume students of
the same school are homogeneous.
• Cost reduction
Kebele SSU
Sub-Kebele TSU
HH
• In the first stage, large groups or clusters are
identified and selected. These clusters contain
more population units than are needed for the
final sample.