Chapter 2: Sampling Theory
Chapter 2: Sampling Theory
Sampling Theory
1
Sampling Theory: it’s the study of relationships existing between
the population and the samples drawn from the population.
3
There are different concepts related to sampling in sampling theory,
5
Sampling: the process of conducting the selection of a finite number
of elements from the given population of interest for the purposes of
an inquiry/study.
6
Parameter: any measurable characteristic of a population.
Population mean, standard deviation, median, etc.
population to be sampled.
7
In the studies of “the socio-economic status of the households”,
Measurement errors
Cost constraint; taking sample survey has being less costly than a
census.
population
Probability sampling
Non-Random/non-probability sampling
12
Probability Sampling: method of sampling in which all
chance to be selected.
Cluster sampling
15
Procedures needed to select a sample using lottery method are:
Numbered the tickets representing a unit in the population.
The tickets are thoroughly mixed, and the number of tickets
equal to the sample size (for a sample of 200 students, the
researcher would select 200 tickets).
Then, the sample has to be consists all units of the
population corresponding to the selected tickets.
16
Table of Random Numbers: it’s table of the digits 0, 1, 2,…, 9 and each digit
having an equal chance of selection at any draw.
It’s a list of numbers generated by a computer (Software, such as SPSS,
Stata, etc.).
The procedures for generating random numbers are;
The researcher assigns a number to each unit of the population and
constructs the random table.
The investigator randomly selects a starting point, and goes across the
rows or down the columns, and list the numbers as they appear on the
table.
Members of the population with the selected numbers constitute the
sample.
17
It’s possible for a unit's number to be selected more than once.
For instance, the investigator have been asked to perform a
survey in a prison. The list of all 2000 prisoners has been given
to the investigator. Investigator think that a sample of 300 would
be satisfactory for the investigation. If investigator want to
choose 300 of them for interview randomly, investigator can use
a random number generator to generate 300 numbers between 1
and 2000.
o But, most of the time, investigator would have some repeated
numbers that should be replaced by new numbers.
18
Advantage of Simple Random Sampling
Ensures that the sample has being unbiased, in which every
individual and every sample has an advantage of being chosen.
20
Stratified Sampling
• It is applied in heterogeneous population.
The population will be divided in to non-overlapping (having
different character), but exhaustive groups called strata.
Samples will be chosen from each stratum based on simple
random or systematic sampling method.
Elements in the same strata should be homogeneous, while
different in different strata.
The strata are made according to various homogeneous
characteristics, such as gender, race, region, age, religious,
21
Stratified sampling can be;
Proportionate (drawn proportionate number of elements)
Non-proportionate (equal number of elements are drawn from
each stratum).
24
ii) To get the non-proportional sample size, since there are total of
three stratum (sub-groups) and 30 samples to be selected, the three
stratum has equal sample size from the selected sample of 30, in
which it divided by 3, 10 samples have to selected in each stratum.
Therefore, the investigator have to select 10 elements from the
three stratum regardless of the number of elements in each
stratum (10 from each of 4000, 2400 and 1600 equally).
25
Advantages of stratified sampling
Unlike simple random sampling, Minority sub-groups of
interest in the population be represented in the sample.
Dis-advantages of Stratified Sampling
If there are many variables of interest, dividing a large population in
to representative sub-groups requires a great deal of effort.
If variables are some-what complex or ambiguous (such as; beliefs,
attitudes,…etc.), it’s difficult to separate individuals in to the sub-
groups according to the selected variables.
26
Cluster Sampling: clusters are formed in a way that, population has
to be divided in to non-over-lapping groups (clusters).
Elements within a cluster are heterogeneous, but, elements between
the cluster are homogeneous.
It’s useful when generate a simple random sample has been
ambiguous or costly.
o To estimate the average annual household income in a large city, the
All the units in the selected clusters may under the studies or
the investigator may select samples from each cluster.
o If investigator may select samples from each cluster, the
investigator select a sample of clusters, and then select
elements from each cluster.
28
Cluster sampling has the advantages;
representative.
30
For any number between 1 and k, i.e. 1 ≤ j ≤ k, the units
selected first, and then , , … etc. will be selected until the
required sample size will be reached.
Suppose there are 2000 subjects in the population and a sample
size of 50 subjects are needed.
o Then; the sampling interval (k) = = = 40
31
o Lets, 12 to be the first subject selected, then the sample interval
(k) will be added to each subsequent elements, and the sample
would consist of samples whose numbers were 12, 52, 92, 132,
… etc. until 50 subjects (samples) are obtained in the way , , …
etc.
In systematic random sampling, all members of the population
have no equal chance of being selected, it’s not strictly random.
Suppose a researcher wants to know the impact of micro-finance on
the clients' household income. The researcher wishes to select 10
clients out of 250 clients using systematic sampling.
o How the research assistant to select a systematic sample of 10 clients?
32
Systematic sampling has the advantages;
Less time consuming and easier to perform than SRS.
It’s more convenient to use as compared to SRS.
It provides a good approximation to simple random sampling.
33
o The selection of the starting point could produce a sample of all
defective items or non-defective items depending on the even or odd
sample interval (k).
If the starting point has defective item, and added even ‘k’, the
sample will be all defective item.
If the starting point has non-defective item, and added even ‘k’,
the sample will be all non-defective item.
34
Multi-stage Sampling: it has used for large and widely scattered.
target population.
In multi-stage sampling method, the investigator do several
sampling steps (primary sampling unit, secondary sampling
unit, the third sampling unit, … etc.,), and use different suitable
sampling methods in different sampling steps.
o In studying in district level, the primary sampling unit can be
the districts, the secondary sampling unit can be the kebeles, the
third sampling unit will be village, etc.
It uses both probability and non-probability sampling techniques, but the sample
that represent the population has to be selected using probability sampling.
35
Non -probability Sampling
It’s a sampling technique in which the choice of units selected for
a sample depends on the basis of convenience, and personal
interest.
In non-random sampling, each elements of the population has not
a chance of being included in the sample.
The process of sample selection involves at least some degree of
personal subjectivity.
Units included in the sample have been selected at the discretion
of the researcher, and samples derive from the judgment of the
researcher.
36
In non-probability sampling technique, sampling frame (list of
all elements in the population) doesn’t necessary.
In non-probability sampling technique, there’s non-random
selection of the sample (unrepresentative to the population).
It’s inappropriate if the investigation aims to measure variables
and generalize the findings about the population on the basis of
the sample.
It’s easier, quicker and cheaper to carryout than probability
sampling designs.
37
Dis-advantages of Non-probability Sampling Technique
Non-probability sampling has the dis-advantages are;
38
Advantages of Non-probability Sampling Technique
Non-probability sampling has the advantages are;
Much less complicated, less expensive.
Very convenient
More adequate for vast investigation,
Undesirable to generalize the findings beyond the sample.
39
There are four non- probability sampling methods.
Convenient
Ineffective
42
Problems in Sampling
Problems in sampling may non-sampling errors or sampling
errors.
45
Sampling Error
Sampling errors may occurred due to;
48
Sample Size Estimation using Yamane and Cochran and
Krejcie and Morgan and Green Formulas
1. Taro Yamane (1973) if population is known
49
Sample Size Estimation using Yamane and Cochran and
Krejcie and Morgan and Green Formulas
2. Krejcie & Morgan Formula (Krejcie & Morgan, 1970)
• If the population size is known the sample size n is:
50
Cont’d
3. Cochran Formula (Cochran, 3.2 If the population size is
1977) unknown and the population
3.1 If the population size is proportion is unknown
unknown but a lot, the population
proportion is know Example: from the above example