Unit-9 Sampling Methods
Unit-9 Sampling Methods
Objectives
On successful completion of this unit, you should be able to:
• appreciate why sampling is so common in managerial situations
• identify the potential sampling errors
• list the various sampling methods with their strengths and weaknesses
• distinguish between probability and non-probability sampling
• know when to use the proportional or the disproportional stratified
sampling
• understand the role of multi-stage and multi-phase sampling in large
sampling studies
• appreciate why and how non-probability sampling is used in spite of its
theoretical weaknesses
• recognise the factors which affect the sample size decision.
Structure
9.1 Introduction
9.2 Why Sampling?
9.3 Types of Sampling
9.4 Probability Sampling Methods
9.5 Non-Probability Sampling Methods
9.6 The Sample Size
9.7 Summary
9.8 Self-assessment Exercises
9.9 Further Readings
9.1 INTRODUCTION
Let us take a look at the following five situations to find out the common
features among them, if any:
1) An inspector from the Weights &Measures department of the government
goes to a unit manufacturing vanaspati. He picks up a small number of
packed containers from the day's production, pours out the contents from
each of these selected containers and weighs them individually to
determine if the manufacturing unit is packing enough vanaspati in its
containers to conform to what is claimed as the net weight in the label.
2) The personnel department of a large bank wants to measure the level of
employee motivation and morale so that it can initiate appropriate
measures to help improve the same. It administers a questionnaire to
about 250 employees from different branches and offices all over India
selected from a total of about 30,000 employees and analysis the 149
Sampling and information contained in these 250 filled-in questionnaires to assess the
Sampling
Distributions
morale and motivation levels of all employees.
3) The product development department of a consumer products company
has developed a "new improved" version of its talcum powder. Before
launching the new product, the marketing department gives a container of
the old version first and after a week, a container of the new version to a
group of 400 consumers and gets the feedback of these consumers on
various attributes of the products. These consumer responses will form
the basis for assessing the consumer perception of the new talcum powder
as compared to the old talcum powder.
4) The quality control department of a company manufacturing fluorescent
tubes checks the life of its products by picking up 15 of its tubes at
random and letting them burn till each one of them fuses. The life of all
its products is assessed based on the performance of these 15 tubes.
5) An industrial engineer takes 100 rounds of the shop floor over a period of
six clays and based on these 100 observations, assesses the machine
utilisation on the shop floor.
What is Sampling
On the face of it, there is little that is common among the five situations
described above. Each one refers to a different functional area and the nature
of the problem also is quite different from one situation to another. However,
on closer observation, it appears that in all these situations one is interested in
measuring some attribute of a large or infinite group of elements by studying
only a part of that group. This process of inferring something about a large
group of elements by studying only a part of it, is referred to as sampling.
Most of us use sampling in our daily life, e.g. when we go to buy provisions
from a grocery. We might sample a few grains of rice or wheat to infer the
quality of a whole bag of it. In this unit we shall study why sampling works
and the various methods of sampling available so that we can make the
process of sampling more efficient.
Some Basic Concepts
We shall refer to the collection of all elements about which some inference is
to be made as the population. For example, in situation (ii) above,, the
population is the set of 30,000 employees working in the bank and in
situation (iii), the population comprises of all the consumers of talcum
powder in the country.
We are basically interested in measuring some characteristics of the
population. This could be the average life of a fluorescent tube, the
percentage of consumers of talcum powder who prefer the "new improved"
talcum powder to the old one or the percentage of time a machine is being
used as in situation (v) above. Any characteristic of a population will be
referred to as a parameter of the population.
In sampling, some population parameter is inferred by studying only a part of
the population. We shall refer to the part of the population that has been
150
chosen as a sample. Sampling, therefore, refers to the process of choosing a Sampling Methods
sample from the population so that some inference about the population can
be made by studying the sample. For example, the sample in situation (ii)
consists of the 250 employees from different branches and offices of the
bank.
Any characteristic of a sample is called a statistic. For example, the mean life
of the sample of 15 tubes in situation (iv) above is a sample statistic.
Conventionally, population parameters are denoted by Greek or capital letters
and sample statistics by lower case Roman letters. There can be exceptions to
this form of notation, e.g. population proportions is usually denoted by p and
the sample proportion by p.
Figure I shows the concept of a population and a sample in the form of the
Venn diagram, where the population is shown as the universal set and a
sample is shown as a true subset of the population. The characteristics of a
population and a sample and some symbols for these are presented in Table
1.
Figure I: Population and Sample
POPULATION SAMPLE
Characteristic Parameter Statistic
Symbols Population size = N Sample size = n
Population mean = � Sample mean = �
Population s.d. = � Sample s.d. = s
Population proportion = P Sample proportion = p
Sampling is not the only process available for making inferences about a
population. For small populations, it may be feasible and practical, and
sometimes desirable to examine every member of the population e.g. for
inspection of some aircraft, components. This process is referred to as census
or complete enumeration of the population.
151
Sampling and
Sampling
9.2 WHY SAMPLING?
Distributions
In the example situations given in section 9.1 above, the reasons for resorting
to sampling should be very clear. We give below the various reasons which
make sampling a desirable, and in many cases, the only course open for
making an inference about a population.
Time taken for the Study
Inferring from a sample can be much faster than from a complete
enumeration of the population because fewer elements are being studied. In
situation (iii) above in section 9.1, a complete enumeration of all consumers,
even if feasible, would perhaps take so much time that it is unacceptable for
product launch decisions.
Cost involved for the Study
Sampling also helps in substantial cost reductions as compared to censuses
and as we shall see later in this unit, a better sample design could reduce the
cost of the study further. In many cases, like in situation (ii) above in section
9.1, it may be too costly, although feasible, to contact all the employees in the
bank and get information from them.
Physical Impossibility of Complete Enumeration
In many situations the element being studied gets destroyed while being
tested. The fluorescent tubes in situation (iv) of section 9.1, which are chosen
for testing their lives, get destroyed while being tested. In such cases, a
complete enumeration is impossible as there would be no population left after
such an enumeration.
Practical Infeasibility of Complete Enumeration
Quite often it is practically infeasible to do a complete enumeration due to
many practical difficulties. For example, in situation (iii) of section 9.1, it
would be infeasible to collect information from all the consumers of talcum
powder in India. Some consumers would have moved from one place to
another during the period of study, some others would have stopped
consuming talcum powder just before the period of study whereas some
others would have been users of talcum powder during the period of study
but would have stopped using it some time later. In such situations, although
it is theoretically possible to do a complete enumeration, it is practically
infeasible to do so.
Enough Reliability of Inference based on Sampling
In many eases, sampling provides adequate information so that not much
additional reliability can be gained with complete enumeration in spite of
spending large amounts of additional money and time. It is also possible to
quantify the magnitude of possible error on using; some types of sampling as
will be explained later.
152
Quality of Data Collected Sampling Methods
For large populations, complete enumeration also suffers from the possibility
of spurious or unreliable data collected by the enumerators. On the other
hand, there is greater confidence on the purity of the data collected in
sampling as there can be better interviewing, better training and supervision
of enumerators, better analysis of missing data and so on.
Activity A
When would you prefer complete enumeration to sampling?
………………………………………………………………………………....
………………………………………………………………………………....
………………………………………………………………………………....
………………………………………………………………………………....
………………………………………………………………………………....
………………………………………………………………………………....
Activity B
Name two decisions in each of the following functional areas, where
sampling can be of use:
Functional Area Decision
Manufacturing 1) Inspection of components
2)
Personnel 1)
2)
Marketing । 1)
2)
Finance 1)
2)
154
Simple random sampling is a process which ensures that each of the samples Sampling Methods
156
We can also use a table of random numbers to pick up a simple random Sampling Methods
158 …………………………………………………………………………………
Systematic Sampling Sampling Methods
160
Proportional stratified sampling: After defining the strata, a simple random Sampling Methods
sample is picked up from each of the strata. If we want to have a total sample
of size 100, this number is allocated to the different strata-either in proportion
to the size of the stratum in the population or otherwise.
If the different strata have similar variances of the characteristic being
measured, then the statistical efficiency will be the highest if the sample sizes
for different strata are in the same proportion as the size of the respective
stratum in the population. Such a design is called proportional stratified
sampling and is shown in Table 4 below.
If we want to pick up a proportional stratified sample of size n from a
population of size N, which has been stratified to p different strata with sizes
N�, N� , … … … … . N� respectively, then the sample sizes for different strata,
viz n� , n� , … … n� will be given by
�� �� �� �
= =⋯= =
�� �� �� �
The strata and the samples from each stratum are shown in the form of a
Venn diagram in Figure III below, where S�, S2 etc. refer to the stratum
number 1 and stratum number 2 etc. respectively.
Figure III: Stratified Sampling
where the other symbols have the same meaning as in the previous example.
Suppose the variances of the characteristic we are measuring were different
for each of the three strata of the earlier example and were actually as shown
in Table 5. If the total sample size was still restricted to 50, the statistically
optimal allocation would be as given in Table 5 and one can compare this
Table with Table 4 above to find that the sampling ratio would fall for
Stratum-3 as the variance is smaller here and would go up for Stratum-2
where the variance is larger.
Table 5: Disproportional Stratified Sampling
IV below.
Figure IV: Blocks in a residential colony
…………………………………………………………………………………
9.7 SUMMARY
In this unit we have looked at various sampling methods available when one
wants to make some inferences about a population without enumerating it
166 completely. We started by looking at some situations where sampling was
being done and then found that in many situations sampling may be the only Sampling Methods
169