Unit 10
Unit 10
Structure
10.0 Objectives
10.1 Introduction
10.2 Importance of Sampling
10.3 Types of Sampling Techniques
10.4 Probability based Sampling Techniques
10.4.1 Simple Random Sampling
10.4.2 Systematic Sampling
10.4.3 Stratified Sampling
10.4.4 Cluster Sampling
10.4.5 Multistage Sampling
10.5 Non-Probability based Sampling Techniques
10.5.1 Convenience Sampling
10.5.2 Purposive Sampling
10.5.3 Snowball Sampling
10.5.4 Self-Selection Sampling
10.6 Sample Size Determination
10.6.1 Determining Sample Size for Questions Involving Means
10.6.2 Determining Sample Size for Proportions
10.7 Sampling and Non-Sampling Errors
10.7.1 Random Sampling Errors
10.7.2 Non-Sampling Errors
10.7.3 Control of Errors
10.8 Let Us Sum Up
10.9 Key Words
10.10 References/Suggested Readings
10.11 Answers to Check Your Progress Exercises
10.12 Unit End Questions
1
10.0 OBJECTIVES
After studying the unit the student should be able to:
explain the concept and importance of sampling;
distinguish between probability-based and non-probability based sampling
techniques;
decide on the use of appropriate sampling techniques for research;
determine suitable sample size for study; and
identify different types of sampling errors.
10.1 INTRODUCTION
To begin with this Unit, it is important to understand a few terms that we would be
using quite often during our discussion. These terms are:
i. Population
ii. Census
iii. Sampling
iv. Sampling element
A population is the aggregate of all the elements that share some common set of
characteristics and that comprise the universe for the purpose of the research problem.
Any research study aims to obtain information about the characteristics or parameters
of a population.
The collection of data from the entire population is referred to as a census study. A
census involves a complete listing of all the elements of a population. Collecting data
from the aggregate of all the elements (population) in case, the number of elements is
larger would sometimes render the researcher to incur huge costs and time.
A sample is a subset of a population and the process of selecting units (e.g., people,
organizations) from a population is referred to as sampling. In most cases, it is difficult
in terms of the high cost and time involved in collecting the information from the entire
population. With sampling, the researchers are able to collect the information from a
portion of the population, which otherwise was not possible due to the large size of the
population and other constraints like time and cost. Based on the sample findings, the
parameters for the population are inferred.
The results of the sample studies may be generalized for the population however care is
to be taken so that sampling errors are kept to the minimum. The concept of sampling
and population is shown in Fig 4.1.
2
Fig. 4.1: Population, Sample, and sample elements
3
collect the data, you can afford higher-quality staff. You also can devote more
time to obtaining data from difficult-to-reach cases.
a) Data collection
b) inclusion of every member of the defined target population into the research
study
e) a person or object from the defined target population from which data is
sought
4
d) All of the above are reasons for sampling
5
10.4.1 Simple Random Sampling
Simple random sampling is, in a sense, the prototype of population sampling. Suppose
that a teacher wants to select a sample of 25 students from a class size of 100 students.
The procedure would be very simple. The teacher will randomly select 25 students for
the study. Here every student would be having an equal chance of being selected during
the selection procedure. The scientific basis of drawing a sample based on random
numbers may also be used.
To summarise, a random sample is defined as follows:
Selections are made from a specified and defined population (i.e., the sampling
frame is known).
Each unit is selected with a known probability so that every unit in the
population has an equal chance of being selected.
6
is used for a variety of purposes, particularly for large samples. For example, you could
group your data by type of manufacturing firm or geographical area.
This technique can be detailed in the following steps:
i. Determine and select the cluster grouping for your sampling frame.
ii. Number each of the clusters with a unique number. The first cluster is
numbered 0, the second 1, and so on.
iii. Select your sample using some form of random sampling as discussed earlier.
iv. Selecting clusters randomly makes cluster sampling a probability sampling
technique.
The teacher may form four clusters in the class based on the residential area of a city as
students residing in the north, south, west, and eastern part of the city and then select 5
students randomly from each cluster for the study. This is called a two-stage clustering
technique.
7
10.5.2 Purposive Sampling
A non-probability sample that conforms to certain criteria is called purposive sampling.
There are two major types within this type of sampling:
Judgement sampling and
Quota sampling
i. Judgement Sampling: The sample elements are chosen based on the judgement
that prevails in the researcher’s mind about the prospective individual. The
researcher may use his wisdom to conclude that a particular individual may be
representative of the population in which one is interested.
The distinguishing feature of judgment sampling is that the population elements are
purposively selected. Again, the selection is not based on whether samples are
representative or not, but rather because they can offer the contributions sought. For
example, companies often try out new product ideas for their employees. The
rationale is that one would expect the firm’s employees to be more favourably
disposed towards a new product idea than the public. If the product does not pass
this group, it does not have a prospect of success in the general market.
ii. Quota Sampling: This is often used to improve the representativeness of the study.
Quota sampling is a special case of a two-staged judgemental sampling technique,
wherein the first stage consists of developing quotas based on the different
categories existing in the population. The characteristics may include age, gender
and income level, etc. that are identified on the basis of the researcher’s judgement.
Further, the researcher will set quotas for the identified groups. For example, the
researcher may determine the categories of gender and further decides to study 40%
of men and 60% of women in a population.
In the second stage of the sampling, the researcher will select 40% of men and 60%
of women based on convenience or judgement as discussed in the convenience
sampling.
8
i. Give wide publicity to your sampling requirements. This may be done by
advertising through appropriate media or by asking them to take part by sending
emails or other platforms.
ii. Distribute the questionnaires to those who respond and collect the data.
Sudha’s research was concerned with the amount of time spent browsing Facebook
profiles. Sudha decided to administer her questionnaire using the Internet and therefore
prepared her questionnaire using Google Docs and posted the link on her Facebook
page with a request to fill the same. She publicized her research on a range of other
bulletin boards, asking for volunteers to fill in a questionnaire. Those who volunteered
by clicking on a hyperlink were automatically taken to her online questionnaire.
9
a) Elements within a stratum should be as homogeneous as possible.
5) The purpose of stratified random sampling is to make certain that
a) Every member of the population has an equal chance of being selected for
the sample.
b) The sample proportionately represents individuals from different categories
of the population.
c) The participants chosen for the study are the ones most likely to react to the
treatment.
d) The sample is more representative of the target population than the
accessible population.
6) In which of the following situations would stratified sampling be
recommended?
a) The population has more males than females, and the researcher is studying
a topic related to gender.
b) There are four distinct ethnic groups in New Orleans, and the researcher
wants to be assured each group is represented equally in the sample.
c) A superintendent wants to be sure she selects teachers from elementary,
middle/junior high, and high schools in numbers that are equal to the
proportions of teachers in these types of schools in her district.
d) All of these.
7) Cluster sampling is often preferred to other random sampling strategies because
it:
a) Is more complicated than simple random sampling.
b) Allows you to easily identify subgroups in the population.
c) Equally represents all groups in the population.
d) Can be used when a sampling frame is not available.
8) In the cluster sampling method
a) Every unit in the cluster is chosen to be part of the sample if the cluster they
belong to has been selected.
b) The chance of a unit being chosen to be part of the sample depends on the
size of the cluster.
c) Every unit closely resembles another unit within the same cluster.
d) None of the above
9) What is ‘multi-stage’ sampling?
a) This is a form of random sampling in which a clustered group is used as
representative of the population, thus enabling the researcher to drill right
down to individual data sources within the cluster.
b) Multi-stage sampling is a kind of stratified sampling in which the samples
are investigated at different stages of the processes that are being critically
reviewed.
10
c) Multi-stage sampling is a sampling that takes place in stages over a period
of time.
d) All the above
10) A convenience sample is best described as
a) Participants who are students
b) Participants the experimenter calls or polls on the street
c) Participants who the experimenter has easy access to
d) Experimenters who the participants have easy access to
11) Under which circumstances would judgemental sampling be appropriate?
a) Selection of households in a geographic area
b) Test markets to determine the potential of a new product
c) People on the street interviews
d) Selection of a sample of students to evaluate their experience at the
university
12) Which of the following is NOT a characteristic of quota sampling?
a) The random selection of units makes it possible to calculate the standard
error.
b) It is a relatively fast and cheap way of finding out about public opinions.
c) Those who are available to be surveyed in public places are unlikely to be a
representative sample.
d) The researcher chooses who to approach and so might bias the sample.
e) The sample is subdivided into groups from which the researcher must obtain
a certain number of responses.
13) Quota sampling can be based on:
a) Observation or questioning.
b) A sample frame and sampling units.
c) Group membership or participation.
d) Interviewer discretion or judgment.
14) Snowball sampling can help the researcher to:
a) Overcome the problem of not having an accessible sampling frame.
b) Collect data cost-effectively.
c) Theorise inductively in a qualitative study.
d) Access difficult or hidden populations.
e) All of the above.
15) Which of the following is not a benefit of snowball sampling?
a) It can be used when there is difficulty in creating a sampling frame.
b) It is always representative of the population.
11
c) It can be used within a qualitative research strategy.
d) It can be used to reflect relationships between people by tracing
connections.
Activity 1:
1. Suppose that a researcher is going to conduct a study to determine the
characteristics of graduate school that are most stressful to students. Identify
two variables that a researcher might use to define strata before selecting a
sample of graduate students for the study.
4. Make a proposal for understanding how the Farmers rate the ongoing
12
Government schemes and programmes. Clearly give the objectives for your
study and the sampling plan.
13
A rule of thumb for estimating the value of the standard deviation is to expect it to be
about one-sixth of the range. The range is the difference between the largest and the
smallest observation value. That is if a researcher conducting a study on mobile phone
purchase, expected the price paid to range from Rs.1000 to Rs. 7000, a rule-of-thumb
estimate for the standard deviation would be Rs. 1000 ((Rs. 7000-Rs.1000)/6).
ii. Determine a confidence level
A confidence level is a percentage or decimal value that signifies how confident a
researcher can be about being correct; it states the long-run percentage of confidence
intervals that will include the true population means. A 95% confidence is assumed for
this example.
iii. Decide on the Magnitude of the error
The range of values above and below the sample statistic is called the margin of error.
In simple terms, it is the expected difference between the population finding and the
finding you get from your sample study.
iv. Determine the Sample Size
To estimate the mean of a particular population, the formula for sample size is:
𝑍𝑆 2
𝑛=( )
𝐸
Where,
Z = standardized value obtained from the normal distribution corresponding
to the selected confidence level.
S = sample standard deviation
E = acceptable positive/negative magnitude of the error.
Suppose a business researcher studying expenditures on mobile phones wishes to have
a 95 percent confidence level (Z = 1.96, from the normal distribution table) and a range
of error (E) of less than Rs. 200. If the estimate of the standard deviation is Rs. 2900
(known from similar studies in past), the sample size can be calculated as follows:
𝑍𝑆 2
𝑛=( )
𝐸
(1.96) (2900) 2
=( ) = 28.422 = 808
200
Hence the sample size required for the study is 808.
If a range of error (E) of Rs. 400 is acceptable, then the necessary sample size will be
reduced:
𝑍𝑆 2
𝑛=( )
𝐸
14
(1.96) (2900) 2
= ( ) = 14.212 = 202
400
You can observe that increase in the value of error provides a less sample size.
15
standard deviation to be around Rs. 100. What would be the required sample
size?
............................................................................................................................
................................................................................................................................
................................................................................................................................
................................................................................................................................
16
population is heterogeneous in nature. Intuitively, we know that the larger the sample
the more accurate the research. In fact, the sampling error varies with samples of
different sizes. Increasing the sample size decreases the sampling error.
17
10.7.3 Control of Errors
It is not possible to eliminate completely the sources of errors but we can make efforts
to minimize these sources of errors as much as possible. Some of the steps we should
use are:
a. Designing and executing a good questionnaire
b. Selection of appropriate sampling method
c. Adequate sample size
d. Employing trained investigators to collect the data; and
e. Avoid errors in editing, coding, and entering the data into the computer.
1) A researcher mailed surveys to 100,000 people but only 1,000 responded, resulting
in a response rate of 1%. This study is threatened by _____.
a) Nonresponse error
b) Response error
c) Sampling error
d) Measurement error
e) Coverage error
2) Which of the following is true regarding non-sampling errors?
a) They occur due to the normal variation in data that occurs as a result of each
unique sample.
b) Like sampling error, a non-sampling error can also be measured.
c) Nonsampling errors may not be measured but appropriate steps may be taken
to limit their occurrence.
d) Nonsampling errors may be calculated to four decimal places.
e) Nonsampling errors may be calculated except in cases where interviewers
cheat.
3. As the sample size increases, the sampling error:
a) Increases
b) Decreases
18
c) Remain Constant
d) None of the above
4. The sampling errors arise due to:
a) The investigator’s bias
b) The data processing problem
c) The respondent’s bias
d) The sampling method applied
5. Which of the following is true?
a) The sampling error is unrelated to the sample size.
b) The sampling error becomes progressively larger with the increasing sample
size.
c) The sampling error becomes progressively smaller with the increasing sample
size.
d) The sampling error arises due to administrative errors.
19
Judgement Sampling : In judgement sampling, the researcher or some other
“expert” uses his/her judgement in selecting the units
from the population for study based on the population’s
parameters.
Multistage Sampling : Multistage Sampling is a sampling strategy used when
conducting studies involving a very large population. The
entire population is divided into naturally-occurring
clusters and sub-clusters, from which the researcher
randomly selects the sample.
Purposive Sampling : Purposive sampling technique is a type of non-probability
sampling where the researcher consciously selects
particular elements or subjects for addition in a study to
make sure that the elements will have certain
characteristics pertinent to the study.
Quota Sampling : Quota sampling is a method for selecting survey
participants. In quota sampling, a population is first
segmented into mutually exclusive sub-groups, just as in
stratified sampling.
Simpl Random Sampling : A simple random sample is a sample of size n drawn
from a population of size N in such a way that every
possible sample of size n has the same chance of being
selected.
Sampling Error : Sampling Error is an error that occurs when using
samples to make inferences about the populations from
which they are drawn.
Sampling : Statistical method of obtaining representative data or
observations from a group.
Sampling Frame : A list of sampling units.
Self-Selection Sampling : Self-selection sampling is a type of non-probability
sampling technique. Non-probability sampling focuses on
sampling techniques that are based on the judgement of
the researcher.
Snow Ball Sampling : Snowball sampling is a non-probability sampling
technique that is used by researchers to identify potential
subjects in studies where subjects are hard to locate.
Stratified Sampling : A stratified sample is a probability sampling technique in
which the researcher divides the entire target population
into different subgroups or strata, and then randomly
selects the final subjects proportionally from the different
strata.
20
Systematic Sampling : A method of choosing a random sample from among a
larger population.
21
Check Your Progress Exercise 2
1. True
2. a
3. c
4. b
5. b
6. d
7. d
8. a
9. a
10. c
11. b
12. a
13. c
14. e
15. b
22
10.12 UNIT END QUESTIONS
1. Define the following terms
i. Population
ii. Sampling
iii. Census
iv. Sampling element
2. Discuss the importance of sampling.
3. What are the limitations to conduct a census survey?
4. How is random sampling done? Discuss the procedure.
5. Distinguish between Probability and Non- Probability-based Sampling
Techniques.
6. Discuss the different types of Sampling Techniques.
7. Despite the researcher’s judgment, judgment sampling is still a type of non-
probability sampling? Elaborate.
8. One of the strengths of snowball sampling is that it leads to a very
heterogeneous sample thus reducing bias. Discuss.
9. A self-selecting sample is usually highly representative of the target population.
Elaborate.
10. What do you mean by sample size determination?
11. How do determine the sample size for means?
12. How do determine the sample size for proportion?
13. Differentiate between sampling and non-sampling errors.
23