Module 4
Module 4
INTRODUCTION
The research work cannot be undertaken without use of sampling. All items in any field of
inquiry constitute a ‘Universe’ or ‘Population.’ A complete enumeration of all items in the
‘population’ is known as a census inquiry. Besides, this type of inquiry involves a great deal
of time, money and energy. Therefore, when the field of inquiry is large, the study of the total
population is not possible and it is also impracticable. The practical limitation: cost, time and
other factors in the way of studying the total population. Further, many a time it is not
possible to examine every item in the population, and sometimes it is possible to obtain
sufficiently accurate results by studying only a part of total population. In such cases there is
no utility of census surveys.
The selected respondents constitute what is technically called a ‘sample’ and the
selection process is called ‘sampling technique.’ The survey so conducted is known as
‘sample survey’.
Definition of sampling
W.G. Cocharn defined the term sampling
“In every branch of science we lack the resources, to study more than a fragment of the phenomena that might advance our
knowledge.”
“In the social sciences, it is not possible to collect data from every respondent relevant to our study but only from some fractional
part of the respondents. The process of selecting the fractional part is called sampling.”– David S. Fox
‘Sampling design’ means the joint procedure of selection and estimation. Sampling should
be such that error of estimation is minimum.
SAMPLE DESIGN
“A sample design is a definite plan for obtaining a sample from a given population”. It refers to the technique or the
procedure the researcher would adopt in selecting items for the sample. Sample design may as
well lay down the number of items to
be included in the sample i.e., the size of the sample. Sample design is determined before data
are collected.
Unrestricted sampling
Probability sampling Non-probability sampling
Restricted sampling Simple random sampling Haphazard sampling or
convenience sampling
Complex random sampling Purposive sampling (such as
(such as cluster sampling, quota sampling, judgement
systematic sampling sampling)
stratified sampling etc.)
Non-probability sampling
Characteristics of Non-probability Sampling
The following are the main characteristics of nonprobability sample:
1. There is no idea of population in non-probability sampling.
2. There is no probability of selecting any individual.
3. Non-probability sample has free distribution.
4. The observations of non-probability sample are not used for generalization
purpose.
5. Non-parametric or non-inferential statistics are used in non probability sample.
6. There is no risk for drawing conclusions from non-probability sample.
Types of Non-probability Sample
There are the following four types of non-probability sample:
1. Incidental or accidental sample.
2. Purposive sample.
3. Quota sample.
4. Judgement sample.
1. Incidental or Accidental Assignment
The term incidental or accidental applied to those samples that are taken because they
are most frequently available, i.e. this refers to groups which are used as samples of a
population because they are readily available or because the researcher is unable to employ
more acceptable sampling methods.
Advantages
a. It is very easy method of sampling.
b. It is frequently used in behavioural sciences.
c. It reduces the time, money and energy i.e. it is an economical method.
Disadvantages
a. It is not a representative of the population.
b. It is not free from error.
c. Parametric statistics cannot be used.
2. Purposive Sampling
The purposive sampling is selected by some arbitrary method because it is known to
be representative of the total population, or it is known that it will produce well matched
groups. The Idea is to pick out the sample in relation to some criterion, which are considered
important for the particular study. This method is appropriate when the study places special
emphasis upon the control of certain specific variables.
Advantages
a. Use of the best available knowledge concerning the sample subjects.
b. Better control of significant variables.
c. Sample groups data can be easily matched.
d. Homogeneity of subjects used in the sample.
Disadvantages
a. Reliability of the criterion is questionable.
b. Knowledge of population is essential.
c. Errors in classifying sampling subjects.
d. Inability to utilise the inferential parametric statistics.
e. Inability to make generalization concerning total population.
3. Quota Sampling
This combined both judgement sampling and probability sampling. The population is
classified into several categories: on the basis of judgement or assumption or the previous
knowledge, the proportion of population falling into each category is decided. Thereafter a
quota of cases to be drawn is fixed and the observer is allowed to sample as he likes. Quota
sampling is very arbitrary and likely to figure in Municipal surveys.
Advantages
a. It is an improvement over the judgement sampling.
b. It is an easy sampling technique.
c. It is most frequently used in social surveys.
Disadvantages
a. It is not a representative sample.
b. It is not free from error.
c. It has the influence of regional geographical and social factors.
Since research design is a plan by which research samples may be selected
from a population and under which experimental treatments are administered and
controlled so that their effect upon the sample may be measured. Therefore, a second
step in the establishment of an experimental design is to select the treatments that will
be used to control sources of learning change in the sample subjects.
4. Judgement Sampling
This involves the selection of a group from the population on the basis of available
information thought. It is to be representative of the total population. Or the selection of a
group by intuition on the basis of criterion deemed to be self-evident. Generally investigator
should take the judgement sample so this sampling is highly risky.
Advantages
a. Knowledge of the investigator can be best used in this technique of sampling.
b. This technique of sampling is also economical.
Disadvantages
a. This technique is objective.
b. It is not free from error.
c. It includes uncontrolled variation.
d. Inferential statistics cannot be used for the observations of this sampling, so
generalization is not possible.
Non-probability sampling is that sampling procedure which does not afford any basis
for estimating the probability that each item in the population has of being included in the
sample.
Probability sampling
Probability sampling is also known as ‘random sampling’ or ‘chance sampling’.
Characteristics of Probability Sampling
The following are the main characteristics of probability sampling:
1. In probability sampling we refer from the sample as well as the population.
2. In probability sampling every individual of the population has equal
probability to be taken into the sample.
3. Probability sample may be representative of the population.
4. The observations (data) of the probability sample are used for the inferential
purpose.
5. Probability sample has not from distribution for any variable.
6. Inferential or parametric statistics are used for probability sample.
7. There is a risk for drawing conclusions from probability sample.
8. The probability is comprehensive. Representativeness refers to characteristic.
Advantages
a. It is (more precisely third way) a good representative of the population.
b. It is an improvement over the earlier.
c. It is an objective method of sampling.
d. Observations can be used for inferential purpose.
Disadvantages
a. Serious disadvantage of this method is that it is difficult for the researcher to
decide the relevant criterion for stratification.
b. Only one criterion can be used for stratification, but it generally seems more
than one criterion relevant for stratification.
c. It is costly and time consuming method.
d. Selected sample may be representative with reference to the used criterion but
not for the other.
e. There is a risk in generalization.
4. Multiple or Double or Repetitive Sampling
Generally this is not a new method but only a new application of the samplings we
discussed above. This is most frequently used for establishing the reliability of a sample.
When employing a mailed questionnaire, double sampling is sometimes used to obtain a
‘more representative sample. This is done because some randomly selected subjects who are
sent questionnaires may not return them. Obviously, the missing data will bias the result of the
study, if the people who fail to reply the’ query differ in some fundamental way from the others
in respect to the phenomena being studied. To eliminate this bias, a second sample may be
drawn at random from the non-respondents and the people interviewed to obtain the desired
information. Thus this technique is also known as repeated or multiple sampling. This double
sampling technique enables one to check on the reliability of the information obtained from the
first sample. Thus, double sampling, wherein one sample is analysed, and information
obtained is used to draw the next sample to examine the problem further.
Advantages
a. This sampling procedure leads to the inferences of free determine precision
based on a number of observations.
b. This technique of sampling reduces the error.
c. This method maintains the procedure of the finding evaluate the reliability of
the sample.
Disadvantages
a. This technique of sampling cannot be used for a large sample. It is applicable
only for small sample.
b. This technique is time consuming, costly, and requires more competition.
c. Its planning and administration is more complicated.
5. Multi-Stage Sampling
This sample is more comprehensive and representative of the population. In this type
of sampling primary sample units are inclusive groups and secondary units are sub-groups
within these ultimate units to be selected which belong to one and only one group. Stages of a
population are usually available within a group or population, whenever stratification is done
by the researcher. The Individuals are selected from different stages for constituting the
multi-stage sampling.
Advantages
a. It is a good representative of the population.
b. Multi-stage sampling is an improvement over the earlier methods.
c. It is an objective procedure of sampling.
d. The observations from multi-stage sample may be used for inferential purpose.
Disadvantages
a. It is a difficult and complex method of samplings.
b. It involves errors when we consider the primary and secondary stages.
c. It is again a subjective phenomenon.
6. Cluster Sampling
To select the intact group as a whole is known as a Cluster sampling. In Cluster
sampling the sample units contain groups of elements (clusters) instead of individual
members or items in the population.
Rather than listing all elementary school children in a given city and randomly selecting 15
per cent of these students for the sample, a researcher lists all of the elementary schools in the
city, selects at random 15 per cent of these clusters of units, and uses all of the children in the
selected schools as the sample.
Advantages
a. It may be a good representative of the population.
b. It is an easy method.
c. It is an economical method.
d. It is practicable and highly applicable in education.
e. Observations can be used for inferential purpose.
Disadvantages
a. Cluster sampling is not free from error.
b. It is not comprehensive.
All these above are techniques of probability sampling.
MEASUREMENT SCALES
Statistical information, including numbers and sets of numbers, has specific qualities that are
of interest to researchers. These qualities, including magnitude, equal intervals, and absolute
zero, determine what scale of measurement is being used and therefore what statistical
procedures are best. Magnitude refers to the ability to know if one score is greater than,
equal to, or less than another score. Equal intervals mean that the possible scores are each an
equal distance from each other. And finally, absolute zero refers to a point where none of the
scale exists or where a score of zero can be assigned.
The most widely used classification of measurement scales are:
a. Nominal
b. Ordinal
c. Interval
d. Ratio
Properties of Scales
● Distinctive classification
● Order
● Equal distance
● Fixed origin
a. Nominal scale
In research activities a YES/NO scale is nominal. It has no order and there is no distance between YES and NO.
1. A nominal scale is the simplest scale. In nominal scale the numbers or letters
assigned to objects serve as labels for identifying and classifying objects. The
numbers are just for namesake, and they do not carry any value or order or
meaning.
Ex.: A list of students in alphabetical order, a list of favorite cartoon
characters, or the names on an organizational chart. Classification like STD
codes for cities, codes for various subjects in a university etc.
2. Nominal scales provide convenient ways of keeping track of people, objects
and events. One cannot do much with the numbers involved.
3. Nominal scale is the least powerful level of measurement. It indicates no order
or distance relationship and has no arithmetic origin.
4. The data collected through a nominal scale is known as a nominal data.
5. In spite of all this, nominal scales are still very useful and are widely used in
surveys and other ex-post-facto research when data are being classified by
major sub-groups of the population. Nominal scale is used in marketing
research.
b. Ordinal scale
The simplest ordinal scale is a ranking.
1. An ordinal scale is more powerful than a nominal scale. The ordinal data has
magnitude only. Any set of data that can be placed in order from greatest to
lowest but there is no absolute zero and no equal intervals.
2. The use of an ordinal scale implies a statement of ‘greater than’ or ‘less than’
but not able to state how much greater or less, big or little, strong or weak.
3. An ordinal scale indicates relative position.
4. In this scale type, the numbers assigned to objects or events represent the rank
order (1st, 2nd, 3rd, etc) of the entities assessed.
Ex.: In a class of students, the highest mark is 95, next is 85 and the next is 84, converting marks
to ranks will lead to 1, 2, and 3. Incidentally, it may be noted that the difference in the performance
of the 1st ranker and 2nd ranker is not the same as the 2nd ranker and 3rd ranker. Thus, one can only
conclude that 1st ranker has performed better than 2nd ranker and 2nd ranker better than 3rd ranker
5. A Likert scale is a type of ordinal scale and may also use names with an order
such as:“Bad”, “medium” and “good” “very satisfied”, “satisfied”, “neutral”,
“unsatisfied”, “very unsatisfied”
Ex.: The result of a horse race, which says only which horses arrived first,
second, or third but include no information about race times.
6. In marketing research ordinal scale used to measure relative attitude, opinions,
perceptions and preference.
c. Interval scale
The standard survey rating scale is an interval scale.
1. The highest scale, it allows the researcher to identify or classify objects, and
compare intervals or differences. It is also meaningful to compute ratios of
scale values.
2. Ratio scales are quantitative measures with fixed or true zero. Ex.: The
measurement of length The ratio scale possesses all the properties of the
nominal, ordinal, and interval scale.
3. In the marketing, sales, costs, market share, and number of customers are
available measure on ratio scale.
Ex.: A weighing scale is a ratio scale, height, price, sales, revenue, profit etc. In all these cases
zero implies absence of that characteristic.
The scaling techniques commonly employee in marketing research can be classified into:
1. Comparative scales
▪ Potential problem is rounding error if too few units, use large number of units
number of objects.
This technique uses a rank order procedure in which objects are sorted into
The number of objects to be sorted should not be less than 60 nor more than
140 to 90 objective is reasonable.
It is magnitude estimation.
They assign a number between 0 to 100 to each statement to indicate the
2. Noncomparative scales:
❖ One of two types of scaling techniques in which each stimulus object is scaled
MEANING OF DATA
“Data is information in raw or unorganized form (such as alphabet, number or symbols) that refers to or represent, ideas, object.
Data is limitless and present everywhere in the universe”.
TYPES OF DATA
The data collection is of two types of data viz., primary and secondary.
A. THE PRIMARY DATA
❑ Data that has been collected from first-hand-experience is known as primary data.
Primary data has not been published yet and is more reliable, authentic and objective.
Primary data has not been changed or altered by human beings, therefore its validity
is greater than secondary data
Merit of primary data
❖ They should be honest, sincere, hardworking, and impartial and must possess
❖ Occasional field checks should be made to ensure that interviewers are neither
cheating, nor deviating from instructions given to them for performing their job
efficiently.
investigation.
************
Questions
1. What do you mean by ‘Sample Design’? What points should be taken into
consideration by a researcher in developing a sample design for this research project.
2. How would you differentiate between simple random sampling and complex random
sampling designs?
Explain clearly giving examples.
3. Why probability sampling is generally preferred in comparison to non-probability
sampling? Explain the procedure of selecting a simple random sample.
4. Under what circumstances stratified random sampling design is considered
appropriate? How would you select such sample? Explain by means of an example.
5. Distinguish between:
(a) Restricted and unrestricted sampling;
(b) Convenience and purposive sampling;
(c) Systematic and stratified sampling;
(d) Cluster and area sampling.
6. Under what circumstances would you recommend?
(a) A probability sample?
(b) A non-probability sample?
(c) A stratified sample?
(d) A cluster sample?
7. Explain and illustrate the procedure of selecting a random sample.
8. “A systematic bias results from errors in the sampling procedures”. What do you
mean by such a systematic bias? Describe the important causes responsible for such a
bias.