Sampling and Variables
Sampling and Variables
Structure
6.0 Objectives
6.1 Introduction
6.2 Types of Variables
6.3 Measurement of Qualitative Data
6.4 Census versus Sample Survey
6.5 Sampling Procedure
6.6 Types of Sampling
6.7 Summary
6.8 Answers to Self Check Exercises
6.9 Keywords
6.10 References and Further Reading
6.0 OBJECTIVES
After going through this Unit, you will be able to:
• define a variable;
• distinguish between various types of variables;
• measure ordinal variables through scaling techniques;
• distinguish between census and sample survey;
• explain the steps involved in carrying out a sample survey; and
• distinguish between various types of sampling.
6.1 INTRODUCTION
During the course of research you come across situations where you have to measure
various characteristics. These characteristics could be of various types, viz., age, height
or income level of visitors to a library; educational qualifications, social status or reading
habits of a person; gender, religion, or area of interest of a library user. Note that all
these characteristics are not similar from the point of measurement. While age, height
or income can be measured in quantitative terms (in number of years, in centimetres, in
rupees) religion or gender can be put to certain categories only. In this Unit we discuss
the issue of measurement of qualitative variables, particularly the scaling techniques.
Another issue discussed in the Unit pertains to collection of data on the basis of sample
survey. Very often it is not possible to survey all the units bearing the characteristic
under study. The constraints could be inadequate funds, time limit, and manpower. In
such situations we survey only a subset of the population, called sample. We discuss
various concepts associated with sampling procedure.
We begin with the types of variables and their measurement. 91
Measurement of
6.2 TYPES OF VARIABLES Variables
Let us begin with the concept of a variable. It is a characteristic of the sample or the
population that we intend to measure. Thus age of the reader is a variable, so is gender,
educational level or mother tongue. As we have mentioned earlier all variables are not
similar.
Variables can be of two types: qualitative and quantitative. Qualitative variable is one
that cannot be expressed in numerical terms. For example, marital status is a qualitative
variable. Here we can have two categories: married and single. Of course, if you want
a more detailed categorisation you can further divide single in to widow/widower, divorcee
and never married. Similarly, gender (male or female), mother tongue (Hindi, Bengali,
Oriya, Tamil, Urdu, etc.), subject categories (economics, history, physics, medicine,
etc.), religion (Buddhism, Christianity, Hinduism, Islam, etc.) are examples of qualitative
variables. Here we study an attribute or quality that cannot be quantified, but can be
divided into various categories. Moreover, we cannot say that one category is higher or
greater than another category. Such variables are also called nominal variables.
There is another type of qualitative variable where we can divide the observations into
various categories and also say that one category is higher or greater than another
category. An example could be the educational qualification of a visitor to a library.
Here we can divide the visitors on the basis of their educational qualification into
categories such as ‘secondary’, ‘senior secondary’, ‘graduate’ and ‘post-graduate’. In
this case, obviously, the category ‘Senior Secondary’ is higher than the category
‘Secondary’ in terms of number of years of schooling and expected mental maturity. In
this case we arrange the categories in an ascending or descending order. This sort of
variables are called ordinal variables.
In the case of nominal variables we cannot perform any mathematical operations (such
as addition, subtraction, multiplication, division,) or logical operations (greater than,
less than) across categories. We can simply count the number of observations in each
category. In the case of ordinal variables we can say that one category is greater than
another category. But we cannot quantify the difference between categories. For
example, we cannot express numerically the difference between two categories (say
secondary and senior secondary). Also we cannot say that the difference between two
categories (say secondary and senior secondary) is the same as the difference between
two other categories (say graduate and post-graduate).
A quantitative variable can be expressed in numerical terms. Hence it is also called
numerical variable. Examples of numerical variable could be age, income, weight, height,
distance travelled, etc. This category of variables can be subjected to various mathematical
and logical operations. Thus we can express the monthly income of a librarian in Rupees
and also say by what percentage it exceeds the salary of a library assistant.
Numerical variables can be of two types: discrete and continuous. Discrete variable is
one where the observations assume values in complete numbers. For example, the
number of children in family can only be whole numbers; it cannot be fractions. On the
other hand, continuous variables can assume any value in an interval. For example,
weight of a person can be measured to any precision and thus can take any value in
between two points.
Let us distinguish between variable and data. We obtain data by measuring a variable
(qualitative or quantitative) on certain individuals or units. For example, suppose we
measure the height of 50 employees in a library. Here height is the variable and we 89
obtainfor
Tools 50 Research
observations. These 50 numerical values that we obtain are our data. Thus
we have discrete data or continuous data depending upon whether the variable is discrete
or continuous.
Similarly there are primary data and secondary data. Primary data refers to data collected
by the researcher by undertaking a field survey. On the other hand, secondary data
refers to collection of data from published sources, e.g., census, budget, handbooks,
etc. Thus when you undertake a field survey, collect data, analyse the results and present
it in some forum, it is primary data. But when I use that data for further analysis it
becomes secondary data for me.
6.7 SUMMARY
A variable is a characteristic that we are interested in analysing. It can be nominal,
ordinal or numerical. Numerical variable can be discrete or continuous. While numerical
variable is amenable to mathematical and logical operations nominal and ordinal variables
95
are not. In certain cases it is difficult to measure ordinal variables because we do not
have standard
Tools of measurement. In such cases we need to construct a measurement
for Research
scale. In this we discussed one such scaling techniques, that is summated rating.
Another issue that we covered in this Unit relates to sampling techniques. Because of
inadequate resources or infeasibility we often resort to sampling instead of census of all
the units in the population. There are two broad types of sampling procedures: probability
sampling and non-probability sampling. In probability sampling we have simple random
sampling, systematic random sampling, stratified random sampling and multistage
sampling. On the other hand, non-probability sampling procedures are judgment
sampling, quota sampling and snowball sampling.
97