Unit III Research Methodology
Unit III Research Methodology
UNIT III
CLASSIFICATION OF DATA
The task of data collection begins after a research
problem has been defined and research design/
plan chalked out.
While deciding about the method of data collection
to be used for the study, the researcher should
keep in mind two types of data viz.,
Primary Data
Secondary Data.
The primary data are those which are collected
afresh and for the first time, and thus happen to
be original in character.
The secondary data, on the other hand, are those
which have already been collected by someone else
and which have already been passed through the
statistical process.
The methods of collecting primary and secondary
data differ since primary data are to be originally
collected, while in case of secondary data the
nature of data collection work is merely that of
compilation.
Advantages of primary data:
The primary data are original and relevant to the
topic of the research study so the degree of accuracy
is very high.
Primary data is that it can be collected from a
number of ways like interviews, telephone surveys,
focus groups etc. It can be also collected across the
national borders through emails and posts. It can
include a large population and wide geographical
coverage.
Moreover, primary data is current and it can give a
realistic view to the researcher about the topic under
consideration.
Reliability of primary data is very high because these
are collected by the concerned and reliable party.
Disadvantages of primary data:
For collection of primary data where interview is to be
conducted the coverage is limited and for wider
coverage a more number of researchers are required.
A lot of time and efforts are required for data
collection. By the time the data collected, analyzed and
report is ready the problem of the research becomes
very serious or out dated. So the purpose of the
research may be defeated.
It has design problems like how to design the surveys.
The questions must be simple to understand and
respond.
Some respondents do not give timely responses.
Sometimes, the respondents may give fake, socially
acceptable and sweet answers and try to cover up the
realities.
The data is classified into four categories:
• Nominal data.
• Ordinal data.
• Discrete data.
• Continuous data.
Qualitative or Categorical Data
Qualitative or Categorical Data is data that can’t
be measured or counted in the form of numbers.
These types of data are sorted by category, not by
number. That’s why it is also known as
Categorical Data. These data consist of audio,
images, symbols, or text. The gender of a person,
i.e., male, female, or others, is qualitative data.
SCALING:
• Scaling is an extension of measurement.
• Scaling involves creating a continuum upon which
measured objects are located.
PRIMARY SCALES OF MEASUREMENT
There are four primary scales of measurement:
(a) Nominal Scale
0
Primary Scales of Measurement
Nominal Numbers
Assigned 4 81 9
to
Runners
Interval Performance
Rating on a 0 8.2 9.1 9.6
to 10 Scale
Semantic
Likert Differenti Stapel
al
Comparative scale:
It involves the direct comparison of stimulus objects
with one another.
For eg. Respondents may be asked whether they like
coke or Pepsi.
Comparative scale data must be interpreted in
relative terms and have only ordinal or rank order
properties.
Non-Comparative Scale:
It is also referred to as monadic or metric scales.
Each object in this type of scaling is scaled
independently of the other objects in the stimulus
set.
COMPARATIVE SCALING TECHNIQUE
Paired Comparison Scaling:
A comparative scaling technique in which a
respondent is presented with two objects at a
time and asked to select one object in the pair
according to some criterion.
The data obtained are ordinal in nature.
Paired Comparison
• Please indicate which of the following airlines you
prefer by circling your more preferred airline in
each pair:
Air India Vistara
Vistara Spicejet
Jet Airways Indigo
Spicejet Go Air
Indigo Air India
Go Air Jet Airways
Rank order scaling
A comparative scaling technique in which
respondents are presented with several objects
simultaneously and asked to order or rank them
according to some criterion.
This scaling is commonly used to measure
preferences for brands as well as attributes.
Rank-Order Scales
Rank the following soft-drinks from 1 (best) to 5
(worst) according to your taste preference:
Coca-Cola _____
7-Up _____
Sprite _____
Pepsi-Cola _____
Mountain Dew _____
Research 1 2 3 4 5
Methodology is the
most interesting
subject known to
man
AGREEMENT
•Agree Strongly
•Strongly •Agree
Agree Moderately
•Agree
•Agree •Agree Slightly
•Disagree •Agree •Undecided
•Undecided
Slightly •Disagree
•Disagree •Disagree •Disagree
•Strongly Moderately
Disagree •Disagree
Strongly
•Completely
Agree •Disagree
•Agree Very •Mostly Agree Strongly
Strongly •Slightly •Disagree
•Agree Strongly Agree •Tend to
•Agree •Yes Disagree
•Slightly
•Disagree •No Disagree
•Tend to
•Disagree Strongly Agree
•Disagree Very •Mostly •Agree
Strongly Disagree •Agree
•Completely Strongly
Disagree
FREQUENCY
•Very •Always
Frequently •Very
•Always
•Frequently Frequently •Almost Always
•Usually
•Occasionall •Occasionall •To a Considerable
•About Half the
y y Degree
Time
•Rarely •Rarely •Occasionally
•Seldom
•Very •Very •Seldom
•Never
Rarely Rarely
•Never •Never
•A Great
•Always
Deal •Often
•Very Often
•Much •Sometimes
•Sometimes
•Somewhat •Seldom
•Rarely
•Little •Never
•Never
•Never
IMPORTANCE
•Very Important
•Important
•Moderately Important •Very Important
•Of Little Importance •Moderately Important
•Unimportant •Unimportant
QUALITY
•Extremely
•Very Good Poor
•Good •Below
•Good
•Barely Average
•Fair
Acceptable •Average
•Poor
•Poor •Above
•Very Poor Average
•Excellent
Semantic scales:
This type of scale makes extensive use of words rather
than numbers.
Respondents describe their feelings about the products
or brands on scales with semantic labels.
When bipolar adjectives are used at the end points of
the scales, these are termed semantic differential
scales.
Semantic Differential Scale
XYZ
Poor Product -5 -4 -3 -2 -1 1 2 3 4 5
Selection
Costly Products -5 -4 -3 -2 -1 1 2 3 4 5
Fast Service -5 -4 -3 -2 -1 1 2 3 4 5
High Quality -5 -4 -3 -2 -1 1 2 3 4 5
Products
Innovative -5 -4 -3 -2 -1 1 2 3 4 5
CHARACTERISTICS OF GOOD MEASUREMENT
SCALES
1. Reliability
• The degree to which a measure accurately captures
an individual’s true outcome without error; Accuracy
• synonymous with repetitive consistency
2. Validity
• The degree to which a measure faithfully represents
the underlying concept;
3. Sensitivity
• The ability to discriminate meaningful differences
between attitudes. The more categories the more
sensitive (but less reliable)
4. Generalizability
• How easy is scale to administer and interpret
VALIDITY AND RELIABILITY
POPULATION:
A population is the aggregate of all elements,
sharing some common set of characteristics and
comprise the universe for the purpose of research.
CENSUS:
A census is the complete enumeration of the
elements of a population or study objects.
SAMPLE:
A sample is the sub-group of the population selected
for participation in the study.
THE SAMPLING DESIGN PROCESS
TARGET POPULATION
It is the collection of elements or objects that
possess the information sought by the researcher
and about which inferences are to be made.
ELEMENT
Objects that possess the information sought by the
researcher and about which inferences are to be
made.
SAMPLING UNIT
The basic unit containing the elements of the
population to be sampled.
DETERMINE THE SAMPLING FRAME
SAMPLING FRAME
It is the representation of the elements of the target
population. It consists of a list or set of directions
for identifying the target population
The example of a sampling frame may include the
telephone book, a city directory or a map.
Often it is possible to compile or obtain a list of
population elements, but the list may omit some
elements of the population or include other
elements which does not belong to it.
SELECT A SAMPLING TECHNIQUE
4. Indeterminancy principle:
Sometimes we find that individuals act differently
when kept under observation than what they do when
kept in non-observed situations.
5. Natural bias in the reporting of data:
Natural bias of respondents in the reporting of data is
often the cause of a systematic bias in many inquiries.
There is usually a downward bias in the income data
collected by government taxation department,
whereas we find an upward bias in the income data
collected by some social organization.
People in general understate their incomes if asked
about it for tax purposes, but they overstate the same
if asked for social status or their affluence. Generally
in psychological surveys, people tend to give what they
think is the ‘correct’ answer rather than revealing
their true feelings.
CHARACTERISTICS OF A GOOD SAMPLE
DESIGN
Mall shoppers
Advantages of convenience sampling
1. Simplicity of sampling and the ease of research
List of clients
Random subsample
Random Sampling
Selected by using
chance or random
numbers
Each individual
subject (human or
otherwise) has an
equal chance of
being selected
Examples:
Drawing names
from a hat
Random Numbers
SYSTEMATIC SAMPLING:
A probability sampling technique in which the
sample is chosen by selecting a random starting
point and then picking every i th element in
succession from the sampling frame.
The sampling interval i is determined by dividing
the population size N by the sample size n and
rounding to the nearest integer.
It is different from SRS in that the permissible
samples of size n that can be drawn have a known
and equal probability of selection
It is less costly than SRS.
SYSTEMATIC RANDOM SAMPLING 1 26 51 76
2 27 52 77
N = 100 3 28 53 78
4 29 54 79
5 30 55 80
6 31 56 81
Want n = 20 7 32 57 82
8 33 58 83
9 34 59 84
10 35 60 85
11 36 61 86
N/n = 5 12 37 62 87
13 38 63 88
14 39 64 89
15 40 65 90
Select a random number from 1-5: chose 4 16 41 66 91
17 42 67 92
18 43 68 93
19 44 69 94
20 45 70 95
21 46 71 96
Start with #4 and take every 5th unit 22 47 72 97
23 48 73 98
24 49 74 99
25 50 75 100
STRATIFIED SAMPLING
A probability sampling technique that uses a two
step process to divide the population into sub-
populations , or strata.
Elements are selected from each stratum by a
random procedure.
The strata should be mutually exclusive and
collectively exhaustive.
This means that every population element should be
assigned to one and only one stratum and no
population elements should be omitted.
Technically, only SRS should be employed in
selecting the elements from each stratum.
Stratified sampling differs from quota sampling in
that the sampling elements are selected
probabilistically rather than based on convenience or
judgment.
A major objective of stratified sampling is to increase
precision without increasing cost.
STRATIFIED RANDOM SAMPLING
List of clients
Strata