Measurement - Scaling, Reliability, Validity
Measurement - Scaling, Reliability, Validity
Reliability, Validity
Measurement
• Measurement is the assignment of numbers or other symbols to
characteristics (or attributes) of objects according to a pre-specified
set of rules.
• Objects include persons, SBUs, companies, countries, restaurants, shampoos,
and so on.
• Characteristics include arousal-seeking tendency, achievement motivation,
organizational effectiveness, shopping behavior, ethnic diversity, service
quality etc.
• It is important to realize that you cannot measure objects (for instance a
company); you measure characteristics or attributes of objects (e.g.
organizational effectiveness of company, service quality of restaurant,
shopping behavior of consumer etc.)
Measurement
• Measurement means gathering data in the form of numbers.
• Rating Scales have several response categories and are used to elicit
responses with regard to the object, event, or person studied.
• Likert scale is designed to examine how strongly subjects agree or disagree with
the statements on a five point scale with the following anchors.
Rating Scales
• Itemized Rating Scale: A five point or seven point scale with anchors, as needed,
is provided for each item and the respondent states the appropriate response
against each. This uses an interval scale.
Rating Scales
• Fixed or constant sum scale
• The respondents are here asked to distribute a given number points
across various items as per the example below.
• This is more in the nature of an ordinal scale.
Rating Scales
• Stapel Scale
• This scale is simultaneously measures the direction and intensity of
the attitude toward the item under study.
• Since this does not have an absolute zero point, this is an interval
scale.
Rating Scales
• Graphic Rating Scale
• A graphical representation helps the respondents to indicate on this
scale their answers to a particular question by placing a mark at the
appropriate point on the line.
• This is an ordinal scale.
Ranking Scales
• Ranking scales are used to tap preferences between two or among
more objects or items (ordinal in nature).
• Suppose there are four product lines,
• Manager seeks information that would help decide which product get the
most attention.
• Assume that 35% respondents choose first product,
• 25% the second, 20% choose each of the category.
• Manager cannot conclude that the first product is the most preferred, since
65% of the respondents did not choose the product.
• Alternative methods used are paired comparison, forced choice, and
the comparative scale.
Ranking Scales
• Paired Comparison scale is used when, among a small number of
objects, respondents are asked to choose between two objects at a
time.
• Force Choice enables respondents to rank objects relative to one
another, among the alternative provided.
Ranking Scales
• Comparative Scale provides a benchmark or a point of reference to
assess attitudes towards the current object, event, situation under
study.
Goodness of Measures
• It is important to make sure that the instrument that we develop to
measure a particular concept is indeed accurately measuring the variable,
and that, in fact, we are actually measuring the concept that we set out to
measure.
• First, an item analysis of the responses to the questions tapping the variable is
carried out.
Predicative Validity
Validity Types
3. Construct Validity testifies how well the results obtained from the
use of the measures fit the theories around which the test is designed.