Measuring and Scaling of Quantitative Data: UNIT - III
Measuring and Scaling of Quantitative Data: UNIT - III
Measuring and Scaling of Quantitative Data: UNIT - III
UNIT -iii
Agenda
What are measuring and scaling? Levels of measurement Process of measurement Methods of scaling Types of scales Reliability and validity of scales
If a thing exists, it exists in some amount; and if it exists in some amount, it can be measured.
E. L. Thorndike (1914)
Levels of measurement
Nominal Ordinal Interval Ratio
Nominal
A categorical variable, also called a nominal
variable, is for mutual exclusive, but not ordered, categories.
Ordinal
This scale has the ability to rank the individual
attributes of two items in same group but unit of measurement is not available in this scale, like student A is taller than student B but their actual heights are not available.
Interval
Classifies data into groups or categories Designates an equal-interval ordering The difference in temperature between 20 degrees
F and 25 degrees F is the same as the difference o o between 76 degrees F and 81 degrees F
o o
Ratio
This is the highest level of measurement and has
the properties of other three levels; coupled with fixed origin or zero point.
Process of measurement
Define concepts to be measured Define attributes of the concepts Select level of measurement (data type) Generate items/questions
Wording Response format
Methods of scaling
Rating scales
Have several response categories and are used to obtain responses with regard to the object, event, or person studied. Make comparisons between or among objects, events, persons and obtain the preferred choices and ranking among them.
Ranking scales
Types of scales
Likert scale Semantic differential scale Stapel scale Graphic rating scale Thurstone scale Guttman scale Paired comparison scale Forced choice Comparative scale
Likert scale
Is designed to examine how strongly subjects agree or
disagree with statements on a 5-point scale.
Stapel scale
This scale simultaneously measure both
the direction and intensity of the attitude toward the items under study. It is a slight modification of semantic differential scale. The scale consists of a single adjective in the middle of positive and negative numbers
Stapel scale
Thurstone scale
This technique assesses the extent of agreement
among a group of judges about the proposed items for a scale. For example, one might ask a group of persons to judge how closely 25 different items come to measuring self-esteem. Then, one might select the 10 items that received the highest average scores for having content validity with self-esteem. It can help find the best questions to ask to measure an abstract concept. It does not specify how a question or set of questions should be formatted on a questionnaire.
Guttman scale
Who agrees with an item will also agree with all other
items expressing a less extreme position Using a series of statements to reflect the strength of attitudes
Forced choice
Enables respondents to rank
objects relative to one another, among the alternatives provided.
Comparative scale
Provides a benchmark or a point of
reference to assess attitudes toward the current object, event, or situation under study.
Reliability of scale
Indicates the extent to which it is
without bias (error free) and hence ensures consistent measurement across time and across the various items in the instrument.
Types of reliability
Stability of measures
Split-half reliability
Validity of scale
Ensures the ability of a scale to indeed
measure the concept we want to measure and not something else.
Content validity Criterion related validity Construct validity
Content validity
Ensures that the measure includes an
adequate and representative set of items that tap the concept.
A panel of judges
scale differentiates individuals who are known to be different Predictive validity: indicates the ability of measuring instrument to differentiate among individuals with reference to future criterion
Construct validity
Testifies to how well the results obtained from
the use of the measure fit the theories around which the test is designed.
Convergent validity: established when the scores
obtained with two different instruments measuring the same concept are highly correlated Discriminant validity: established when, based on theory, two variables are predicted to be uncorrelated, and the scores obtained by measuring them are indeed empirically found to be so
THANKS A LOT