0% found this document useful (0 votes)

34 views8 pages

Measurement in Research

Psychology is a broad field that studies behavior. Psychologists work in diverse settings but all study behavior and depend on its measurement. Measurement is the process of assigning numbers to objects or observations according to some rule. It is relatively easy to measure physical properties but more difficult to measure abstract concepts. There are different scales of measurement including nominal, ordinal, interval, and ratio scales, with ratio scales having a true zero point. Researchers must consider sources of error in measurement like respondent characteristics, situational factors, and issues with the measuring instrument or researcher. Tests of sound measurement include validity, reliability, and practicality.

Uploaded by

AF Ann Ross

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views8 pages

Measurement in Research

Uploaded by

AF Ann Ross

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Psychology is a broad, exciting field.

Psychologists work in settings ranging from schools and clinics to

basic research laboratories, pharmaceutical firms, and private international companies. Despite this
diversity, all psychologists have at least two things in common: They all study behavior, and they all
depend to some extent on its measurement.

Measurement in Research
 We measure physical objects and abstract concepts in our daily lives
o When we use yard stick to determine weight, height, or some other feature of physical
object
o When we judge how much we like a song or a painting or the personalities of our friends
 It is a complex and demanding task – especially when concerned with measuring abstract
phenomena (an event, an occurrence, happening, circumstance, situation)
 In the case of research, measurement means the process of assigning numbers to objects or
observations

When is it difficult?
 It is easy to assign numbers in respect of properties of some objects but it is relatively difficult in
respect of others
 Measuring things such as social conformity, intelligence, or marital adjustment is much less
obvious and requires much closer attention than measuring physical weight, age, or financial
assets
 Easy to measure properties like weight, height, etc. with use of standard unit of measurement –
not so much with properties like motivation, stress, etc.
o Expect high accuracy in measuring length of pipe with yard stick
o We are less confident about accuracy of the results of measurement if the concept is
abstract and the measurement tools are not standardized

More technicalities on the definition of measurement

 Measurement is the process of mapping aspects of a domain (X) onto other aspects of a range
(Y) according to some rule of correspondence


 For example:
o We want to find the male to female attendance ratio while conducting a study of
persons who attend some show, then we may tabulate those who come to the show
according to sex
o Mapping the observed physical properties of an audience in a show (the domain) on to a
sex classification (the range)
o The rule of correspondence is: If the person/object is male, we assign it to “O” and if
female, assign to “1”

Measurement Scales
Nominal Scale
 It is simply a system of assigning number symbols to events in order to label them
o The number is not associated with an ordered scale  you can’t say that 1 is greater
than 0 because the numbers are just labels for the particular class of events and as such
have no quantitative value
 Example:
o Assignment of numbers of basketball players in order to identify them
 Possible arithmetic: counting only of members in each group
 Measure of central tendency: mode
 Test of significance: commonly chi-square test is utilized
 Measure of correlation: contingency coefficient
 Least powerful level
 No order of distance relationship, no arithmetic origin
 Nominal scale simply describes differences between things by assigning them to categories

Ordinal Scale
 Places events in order, but no attempt to make the intervals of the scale equal in terms of some
rule
 Ordinal scales only permit the ranking of items from highest to lowest. Ordinal measures have
no absolute values, and the real differences between adjacent ranks may not be equal.
 Example:
o Ranks in competitions uses an ordinal scale
 Measure of central tendency: median
 Dispersion: percentile or quartile measure
 The median is the score at the middle of all scores, or more formally defined “the middle value
in a distribution, below and above which lie values with equal total frequencies or probabilities”
(Porkess, 1991, p. 134). This means that 50% of the respondents scored equal or higher to the
median, and also 50% of the respondents scored lower or equal. If for example at a school exam
the results indicate that the median is a 70 (out of 100, with 55 or more being a pass), then we
know that at least 50% of the students passed. From a frequency table, the median can quickly
be found by looking at the cumulative percentages.
 In the example from Table 5 we can see that the cumulative percent passes the 50% mark when
it goes from 31.3 to 67.8. So, one of the 348 people that chose ‘Not too scientific’ is the one
exactly in the middle. The median is therefore 'not too scientific’.
 43350_4.pdf (sagepub.com)
Interval Scale
 More powerful than ordinal scale because it incorporates the concept of equality of interval
 Interval scales lack a true zero – it does not have the capacity to measure the complete absence
of a trait or characteristic
 Central tendency: mean
 Dispersion: standard deviation
 Statistical significance: t test and the F test

3. Interval Scale –
An interval scale has ordered numbers with meaningful divisions, the magnitude between the
consecutive intervals are equal. Interval scales do not have a true zero i.e In Celsius 0 degrees
does not mean the absence of heat.
Interval scales have the properties of:
 Identity
 Magnitude
 Equal distance
For example, temperature on Fahrenheit/Celsius thermometer i.e. 90° are hotter than 45° and
the difference between 10° and 30° are the same as the difference between 60° degrees and
80°.

Ratio Scale
 Ratio scales represents the actual amounts of variables.
 Have an absolute or true zero
 Example:
o Length, weight, distance – measures of physical dimensions
 All statistical techniques are usable

4. Ratio Scale –
The ratio scale of measurement is similar to the interval scale in that it also represents quantity
and has equality of units with one major difference: zero is meaningful (no numbers exist below
the zero). The true zero allows us to know how many times greater one case is than another.
Ratio scales have all of the characteristics of the nominal, ordinal and interval scales. The
simplest example of a ratio scale is the measurement of length. Having zero length or zero
money means that there is no length and no money but zero temperature is not an absolute
zero.
Properties of Ratio Scale:
 Identity
 Magnitude
 Equal distance
Absolute/true zero

Sources of Error in Measurement

See Kaplan and Sacuzzo!!
Researcher must know that correct measurement addresses the following problems. As much as possible
the researcher eliminates or minimize the possible sources of error so that final results may not be
contaminated.
 Measurement should be precise but this is not often met with in entirety – that’s why we must
be aware about the sources of error in measurement
 Respondent (or subject variables) – test taker characteristics which limit the ability of the
respondent to respond accurately and fully
o Reluctance (unwillingness or hesitation to express strong negative feelings)
o Ignorance (little knowledge about the subject but will not admit ignorance) – “guessing”
o Fatigue, boredom, anxiety, test anxiety (difficulty focusing attention on the test items
and are distracted by other thoughts
o Health / illness (e.g. when you have a cold or a flu, you might not perform as well as
when you are feeling well)
 Situation – there are situational factors that may also come in the way of correct measurement
o E.g. mode of administration (online, face-to-face)
o E.g. presence of someone else during an interview may have effect on rapport
o E.g. when anonymity is not assured, reluctance in expressing feelings/opinions
 Measurer –
o Behavior, style, looks may encourage or discourage responses
o Interviewer/researcher may reword or reorder questions
o Training / experience of the measurer
o Carelessness in processing, encoding, etc.
o Expectancy of the measurer  data can sometimes be affected by what the measurer
wants to find
o Drift  in behavioral assessment, observers have a tendency to drift away from the
strict rules followed in training
o Reactivity  in behavioral assessment, when observers are being observed, there is an
increase in reliability
 Instrument
o Error may arise because of the defective measuring instrument
o Use of complex words, ambiguous meanings, poor printing, inadequate space for
replies, response choice omissions, etc. make the instrument defective and may result in
measurement errors
o Poor sampling of the universe of items of concern

Tests of Sound Measurement

Sound  in good condition; not damaged; fit; strong;
These three considerations should be included when we evaluate a measuring tool.

Validity - extent to which a test measures what we actually wish to measure; validity is the evidence for
inferences made about a test score
 The use of categories does not imply that there are distinct forms of validity  care exercised in
making distinctions because the categories actually overlap (Kaplan & Sacuzzo, 2015)
 Is it really measuring what it is supposed to measure?
o Most critical crirterion
o Can be thought of as utility?
 How do we check an instrument’s validity? We seek other relevant evidence that confirms the
answers we have found with our measuring tool
 Content validity – the extent to which the instrument provides adequate coverage of the topic
under study
o E.g. we can have an expert panel to judge how the instrument meets the standards
o No numerical way to express it
 Criterion related validity – our ability to predict some outcome or estimate the existence of
some current condition; broad term that actually refers to predictive and concurrent validity
What do we mean by criterion??? A basis, a reference
o Criterion must be: relevant, free from bias, reliable (stable), and available
o Predictive validity – usefulness of a test in predicting some future performance
o Concurrent validity – usefulness of a test in closely relating to other measures of known
validity;
 Criterion and measure are taken at the same time
 Example: learning disability test and school performance (Kaplan & Sacuzzo,
2015)
 Here the measure and the criterion are taken at the same time because the test
is designed to explain why the person is now having difficulty in school
Expression: coefficient of correlation between test scores and some measure of future
performance or between test scores and scores on another measure of known validity
 Construct validity – the degree to which scores/measurement using a test can be accounted for
by explanatory constructs of a sound theory
o Convergent - Convergent evidence comes from correlations between the test and other
variables that are hypothetically related to the construct.
o Divergent or discriminant validity - Discriminant evidence shows that the measure does
not include superfluous items and that the test measures something distinct from other
tests.
o Construct validity evidence is used when a specific criterion is not well defined.
Reliability and validity are related because it is difficult to obtain evidence for validity
unless a measure has reasonable validity.
o Construct validity evidence is established through a series of activities in which a
researcher simultaneously defines some construct and develops the instrumentation to
measure it. This process is required when “no criterion or universe of content is
accepted as entirely adequate to define the quality to be measured” (Cronbach &
Meehl, 1955, p. 282; Sackett, 2003). Construct validation involves assembling evidence
about what a test means. This is done by showing the relationship between a test and
other tests and measures. Each time a relationship is demonstrated, one additional bit
of meaning can be attached to the test. Over a series of studies, the meaning of the test
gradually begins to take shape. The gathering of construct validity evidence is an
ongoing process that is similar to amassing support for a complex scientific theory.
Although no single set of observations provides crucial or critical evidence, many
observations over time gradually clarify what the test means.
As we saw in Chapter 4, if a test measures whatever it measures well, its scores may be deemed to be
reliable (consistent, precise, or trustworthy), but they are not necessarily valid in the contemporary,
fuller sense of the term. In other words, test scores may be relatively free of measurement error, and
yet may not be very useful as bases for making the inferences we need to make

According to the testing pioneer Lee Cronbach, it may not be appropriate to continue to divide validity
into three parts: “All validation is one, and in a sense all is construct validation” (1980, p. 99). Recall
that the 2012 edition of Standards for Educational and Psychological Testing no longer recognizes
different categories of validity. Instead, it recognizes different categories of evidence for validity.

Reliability – accuracy and precision of a measurement procedure

A reliable measuring instrument provides consistent results.
 Relationship with validity: reliability of a measuring instrument contributes to validity  but a
reliable instrument is not necessarily a valid instrument; a valid instrument is always reliable
 Easier to assess reliability compared to validity
 When a test is reliable, we can be confident that the transient and situational factors are not
interfering
 Stability aspect  consistent results with repeated measurements of the same person and with
the same instrument
 Equivalence aspect  consistent results even when there are different investigators or different
samples of items being studied
o Example: equivalent/parallel forms of a test
 How to improve reliability?
o Make sure conditions under which measurement takes place is standardized – we must
ensure that external sources such as boredom, fatigue, etc. are minimized to the extent
possible  improves stability aspect
 From Determining Reliability of a Test: 4 Methods (yourarticlelibrary.com): Time
gap of retest should not be more than six months. Time gap of retesting
fortnight (2 weeks) gives an accurate index of reliability.
o Design directions for measurement with no variation from group to group, and by using
trained and motivated persons to conduct the research and also by broadening the
sample of items used  improves equivalence aspect

Practicality – concerned with factors of economy, convenience, and interpretability

 Economy – there is trade off between ideal research project and that which the budget can
afford and even time!
o Generally, more items give greater reliability but we have to take only few items in the
interest of limiting interview or observation time
 Convenience – should be easy to administer
o Test/questionnaires should have clear instructions
 Interpretability – the instrument should be accompanied or supplemented by (a) detailed
instructions for administering the test; (b) scoring keys; (c) evidence about the reliability and (d)
guides for using the test and for interpreting results
Developing Measurement Tools
It is a four-stage process consisting:

 concept development – researcher should arrive at an understanding of the major concepts

pertaining to his study
 specification of concept dimensions – researcher specifies the dimensions of the concepts that
he developed in the first stage
o via deduction (intuitive approach or by empirical correlation of the individual
dimensions with the total concept and/or other concepts
o example: company image
 dimensions may be thought as (1) product reputation; (2) customer treatment;
(3) corporate leadership; (4) concern for individual; (5) sense of social
responsibility; etc.
 selection of indicators – researcher develops indicators to measure each dimension/element of
the concept
o indicators – these are specific questions, scales, or other devices by which respondent’s
knowledge, opinion, expectation, etc. are measured
o no one perfect indicator! The researcher should consider several alternatives for the
purpose. The use of more than one indicator gives stability to the scores and it also
improves their validity
 formation of index – researcher combines the indicators into an index
o we combine the several dimensions of a concept into one single index
o example: we can provide scale values to the responses and then sum up the
corresponding scores

Review Notes For Sikolohiyang Pilipino
No ratings yet
Review Notes For Sikolohiyang Pilipino
9 pages
Romero's PSSQ 14 Items
No ratings yet
Romero's PSSQ 14 Items
12 pages
Demo DLP Entrep (Explicit)
No ratings yet
Demo DLP Entrep (Explicit)
10 pages
Feist CH 5
No ratings yet
Feist CH 5
24 pages
Theories of Personality 2021
100% (3)
Theories of Personality 2021
211 pages
Ey The Indian Organic Market Report Online Version 21 March 2018
100% (2)
Ey The Indian Organic Market Report Online Version 21 March 2018
52 pages
Theoretical Framework
No ratings yet
Theoretical Framework
16 pages
Positive Psychology: Resilience Skills
No ratings yet
Positive Psychology: Resilience Skills
12 pages
CEA Method
No ratings yet
CEA Method
5 pages
Cognition and Emotion: November 13-20, 2008
No ratings yet
Cognition and Emotion: November 13-20, 2008
90 pages
Wafo Tutorial 2017
100% (1)
Wafo Tutorial 2017
195 pages
Name: Ann Ross L. Fernandez Your Signature Strengths
No ratings yet
Name: Ann Ross L. Fernandez Your Signature Strengths
4 pages
POM Unit 1
No ratings yet
POM Unit 1
56 pages
The Brain-Changing Benefits of Exercise - Wendy Suzuki
No ratings yet
The Brain-Changing Benefits of Exercise - Wendy Suzuki
5 pages
Assesment of Risk
No ratings yet
Assesment of Risk
31 pages
PSY 704 Notes For Bandura Lecture
No ratings yet
PSY 704 Notes For Bandura Lecture
6 pages
IMRAD Papers
No ratings yet
IMRAD Papers
12 pages
Psy 701: Research Methods July 3, 2021 9AM-12:44NN Introduce Yourself
No ratings yet
Psy 701: Research Methods July 3, 2021 9AM-12:44NN Introduce Yourself
3 pages
Francesco Casetti and Italian Film Semiotics
No ratings yet
Francesco Casetti and Italian Film Semiotics
25 pages
Sociological Paper Analysis
No ratings yet
Sociological Paper Analysis
14 pages
National University of Lesotho Department of Statistics and Demography Tutorial 1 St1381: Elementary Statistics
No ratings yet
National University of Lesotho Department of Statistics and Demography Tutorial 1 St1381: Elementary Statistics
4 pages
Emic and Etic Viewpoints
No ratings yet
Emic and Etic Viewpoints
6 pages
Hospital Compliance
No ratings yet
Hospital Compliance
15 pages
School of Illusion: Hypnotic Illusion and Its Possible Applications.
No ratings yet
School of Illusion: Hypnotic Illusion and Its Possible Applications.
11 pages
Counterfeit Drugs
67% (3)
Counterfeit Drugs
2 pages
Iii Final Output
No ratings yet
Iii Final Output
46 pages
Rui Ma Elaine S Resume
No ratings yet
Rui Ma Elaine S Resume
2 pages
Chap8 PDF
No ratings yet
Chap8 PDF
81 pages
SK ANTIPLAG in Short
No ratings yet
SK ANTIPLAG in Short
5 pages
SERB Call For Proposals 2022
No ratings yet
SERB Call For Proposals 2022
1 page
LGC1991 XXX Asdf
No ratings yet
LGC1991 XXX Asdf
4 pages
What Is Understanding by Design (Ubd) ?: Tyler L. D'Angelo, Andrew C. Thoron, and J. C. Bunch
No ratings yet
What Is Understanding by Design (Ubd) ?: Tyler L. D'Angelo, Andrew C. Thoron, and J. C. Bunch
5 pages
Math533 Course Project - Salescall Inc
No ratings yet
Math533 Course Project - Salescall Inc
2 pages
Health Education Lesson 5
No ratings yet
Health Education Lesson 5
5 pages
Lecture 3&4 - GE. 105
No ratings yet
Lecture 3&4 - GE. 105
7 pages
Cipp Model Stufflebeam2015
100% (1)
Cipp Model Stufflebeam2015
51 pages
Long Phrases in Torah Codes
No ratings yet
Long Phrases in Torah Codes
5 pages
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6458)
Shaping Future Professionals: Employer Perspectives On Accounting Student Competency in Internships
No ratings yet
Shaping Future Professionals: Employer Perspectives On Accounting Student Competency in Internships
15 pages
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (648)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (1005)
Complete Bundle Busines Research Methos 14th Edition HQ File
No ratings yet
Complete Bundle Busines Research Methos 14th Edition HQ File
406 pages
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5181)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1022)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (582)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2886)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
3.5/5 (2141)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2814)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (464)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2016)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4372)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (280)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4135)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)

Measurement in Research

Uploaded by

Measurement in Research

Uploaded by

Psychology is a broad, exciting field.

Psychologists work in settings ranging from schools and clinics to

More technicalities on the definition of measurement

Sources of Error in Measurement

Tests of Sound Measurement

Reliability – accuracy and precision of a measurement procedure

Practicality – concerned with factors of economy, convenience, and interpretability

 concept development – researcher should arrive at an understanding of the major concepts

You might also like