Chapter 2 - Types of Variables and Levels of Measurement
Chapter 2 - Types of Variables and Levels of Measurement
STATISTICS
Types of Variables and
Levels of Measurement
Roy REVILLA
Variables and Constants
• The first thing you must be familiar with in statistics is the concept of a variable. A variable is, quite simply,
something that varies. It is a coding scheme used to measure a particular characteristic of interest. It is also a
characteristic that describes people, objects, or places and takes on multiple values in a sample or population.
• Variables sit in contrast to constants, which are characteristics that assume only one value in a sample. A
constant has characteristic that describes people, objects, or places and takes on only one value in a sample or
population.
Units of Analysis
• It seems rather self-evident, but nonetheless bears explicit mention, that every
scientific study contain something that the researcher conducting the study gathers
and examines. These “somethings” can be objects or entities such as rocks, people,
molecules, or prisons.
• This “something” is called the unit of analysis, and it is, essentially, whatever the
sample under study consists of. It is the object or target of a research study.
• In criminal justice and criminology research, individual people are often the units
of analysis. These individuals might be probationers, police officers, criminal
defendants, or judges. Prisons, police departments, criminal incidents, or court
records can also be units of analysis. Larger units are also popular; for example,
many studies focus on census tracks, block groups, cities, states, or even countries.
Independent Variables and Dependent Variables
• Researchers in criminal justice and criminology typically seek to examine relationships between two or more
variables. Observed or empirical phenomena give rise to questions about the underlying forces driving
them. To be empirical means to have the qualities of being measurable, observable, or tangible. Empirical
phenomena are detectable with senses such as sight, hearing, or touch.
• Researchers undertaking quantitative studies must specify dependent variables (DVs) and independent
variables (IVs). Dependent variables are the empirical events that a researcher is attempting to explain.
They are phenomena that a researcher wishes to study, explain, or predict. Independent variables are factors
a researcher believes might affect the DV. They are also factors or characteristic that is used to try to explain
or predict a dependent variable. The designation of a certain phenomenon as an IV or a DV depends on the
nature of the research study.
Relationships Between Variables: A Cautionary
Note
• It is vital to understand that independent and dependent are not synonymous with cause and effect,
respectively.
• A particular IV might be related to a certain DV, but this is far from definitive proof that the former is the
cause of the latter. To establish causality, researchers must demonstrate that their studies meet three criteria.
The Three Criteria
• First is temporal ordering, meaning that the IV must occur prior to the DV. The causality requirement holding
that an independent variable must precede a dependent variable. It would be illogical, for instance, to predict
that adolescents’ participation in delinquency will impact their gender; conversely, it does make sense to predict
that adolescents’ gender affects the likelihood they will commit delinquent acts.
• The second causality requirement is that there be an empirical relationship between the IV and the DV. The
causality requirement holding that the independent and dependent variables possess an observed relationship
with one another. This is a basic necessity—it does not make sense to try to delve into the nuances of a
nonexistent connection between two variables. For example, if a researcher predicts that people living in high-
crime areas are more likely to own handguns for self-protection, but then finds no relationship between
neighborhood-level crime rates and handgun ownership, the study cannot proceed.
• The last requirement is that the relationship between the IV and the DV be nonspurious, or that the causality
requirement holding that the relationship between the independent variable and dependent variable not be the
product of a third variable that has been erroneously omitted from the analysis. This third criterion is frequently
the hardest to overcome in criminology and criminal justice research (indeed, all social sciences) because
human behavior is complicated, and each action a person engages in has multiple causes. Disentangling these
causal factors can be difficult or impossible.
Spuriousness and Omitted Bias
• The reason spuriousness is a problem is that there could be a third variable that explains the DV as well as,
or even better than, the IV does. This third variable might partially or fully account for the relationship
between the IV and DV. The inadvertent exclusion of one or more important variables can result in erroneous
conclusions because the researcher might mistakenly believe that the IV strongly predicts the DV when, in
fact, the relationship is actually partially or entirely due to intervening factors.
• Another term for this problem is omitted variable bias. This is an error that occurs as a result of
unrecognized spuriousness and a failure to include important third variables in an analysis, leading to
incorrect conclusions about the relationship between the independent and dependent variables. When omitted
variable bias (i.e., spuriousness) is present in an IV–DV relationship but erroneously goes unrecognized,
people can reach the wrong conclusion about a phenomenon.
Examination of Aggregate Trends
• A final caution with respect to causality is that statistical analyses are examinations of aggregate trends.
• Uncovering an association between an IV and a DV means only that the presence of the IV has the tendency to be related to either
an increase or a reduction in the DV in the sample as a whole—it is not an indication that the IV–DV link inevitably holds true for
every single person or object in the sample.
• For example, victims of early childhood trauma are more likely than nonvictims to develop substance abuse disorders later in life
(see Dass-Brailsford & Myrick, 2010). Does this mean that every person who was victimized as a child has substance abuse
problems as an adult? Certainly not! Many people who suffer childhood abuse do not become addicted to alcohol or other drugs.
Early trauma is a risk factor that elevates the risk of substance abuse, but it is not a guarantee of this outcome. Associations present
in a large group are not uniformly true of all members of that group.
• In sum, you should always be cautious when interpreting IV–DV relationships. It is better to think of IVs as predictors and DVs as
outcomes rather than to view them as causes and effects. As the adage goes, correlation does not mean causation. Variables of all
kinds are related to each other, but it is important not to leap carelessly to causal conclusions on the basis of statistical associations.
Levels of Measurement
• Continuous variables differ from categorical ones in that the former are represented not by categories but
rather by numbers. Interval variables are numerical scales in which there are equal distances between all
adjacent points on those scales and are quantitative variables that numerically measures the extent to which a
particular characteristic is present or absent
• Ambient temperature is a classic example of an interval variable. This scale is measured using numbers
representing degrees, and every point on the scale is exactly one degree away from the nearest points on each
side. Twenty degrees Fahrenheit, for instance, is exactly 1 degree cooler than 21 degrees and exactly 4
degrees warmer than 16 degrees.
• An example of an interval-level variable is the GSS’s scoring of respondents’ occupational prestige. The GSS
uses a ranking system to assign each person a number representing how prestigious his or her occupation is.
Figure 2.2 displays the results. The scores range from 16 to 80 and so are presented as a chart rather than a
table.
Figure 2.2 Respondents’ Occupational
Prestige Scores
Ratio Variables
• Ratio variables are the other subtype within the continuous level of measurement. They are a quantitative
variable that numerically measures the extent to which a particular characteristic is present or absent and has
a true zero point. The ratio level resembles the interval level in that ratio, too, is numerical and has equal and
known distance between adjacent points. The difference is that ratio-level scales, unlike interval ones, have
meaningful zero points that represent the absence of a given characteristic.
• Temperature, for instance, is not ratio level because the zeros in the various temperature scales are just
placeholders. Zero does not signify an absence of temperature. Likewise, the data presented in Figure 2.2 and
Tables 2.7, as discussed previously, do not have meaningful zero points and therefore cannot be multiplied or
divided. You could not, for instance, say that someone who scores a 4 on the punitiveness scale is twice as
punitive as a person with a 2 on this measure. Ratio-level data permit this higher level of detail.
• Criminal justice and criminology researchers deal with many ratio-level variables. Age is one example.
Although it is strange to think of someone as having zero age, age can be traced close enough to zero to make
analytically reasonable to think of this variable as ratio. A 40-year-old person is twice as old as someone who
is 20.
Table 2.7 Respondents’ Political Views
Interval and Ratio Variables
• In the real world of statistical analysis, the terms interval and ratio variables are often used interchangeably.
It is the overarching categorical-versus-continuous distinction that usually matters most when it comes to
statistical analyses. When a researcher is collecting data and has a choice about level of measurement, the
best strategy is to always use the highest level possible. A continuous variable can always be made categorical
later, but a categorical variable can never be made continuous.
• Level of measurement is a very important concept. It might be difficult to grasp if this is the first time you
have been exposed to this idea; however, it is imperative that you gain a firm understanding because level of
measurement determines what analyses can and cannot be conducted. This fundamental point will form an
underlying theme of this entire book, so be sure you understand it. Do not proceed with the book until you
can readily identify a given variable’s level of measurement. Table 2.11 summarizes the basic characteristics
that define each level and distinguish it from the others.