Types of Data New
Types of Data New
A variable can
take different values for different individuals. Variables can be
categorical or quantitative. Per S. S. Stevens…
•Nominal - Categorical
variables with no inherent
order or ranking sequence
such as names or classes (e.g.,
gender). Value may be a
numerical, but without numerical
value (e.g., I, II, III). The only
operation that can be applied to
Nominal variables is enumeration.
•Ordinal - Variables with an
inherent rank or order, e.g.
mild, moderate, severe. Can
be compared for equality, or
greater or less, but not how
much greater or less.
•Interval - Values of the variable
are ordered as in Ordinal, and
additionally, differences between
values are meaningful, however,
the scale is not absolutely
anchored. Calendar dates and
temperatures on the Fahrenheit
scale are examples. Addition and
subtraction, but not multiplication
and division are meaningful
operations.
•Ratio - Variables with all
properties of Interval plus an
absolute, non-arbitrary zero
point, e.g. age, weight,
temperature (Kelvin). Addition,
subtraction, multiplication, and
division are all meaningful
operations.
Basics of Statistics
Definition: Science of collection,
presentation, analysis, and
reasonable interpretation of data.
Statistics presents a rigorous scientific method for gaining
insight into data. For example, suppose we measure the
weight of 100 patients in a study. With so many
measurements, simply looking at the data fails to provide
an informative account. However statistics can give an
instant overall picture of data based on graphical
presentation or numerical summarization irrespective to
the number of data points. Besides data summarization,
another important task of statistics is to make inference
and predict relations of variables.
A Taxonomy of Statistics
Statistical Description of Data
•Statistics describes a
numeric set of data by its
• Center
• Variability
• Shape
•Statistics describes a
categorical set of data by
• Frequency, percentage or
proportion of each category
The center is found using a statistic
such as mean, median, midrange, or
mode, and provides a single value
that is representative of the data.