Lecture1 Olive's File
Lecture1 Olive's File
Nominal
Qualitative
Ordinal
Variable
Discrete
Quantitative
Continuous
Nominal Variable: A qualitative variable that categorizes (or
describes, or names) an element of a population.
15
A nominal scale is an unordered set of
categories identified only by name.
Nominal measurements only permit you to
determine whether two individuals are the
same or different. Examples…
An ordinal scale is an ordered set of
categories. Ordinal measurements tell you
the direction of difference between two
individuals. Examples…
16
An interval scale is an ordered series of equal-sized
categories. Interval measurements identify the direction and
magnitude of a difference. The zero point is located
arbitrarily on an interval scale. Interval scales are numeric
scales in which we know not only the order, but also the
exact differences between the values. The classic example of
an interval scale is Celsius temperature because the
difference between each value is the same. For example, the
difference between 60 and 50 degrees is a measurable 10
degrees, as is the difference between 80 and 70
degrees. Time is another good example of an interval scale
in which the increments are known, consistent, and
measurable.
17
• Here’s the problem with interval scales: they don’t have a
“true zero.” For example, there is no such thing as “no
temperature.” Without a true zero, it is impossible to
compute ratios. With interval data, we can add and
subtract, but cannot multiply or divide. 10 degrees + 10
degrees = 20 degrees, but 20 degrees is not twice as hot as
10 degrees. Again, there is no such thing as “no
temperature”. Bottom line, interval scales are great, but
we cannot calculate ratios, which brings us to our last
measurement scale.
• A ratio scale is an interval scale where a value of
zero indicates nothing of the variable. Ratio
measurements identify the direction and
magnitude of differences and allow ratio
comparisons of measurements. Ratio scales tell us
about the order, they tell us the exact value
between units, AND they also have an absolute
zero. Good examples of ratio variables include
height and weight.
Measure and Variability
• No matter what the response variable: there
will always be variability in the data.
• One of the primary objectives of statistics:
measuring and characterizing variability.
• Controlling (or reducing) variability in a
manufacturing process: statistical process
control.
Example: A supplier fills cans of soda marked 12 ounces.
How much soda does each can really contain?
Note:
1. Inherent in the concept of randomness: the next result (or
occurrence) is not predictable.
2. Proper procedure for selecting a random sample: use a
random number generator or a table of random numbers.
Example: An employer is interested in the time it takes each
employee to commute to work each morning. A random
sample of 35 employees will be selected and their commuting
time will be recorded.
• Determine the largest and smallest numbers in the raw data and thus
find the range (the difference between the largest and smallest
numbers).
• Divide the range into a convenient number of class intervals having the
same size. If this is not feasible, use class intervals of different sizes or
open class intervals. The number of class intervals is usually between 5
and 20, depending on the data. Class intervals are also chosen so that
the class marks (or midpoints) coincide with the actually observed
data.
• Determine the number of observations falling into each class interval;
that is, find the class frequencies. This is best done by using a tally, or
score sheet
Forming Frequency Distributions
content.answers.com
Dot Plots
To construct a dot plot
1. Draw and label horizontal line
2. Mark regular values
3. Place a dot above each value on Sodium
the number line in
Cereals
Stem-and-leaf plots
• Summarizes quantitative
variables
• Separate each Sodium in
observation into a stem Cereals
(first part of #) and a
leaf (last digit)
• Write each leaf to the
right of its stem; order
leaves if desired
Boxplot
Rank Causes of death Counts Percent of top 10s Percent of total deaths
3 Cerebrovascular 163,538 9% 7%
5 Accidents 101,537 5% 4%
6 Diabetes mellitus 71,372 4% 3%
7 Flu and pneumonia 62,034 3% 3%
8 Alzheimer’s disease 53,852 3% 2%
9 Kidney disorders 39,480 2% 2%
10 Septicemia 32,238 2% 1%
For each individual who died in the United States in 2001, we record what was
the cause of death. The table above is a summary of that information.
Bar graphs
Each category is represented by one bar. The bar’s height shows
the count (or sometimes the percentage) for that particular
category.
800
700 Top 10 causes of death in the U.S., 2001
Counts (x1000)
600
500 The number of individuals
400 who died of an accident in
300 2001 is approximately
100,000.
200
100
0
ia
ry
us
s
ar
ia
es
ts
rs
er
as
on
to
em
en
ul
ce
as
l it
rd
ira
se
sc
el
um
cid
an
se
ic
so
m
va
sp
di
pt
C
ne
di
Ac
di
s
o
's
re
Se
te
rt
er
ey
eb
ea
ic
be
&
dn
on
er
H
u
ia
ei
Ki
Fl
C
hr
zh
D
C
Al
Counts (x1000)
Counts (x1000) H
ea
rt
Ac d
100
200
300
400
500
600
700
800
0
Al c
100
200
300
400
500
600
700
800
zh id is 0
ei e ea
m nt
s se
er
's s
di
se C
as C an
er ce
e eb rs
C r ov
C an C as
er ce hr
eb rs on cu
r ic l ar
ov
C as re
hr cu sp
on ira
ic la
re r t or
sp y
D ira Ac
ia t D
b et
or
y ia
ci
de
es b nt
m et
es s
Fl el
u lit m
& u s Fl
pn u el
lit
e & u
um pn s
H on Al e
ea ia zh um
rt ei
d is m on
ia
Ki e er
dn as 's
ey es di
Ki se
di
so
dn
ey a se
rd
e rs di
Top 10 causes of death in the U.S., 2001
Se so
Easy to analyze
p rd
tic e rs
em
Much less useful
Se
ia p
Sorted alphabetically
tic
em
ia
Bar graph sorted by rank
Multiple Bar Chart