Lec 2
Lec 2
Graphs
Topics
• Exploring a
Distribution of
Numerical Data
• Dotplots, Frequency
/ Relative Frequency
Histograms, and
Stemplots
What Is a Distribution?
Keep in mind:
• Using a picture to display the data will help to
see patterns.
• Different visual representations capture different
aspects in the data.
• The picture must:
– Record the data values.
– Indicate the frequency (count) of the data
values.
Visualizing Data: Dotplots
Record data values on a number line with a dot
Dotplot above the number line for each data value
observed.
Value Frequency Example: Here is the dotplot for
9 1
11 4
our example data.
12 1
13 5
14 3
15 2
16 4
18 2
19 2
20 2
21 1
35 1
Dotplot: Example (1 of 2)
Stemplots
• Also called stem-and-leaf plots
• Like dotplots, show all individual data values
• Useful when technology is not available or when
the data set is not too large
Stemplot: Example
A collection of college students who said that they drink
alcohol were asked how many alcoholic drinks they had
consumed in the last seven days. Their answers were:
1,1,1,1,1, 2, 2, 2, 3, 3, 3, 3, 3, 4, 5, 5, 5, 6, 6, 6, 8,10,10,15
17, 20, 25, 30, 30, 40
Symmetric
Left and right side roughly the same
Shape: Skewed
Skewed
Most of the data is on one side with a long tail
(or skew) on the other side
Shape: Mounds (1 of 2)
Bar Chart
Similar to a histogram, record data
categories along the horizontal axis with the
height corresponding to the frequency of the
data
Visualizing Data: Bar Chart (2 of 2)
Example:
Here is the bar chart for the class standing of
students interested in a UCLA statistics class.
Class Frequency
Unknown 7
First-year student 0
Sophomore 3
Junior 4
Senior 5
Graduate 1
Total 20
Bar Chart vs. Histogram
Note: Bar charts and histograms are different!
Key differences:
Blank Histogram Bar Chart
Bars: May touch Do not touch
Bar Width: Corresponds to bin Can be any desired
width width (all the same)
Horizontal labels: Numerical Order No inherent order
Pie Chart
A circle divided into pieces (the area of each
piece is proportional to the relative frequency, or
percent, of the data in that piece)
Visualizing Data: Pie Chart (2 of 2)
Example:
Here is the pie chart for the class standing of
students interested in a U C L A statistics class.
Class Frequency
Unknown 7
First-year student 0
Sophomore 3
Junior 4
Senior 5
Graduate 1
Total 20
SECTION 2.4 SUMMARIZING
CATEGORICAL DISTRIBUTIONS
Recall:
To describe a numerical distribution, we record
shape, center, and spread.
Since categorical data has no inherent order, these
measures do not make sense for categorical data.
Two Main Components of a Categorical
Distribution:
1. Mode (typical, or most frequent, outcome)
2. Variability (or diversity in outcomes)
Mode
Mode The category that occurs the most frequently
Key difference in the mode for categorical and
numerical data:
– Numerical data: Modes do not need to be the same height.
• Making Appropriate
Graphs
• Misleading Graphs
Appropriate Graphs
Recall:
The type of data you are dealing with
determines the type of graph you use!