Lesson 2 Frequency Distribution and Graphs
Lesson 2 Frequency Distribution and Graphs
Definitions
Raw Data
Data collected in original form.
Frequency
The number of times a certain value or class of values occurs.
Frequency Distribution
The organization of raw data in table form with classes and
frequencies.
Categorical Frequency Distribution
A frequency distribution in which the data is only nominal or ordinal.
Ungrouped Frequency Distribution
A frequency distribution of numerical data. The raw data is not
grouped.
Grouped Frequency Distribution
A frequency distribution where several numbers are grouped into one
class.
Class Limits
Separate one class in a grouped frequency distribution from another.
The limits could actually appear in the data and have gaps between
the upper limit of one class and the lower limit of the next.
Class Boundaries
Separate one class in a grouped frequency distribution from another.
The boundaries have one more decimal place than the raw data and
therefore do not appear in the data. There is no gap between the
upper boundary of one class and the lower boundary of the next class.
The lower class boundary is found by subtracting 0.5 units from the
lower class limit and the upper class boundary is found by adding 0.5
units to the upper class limit.
Class Width
The difference between the upper and lower boundaries of any class.
The class width is also the difference between the lower limits of two
consecutive classes or the upper limits of two consecutive classes. It is
not the difference between the upper and lower limits of the same
class.
Class Mark (Midpoint)
The number in the middle of the class. It is found by adding the upper
and lower limits and dividing by two. It can also be found by adding the
upper and lower boundaries and dividing by two.
Cumulative Frequency
The number of values less than the upper class boundary for the
current class. This is a running total of the frequencies.
Relative Frequency
The frequency divided by the total frequency. This gives the percent of
values falling in that class.
Example:
Consider a data of the Table 1, which represents the lives of 40 similar
car batteries recorded to the nearest tenth of a year. The batteries were
guaranteed to last 3 years.
Table I.
Car Battery Lives
Steps
Further computations
For the class midpoint or class mark (average of upper and lower class limits)
Example: (lower class limit + upper class limit) / 2
(1.5 + 1.9)/2 = 1.7
Relative frequency
Bar chart
16
14
12
10
Frequency
8 15
6
10
4
2 4 5
2 3
0 1
1.5 – 1.9 2.0 – 2.4 2.5 – 2.9 3.0 – 3.4 3.5 – 3.9 4.0 – 4.4 4.5 – 4.9
Batter lives (year)
Class Interval
Frequency Histogram
Frequency
A histogram differs from a bar chart in the baes of each bar are the
class boundaries rather than class limits. The use of class boundaries for the
bases eliminates the spaces between the bar to give the solid appearance.
Frequency polygon
Frequency Polygon
16
14
12
10
frequency
8
6
4
2
0
1.2 1.7 2.2 2.7 3.2 3.7 4.2 4.7 5.2
Battery Lives in years
class midpoint
Frequency ogive
frequency ogive
45
40
cumulative frequency
35
30
25
20
15
10
5
0
1.45 1.95 2.45 2.95 3.45 3.95 4.45 4.95
Battery Lives (years)
class boundaries
Pie chart
Box plot
What is a Boxplot?
Click here to have example of creating box and whiskers plot in excel.
https://support.microsoft.com/en-us/office/create-a-box-plot-10204530-
8cdf-40fe-a711-2eb9785e510f
battery_lives Stati
stic
Mean 3.41
25
95% Lower 3.18
Confiden Bound 77
ce Upper 3.63
Interval Bound 73
for
Mean
5% Trimmed Mean 3.43
33
Median 3.40
00
Variance .494
Std. Deviation .702
81
Minimum 1.60
Maximum 4.70
Range 3.10
Interquartile Range .77
Skewness -.36
4
Kurtosis .359
Stem-and-Leaf Plot
1.00 2 . 2
4.00 2 . 5669
15.00 3 . 001111222333444
10.00 3 . 5567778899
5.00 4 . 11234
3.00 4 . 577