Also Called: Box-And-Whisker Plot
Also Called: Box-And-Whisker Plot
Fig 1
Data for box
plot example
2. Median. There is an even number of scores, so the median
is the average of the two middle ones. We must count n/2
and n/2 + 1 from one end.
n/2 = 14/2 = 7 and n/2 + 1 = 8
Count to the seventh and eighth scores in each group and
average them.
3. Hinges. We must find two medians, first of values 1
through 7 and then of values 8 through 14. There are seven
values in each half, an odd number, so we count (7 + 1)/2 =
4 from either end.
lower hinge A = 142 upper hinge A = 160
lower hinge B = 152 upper hinge B = 163
4. H-Spread. The distance between hinges is
H-spread = upper hinge – lower hinge
H-spread A = 160 – 142 = 18
H-spread B = 163 – 152 = 11
5. Inner fences.
upper inner fence = upper hinge + 1.5 × H-spread
upper inner fence A = 160 + 1.5 × 18 = 160 + 27 = 187
upper inner fence B = 163 + 1.5 × 11 = 163 + 16.5 = 179.5
lower inner fence = lower hinge – 1.5 × H-spread
lower inner fence A = 142 – 27 = 115
lower inner fence B = 152 – 16.5 = 135.5
Fig 2
Box plot
example
6. Outer fences.
upper outer fence = upper inner fence + 1.5 × H-spread
upper outer fence A = 187 + 27 = 214
upper outer fence B = 179.5 + 16.5 = 196
lower outer fence = lower inner fence – 1.5 × H-spread
lower outer fence A = 115 – 27 = 88
lower outer fence B = 135.5 – 16.5 = 119
Figure 2 is the box plot of the two teams’ scores. While the
Avengers have a star and the Bulldogs have a poor player,
overall the Bulldogs tend to score higher than the Avengers.
The Bulldogs’ smaller spread also indicates they score more
consistently.
Variations
The box plot was created by John W. Tukey. Many variations
have been proposed for calculating, drawing, and using box
plots. Whenever you use a variation on the basic box plot,
draw solid lines beyond the hinges to indicate that you are not
conforming to Tukey’s rules. Some variations are:
Simple box plot. Instead of calculating and drawing fences
and outliers, draw lines from the ends of the box (hinge
values) to the highest and lowest data values.
Modified box plot. Calculate the arithmetic average of all
the data values and show it with a dot on the box plot. The
closer the average is to the median, the more symmetrical
the distribution.
Modified-width box plot. When using two or more box
plots to compare several data sets, the widths of the boxes
can be drawn proportional to the sample size of the data
sets.
Parentheses can be drawn on the plot to represent 95%
confidence limits.
Ghost box plot or box-plot control chart. A box plot can be
drawn with dotted lines directly on a control chart or other
graph of individual data points to show a summary of the
data. This variation is especially useful if several plots are
drawn showing sequential subgroups of the data. For
example, draw one ghost box plot in the middle of a set of
15 data points prior to a process change and another in the
middle of the next set of 15 data points after the change.