Box Whisker Plot
Box Whisker Plot
A box and whisker plot is a graph that exhibits data from a five-number summary, including one
of the measures of central tendency. It does not display the distribution as accurately as a stem
and leaf plot or histogram does. But, it is principally used to show whether a distribution is
skewed or not and if there are potential unusual observations present in the data set, which are
also called outliers. Boxplots are also very useful when huge numbers of data collections are
involved or compared.
Since the centre, spread and overall range are instantly apparent, using these boxplots the
arrangements can be matched easily.
A box and whisker plot is a way of compiling a set of data outlined on an interval scale. It is also
used for descriptive data interpretation.
The box and whisker plot displays how the data is spread out. In the box and whisker diagram,
it has five pieces of information (also called a five-number summary).
Example:
Solution:
Skewness:
In the figure below a symmetric distribution is represented in the histogram and in the boxplot.
The characteristics of this boxplot are that the whiskers are about the same length and the median
is located about halfway along the box.
The figure below shows a negatively skewed distribution. In such a distribution, the data peak to
the right on the histogram and trail off to the left. In corresponding fashion on the boxplot, the
bunching of the data to the right means that the left-hand whisker is longer and the right-hand
whisker is shorter; that is, the lower 25% of data are sparse and spread out whereas the top 25%
of data are bunched up. The median occurs further towards the right end of the box.
In the figure below we have a positively skewed distribution. In such a distribution, the data peak
to the left on the histogram and trail off to the right. In corresponding fashion on the boxplot, the
bunching of the data that the left-hand whisker is shorter and the right-hand whisker is longer;
that is, the upper 25% of data are sparse and spread out whereas the lower 25% of data are
bunched up. The median occurs further towards the left end of the box.
Example:
Example:
Parallel Boxplot
Example: