0% found this document useful (0 votes)
2 views6 pages

Unit 3 Stats

Uploaded by

dyi1ng.dying
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views6 pages

Unit 3 Stats

Uploaded by

dyi1ng.dying
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

BOX PLOT AND

OUTLIERS
OUTLIERS:
● Outliers are data points that differ significantly from the other observations in a
dataset.
● They lie far away from the majority of data, making them look unusual.
● Outliers can arise due to variability in the data or errors.
● Example: 45,43,56,53,59,42,39,105,40,56,03. Here, 3 and 105 are outliers
because it deviates drastically from the other values.
● Box plots and scatter plots are the most common tools for visually identifying
outliers.
● Box plots display outliers as separate points outside the main range (whiskers)
of the data whereas, Scatter plots show outliers as points that are isolated from
the overall data pattern.
BOX PLOT:
SCATTER PLOT
BOX PLOT:
● A boxplot is a graphical representation that summarizes the distribution of a
dataset, highlighting its central value, variability, and outliers.
● It is also known as box-and-whisker plot.
● It helps in detecting the outliers.
● Components of a box plot:
i. Box: It represents the interquartile range (IQR). The bottom of the box
represents the First Quartile ( Q1 ), the top of the box is the Upper Quartile (Q3 )
and the line inside the box represents the median ( Q2 ).
ii. Whiskers: These are lines that extend from edges of the box to show the range.
The upper whisker extends from the top of the box (Q3) to the largest data point
which is equal to Q 3 + (1.5 * IQR) and The lower whisker extends from the
bottom of the box (Q 1) to the largest data point which is equal to Q1 - (1.5 * IQR).
Any data point that fall beyond these whiskers are considered as Outliers.
STEPS TO CONSTRUCT A BOX PLOT:
● Sort the dataset in ascending order.
● Find Q1 ,Q2 and Q3
● Find IQR( Interquartile range)= Q3 - Q1
● Determine the Whiskers:
Upper whisker= Q 3 + (1.5 * IQR)
Lower whisker= Q 1 - (1.5 * IQR)
● Draw a rectangular box from Q1 to Q 3
● Inside the box, draw a line at median.
● Extend the lower whisker from Q 1 and Upper whisker from Q 3
● Data points that fall outside the whiskers are considered outliers and are
plotted individually, typically as circles or stars.

You might also like