Lecture 4 - Test of Outliers and Test of SKewness
Lecture 4 - Test of Outliers and Test of SKewness
Lecture Four
Test of Outliers - Box Plot
*** **
LB UB
Q1 Q3
Yes No
Symmetric Skewed
Median
Mean Median
IQR
SD IQR
Example
a) Is this sample containing any extreme values? Justify your answer with a
suitable test.
Answer
Test for the outliers - Box Plot
Step 1: put the values in order from the smallest to the largest
50 65 67 70 72 75 77 80 82 112
Step 5:
Lower bound = Q1 – 1.5 IQR = 66.5 – (1.5 X 14) = 45.5 million $
Upper bound = Q3 + 1.5 IQR = 80.5 + (1.5 X 14) = 101.5 million $
Example
* 112
45.5 101.5
Comment: ???
Example
b) According to your conclusion in part (a), calculate the best central and
the best absolute dispersion measure.
Answer
IQR = 14 million $
50 65 67 70 72 75 77 80 82 112
Step2: location of the median (even sample size) – Case 2
𝒏 𝟏𝟎 𝟏𝟎
= = = 𝟓 and +𝟏=𝟔
𝟐 𝟐 𝟐
Step 3: value of the median= (72+75)/2 = 73.5 million $
Example
C) Assuming that the outlier(s) are not found, what would be the best central measure
Answer
After removing 112
Median
Step 1: put the values in order from the smallest to the largest
50 65 67 70 72 75 77 80 82
Step2: location of the median (odd sample size)
𝒏+𝟏 𝟗+𝟏
= =𝟓
𝟐 𝟐
Step 3: value of the median= 72 million $
𝑋𝑖 𝑋 − 𝑏𝑎𝑟 𝑋 − 𝑥𝑏𝑎𝑟 (𝑋 − 𝑥𝑏𝑎𝑟)^2
70.89 − 72
=3 = −0.34
9.68
Comment: ???
Coefficient of Variation
Can be used to compare the variability of two or more sets of
data measured in different units.
S
CV 100%
X
Rule: The lower CV is the higher level of homogeneity