Finding Outliers 2 Wayes Z-Score and Interquortile Range

The document discusses using Z-scores to identify outliers in data that follows a normal distribution. It explains that a Z-score indicates how many standard deviations an observation is from the mean, and values more than 3 standard deviations out are considered outliers. However, outliers can skew the calculation of Z-scores by influencing the mean and standard deviation. The document then introduces an alternative method using interquartile range to calculate inner and outer fences to identify outliers. Values outside the outer fences would be outliers.

Uploaded by

Ana Chikovani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views1 page

Finding Outliers 2 Wayes Z-Score and Interquortile Range

Uploaded by

Ana Chikovani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Using Z-scores to Detect Outliers

Z-scores can quantify the unusualness of an observation when your data follow the normal distribution. Z-scores
are the number of standard deviations above and below the mean that each value falls. For example, a Z-score
of 2 indicates that an observation is two standard deviations above the average while a Z-score of -2 signifies it
is two standard deviations below the mean. A Z-score of zero represents a value that equals the mean.

The further away an observation’s Z-score is from zero, the more unusual it is. A standard cut-off value for
finding outliers are Z-scores of +/-3 or further from zero. The probability distribution below displays the
distribution of Z-scores in a standard normal distribution. Z-scores beyond +/- 3 are so extreme you can barely
see the shading under the curve.

In a population that follows the normal distribution, Z-score values more extreme than +/- 3 have a probability
of 0.0027 (2 * 0.00135), which is about 1 in 370 observations. However, if your data don’t follow the normal
distribution, this approach might not be accurate.

Also, note that the outlier’s presence throws off the Z-scores because it inflates the mean and standard deviation
as we saw earlier. Notice how all the Z-scores are negative except the outlier’s value. If we calculated Z-scores
without the outlier, they’d be different! Be aware that if your dataset contains outliers, Z-values are biased such
that they appear to be less extreme (i.e., closer to zero).

To calculate the outlier fences, do the following:

1. Take your IQR and multiply it by 1.5 and 3. We’ll use these values
to obtain the inner and outer fences. For our example, the IQR equals 0.222.
Consequently, 0.222 * 1.5 = 0.333 and 0.222 * 3 = 0.666. We’ll use 0.333
and 0.666 in the following steps.
2. Calculate the inner and outer lower fences. Take the Q1 value and subtract the two values from step 1. The two
results are the lower inner and outer outlier fences. For our example, Q1 is 1.714. So, the lower inner fence =
1.714 – 0.333 = 1.381 and the lower outer fence = 1.714 – 0.666 = 1.048.
3. Calculate the inner and outer upper fences. Take the Q3 value and add the two values from step 1. The two
results are the upper inner and upper outlier fences. For our example, Q3 is 1.936. So, the upper inner fence =
1.936 + 0.333 = 2.269 and the upper outer fence = 1.936 + 0.666 = 2.602.

Using the Outlier Fences with Our Example Dataset

For our example dataset, the values for these fences are 1.048, 1.381, 2.269, and 2.602. Almost all of our data
should fall between the inner fences, which are 1.381 and 2.269. At this point, we look at our data values and
determine whether any qualify as being major or minor outliers. 14 out of the 15 data points fall inside the inner
fences—they are not outliers. The 15th data point falls outside the upper outer fence—it’s a major or extreme
outlier.

The IQR method is helpful because it uses percentiles, which do not depend on a specific distribution.
Additionally, percentiles are relatively robust to the presence of outliers compared to the other quantitative
methods. Values that fall inside the two inner fences are not outliers. Let’s see how this method works using
our example dataset.

Biostatistics-I MCQS: Topic: Sample Descriptive Statics
100% (9)
Biostatistics-I MCQS: Topic: Sample Descriptive Statics
40 pages
Coursera R Lab - Correlation and Regression Answers
100% (1)
Coursera R Lab - Correlation and Regression Answers
6 pages
STAT5002 Midterm Review Solutions N
No ratings yet
STAT5002 Midterm Review Solutions N
8 pages
Outliers Z-Score
No ratings yet
Outliers Z-Score
1 page
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-09 Reference-Material-III
No ratings yet
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-09 Reference-Material-III
4 pages
Numericalquestionsonzscoreand IQ
No ratings yet
Numericalquestionsonzscoreand IQ
3 pages
Outlier Detection and Removal
No ratings yet
Outlier Detection and Removal
2 pages
Lecture 3
No ratings yet
Lecture 3
23 pages
5 Ways To Find Outliers in Your Data - Statistics by Jim
No ratings yet
5 Ways To Find Outliers in Your Data - Statistics by Jim
35 pages
Handling Outliers
No ratings yet
Handling Outliers
6 pages
Outliers
No ratings yet
Outliers
3 pages
Outlier Treatment
No ratings yet
Outlier Treatment
16 pages
DPT Week 10
No ratings yet
DPT Week 10
1 page
Datamining Seminar
No ratings yet
Datamining Seminar
19 pages
How To Calculate Outliers
No ratings yet
How To Calculate Outliers
7 pages
Nikita Prasad - Outliers Basics
No ratings yet
Nikita Prasad - Outliers Basics
13 pages
Outlier Detection
No ratings yet
Outlier Detection
41 pages
Discusion Forum Unit 2
No ratings yet
Discusion Forum Unit 2
2 pages
Univariate Outlier Detection
No ratings yet
Univariate Outlier Detection
9 pages
Outliers
No ratings yet
Outliers
3 pages
OUTLIERS
100% (1)
OUTLIERS
5 pages
ISAT 600 Progress Report 3
No ratings yet
ISAT 600 Progress Report 3
4 pages
Guide On Outlier Detection Methods
No ratings yet
Guide On Outlier Detection Methods
11 pages
Detecting Data Outliers
No ratings yet
Detecting Data Outliers
7 pages
Detecting Data Outliers
No ratings yet
Detecting Data Outliers
7 pages
Outliers in Machine Learning
No ratings yet
Outliers in Machine Learning
13 pages
Identifying and Handling Outliers in Pandas - A Step-By-Step Guide - by Arvid Eichner - Python in Plain English
No ratings yet
Identifying and Handling Outliers in Pandas - A Step-By-Step Guide - by Arvid Eichner - Python in Plain English
19 pages
Explanatory Data Analysis
100% (1)
Explanatory Data Analysis
28 pages
Empirical Rule and Outliers 1721456291
No ratings yet
Empirical Rule and Outliers 1721456291
13 pages
Boxplot Outlier
No ratings yet
Boxplot Outlier
3 pages
Outlier Detection in Non-Gaussian Distributions Uitschieter Detectie in Niet-Gauss Verdelingen
No ratings yet
Outlier Detection in Non-Gaussian Distributions Uitschieter Detectie in Niet-Gauss Verdelingen
45 pages
Advanced Data Analysis Techniques 3
No ratings yet
Advanced Data Analysis Techniques 3
31 pages
Notes PDF ML Day 17
No ratings yet
Notes PDF ML Day 17
9 pages
Lec 7 Data Visualization Basic Statistics Updated 21102024 122008pm
No ratings yet
Lec 7 Data Visualization Basic Statistics Updated 21102024 122008pm
39 pages
4 - Lect-Finding Z - Score, Percentiles and Quartiles
No ratings yet
4 - Lect-Finding Z - Score, Percentiles and Quartiles
23 pages
Mathematical
No ratings yet
Mathematical
14 pages
Updated 2 - STAT100 - Median+Mode+Range+Outlier+Percentiles - Problem+Solution - Asma
No ratings yet
Updated 2 - STAT100 - Median+Mode+Range+Outlier+Percentiles - Problem+Solution - Asma
7 pages
CHP 3b
No ratings yet
CHP 3b
32 pages
3-Introduction To Data Cleaning Outlires
No ratings yet
3-Introduction To Data Cleaning Outlires
5 pages
Handling Ouliers
No ratings yet
Handling Ouliers
5 pages
ML Ex2
No ratings yet
ML Ex2
7 pages
Test To Identify Outliers in Data Series
100% (1)
Test To Identify Outliers in Data Series
16 pages
Outlier Analysis in Data Mining
No ratings yet
Outlier Analysis in Data Mining
5 pages
Mba 15-2
No ratings yet
Mba 15-2
18 pages
Outliers PDF
No ratings yet
Outliers PDF
5 pages
TN 5 3.2 - 3.3
No ratings yet
TN 5 3.2 - 3.3
5 pages
Anomaly Detection
No ratings yet
Anomaly Detection
10 pages
Standard Deviation, Interquartile Range + Outliers
No ratings yet
Standard Deviation, Interquartile Range + Outliers
7 pages
Feature Engineering
No ratings yet
Feature Engineering
63 pages
Descriptive Stats - Part B: Measures of Relative Location and Detecting Outliers Exploratory Data Analysis
No ratings yet
Descriptive Stats - Part B: Measures of Relative Location and Detecting Outliers Exploratory Data Analysis
18 pages
Numerical Measures of Relative Standing: Fall 2016-2017 MGT 205 1
No ratings yet
Numerical Measures of Relative Standing: Fall 2016-2017 MGT 205 1
44 pages
Statistics Session - 9 - Boxplot - Outliers
No ratings yet
Statistics Session - 9 - Boxplot - Outliers
6 pages
Lecture 8 Data Prepration Techniques
No ratings yet
Lecture 8 Data Prepration Techniques
4 pages
Outliers
No ratings yet
Outliers
5 pages
DS 5-Marks Semeseter Suggestion
No ratings yet
DS 5-Marks Semeseter Suggestion
56 pages
05 - Moments-Standized - Variable - Chebychev-1
No ratings yet
05 - Moments-Standized - Variable - Chebychev-1
22 pages
Fundamentals Stats
No ratings yet
Fundamentals Stats
44 pages
What Is Outlier
No ratings yet
What Is Outlier
3 pages
Statistics Measures of Position Unit Plan
No ratings yet
Statistics Measures of Position Unit Plan
3 pages
A Review of Statistical Outlier Methods
No ratings yet
A Review of Statistical Outlier Methods
8 pages
M4. Outliers
No ratings yet
M4. Outliers
11 pages
Gre Formula Book
From Everand
Gre Formula Book
Saifuddin Kamran
No ratings yet
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
Distributions Normal Binominal
No ratings yet
Distributions Normal Binominal
1 page
Box Plot Consect
No ratings yet
Box Plot Consect
2 pages
Project 1 - Descriptive Statistics
No ratings yet
Project 1 - Descriptive Statistics
11 pages
Final Project - Group 1
No ratings yet
Final Project - Group 1
6 pages
BIO 401 FINAL MCQs AND QUESTION
No ratings yet
BIO 401 FINAL MCQs AND QUESTION
22 pages
MCQ Questions
No ratings yet
MCQ Questions
23 pages
Descriptive and Inferential Statistics
No ratings yet
Descriptive and Inferential Statistics
29 pages
Statistics - Short Notes
No ratings yet
Statistics - Short Notes
2 pages
Elementary Statistics
100% (1)
Elementary Statistics
22 pages
Universiteti I Prishtinës Hasan Prishtina: Punim Diplome - Master Xhevat Miranaj
No ratings yet
Universiteti I Prishtinës Hasan Prishtina: Punim Diplome - Master Xhevat Miranaj
45 pages
Essentials of Statistics For Business and Economics Revised 6th Edition David R. Anderson Instant Download
100% (1)
Essentials of Statistics For Business and Economics Revised 6th Edition David R. Anderson Instant Download
59 pages
Reduced Mean in Gumbel Distribution
No ratings yet
Reduced Mean in Gumbel Distribution
1 page
Simplified Field Table - Head Circumference - Girl - 2 PDF
No ratings yet
Simplified Field Table - Head Circumference - Girl - 2 PDF
4 pages
Solutions Manual: Answers To Section One Exercises A Review of Chapters 1 - 4
No ratings yet
Solutions Manual: Answers To Section One Exercises A Review of Chapters 1 - 4
9 pages
Statistics Case Study
No ratings yet
Statistics Case Study
22 pages
Set-A: Class Test-Business Statistics Answer The Following Maxmarks: 15 Time: 1 HR
No ratings yet
Set-A: Class Test-Business Statistics Answer The Following Maxmarks: 15 Time: 1 HR
2 pages
GridDataReport-Surfer - Curvas de Nivel
No ratings yet
GridDataReport-Surfer - Curvas de Nivel
7 pages
Descriptive Statistics Cheat Sheet
No ratings yet
Descriptive Statistics Cheat Sheet
1 page
Case Study: Jane Smith (C) : Group Members
No ratings yet
Case Study: Jane Smith (C) : Group Members
23 pages
18ME505 M&M Teaching Notes Unit-1 &2
No ratings yet
18ME505 M&M Teaching Notes Unit-1 &2
112 pages
The Following Are Measurements of The Breaking Strength
No ratings yet
The Following Are Measurements of The Breaking Strength
4 pages
BasicStatistics I
No ratings yet
BasicStatistics I
90 pages
(Ebook PDF) Modern Business Statistics, With Microsoft Office Excel 4th Edition Download
100% (7)
(Ebook PDF) Modern Business Statistics, With Microsoft Office Excel 4th Edition Download
56 pages
Statistic - With Python PDF
No ratings yet
Statistic - With Python PDF
11 pages
Direct Series
100% (1)
Direct Series
22 pages
SPSS 19 Answers To Selected Exercises
No ratings yet
SPSS 19 Answers To Selected Exercises
68 pages
Measures of Central Tendency and Other Positional Measures
No ratings yet
Measures of Central Tendency and Other Positional Measures
13 pages
Scheme of Valuation of Bussiness Statics Set 2.
No ratings yet
Scheme of Valuation of Bussiness Statics Set 2.
8 pages
Measures of Spread
No ratings yet
Measures of Spread
4 pages
CBSE Class 10 Maths Worksheet - Statistics
67% (6)
CBSE Class 10 Maths Worksheet - Statistics
2 pages

Finding Outliers 2 Wayes Z-Score and Interquortile Range

Uploaded by

Finding Outliers 2 Wayes Z-Score and Interquortile Range

Uploaded by

Using Z-scores to Detect Outliers

To calculate the outlier fences, do the following:

Using the Outlier Fences with Our Example Dataset

You might also like