Anomaly Detection

Uploaded by

dipyamanbiswas2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views10 pages

Anomaly Detection

Uploaded by

dipyamanbiswas2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 10

ANOMALY

DETECTION
- Prof Madhurima Paul
Financial transactions

Normal: Routine purchases and consistent spending by an

individual in London.

Outlier: A massive withdrawal from Ireland from the same

account, hinting at potential fraud.
Network traffic in cybersecurity

Normal: Regular communication, steady data transfer, and adherence to

protocol.

Outlier: Abrupt increase in data transfer or use of unknown protocols

signaling a potential breach or malware.
Patient vital signs
monitoring

Normal: Stable heart rate and consistent blood pressure

Outlier: Sudden increase in heart rate and decrease in blood pressure,

indicating a potential emergency or equipment failure.
The Importance of Anomaly Detection in Data Science

Data is the most precious commodity in data science, and anomalies are
the most disruptive threats to its quality. Bad data quality means bad:

• Statistical tests
• Dashboards
• Machine learning models
• Decisions
Types of Anomalies
• Anomaly detection encompasses two broad practices: outlier
detection and novelty detection.

• Identifying the type of anomalies is crucial as it allows you to choose the

right algorithm to detect them.

Example
Now, imagine that the city installs a new, more accurate weather
monitoring station. As a result, the dataset starts consistently recording
slightly higher temperatures, ranging from 25°C to 35°C. This
sustained increase in temperatures is a novelty, representing a new
pattern introduced by the improved monitoring system.
Types of Outliers

As there are two types of anomalies, there are two types of outliers as
well: univariate and multivariate. Depending on the type, we will use
different detection algorithms.

1. Univariate outliers exist in a single variable or feature in isolation.

Univariate outliers are extreme or abnormal values that deviate from the
typical range of values for that specific feature.
2. Multivariate outliers are found by combining the values of
multiple variables at the same time.
Anomaly Detection Methods
For univariate outlier detection, the most popular methods are:

1. Z-score (standard score): the z-score measures how many standard

deviations a data point is away from the mean. Generally, instances
with a z-score over 3 are chosen as outliers.

2. Interquartile range (IQR): The IQR is the range between the first
quartile (Q1) and the third quartile (Q3) of a distribution. When an
instance is beyond Q1 or Q3 for some multiplier of IQR, they are
considered outliers. The most common multiplier is 1.5, making the
outlier range [Q1–1.5 * IQR, Q3 + 1.5 * IQR].

3. Modified z-scores: similar to z-scores, but modified z-scores use the

median and a measure called Median Absolute Deviation (MAD) to find
outliers. Since mean and standard deviation are easily skewed by
outliers, modified z-scores are generally considered more robust.
For multivariate outliers, we generally use machine learning algorithms.
Because of their depth and strength, they are able to find intricate patterns
in complex datasets:

1.Isolation Forest: uses a collection of isolation trees (similar to decision

trees) that recursively divide complex datasets until each instance is
isolated. The instances that get isolated the quickest are considered
outliers.

2.Local Outlier Factor (LOF): LOF measures the local density deviation of
a sample compared to its neighbours. Points with significantly lower
density are chosen as outliers.

3.Clustering techniques: techniques such as k-means or hierarchical

clustering divide the dataset into groups. Points that don’t belong to any
group or are in their own little clusters are considered outliers.

4. Angle-based Outlier Detection (ABOD): ABOD measures the

Worksheets - Unit 5
90% (10)
Worksheets - Unit 5
13 pages
Certificate of Motor Insurance: Signedonbehalfof Aviva Insura Nce Limite D (Authorise D Insure RS)
No ratings yet
Certificate of Motor Insurance: Signedonbehalfof Aviva Insura Nce Limite D (Authorise D Insure RS)
3 pages
Customer Seminar Off-Line Testing Using Doble - Manila - March 2018
100% (2)
Customer Seminar Off-Line Testing Using Doble - Manila - March 2018
95 pages
Module 11 (C)
No ratings yet
Module 11 (C)
4 pages
17 dm2 Anomaly Detection 2022 23
No ratings yet
17 dm2 Anomaly Detection 2022 23
113 pages
Advanced Data Analysis Techniques 3
No ratings yet
Advanced Data Analysis Techniques 3
31 pages
Outlier Detection
No ratings yet
Outlier Detection
22 pages
ISAT 600 Progress Report 3
No ratings yet
ISAT 600 Progress Report 3
4 pages
Anomaly Detection and Outlier Analysis
No ratings yet
Anomaly Detection and Outlier Analysis
25 pages
Anomaly Detection
No ratings yet
Anomaly Detection
22 pages
Feature Engineering
No ratings yet
Feature Engineering
63 pages
Handling Outliers
No ratings yet
Handling Outliers
6 pages
Outlier Detection and Removal
No ratings yet
Outlier Detection and Removal
2 pages
Detecting Outliers in High Dimensional Data Sets U
No ratings yet
Detecting Outliers in High Dimensional Data Sets U
6 pages
Distance Based Outlier Detection
No ratings yet
Distance Based Outlier Detection
40 pages
Data Minning Unit 4-1
No ratings yet
Data Minning Unit 4-1
10 pages
5 Anomaly Detection Annotated Section 100 300
No ratings yet
5 Anomaly Detection Annotated Section 100 300
48 pages
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-09 Reference-Material-III
No ratings yet
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-09 Reference-Material-III
4 pages
Feature Engineering
No ratings yet
Feature Engineering
66 pages
Elastic Anomalies
No ratings yet
Elastic Anomalies
7 pages
Data Mining Slide Contents
No ratings yet
Data Mining Slide Contents
22 pages
Anomaly Detection
No ratings yet
Anomaly Detection
7 pages
Outlier Analysis
No ratings yet
Outlier Analysis
28 pages
Unit 4
No ratings yet
Unit 4
17 pages
ARDOD: Adaptive Radius Density Based Outlier Detection: Farshad Rahmati Reza Heydari Gharaei Hossein Nezamabadi Pour
No ratings yet
ARDOD: Adaptive Radius Density Based Outlier Detection: Farshad Rahmati Reza Heydari Gharaei Hossein Nezamabadi Pour
16 pages
Datamining Seminar
No ratings yet
Datamining Seminar
19 pages
741 Outlier Detection
No ratings yet
741 Outlier Detection
55 pages
How To Calculate Outliers
No ratings yet
How To Calculate Outliers
7 pages
6anomaly Fraud Detection
No ratings yet
6anomaly Fraud Detection
5 pages
Reverse Accessible in Local Outlier Factor Density Based Recognition
No ratings yet
Reverse Accessible in Local Outlier Factor Density Based Recognition
10 pages
Anomoly Detection - Ensemble - Classifiers
No ratings yet
Anomoly Detection - Ensemble - Classifiers
68 pages
188 1496475265 - 03-06-2017 PDF
No ratings yet
188 1496475265 - 03-06-2017 PDF
6 pages
Outliers
No ratings yet
Outliers
3 pages
Methods To Detect Different Types of Outliers: March 2016
No ratings yet
Methods To Detect Different Types of Outliers: March 2016
7 pages
On Detection of Outliers and Their Effect in Supervised Classification
No ratings yet
On Detection of Outliers and Their Effect in Supervised Classification
14 pages
Lecture Notes - Anomaly Detection in Time Series
No ratings yet
Lecture Notes - Anomaly Detection in Time Series
43 pages
Anomaly or Outlier Detection
No ratings yet
Anomaly or Outlier Detection
14 pages
Anomaly Detection: Lecture Notes For Chapter 9 Introduction To Data Mining, 2 Edition by Tan, Steinbach, Karpatne, Kumar
No ratings yet
Anomaly Detection: Lecture Notes For Chapter 9 Introduction To Data Mining, 2 Edition by Tan, Steinbach, Karpatne, Kumar
33 pages
Outlier Mining Techniques For Uncertain Data
No ratings yet
Outlier Mining Techniques For Uncertain Data
7 pages
The Ultimate Guide To Anomaly Detection: Key Use Cases, Techniques, and Autoencoder Machine Learning Models
No ratings yet
The Ultimate Guide To Anomaly Detection: Key Use Cases, Techniques, and Autoencoder Machine Learning Models
9 pages
Statistical Test Methods For Hypothesis Testing
No ratings yet
Statistical Test Methods For Hypothesis Testing
6 pages
Outlier
No ratings yet
Outlier
2 pages
Test To Identify Outliers in Data Series
100% (1)
Test To Identify Outliers in Data Series
16 pages
Outlier Detection
No ratings yet
Outlier Detection
10 pages
Handling Ouliers
No ratings yet
Handling Ouliers
5 pages
Outlier Analysis in Data Mining
No ratings yet
Outlier Analysis in Data Mining
5 pages
T6 - QMchange Point Anomaly
No ratings yet
T6 - QMchange Point Anomaly
11 pages
Histogram-Based Outlier Score (HBOS) : A Fast Unsupervised Anomaly Detection Algorithm
No ratings yet
Histogram-Based Outlier Score (HBOS) : A Fast Unsupervised Anomaly Detection Algorithm
5 pages
References
No ratings yet
References
6 pages
Unit 2 - Part A
No ratings yet
Unit 2 - Part A
51 pages
Unit - 3: Big Data Analytics
No ratings yet
Unit - 3: Big Data Analytics
23 pages
Outlier Detection For Different Applications Review IJERTV2IS3508
No ratings yet
Outlier Detection For Different Applications Review IJERTV2IS3508
13 pages
Ecmlpkdd08 Lazarevic Dmfa
No ratings yet
Ecmlpkdd08 Lazarevic Dmfa
116 pages
Lecture-8 Outlier Detection
No ratings yet
Lecture-8 Outlier Detection
72 pages
Ebook Beginners Guide To Anomaly Detection 2022
No ratings yet
Ebook Beginners Guide To Anomaly Detection 2022
12 pages
Print Data Mining 5
No ratings yet
Print Data Mining 5
6 pages
5 Ways To Find Outliers in Your Data - Statistics by Jim
No ratings yet
5 Ways To Find Outliers in Your Data - Statistics by Jim
35 pages
Distance-Based Outlier Detection: Consolidation and Renewed Bearing
No ratings yet
Distance-Based Outlier Detection: Consolidation and Renewed Bearing
12 pages
Anomaly Detection Survey
No ratings yet
Anomaly Detection Survey
72 pages
Krishnendu PCB-IT602B
No ratings yet
Krishnendu PCB-IT602B
11 pages
Explanatory Data Analysis
100% (1)
Explanatory Data Analysis
28 pages
Satellite Communication Engineering Second Edition Michael Olorunfunmi Kolawole Download
100% (1)
Satellite Communication Engineering Second Edition Michael Olorunfunmi Kolawole Download
51 pages
Applikationsbeitrag DESMA en
No ratings yet
Applikationsbeitrag DESMA en
4 pages
S1 - Flex in LTE
No ratings yet
S1 - Flex in LTE
6 pages
Bee Movie Script.: The Tale of A Bee
No ratings yet
Bee Movie Script.: The Tale of A Bee
33 pages
English 1st Rearange
No ratings yet
English 1st Rearange
15 pages
RFIT-PRT-0895 FilmArrayPneumoplus Instructions For Use EN PDF
No ratings yet
RFIT-PRT-0895 FilmArrayPneumoplus Instructions For Use EN PDF
112 pages
MSDS Rockwool Tombo
No ratings yet
MSDS Rockwool Tombo
6 pages
N9020A MXA X-Series Signal Analyzer: Data Sheet
No ratings yet
N9020A MXA X-Series Signal Analyzer: Data Sheet
18 pages
Csvtu Syllabus Be Civil 5 Sem
No ratings yet
Csvtu Syllabus Be Civil 5 Sem
12 pages
16SPC14XTB
No ratings yet
16SPC14XTB
1 page
FundamentalsExam W Ratio
No ratings yet
FundamentalsExam W Ratio
13 pages
Question: What Are The Basic Building Blocks of Learning Agent? Explain Each of Them With A Neat Block Diagram
No ratings yet
Question: What Are The Basic Building Blocks of Learning Agent? Explain Each of Them With A Neat Block Diagram
15 pages
Use of Cow Dung Ash in Eco Friendly Concrete
No ratings yet
Use of Cow Dung Ash in Eco Friendly Concrete
6 pages
Reduction To Diagnol Form
No ratings yet
Reduction To Diagnol Form
11 pages
Welding Preheating
No ratings yet
Welding Preheating
13 pages
Laboratory Exercise 3
No ratings yet
Laboratory Exercise 3
3 pages
100 Mcqs of Solid State Physics-1
No ratings yet
100 Mcqs of Solid State Physics-1
17 pages
Nursing Management of Patients With Urinary Incontinence: Moh Nursing Clinical Practice Guidelines 1/2003
No ratings yet
Nursing Management of Patients With Urinary Incontinence: Moh Nursing Clinical Practice Guidelines 1/2003
45 pages
Errata Electromagnetic Foundations of Electrical Engineering
No ratings yet
Errata Electromagnetic Foundations of Electrical Engineering
5 pages
Wireless M-Bus Gateway
No ratings yet
Wireless M-Bus Gateway
7 pages
Portfolio Assignment SUS1501 - Sustainability and Greed: Student Number: Date: Teaching Assistant Name
No ratings yet
Portfolio Assignment SUS1501 - Sustainability and Greed: Student Number: Date: Teaching Assistant Name
13 pages
CFD Simulation For Wind Load On Octagonal Tall Buildings: Article
No ratings yet
CFD Simulation For Wind Load On Octagonal Tall Buildings: Article
7 pages
Important Questions
No ratings yet
Important Questions
20 pages
Activity 1.0 - Statistical Analysis and Design
No ratings yet
Activity 1.0 - Statistical Analysis and Design
22 pages
9515-181-50-Eng - Rev - G1 Eli 280 V2.2.0
No ratings yet
9515-181-50-Eng - Rev - G1 Eli 280 V2.2.0
87 pages
QUANTUM No-Go Locator
No ratings yet
QUANTUM No-Go Locator
1 page
Atr72-600 Jic 05-51-25 Volcanic Ash Insp 2
No ratings yet
Atr72-600 Jic 05-51-25 Volcanic Ash Insp 2
9 pages