0% found this document useful (0 votes)

54 views5 pages

6anomaly Fraud Detection

Anomaly detection techniques are used to identify unusual patterns or outliers in data. Simple statistical methods flag outliers based on standard deviations from the mean or interquartile ranges. Machine learning approaches include density-based methods that identify anomalies as points farther away from dense neighborhoods, and clustering-based methods that find outliers outside of normal clusters. These techniques have applications in fraud detection, system monitoring, and intrusion detection.

Uploaded by

Saugat Tripathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views5 pages

6anomaly Fraud Detection

Uploaded by

Saugat Tripathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Anomaly

/ Fraud Detection
Anomaly detection is a technique used to identify unusual patterns that do not conform to
expected behavior, called outliers.

Outlier detection (also known as anomaly detection) is the process of finding data objects with
behaviors that are very different from expectation. Such objects are called outliers or anomalies.

It has many applications in business, from intrusion detection (identifying strange patterns
in network traffic that could signal a hack) to system health monitoring (spotting a
malignant tumor in an MRI scan), and from fraud detection in credit card transactions to
fault detection in operating environments.
Anomalies can be broadly categorized as:

1. Point anomalies: A single instance of data is anomalous if it's too far off from the
rest. Business use case: Detecting credit card fraud based on "amount spent."

2. Contextual anomalies: The abnormality is context specific. This type of anomaly is

common in time-series data. Business use case: Spending $100 on food every day during
the holiday season is normal, but may be odd otherwise.
3. Collective anomalies: A set of data instances collectively helps in detecting
anomalies. Business use case: Someone is trying to copy data form a remote machine to a
local host unexpectedly, an anomaly that would be flagged as a potential cyber attack.

Anomaly Detection Techniques

i) Simple Statistical Methods

The simplest approach to identifying irregularities in data is to flag the data points that
deviate from common statistical properties of a distribution, including mean, median,
mode, and quantiles. Let's say the definition of an anomalous data point is one that deviates
by a certain standard deviation from the mean.

Algorithm:
Example: 10, 11, 15, 25, 35, 30, 7, 68

 Sort: 7, 10, 11, 15, 25, 30, 35, 68

 Find:Q1=10.5; Q3=32.5
 Find Interquartie range Q3-Q1=22
 Multiply IQR by1.5=33
 Subtract IQR from Q1 and add in Q3
 10.5-33=-22.5
 32.5+33=65.5
 Check the dataset for any data value the is smallthan Q1-1.5*IQR or larger than
Q3+1.5*IQR
68 is outlier

ii) Machine Learning-Based Approaches

Density-Based Anomaly Detection

Density-based anomaly detection is based on the k-nearest neighbor’s algorithm.

Assumption: Normal data points occur around a dense neighborhood and abnormalities are
far away.

The nearest set of data points are evaluated using a score, which could be Eucledian
distance or a similar measure dependent on the type of the data (categorical or numerical).
They could be broadly classified into two algorithms:
1. K-nearest neighbor: k-NN is a simple, non-parametric lazy learning technique used to
classify data based on similarities in distance metrics such as Eucledian, Manhattan,
Minkowski, or Hamming distance.

2. Relative density of data: This is better known as local outlier factor (LOF). This concept
is based on a distance metric called reachability distance.

LOF(k) ~ 1 means Similar density as neighbors,

LOF(k) < 1 means Higher density than neighbors (Inlier),
LOF(k) > 1 means Lower density than neighbors (Outlier)

Clustering-Based Anomaly Detection

Clustering is one of the most popular concepts in the domain of unsupervised learning.

Assumption: Data points that are similar tend to belong to similar groups or clusters, as
determined by their distance from local centroids.

K-means is a widely used clustering algorithm. It creates 'k' similar clusters of data points.
Data instances that fall outside of these groups could potentially be marked as anomalies.

Transmission Line Modelling and Performance
100% (1)
Transmission Line Modelling and Performance
8 pages
Anomaly Detection For Cyber Security
No ratings yet
Anomaly Detection For Cyber Security
31 pages
(Terrorism, Security, and Computation) Kishan G. Mehrotra, Chilukuri K. Mohan, HuaMing Huang (Auth.) - Anomaly Detection Principles and Algorithms-Springer International Publishing (2017)
No ratings yet
(Terrorism, Security, and Computation) Kishan G. Mehrotra, Chilukuri K. Mohan, HuaMing Huang (Auth.) - Anomaly Detection Principles and Algorithms-Springer International Publishing (2017)
229 pages
Anomaly Detection
No ratings yet
Anomaly Detection
13 pages
Anomaly Detection: A Tutorial
No ratings yet
Anomaly Detection: A Tutorial
101 pages
Anomaly Detection Guidebook
100% (1)
Anomaly Detection Guidebook
16 pages
701
100% (2)
701
35 pages
Best Practices For Effectively Implementing An ATP Sanitation Verification Program
100% (1)
Best Practices For Effectively Implementing An ATP Sanitation Verification Program
16 pages
Nocom vs. Camerino
0% (1)
Nocom vs. Camerino
7 pages
Felcom 12 15 16 Ssas Tie PDF
No ratings yet
Felcom 12 15 16 Ssas Tie PDF
80 pages
Power and Function of Income Tax Authorities
No ratings yet
Power and Function of Income Tax Authorities
23 pages
17 dm2 Anomaly Detection 2022 23
No ratings yet
17 dm2 Anomaly Detection 2022 23
113 pages
Chapter 10 Strategy Implementation Organizing and Structure
100% (1)
Chapter 10 Strategy Implementation Organizing and Structure
28 pages
Anomoly Detection - Ensemble - Classifiers
No ratings yet
Anomoly Detection - Ensemble - Classifiers
68 pages
5 Anomaly Detection Annotated Section 100 300
No ratings yet
5 Anomaly Detection Annotated Section 100 300
48 pages
Module 11 (C)
No ratings yet
Module 11 (C)
4 pages
Anomaly-Fraud-Detection
No ratings yet
Anomaly-Fraud-Detection
50 pages
Introtoanomalydetection 170421012904
No ratings yet
Introtoanomalydetection 170421012904
53 pages
Unit 3
No ratings yet
Unit 3
37 pages
Ecmlpkdd08 Lazarevic Dmfa
No ratings yet
Ecmlpkdd08 Lazarevic Dmfa
116 pages
Sag - Trainers Methodology I PDF
No ratings yet
Sag - Trainers Methodology I PDF
9 pages
Lecture Notes - Anomaly Detection in Time Series
No ratings yet
Lecture Notes - Anomaly Detection in Time Series
43 pages
Explainable Contextual Anomaly Detection
No ratings yet
Explainable Contextual Anomaly Detection
48 pages
Unit 2 - Part A
No ratings yet
Unit 2 - Part A
51 pages
Anomaly Detection
No ratings yet
Anomaly Detection
22 pages
02 - 03 - Anomaly Detection Survey
No ratings yet
02 - 03 - Anomaly Detection Survey
27 pages
Anomaly Detection and Outlier Analysis
No ratings yet
Anomaly Detection and Outlier Analysis
25 pages
Anomaly Detection: A Tutorial: Arindam Banerjee, Varun Chandola, Vipin Kumar, Jaideep Srivastava
No ratings yet
Anomaly Detection: A Tutorial: Arindam Banerjee, Varun Chandola, Vipin Kumar, Jaideep Srivastava
101 pages
Unit 4
No ratings yet
Unit 4
17 pages
10.anomaly Detection
No ratings yet
10.anomaly Detection
24 pages
Anomaly Detection 2
No ratings yet
Anomaly Detection 2
8 pages
Anomaly Detection: Lecture Notes For Chapter 9 Introduction To Data Mining, 2 Edition by Tan, Steinbach, Karpatne, Kumar
No ratings yet
Anomaly Detection: Lecture Notes For Chapter 9 Introduction To Data Mining, 2 Edition by Tan, Steinbach, Karpatne, Kumar
33 pages
Outlier Analysis
No ratings yet
Outlier Analysis
18 pages
Outlier Detection
No ratings yet
Outlier Detection
22 pages
Phát hiện bất thường sử dụng kỹ thuật bao lồi
No ratings yet
Phát hiện bất thường sử dụng kỹ thuật bao lồi
19 pages
PR1 Characteristics Strengths and Weaknesses Kinds and Importance of Qualitative Research
No ratings yet
PR1 Characteristics Strengths and Weaknesses Kinds and Importance of Qualitative Research
13 pages
Anomaly Detection For Data Streams in Large-Scale Distributed Heterogeneous Computing Environments
No ratings yet
Anomaly Detection For Data Streams in Large-Scale Distributed Heterogeneous Computing Environments
11 pages
Aam Micro
No ratings yet
Aam Micro
13 pages
WP S-Ax Key Steps To Detect An Anomaly in Real-time-JAN10
No ratings yet
WP S-Ax Key Steps To Detect An Anomaly in Real-time-JAN10
10 pages
Reverse Accessible in Local Outlier Factor Density Based Recognition
No ratings yet
Reverse Accessible in Local Outlier Factor Density Based Recognition
10 pages
28682-Article Text-32736-1-2-20240324
No ratings yet
28682-Article Text-32736-1-2-20240324
11 pages
10 - Anomaly Detection
No ratings yet
10 - Anomaly Detection
12 pages
Outlier Detection
No ratings yet
Outlier Detection
36 pages
Ebook Beginners Guide To Anomaly Detection 2022
No ratings yet
Ebook Beginners Guide To Anomaly Detection 2022
12 pages
Appearance Release: Complete Only For Hazardous Activity
No ratings yet
Appearance Release: Complete Only For Hazardous Activity
1 page
Anomaly Detection: Jing Gao
No ratings yet
Anomaly Detection: Jing Gao
51 pages
References
No ratings yet
References
6 pages
1 s2.0 S0952197622004936 Main
No ratings yet
1 s2.0 S0952197622004936 Main
8 pages
Anomaly Detection Survey
No ratings yet
Anomaly Detection Survey
72 pages
Anomaly Detection
No ratings yet
Anomaly Detection
7 pages
MBA Analytics For Finance 08
No ratings yet
MBA Analytics For Finance 08
9 pages
Outlier Mining Techniques For Uncertain Data
No ratings yet
Outlier Mining Techniques For Uncertain Data
7 pages
Detecting Outliers in High Dimensional Data Sets U
No ratings yet
Detecting Outliers in High Dimensional Data Sets U
6 pages
Data Mining-Outlier Analysis
No ratings yet
Data Mining-Outlier Analysis
6 pages
Sprint Hack-O-Hire Team 1920587 1b2c50fteam Blitz
No ratings yet
Sprint Hack-O-Hire Team 1920587 1b2c50fteam Blitz
6 pages
A Survey On Outlier Detection Methods
No ratings yet
A Survey On Outlier Detection Methods
4 pages
CS37300 Data Mining & Machine Learning: Anomaly Detection
No ratings yet
CS37300 Data Mining & Machine Learning: Anomaly Detection
10 pages
The Ultimate Guide To Anomaly Detection: Key Use Cases, Techniques, and Autoencoder Machine Learning Models
No ratings yet
The Ultimate Guide To Anomaly Detection: Key Use Cases, Techniques, and Autoencoder Machine Learning Models
9 pages
Anomaly Detection Algorithms For RapidMiner
No ratings yet
Anomaly Detection Algorithms For RapidMiner
12 pages
Thermal Anomaly Detection
No ratings yet
Thermal Anomaly Detection
3 pages
Anomaly Detection
No ratings yet
Anomaly Detection
7 pages
Sustainable Devlopment Goals
No ratings yet
Sustainable Devlopment Goals
34 pages
Anomaly Detection
No ratings yet
Anomaly Detection
3 pages
Anomoly Detection
No ratings yet
Anomoly Detection
2 pages
Elastic Anomalies
No ratings yet
Elastic Anomalies
7 pages
Sciencedirect: Survey On Anomaly Detection Using Data Mining Techniques
No ratings yet
Sciencedirect: Survey On Anomaly Detection Using Data Mining Techniques
6 pages
Anomaly Detection RapidMiner
No ratings yet
Anomaly Detection RapidMiner
12 pages
Outliers Intrusion Detection: Anomaly Detection, Also Referred To As Outlier Detection
No ratings yet
Outliers Intrusion Detection: Anomaly Detection, Also Referred To As Outlier Detection
1 page
Testing & Commissioning of Irrigation System
No ratings yet
Testing & Commissioning of Irrigation System
13 pages
IDoc document-OpenText
No ratings yet
IDoc document-OpenText
13 pages
Current Affairs-Weekly Session-Ppt - June 2024 Part-I
No ratings yet
Current Affairs-Weekly Session-Ppt - June 2024 Part-I
99 pages
MIP GET VIEW BOQDripSystem
No ratings yet
MIP GET VIEW BOQDripSystem
6 pages
Parts Catalog: TJ053E-AS50
No ratings yet
Parts Catalog: TJ053E-AS50
14 pages
Rock Cycle - Metamorphic Rocks
No ratings yet
Rock Cycle - Metamorphic Rocks
33 pages
Ultrasonic Sensors: USA Series US-T50/R25 US-S25AN US-S300 Series US-1AH
No ratings yet
Ultrasonic Sensors: USA Series US-T50/R25 US-S25AN US-S300 Series US-1AH
19 pages
Package Desire': R Topics Documented
No ratings yet
Package Desire': R Topics Documented
22 pages
B.tech Eeee Syllabus
No ratings yet
B.tech Eeee Syllabus
12 pages
Certificate: Lokmanya Tilak College of Engineering
No ratings yet
Certificate: Lokmanya Tilak College of Engineering
5 pages
FVC Labor Union-Ptgwo vs. Sanama-Fvc-Siglo
100% (1)
FVC Labor Union-Ptgwo vs. Sanama-Fvc-Siglo
3 pages
Laag 1
No ratings yet
Laag 1
12 pages
Get Data Analytics For Accounting, 3rd Edition Vernon J. Richardson Free All Chapters
No ratings yet
Get Data Analytics For Accounting, 3rd Edition Vernon J. Richardson Free All Chapters
40 pages
Illycaffe: The Starbucks Threat: Marketing Strategy
No ratings yet
Illycaffe: The Starbucks Threat: Marketing Strategy
12 pages
Lecture Notes 1
No ratings yet
Lecture Notes 1
17 pages
HW8-smoother Tuning DIAL
100% (1)
HW8-smoother Tuning DIAL
5 pages
CH3 4
No ratings yet
CH3 4
32 pages
Smartphone As A Tool For Different Applications
No ratings yet
Smartphone As A Tool For Different Applications
4 pages
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
Automatic Target Recognition: Fundamentals and Applications
From Everand
Automatic Target Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Automatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition
From Everand
Automatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition
Fouad Sabry
No ratings yet

6anomaly Fraud Detection

Uploaded by

6anomaly Fraud Detection

Uploaded by

Anomaly

2. Contextual anomalies: The abnormality is context specific. This type of anomaly is

Anomaly Detection Techniques

i) Simple Statistical Methods

 Sort: 7, 10, 11, 15, 25, 30, 35, 68

ii) Machine Learning-Based Approaches

Density-Based Anomaly Detection

Density-based anomaly detection is based on the k-nearest neighbor’s algorithm.

LOF(k) ~ 1 means Similar density as neighbors,

Clustering-Based Anomaly Detection

You might also like