Reverse Accessible in Local Outlier Factor Density Based Recognition

The document discusses a proposed method for outlier detection using Local Outlier Factor (LOF) and Local Distance-Based Outlier Factor (LDOF) algorithms, which improve upon previous systems in terms of speed, complexity, and efficiency. It emphasizes the importance of incremental outlier detection that adapts to changing data profiles over time and provides a broad comparison of various outlier detection models. The methodology includes a greedy algorithm for selecting outliers based on their impact on entropy in a dataset.

Uploaded by

vijayakathari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views10 pages

Reverse Accessible in Local Outlier Factor Density Based Recognition

Uploaded by

vijayakathari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

International Journal of Research p-ISSN: 2348-6848

e-ISSN: 2348-795X
Available at https://edupediapublications.org/journals
Volume 03 Issue 10
June 2016

Reverse Accessible in Local Outlier Factor Density Based

Recognition
N V S K Vijaya Lakshmi K1 & David Raju Kuppala2
1
Assistant Professor, Dept of IT, Sir C R Reddy College Of Engineering, Eluru,Andhra Pradesh.
2
Assistant Professor, Dept of CSE, K L University,Vaddeswaram,Guntur, Andhra Pradesh.

Abstract: Recent data mining outlier to recognition data point the expected system to sufficient dataset or
is significantly many data exhibits that as dimensionality increases there exists hubs and anti hubs the
points that frequently occur in k nearest neighbor lists. Ant hubs are points that infrequently model in kNN
lists. .This proposed system to developing and comparing to unsupervised outlier detection models This
proposed method to details about the development and analysis of outlier detection methods is Local
Outlier Factor (LOF), and Local Distance-Based Outlier Factor(LDOF) .Outliers improves the results of
the previous systems to reference to speed, complexity and efficiency . The classification algorithms is
used to finding the relevant features and classify in the criteria in data mining methods. These techniques
suffer to increasing complexity, size and variety of data sets. The proposed incremental LOF algorithm
takes equivalent finding performance as the iterated static LOF algorithm while requiring significantly
less computational time. In addition, the incremental LOF algorithm is dynamically modify the data of
data points. This is a very important application, change data profiles to change over time. Moreover, we
have also given a broad comparison of the number of model the different outlier factors.
Index Terms: Clustering-based; Density-based and Model-based approaches; Nearest Neighbour; Outlier
Detection; Discrimination; Outliers; data mining; Clustering; Neural Network.

1. INTRODUCTION into information. It is basically used in fraud

Outlier detection or anomaly detection means detection, marketing and scientific discovery.
detecting data patterns that do not conform or Data mining actually refers to extracting the
distant from other observations.[5] Outliers can hidden interesting patterns from the large amount
have many anomalous causes. To normal behavior of datasets and databases [2]. Mining is basically
Outliers arise due to changes in system behavior, used to uncover the patterns of the data, but this
fraudulent behavior, human error, instrument error can be carried out on the sample of data. The
or simply through natural deviations in mining process will be completely failed if the
populations. Outliers may contain critical and samples are not the good representation of the
actionable information in fraud detection, large body of the data. Automated identification
intrusion detection and medical diagnosis. Data of suspicious behavior and objects [3] based on
Mining is a non-trivial method of identifying information extracted from video streams is
valid, novel, potentially useful and finally currently an active research area. Other potential
understandable patterns [1]. Now, data mining is applications include traffic control and
becoming an important tool to convert the data surveillance of commercial and residential