Improved Data Discrimination in Wireless Sensor Networks: B. A. Sabarish, S. Shanmugapriya
Improved Data Discrimination in Wireless Sensor Networks: B. A. Sabarish, S. Shanmugapriya
ABSTRACT
In Wireless Sensors Networks, the computational power and storage capacity is limited. Wireless Sensor Networks are
operated in low power batteries, mostly not rechargeable. The amount of data processed is incremental in nature, due to
deployment of various applications in Wireless Sensor Networks, thereby leading to high power consumption in the
network. For effectively processing the data and reducing the power consumption the discrimination of noisy, redundant
and outlier data has to be performed. In this paper we focus on data discrimination done at node and cluster level employing Data Mining Techniques. We propose an algorithm to collect data values both at node and cluster level and
finding the principal component using PCA techniques and removing outliers resulting in error free data. Finally a
comparison is made with the Statistical and Bucket-width outlier detection algorithm where the efficiency is improved
to an extent.
Keywords: Wireless Sensor Networks (WSN); Data Mining; Clustering; Anomaly Detection; Outlier Detection
1. Introduction
2. Related Works
WSN
118
B. A. SABARISH ET AL.
3. Implementation
The proposed discrimination model is implemented in
two levels mainly at node level and clustering level. In
which the raw data set D is finally partitioned into three
subsets, Devent, Derror and Dordinary. The clustering is done
using incremental algorithm, Cluster head is selected
using the petersons algorithm [8]. Cluster heads are
scheduled to rotate themselves with the period of time
based on the energy level of the nodes (Residual energy).
The cluster head will include the process of data aggregation and forwarding to the Base Station. The premature
death of nodes will lead to hole in the network. Hence
the cluster heads are selected from the nodes with the
high residual energy (Advanced Nodes).
The Implementation of Discrimination [9] in two levels are:
Node level: If the value or gradient of a sample exceeds the range of some physical constant, it is apparently an erroneous sample that should be put into Derror.
Otherwise, the set of discrete errors are picked out and
the samples of continuous errors are marked by interval
for further processing. In our work, linear regression is
used based on a fixed size of sliding window. The value
differences of the predicted samples and the real-sensed
samples reflect the temporal pattern of the sample sequence. Both events and errors would incur significant
change of pattern and thus a higher value difference in
prediction, according to which the involved samples are
marked for further discrimination. The rest of the samples are put into Dordinary.
Cluster level: The local fusion center evaluates the
samples in D' with reference to Dordinary. The event samples are finally selected from D' and constitute Devent.
We use deviation-based ranking strategy to evaluate the
samples in D' because it has been assumed that there is
little chance for all of the nearby nodes (within a cluster)
to get similar wrong readings.
Datasets are generated using SimPy simulator and
stored in database. Dataset of LED was taken into consideration and node level and cluster level discrimination
was done using Python. Peterson algorithm was used to
select the cluster head. Incremental clustering and PCA
was also carried out. Final graph was plotted with the
collected data. Data Aggregation has been carried our
using READA to eliminate the redundant information
Copyright 2012 SciRes.
5. Conclusion
In this paper, self umpiring system which performs incremental cluster algorithm, PCA and outlier detection
was tested. In our system each node in the path from
B. A. SABARISH ET AL.
[2]
[3]
L. Deri, S. Suin and G. Maselli, Design and Implementation of an Anomaly Detection System: An Empirical
Approach, TERENA Networking Conference in Association with the CARNET Users Conference, Zagreb, 19-22
May 2003, pp. 1-20.
[4]
[5]
[6]
[7]
S. Rajasegarar, C. Leckie, M. Palaniswami and J. C. Bezdek, Distributed Anomaly Detection in Wireless Sensor
Networks, 10th IEEE Singapore International Conference on Communication Systems, Singapore, 30 October-2
November 2006, pp. 1-5.
doi:10.1109/ICCS.2006.301508
[8]
[9]
Statistical Method
87
22
Incremental Method
87
24
REFERENCES
[1]
119
[10] K. Khedo, R. Doomun and S. Aucharuz, READA: Redundancy Elimination for Accurate Data Aggregation in
Wireless Sensor Networks, Wireless Sensor Network,
Vol. 2, No. 4, 2010, pp. 302-308.
doi:10.4236/wsn.2010.24041
[11] A. Weingessel and K. Hornik, Local PCA Algorithms,
IEEE Transactions on Neural Networks, Vol. 11, No. 6,
2000, pp. 1242-1250. doi:10.1109/72.883408
[12] D.-A. Le Borgne, S. Raybaud and G. Bontempi, Distributed Principal Component Analysis for Wireless Sensor
Networks, Sensors, Vol. 8, No. 8, 2008, pp. 4821-4850.
doi:10.3390/s8084821
WSN