0% found this document useful (0 votes)
121 views27 pages

Autoencoder Forest For IoT's Anomaly Detection

This document summarizes a presentation on using an autoencoder forest for anomaly detection in IoT time series data. It discusses how autoencoders can be used to model normal equipment behavior and detect anomalies. However, a single autoencoder may not generalize well. The presentation proposes an autoencoder forest that trains multiple independent autoencoders on different time windows to better capture the full behavior. An end-to-end workflow is described that preprocesses data, extracts windows, trains the forest, and then uses it to score anomalies. Experimental results on real-world datasets found the forest was able to detect anomalies that a single autoencoder could miss.

Uploaded by

jwtan2010
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
121 views27 pages

Autoencoder Forest For IoT's Anomaly Detection

This document summarizes a presentation on using an autoencoder forest for anomaly detection in IoT time series data. It discusses how autoencoders can be used to model normal equipment behavior and detect anomalies. However, a single autoencoder may not generalize well. The presentation proposes an autoencoder forest that trains multiple independent autoencoders on different time windows to better capture the full behavior. An end-to-end workflow is described that preprocesses data, extracts windows, trains the forest, and then uses it to score anomalies. Experimental results on real-world datasets found the forest was able to detect anomalies that a single autoencoder could miss.

Uploaded by

jwtan2010
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 27

Data Council Singapore 2019

Autoencoder Forest for Anomaly


Detection from IoT Time Series

<Title>
Yiqun Hu, SP Group
Agenda
• Condition monitoring & anomaly detection
• Autoencoder for anomaly detection
• Autoencoder Forest
• End-to-end workflow
• Experiment results
Conditional monitoring & Anomaly
Detection
Condition monitoring
Time-series anomaly detection
• Manual monitoring
– Huge human effort
– Boring task with low quality

• Rule-based method
– Cannot differentiate different
environment
– Cannot adapt to different
condition of the equipment

• Data-driven method
– Model the common behavior of
the equipment
Autoencoder for Anomaly Detection
Autoencoder
• What is autoencoder Autoencoder Neural Network

– A encoder-decoder type of neural network


architecture that is used for self-learning from
unlabeled data

• The idea of autoencoder


– Learn how to compress data into a concise
representation to allow for the reconstruction
with minimum error

• Different variants of autoencoder


– Variational Autoencoder
– LSTM Autoencoder
– Etc.
Autoencoder for anomaly detection
Offline Training Reconstruction errors

Online Detection

Anomaly score
Autoencoder Forest
A key challenge of autoencoder
Single Autoencoder
The idea of autoencoder forest

+
+
+ + +
x x xx x ++
xx x
x

oo
o o
oo
o o
o
Clustering subsequence is meaningless

[1]. Eamonn Keogh, Jessica Lin, Clustering of Time Series Subsequences is Meaningless:
Implications for Previous and Future Research
Autoencoder forest based on time

0:00 1:00 1:30 22:00 23:30


Training autoencoder forest
• Structure is fixed for every
Decoder Layer 2 (window_size, 1)
autoencoder. (try to make it
as generic as possible)
Decoder layer 1 (window_size/2, 1)
• Each autoencoder within
forest is independent. So
Encoder
(window_size/4, 1) the training is naturally
layer 2
parallelizable
Encoder layer 1 (window_size/2, 1) • Using early stopping
mechanism, the training of
Input Layer (window_size, 1) individual autoencoder can
be stopped at similar
accuracy.
Autoencoder Forest
Single Autoencoder Autoencoder Forest
End-to-end Workflow
Automatic end-to-end workflow

Time series Train Data Train Window Autoencoder


Training analysis Preprocessing Extraction Forest Training

Anomaly Test Data Test Window


Anomaly scoring
Preprocessing Extraction
detection
Periodic pattern analysis
• Automatic determine the
repeating period in time
series
– Calculate autocorrelations of
different lags
– Find the strong local maximum
of autocorrelation
– Calculate the interval of any
two local maximum
– Find the mode of intervala
Missing data handling
Misalignment

… • No need to impute

3:05 3:10 3:15 3:20 … 16:15 16:21 16:24 16:30

Missing
• If missing gap is small,
? ?? impute with

neighbouring points;

• If missing gap is large,
3:05 3:10 3:15 3:20 … 16:15 (16:20 – 16:40) 16:45 impute with the same

time of other periods;
Anomaly scoring

Median profile

 
.. Compute
..
.. reconstruction error
Extract the sequence Corresponding as anomaly score
window end at time t autoencoder reconstruct
the sequence window at
time t

Learned
autoencoder forest
Experiment Results
Cooling tower – return water temperature
Chiller – chilled water return temperature
Smart meter – half hour consumption
Normal data Top 3 Detected Anomaly

2018-12-03 22:00:00 2018-09-27 14:30:00 2018-10-06 22:30:00 2018-09-07 15:30:00


A common platform for time series data, with built-in
AI capabilities
powering the nation

You might also like