0% found this document useful (0 votes)

133 views9 pages

Multivariate Multi Step Time Series Forecasting Using Stacked LSTM Sequence To Sequence Autoencoder in Tensorflow 2 0 Keras

This document summarizes a stacked LSTM sequence-to-sequence model for multi-step time series forecasting. It introduces sequence-to-sequence learning and describes a model with an encoder and decoder. It then explains stacking additional LSTM layers to create a more complex representation (E2D2 model). The model is trained on individual household electric power consumption data to forecast future consumption values over multiple time steps.

Uploaded by

Anish Shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

133 views9 pages

Multivariate Multi Step Time Series Forecasting Using Stacked LSTM Sequence To Sequence Autoencoder in Tensorflow 2 0 Keras

Uploaded by

Anish Shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Multivariate Multi-step Time Series Forecasting using Stacked

LSTM sequence to sequence Autoencoder in Tensorflow 2.0 /

Keras
A D VA NC E D D E E P LE A RNI NG PYT HO N S T RUC T URE D D AT A T E C HNI Q UE T I M E S E RI E S F O RE C A S T I NG

Overview

This article will see how to create a stacked sequence to sequence the LSTM model for time series
forecasting in Keras/ TF 2.0.

Prerequ isit es: Th e reader sh ou ld already be familiar wit h n eu ral n et works an d, in part icu lar, recu rren t n eu ral n et works (RNNs). Also, kn owledge of LSTM or

GRU models is preferable. If y ou are n ot familiar wit h LSTM, I wou ld prefer y ou t o read LSTM- Lon g Sh ort -Term Memory .

Introduction

In Sequence to Sequence Learning, an RNN model is trained to map an input sequence to an output
sequence. The input and output need not necessarily be of the same length. The seq2seq model contains
two RNNs, e.g., LSTMs. They can be treated as an encoder and decoder. The encoder part converts the
given input sequence to a fixed-length vector, which acts as a summary of the input sequence.

This fixed-length vector is called the context vector. The context vector is given as input to the decoder
and the final encoder state as an initial decoder state to predict the output sequence. Sequence to
Sequence learning is used in language translation, speech recognition, time series
forecasting, etc.

We will use the sequence to sequence learning for time series forecasting. We can use this architecture to
easily make a multistep forecast. we will add two layers, a repeat vector layer and time distributed dense
layer in the architecture.

A repeat vector layer is used to repeat the context vector we get from the encoder to pass it as an input to
the decoder. We will repeat it for n-steps ( n is the no of future steps you want to forecast). The output
received from the decoder with respect to each time step is mixed. The time distributed densely will apply
a fully connected dense layer on each time step and separates the output for each timestep. The time
distributed densely is a wrapper that allows applying a layer to every temporal slice of an input.

We will stack additional layers on the encoder part and the decoder part of the sequence to sequence
model. By stacking LSTM’s, it may increase the ability of our model to understand more complex
representation of our time-series data in hidden layers, by capturing information at different levels.
Code

The data used is In dividu al h ou seh old elect ric power con su mpt ion . You can dow nload t he dat aset from t his link.

Importing L ibra rie s

import pandas as pd import numpy as np from [Link] import MinMaxScaler import

[Link] as plt import tensorflow as tf import os

Now load the dataset into a pandas data frame.

df=pd.read_csv(r'household_power_consumption.txt', sep=';', header=0, low_memory=False, infer_datetime_format=True,

parse_dates={'datetime':[0,1]}, index_col=['datetime']) [Link]()

Imputing Null Va lue s

df = [Link]('?', [Link]) [Link]().sum()

Now we will create a function that will impute missing values by replacing them with values on their
previous day.

def fill_missing(values): one_day = 60*24 for row in range([Link][0]): for col in range([Link][1]): if

[Link](values

[col]): values
= values[row-one_day,col] df = [Link]('float32') fill_missing([Link]) [Link]().sum()

Downsampling of Data from minutes to Days

There are more than 2 lakhs observations recorded. Let's make the data simpler by downsampling them
from the frequency of minutes to days.

daily_df = [Link]('D').sum() daily_df.head()

Train - Test Split

After downsampling, the number of instances is 1442. We will split the dataset into train and test data in a
75% and 25% ratio of the instances. (0.75 * 1442 = 1081)

train_df,test_df = daily_df[1:1081], daily_df[1081:]

Scaling the values

All the columns in the data frame are on a different scale. Now we will scale the values to -1 to 1 for faster
training of the models.

train = train_df scalers={} for i in train_df.columns: scaler = MinMaxScaler(feature_range=(-1,1)) s_s =

scaler.fit_transform(train[i].[Link](-1,1)) s_s=[Link](s_s,len(s_s)) scalers['scaler_'+ i] =

scaler train[i]=s_s test = test_df for i in train_df.columns: scaler = scalers['scaler_'+i] s_s =

[Link](test[i].[Link](-1,1)) s_s=[Link](s_s,len(s_s)) scalers['scaler_'+i] = scaler

test[i]=s_s

Converting the series to samples

Now we will make a function that will use a sliding window approach to transform our series into samples
of input past observations and output future observations to use supervised learning algorithms.

def split_series(series, n_past, n_future): # # n_past ==> no of past observations # # n_future ==> no of

future observations # X, y = list(), list() for window_start in range(len(series)): past_end = window_start +

n_past future_end = past_end + n_future if future_end > len(series): break # slicing the past and future
parts of the window past, future = series[window_start:past_end, :], series[past_end:future_end, :]

[Link](past) [Link](future) return [Link](X), [Link](y)

For this case, let's assume that given the past 10 days observation, we need to forecast the next 5 days
observations.
n_past = 10 n_future = 5 n_features = 7

Now convert both the train and test data into samples using the split_series function.

X_train, y_train = split_series([Link],n_past, n_future) X_train = X_train.reshape((X_train.shape[0],

X_train.shape[1],n_features)) y_train = y_train.reshape((y_train.shape[0], y_train.shape[1], n_features))

X_test, y_test = split_series([Link],n_past, n_future) X_test = X_test.reshape((X_test.shape[0],

X_test.shape[1],n_features)) y_test = y_test.reshape((y_test.shape[0], y_test.shape[1], n_features))

Model Architecture

Now we will create two models in the below-mentioned architecture.

E1D1 ==> Sequence to Sequence Model with one encoder layer and one decoder layer.

# E1D1 # n_features ==> no of features at each timestep in the data. # encoder_inputs =

[Link](shape=(n_past, n_features)) encoder_l1 = [Link](100, return_state=True)

encoder_outputs1 = encoder_l1(encoder_inputs) encoder_states1 = encoder_outputs1[1:] # decoder_inputs =

[Link](n_future)(encoder_outputs1[0]) # decoder_l1 = [Link](100,

return_sequences=True)(decoder_inputs,initial_state = encoder_states1) decoder_outputs1 =

[Link]([Link](n_features))(decoder_l1) # model_e1d1 =

[Link](encoder_inputs,decoder_outputs1) # model_e1d1.summary()

E2D2 ==> Sequence to Sequence Model with two encoder layers and two decoder layers.

# E2D2 # n_features ==> no of features at each timestep in the data. # encoder_inputs =

[Link](shape=(n_past, n_features)) encoder_l1 = [Link](100,return_sequences =

True, return_state=True) encoder_outputs1 = encoder_l1(encoder_inputs) encoder_states1 = encoder_outputs1[1:]
encoder_l2 = [Link](100, return_state=True) encoder_outputs2 = encoder_l2(encoder_outputs1[0])

encoder_states2 = encoder_outputs2[1:] # decoder_inputs = [Link](n_future)

(encoder_outputs2[0]) # decoder_l1 = [Link](100, return_sequences=True)
(decoder_inputs,initial_state = encoder_states1) decoder_l2 = [Link](100,

return_sequences=True)(decoder_l1,initial_state = encoder_states2) decoder_outputs2 =

[Link]([Link](n_features))(decoder_l2) # model_e2d2 =
[Link](encoder_inputs,decoder_outputs2) # model_e2d2.summary()
Training the models

I have used Adam optimizer and Huber loss as the loss function. Let's compile and run the model.

reduce_lr = [Link](lambda x: 1e-3 * 0.90 ** x)

model_e1d1.compile(optimizer=[Link](), loss=[Link]())
history_e1d1=model_e1d1.fit(X_train,y_train,epochs=25,validation_data=
(X_test,y_test),batch_size=32,verbose=0,callbacks=[reduce_lr])

model_e2d2.compile(optimizer=[Link](), loss=[Link]())
history_e2d2=model_e2d2.fit(X_train,y_train,epochs=25,validation_data=
(X_test,y_test),batch_size=32,verbose=0,callbacks=[reduce_lr])

Pre diction on te st sa mple s

pred_e1d1=model_e1d1.predict(X_test) pred_e2d2=model_e2d2.predict(X_test)

Inverse Scaling of the predicted values

Now we will convert the predictions to its original scale.

for index,i in enumerate(train_df.columns): scaler = scalers['scaler_'+i]

pred1_e1d1[:,:,index]=scaler.inverse_transform(pred1_e1d1[:,:,index])
pred_e1d1[:,:,index]=scaler.inverse_transform(pred_e1d1[:,:,index])

pred1_e2d2[:,:,index]=scaler.inverse_transform(pred1_e2d2[:,:,index])
pred_e2d2[:,:,index]=scaler.inverse_transform(pred_e2d2[:,:,index])
y_train[:,:,index]=scaler.inverse_transform(y_train[:,:,index])

y_test[:,:,index]=scaler.inverse_transform(y_test[:,:,index])

Checking Error

Now we will calculate the mean absolute error of all observations.

from [Link] import mean_absolute_error for index,i in enumerate(train_df.columns): print(i) for j in

range(1,6): print("Day ",j,":") print("MAE-E1D1 : ",mean_absolute_error(y_test[:,j-1,index],pred1_e1d1[:,j-

1,index]),end=", ") print("MAE-E2D2 : ",mean_absolute_error(y_test[:,j-1,index],pred1_e2d2[:,j-1,index]))

print() print()

From the above output, we can observe that, in some cases, the E2D2 model has performed better than the
E1D1 model with less error. Training different models with a different number of stacked layers and
creating an ensemble model also performs well.

Note: The results vary with respect to the dataset. If we stack more layers, it may also lead to overfitting. So
the number of layers to be stacked acts as a hyperparameter.

Links

Here’s the link for the code.

Conclusion

Congratulations, you have learned how to implement multivariate multi-step time series forecasting using
TF 2.0 / Keras. This is my first attempt at writing a blog. So please share your opinion in the comments
section below.

Thanks for reading.

References:

1. [Link]

2. [Link]

3. [Link]

Article Url - [Link]

forecasting-using-stacked-lstm-sequence-to-sequence-autoencoder-in-tensorflow-2-0-keras/

Jagadeesh23

Conv1D-LSTM for Time Series Forecasting
No ratings yet
Conv1D-LSTM for Time Series Forecasting
6 pages
3 Steps To Forecast Time Series - LSTM With TensorFlow Keras - Towards Data Science
No ratings yet
3 Steps To Forecast Time Series - LSTM With TensorFlow Keras - Towards Data Science
16 pages
Time Series Forecasting With 2D Convolutions
No ratings yet
Time Series Forecasting With 2D Convolutions
33 pages
Module 4
No ratings yet
Module 4
36 pages
Implementation of Time Series Forecasting
No ratings yet
Implementation of Time Series Forecasting
12 pages
On Deep Machine Learning & Time Series Models: A Case Study With The Use of Keras
100% (1)
On Deep Machine Learning & Time Series Models: A Case Study With The Use of Keras
34 pages
Time Series Forecasting with RNNs
No ratings yet
Time Series Forecasting with RNNs
41 pages
LSTM Time Series Forecasting with TensorFlow
No ratings yet
LSTM Time Series Forecasting with TensorFlow
15 pages
Practical 9
No ratings yet
Practical 9
5 pages
DL Practical
No ratings yet
DL Practical
25 pages
Seriesnet:A Generative Time Series Forecasting Model: Zhipeng Shen, Yuanming Zhang, Jiawei Lu, Jun Xu, Gang Xiao
No ratings yet
Seriesnet:A Generative Time Series Forecasting Model: Zhipeng Shen, Yuanming Zhang, Jiawei Lu, Jun Xu, Gang Xiao
8 pages
Leveraging Hybrid Deep Learning Models For Enhanced Multivariate Time Series Forecasting
No ratings yet
Leveraging Hybrid Deep Learning Models For Enhanced Multivariate Time Series Forecasting
25 pages
BDCC 08 00048
No ratings yet
BDCC 08 00048
14 pages
Exp. No.: Aim Code:: AIML634P Neural Network Lab 2262034
No ratings yet
Exp. No.: Aim Code:: AIML634P Neural Network Lab 2262034
11 pages
Deep Learning for Time-Series Forecasting
No ratings yet
Deep Learning for Time-Series Forecasting
14 pages
Forecast Test Approach1
No ratings yet
Forecast Test Approach1
3 pages
Time-Series Forecasting With Deep Learning - A Survey
No ratings yet
Time-Series Forecasting With Deep Learning - A Survey
14 pages
Enhancing Transformer for Time Series
No ratings yet
Enhancing Transformer for Time Series
14 pages
Time Series Forecasting With Deep Learning: A Survey: Research
No ratings yet
Time Series Forecasting With Deep Learning: A Survey: Research
13 pages
LSTM and Transformers in Time Series
No ratings yet
LSTM and Transformers in Time Series
4 pages
Understanding LSTM Models for Time Series
No ratings yet
Understanding LSTM Models for Time Series
13 pages
Locality-Enhanced Transformer for Time Series Forecasting
No ratings yet
Locality-Enhanced Transformer for Time Series Forecasting
11 pages
Forecast Live Approach1
No ratings yet
Forecast Live Approach1
3 pages
SSRN 4165241
No ratings yet
SSRN 4165241
28 pages
Neural Basis Expansion Analysis With Exogenous Variables: Forecasting Electricity Prices With Nbeatsx
No ratings yet
Neural Basis Expansion Analysis With Exogenous Variables: Forecasting Electricity Prices With Nbeatsx
27 pages
SSL Assignment Report 1
No ratings yet
SSL Assignment Report 1
11 pages
DL Lab
No ratings yet
DL Lab
14 pages
Time-Series Extreme Event Forecasting With Neural Networks at Uber
No ratings yet
Time-Series Extreme Event Forecasting With Neural Networks at Uber
5 pages
Time Series Forecasting: Kick-Start Your Project With My New Book
No ratings yet
Time Series Forecasting: Kick-Start Your Project With My New Book
50 pages
Decoder Only Foundation Model For Time Series Forecasting: Reprint
No ratings yet
Decoder Only Foundation Model For Time Series Forecasting: Reprint
21 pages
Capstone FINAL
No ratings yet
Capstone FINAL
34 pages
DL Experiments
No ratings yet
DL Experiments
19 pages
How To Develop LSTM Models For Time Series Forecasting
100% (1)
How To Develop LSTM Models For Time Series Forecasting
188 pages
Built An AI Based Forecasting Model For Intraday Trading 1713981234
No ratings yet
Built An AI Based Forecasting Model For Intraday Trading 1713981234
4 pages
RNNs: LSTM and GRU Overview
No ratings yet
RNNs: LSTM and GRU Overview
32 pages
Forecast Live Approach2
No ratings yet
Forecast Live Approach2
3 pages
Convert Time Series to ML Models
No ratings yet
Convert Time Series to ML Models
5 pages
Lab Manual Ccs355
No ratings yet
Lab Manual Ccs355
12 pages
Time Series Forecasting with RNNs
No ratings yet
Time Series Forecasting with RNNs
96 pages
LSTM Model Architecture For Rare Event Time Series Forecasting
No ratings yet
LSTM Model Architecture For Rare Event Time Series Forecasting
36 pages
DL Exp-7 16010422230
No ratings yet
DL Exp-7 16010422230
12 pages
Stock Prediction
No ratings yet
Stock Prediction
10 pages
Time Series Forecasting Using RNNS: An Extended Attention Mechanism To Model Periods and Handle Missing Values
No ratings yet
Time Series Forecasting Using RNNS: An Extended Attention Mechanism To Model Periods and Handle Missing Values
14 pages
Experiment 6
No ratings yet
Experiment 6
11 pages
Student Name: Course: Machine Learning Group: E27-24 Date: 16.01.2025
No ratings yet
Student Name: Course: Machine Learning Group: E27-24 Date: 16.01.2025
10 pages
XLSTMTime - Long-Term Time Series Forecasting With XLSTM
No ratings yet
XLSTMTime - Long-Term Time Series Forecasting With XLSTM
13 pages
Shell AI Pitch - Final
No ratings yet
Shell AI Pitch - Final
21 pages
Roadmap For Project
No ratings yet
Roadmap For Project
9 pages
Multivariate Time Series Forecasting Final 3rd Sem
No ratings yet
Multivariate Time Series Forecasting Final 3rd Sem
22 pages
1 s2.0 S0950705125010160 Main
No ratings yet
1 s2.0 S0950705125010160 Main
11 pages
Visvesvaraya Technological University Belagavi-590018: "Machine Learning Algorithm For Time Series Data"
No ratings yet
Visvesvaraya Technological University Belagavi-590018: "Machine Learning Algorithm For Time Series Data"
10 pages
A Decoder-Only Foundation Model For Time-Series Forecasting
No ratings yet
A Decoder-Only Foundation Model For Time-Series Forecasting
11 pages
UNIT 5 NM Iniya
No ratings yet
UNIT 5 NM Iniya
15 pages
Autoformer: Enhanced Long-Term Forecasting
No ratings yet
Autoformer: Enhanced Long-Term Forecasting
20 pages
Survey On Time Series Forecasting
No ratings yet
Survey On Time Series Forecasting
28 pages
DL Lab1
No ratings yet
DL Lab1
15 pages
5G Network Test Environment Commissioning Guide HCIA
No ratings yet
5G Network Test Environment Commissioning Guide HCIA
41 pages
Infotel CDR Manual
No ratings yet
Infotel CDR Manual
18 pages
@up - Daisycloud - @foxbaseworld #Ulp-238
No ratings yet
@up - Daisycloud - @foxbaseworld #Ulp-238
5,230 pages
FHR SOP With Flow
No ratings yet
FHR SOP With Flow
59 pages
Learn To Use OmegaT in 5 Minutes
No ratings yet
Learn To Use OmegaT in 5 Minutes
2 pages
Chap 6 Internal Control-1
No ratings yet
Chap 6 Internal Control-1
52 pages
DfMirage SDK v1.2 Developer's Guide
No ratings yet
DfMirage SDK v1.2 Developer's Guide
10 pages
IoT Connectivity for Industries
No ratings yet
IoT Connectivity for Industries
54 pages
BATERIAS
No ratings yet
BATERIAS
15 pages
Sales Management: Assignment I
No ratings yet
Sales Management: Assignment I
7 pages
TOPIC 3 - Roots of Non-Linear Equations
No ratings yet
TOPIC 3 - Roots of Non-Linear Equations
34 pages
Basketball Tournament Project Plan
No ratings yet
Basketball Tournament Project Plan
6 pages
Certification Manual
No ratings yet
Certification Manual
11 pages
RKSV Securities Demat Account Details
No ratings yet
RKSV Securities Demat Account Details
1 page
Dwsim Simulation
No ratings yet
Dwsim Simulation
2 pages
Avr Libc User Manual 1.8.0
No ratings yet
Avr Libc User Manual 1.8.0
433 pages
Wind Engineering Techniques Guide
No ratings yet
Wind Engineering Techniques Guide
18 pages
Final Report (Internship)
No ratings yet
Final Report (Internship)
29 pages
The Effect of Social Media On Self Esteem Real One
No ratings yet
The Effect of Social Media On Self Esteem Real One
10 pages
IEEE Xtreme Challenge Report
No ratings yet
IEEE Xtreme Challenge Report
20 pages
Inputs, Processors, Memory and Outputs
No ratings yet
Inputs, Processors, Memory and Outputs
56 pages
Distance-Based Classification Methods
No ratings yet
Distance-Based Classification Methods
12 pages
Department of Cse (Artificial Intelligence & Machine Learning)
No ratings yet
Department of Cse (Artificial Intelligence & Machine Learning)
3 pages
IJETAUTISMPAPER
No ratings yet
IJETAUTISMPAPER
7 pages
Staadpro2006 Module-1 Workbook
No ratings yet
Staadpro2006 Module-1 Workbook
32 pages
E Book Video
No ratings yet
E Book Video
17 pages
978 1 63057 528 1 1 9rqiyjs6d1
No ratings yet
978 1 63057 528 1 1 9rqiyjs6d1
53 pages
Dae Manual
No ratings yet
Dae Manual
41 pages
Enhanced Caesar Cipher in C++
No ratings yet
Enhanced Caesar Cipher in C++
9 pages
NPD Full Report
No ratings yet
NPD Full Report
18 pages

Multivariate Multi Step Time Series Forecasting Using Stacked LSTM Sequence To Sequence Autoencoder in Tensorflow 2 0 Keras

Uploaded by

Multivariate Multi Step Time Series Forecasting Using Stacked LSTM Sequence To Sequence Autoencoder in Tensorflow 2 0 Keras

Uploaded by

Multivariate Multi-step Time Series Forecasting using Stacked

LSTM sequence to sequence Autoencoder in Tensorflow 2.0 /

Importing L ibra rie s

import pandas as pd import numpy as np from [Link] import MinMaxScaler import

Now load the dataset into a pandas data frame.

df=pd.read_csv(r'household_power_consumption.txt', sep=';', header=0, low_memory=False, infer_datetime_format=True,

parse_dates={'datetime':[0,1]}, index_col=['datetime']) [Link]()

df = [Link]('?', [Link]) [Link]().sum()

Downsampling of Data from minutes to Days

daily_df = [Link]('D').sum() daily_df.head()

Train - Test Split

train_df,test_df = daily_df[1:1081], daily_df[1081:]

Scaling the values

train = train_df scalers={} for i in train_df.columns: scaler = MinMaxScaler(feature_range=(-1,1)) s_s =

scaler train[i]=s_s test = test_df for i in train_df.columns: scaler = scalers['scaler_'+i] s_s =

[Link](test[i].[Link](-1,1)) s_s=[Link](s_s,len(s_s)) scalers['scaler_'+i] = scaler

Converting the series to samples

future observations # X, y = list(), list() for window_start in range(len(series)): past_end = window_start +

[Link](past) [Link](future) return [Link](X), [Link](y)

X_train, y_train = split_series([Link],n_past, n_future) X_train = X_train.reshape((X_train.shape[0],

X_train.shape[1],n_features)) y_train = y_train.reshape((y_train.shape[0], y_train.shape[1], n_features))

X_test.shape[1],n_features)) y_test = y_test.reshape((y_test.shape[0], y_test.shape[1], n_features))

Now we will create two models in the below-mentioned architecture.

# E1D1 # n_features ==> no of features at each timestep in the data. # encoder_inputs =

[Link](shape=(n_past, n_features)) encoder_l1 = [Link](100, return_state=True)

[Link](n_future)(encoder_outputs1[0]) # decoder_l1 = [Link](100,

return_sequences=True)(decoder_inputs,initial_state = encoder_states1) decoder_outputs1 =

# E2D2 # n_features ==> no of features at each timestep in the data. # encoder_inputs =

[Link](shape=(n_past, n_features)) encoder_l1 = [Link](100,return_sequences =

encoder_states2 = encoder_outputs2[1:] # decoder_inputs = [Link](n_future)

return_sequences=True)(decoder_l1,initial_state = encoder_states2) decoder_outputs2 =

reduce_lr = [Link](lambda x: 1e-3 * 0.90 ** x)

Pre diction on te st sa mple s

Inverse Scaling of the predicted values

Now we will convert the predictions to its original scale.

for index,i in enumerate(train_df.columns): scaler = scalers['scaler_'+i]

Now we will calculate the mean absolute error of all observations.

from [Link] import mean_absolute_error for index,i in enumerate(train_df.columns): print(i) for j in

1,index]),end=", ") print("MAE-E2D2 : ",mean_absolute_error(y_test[:,j-1,index],pred1_e2d2[:,j-1,index]))

Here’s the link for the code.

Thanks for reading.

Article Url - [Link]

You might also like