0% found this document useful (0 votes)

44 views50 pages

Time Series Forecasting: Kick-Start Your Project With My New Book

The document describes how to develop convolutional neural network (CNN) models for different types of time series forecasting problems. It provides tutorials on developing CNN models for univariate time series forecasting, multivariate time series forecasting, multi-step time series forecasting, and combinations of multivariate and multi-step forecasting. The tutorials include examples of data preparation, defining CNN model architectures, training the models, and making predictions. The overall goal is to provide templates for applying CNN models to different time series forecasting tasks.

Uploaded by

Waqas Hameed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views50 pages

Time Series Forecasting: Kick-Start Your Project With My New Book

Uploaded by

Waqas Hameed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 50

Convolutional Neural Network models, or CNNs for short, can be applied to

time series
forecasting.
There are many types of CNN models that can be used for each specific type of time series
forecasting problem.

In this tutorial, you will discover how to develop a suite of CNN models for a range of
standard time series forecasting problems.

The objective of this tutorial is to provide standalone examples of each model on each type
of time series problem as a template that you can copy and adapt for your specific time
series forecasting problem.

After completing this tutorial, you will know:

 How to develop CNN models for univariate time series forecasting.

 How to develop CNN models for multivariate time series forecasting.
 How to develop CNN models for multi-step time series forecasting.
This is a large and important post; you may want to bookmark it for future reference.

Kick-start your project with my new book Deep Learning for Time Series Forecasting,
including step-by-step tutorials and the Python source code files for all examples.
Let’s get started.
How to Develop Convolutional Neural Network Models for Time Series Forecasting

Photo by Bureau of Land Management, some rights reserved.

Tutorial Overview
In this tutorial, we will explore how to develop a suite of different types of CNN models for
time series forecasting.

The models are demonstrated on small contrived time series problems intended to give the
flavor of the type of time series problem being addressed. The chosen configuration of the
models is arbitrary and not optimized for each problem; that was not the goal.

This tutorial is divided into four parts; they are:

1. Univariate CNN Models

2. Multivariate CNN Models
3. Multi-Step CNN Models
4. Multivariate Multi-Step CNN Models
Univariate CNN Models
Although traditionally developed for two-dimensional image data, CNNs can be used to
model univariate time series forecasting problems.

Univariate time series are datasets comprised of a single series of observations with a
temporal ordering and a model is required to learn from the series of past observations to
predict the next value in the sequence.

This section is divided into two parts; they are:

1. Data Preparation
2. CNN Model
Data Preparation
Before a univariate series can be modeled, it must be prepared.

The CNN model will learn a function that maps a sequence of past observations as input to
an output observation. As such, the sequence of observations must be transformed into
multiple examples from which the model can learn.

Consider a given univariate sequence:

1[10, 20, 30, 40, 50, 60, 70, 80, 90]

We can divide the sequence into multiple input/output patterns called samples, where three
time steps are used as input and one time step is used as output for the one-step prediction
that is being learned.

1X, y

210, 20, 30 40

320, 30, 40 50

430, 40, 50 60

5...

The split_sequence() function below implements this behavior and will split a given

univariate sequence into multiple samples where each sample has a specified number of
time steps and the output is a single time step.
1 # split a univariate sequence into samples

2 def split_sequence(sequence, n_steps):

3 X, y = list(), list()
4 for i in range(len(sequence)):

5 # find the end of this pattern

6 end_ix = i + n_steps

7 # check if we are beyond the sequence

8 if end_ix > len(sequence)-1:

9 break

10 # gather input and output parts of the pattern

11 seq_x, seq_y = sequence[i:end_ix], sequence[end_ix]

12 X.append(seq_x)

13 y.append(seq_y)

14 return array(X), array(y)

We can demonstrate this function on our small contrived dataset above.

The complete example is listed below.

1 # univariate data preparation

2 from numpy import array

4 # split a univariate sequence into samples

5 def split_sequence(sequence, n_steps):

6 X, y = list(), list()

7 for i in range(len(sequence)):

8 # find the end of this pattern

9 end_ix = i + n_steps

10 # check if we are beyond the sequence

11 if end_ix > len(sequence)-1:

12 break

13 # gather input and output parts of the pattern

14 seq_x, seq_y = sequence[i:end_ix], sequence[end_ix]

15 X.append(seq_x)

16 y.append(seq_y)

17 return array(X), array(y)

19 # define input sequence

20 raw_seq = [10, 20, 30, 40, 50, 60, 70, 80, 90]

21 # choose a number of time steps

22 n_steps = 3

23 # split into samples

24 X, y = split_sequence(raw_seq, n_steps)

25 # summarize the data

26 for i in range(len(X)):

27 print(X[i], y[i])

Running the example splits the univariate series into six samples where each sample has
three input time steps and one output time step.

1[10 20 30] 40

2[20 30 40] 50

3[30 40 50] 60

4[40 50 60] 70

5[50 60 70] 80

6[60 70 80] 90

Now that we know how to prepare a univariate series for modeling, let’s look at developing
a CNN model that can learn the mapping of inputs to outputs.

Need help with Deep Learning for Time Series?

Take my free 7-day email crash course now (with sample code).

Click to sign-up and also get a free PDF Ebook version of the course.

Download Your FREE Mini-Course

CNN Model
A one-dimensional CNN is a CNN model that has a convolutional hidden layer that operates
over a 1D sequence. This is followed by perhaps a second convolutional layer in some
cases, such as very long input sequences, and then a pooling layer whose job it is to distill
the output of the convolutional layer to the most salient elements.
The convolutional and pooling layers are followed by a dense fully connected layer that
interprets the features extracted by the convolutional part of the model. A flatten layer is
used between the convolutional layers and the dense layer to reduce the feature maps to a
single one-dimensional vector.

We can define a 1D CNN Model for univariate time series forecasting as follows.

1# define model

2model = Sequential()

3model.add(Conv1D(filters=64, kernel_size=2, activation='relu', input_shape=(n_steps, n_features)))

4model.add(MaxPooling1D(pool_size=2))

5model.add(Flatten())

6model.add(Dense(50, activation='relu'))

7model.add(Dense(1))

8model.compile(optimizer='adam', loss='mse')

Key in the definition is the shape of the input; that is what the model expects as input for
each sample in terms of the number of time steps and the number of features.

We are working with a univariate series, so the number of features is one, for one variable.

The number of time steps as input is the number we chose when preparing our dataset as
an argument to the split_sequence() function.
The input shape for each sample is specified in the input_shape argument on the definition
of the first hidden layer.
We almost always have multiple samples, therefore, the model will expect the input
component of training data to have the dimensions or shape:

1[samples, timesteps, features]

Our split_sequence() function in the previous section outputs the X with the shape

[samples, timesteps], so we can easily reshape it to have an additional dimension for the
one feature.
1# reshape from [samples, timesteps] into [samples, timesteps, features]

2n_features = 1

3X = X.reshape((X.shape[0], X.shape[1], n_features))

The CNN does not actually view the data as having time steps, instead, it is treated as a
sequence over which convolutional read operations can be performed, like a one-
dimensional image.
In this example, we define a convolutional layer with 64 filter maps and a kernel size of 2.
This is followed by a max pooling layer and a dense layer to interpret the input feature. An
output layer is specified that predicts a single numerical value.

The model is fit using the efficient Adam version of stochastic gradient descent and
optimized using the mean squared error, or ‘mse‘, loss function.
Once the model is defined, we can fit it on the training dataset.

1# fit model

2model.fit(X, y, epochs=1000, verbose=0)

After the model is fit, we can use it to make a prediction.

We can predict the next value in the sequence by providing the input:

1[70, 80, 90]

And expecting the model to predict something like:

1[100]

The model expects the input shape to be three-dimensional with [samples, timesteps,
features], therefore, we must reshape the single input sample before making the prediction.
1# demonstrate prediction

2x_input = array([70, 80, 90])

3x_input = x_input.reshape((1, n_steps, n_features))

4yhat = model.predict(x_input, verbose=0)

We can tie all of this together and demonstrate how to develop a 1D CNN model for
univariate time series forecasting and make a single prediction.

1 # univariate cnn example

2 from numpy import array

3 from keras.models import Sequential

4 from keras.layers import Dense

5 from keras.layers import Flatten

6 from keras.layers.convolutional import Conv1D

7 from keras.layers.convolutional import MaxPooling1D

9 # split a univariate sequence into samples

10 def split_sequence(sequence, n_steps):

11 X, y = list(), list()

12 for i in range(len(sequence)):

13 # find the end of this pattern

14 end_ix = i + n_steps

15 # check if we are beyond the sequence

16 if end_ix > len(sequence)-1:

17 break

18 # gather input and output parts of the pattern

19 seq_x, seq_y = sequence[i:end_ix], sequence[end_ix]

20 X.append(seq_x)

21 y.append(seq_y)

22 return array(X), array(y)

24 # define input sequence

25 raw_seq = [10, 20, 30, 40, 50, 60, 70, 80, 90]

26 # choose a number of time steps

27 n_steps = 3

28 # split into samples

29 X, y = split_sequence(raw_seq, n_steps)

30 # reshape from [samples, timesteps] into [samples, timesteps, features]

31 n_features = 1

32 X = X.reshape((X.shape[0], X.shape[1], n_features))

33 # define model

34 model = Sequential()

35 model.add(Conv1D(filters=64, kernel_size=2, activation='relu', input_shape=(n_steps, n_features)))

36 model.add(MaxPooling1D(pool_size=2))

37 model.add(Flatten())

38 model.add(Dense(50, activation='relu'))

39 model.add(Dense(1))

40 model.compile(optimizer='adam', loss='mse')

41 # fit model

42 model.fit(X, y, epochs=1000, verbose=0)

43 # demonstrate prediction

44 x_input = array([70, 80, 90])

45 x_input = x_input.reshape((1, n_steps, n_features))

46 yhat = model.predict(x_input, verbose=0)

47 print(yhat)

Running the example prepares the data, fits the model, and makes a prediction.

Note: Your results may vary given the stochastic nature of the algorithm or evaluation
procedure, or differences in numerical precision. Consider running the example a few times
and compare the average outcome.
We can see that the model predicts the next value in the sequence.

1[[101.67965]]

Multivariate CNN Models

Multivariate time series data means data where there is more than one observation for each
time step.

There are two main models that we may require with multivariate time series data; they are:

1. Multiple Input Series.

2. Multiple Parallel Series.
Let’s take a look at each in turn.

Multiple Input Series

A problem may have two or more parallel input time series and an output time series that is
dependent on the input time series.

The input time series are parallel because each series has observations at the same time
steps.

We can demonstrate this with a simple example of two parallel input time series where the
output series is the simple addition of the input series.

1# define input sequence

2in_seq1 = array([10, 20, 30, 40, 50, 60, 70, 80, 90])

3in_seq2 = array([15, 25, 35, 45, 55, 65, 75, 85, 95])
4out_seq = array([in_seq1[i]+in_seq2[i] for i in range(len(in_seq1))])

We can reshape these three arrays of data as a single dataset where each row is a time
step and each column is a separate time series.

This is a standard way of storing parallel time series in a CSV file.

1# convert to [rows, columns] structure

2in_seq1 = in_seq1.reshape((len(in_seq1), 1))

3in_seq2 = in_seq2.reshape((len(in_seq2), 1))

4out_seq = out_seq.reshape((len(out_seq), 1))

5# horizontally stack columns

6dataset = hstack((in_seq1, in_seq2, out_seq))

The complete example is listed below.

1 # multivariate data preparation

2 from numpy import array

3 from numpy import hstack

4 # define input sequence

5 in_seq1 = array([10, 20, 30, 40, 50, 60, 70, 80, 90])

6 in_seq2 = array([15, 25, 35, 45, 55, 65, 75, 85, 95])

7 out_seq = array([in_seq1[i]+in_seq2[i] for i in range(len(in_seq1))])

8 # convert to [rows, columns] structure

9 in_seq1 = in_seq1.reshape((len(in_seq1), 1))

10 in_seq2 = in_seq2.reshape((len(in_seq2), 1))

11 out_seq = out_seq.reshape((len(out_seq), 1))

12 # horizontally stack columns

13 dataset = hstack((in_seq1, in_seq2, out_seq))

14 print(dataset)

Running the example prints the dataset with one row per time step and one column for each
of the two input and one output parallel time series.

1[[ 10 15 25]

2 [ 20 25 45]

3 [ 30 35 65]

4 [ 40 45 85]
5 [ 50 55 105]

6 [ 60 65 125]

7 [ 70 75 145]

8 [ 80 85 165]

9 [ 90 95 185]]

As with the univariate time series, we must structure these data into samples with input and
output samples.

A 1D CNN model needs sufficient context to learn a mapping from an input sequence to an
output value. CNNs can support parallel input time series as separate channels, like red,
green, and blue components of an image. Therefore, we need to split the data into samples
maintaining the order of observations across the two input sequences.

If we chose three input time steps, then the first sample would look as follows:

Input:

110, 15

220, 25

330, 35

Output:

165

That is, the first three time steps of each parallel series are provided as input to the model
and the model associates this with the value in the output series at the third time step, in
this case, 65.

We can see that, in transforming the time series into input/output samples to train the
model, that we will have to discard some values from the output time series where we do
not have values in the input time series at prior time steps. In turn, the choice of the size of
the number of input time steps will have an important effect on how much of the training
data is used.

We can define a function named split_sequences() that will take a dataset as we have

defined it with rows for time steps and columns for parallel series and return input/output
samples.
1 # split a multivariate sequence into samples
2

3 def split_sequences(sequences, n_steps):

4 X, y = list(), list()
5 for i in range(len(sequences)):
6 # find the end of this pattern
7 end_ix = i + n_steps
8 # check if we are beyond the dataset
9 if end_ix > len(sequences):
10 break
11 # gather input and output parts of the pattern
12 seq_x, seq_y = sequences[i:end_ix, :-1], sequences[end_ix-1, -1]
13 X.append(seq_x)
14 y.append(seq_y)

return array(X), array(y)

We can test this function on our dataset using three time steps for each input time series as
input.

The complete example is listed below.

1 # multivariate data preparation

2 from numpy import array

3 from numpy import hstack

5 # split a multivariate sequence into samples

6 def split_sequences(sequences, n_steps):

7 X, y = list(), list()

8 for i in range(len(sequences)):

9 # find the end of this pattern

10 end_ix = i + n_steps

11 # check if we are beyond the dataset

12 if end_ix > len(sequences):

13 break

14 # gather input and output parts of the pattern

15 seq_x, seq_y = sequences[i:end_ix, :-1], sequences[end_ix-1, -1]

16 X.append(seq_x)

17 y.append(seq_y)

18 return array(X), array(y)

20 # define input sequence

21 in_seq1 = array([10, 20, 30, 40, 50, 60, 70, 80, 90])

22 in_seq2 = array([15, 25, 35, 45, 55, 65, 75, 85, 95])

23 out_seq = array([in_seq1[i]+in_seq2[i] for i in range(len(in_seq1))])

24 # convert to [rows, columns] structure

25 in_seq1 = in_seq1.reshape((len(in_seq1), 1))

26 in_seq2 = in_seq2.reshape((len(in_seq2), 1))

27 out_seq = out_seq.reshape((len(out_seq), 1))

28 # horizontally stack columns

29 dataset = hstack((in_seq1, in_seq2, out_seq))

30 # choose a number of time steps

31 n_steps = 3

32 # convert into input/output

33 X, y = split_sequences(dataset, n_steps)

34 print(X.shape, y.shape)

35 # summarize the data

36 for i in range(len(X)):

37 print(X[i], y[i])

Running the example first prints the shape of the X and y components.

We can see that the X component has a three-dimensional structure.
The first dimension is the number of samples, in this case 7. The second dimension is the
number of time steps per sample, in this case 3, the value specified to the function. Finally,
the last dimension specifies the number of parallel time series or the number of variables, in
this case 2 for the two parallel series.

This is the exact three-dimensional structure expected by a 1D CNN as input. The data is
ready to use without further reshaping.

We can then see that the input and output for each sample is printed, showing the three
time steps for each of the two input series and the associated output for each sample.
1 (7, 3, 2) (7,)

3 [[10 15]

4 [20 25]

5 [30 35]] 65

6 [[20 25]

7 [30 35]

8 [40 45]] 85

9 [[30 35]

10 [40 45]

11 [50 55]] 105

12 [[40 45]

13 [50 55]

14 [60 65]] 125

15 [[50 55]

16 [60 65]

17 [70 75]] 145

18 [[60 65]

19 [70 75]

20 [80 85]] 165

21 [[70 75]

22 [80 85]

23 [90 95]] 185

We are now ready to fit a 1D CNN model on this data, specifying the expected number of
time steps and features to expect for each input sample, in this case three and two
respectively.

1# define model

2model = Sequential()

3model.add(Conv1D(filters=64, kernel_size=2, activation='relu', input_shape=(n_steps, n_features)))

4model.add(MaxPooling1D(pool_size=2))

5model.add(Flatten())

6model.add(Dense(50, activation='relu'))

7model.add(Dense(1))
8model.compile(optimizer='adam', loss='mse')

When making a prediction, the model expects three time steps for two input time series.

We can predict the next value in the output series providing the input values of:

180, 85

290, 95

3100, 105

The shape of the one sample with three time steps and two variables must be [1, 3, 2].

We would expect the next value in the sequence to be 100 + 105 or 205.

1# demonstrate prediction

2x_input = array([[80, 85], [90, 95], [100, 105]])

3x_input = x_input.reshape((1, n_steps, n_features))

4yhat = model.predict(x_input, verbose=0)

The complete example is listed below.

1 # multivariate cnn example

2 from numpy import array

3 from numpy import hstack

4 from keras.models import Sequential

5 from keras.layers import Dense

6 from keras.layers import Flatten

7 from keras.layers.convolutional import Conv1D

8 from keras.layers.convolutional import MaxPooling1D

10 # split a multivariate sequence into samples

11 def split_sequences(sequences, n_steps):

12 X, y = list(), list()

13 for i in range(len(sequences)):

14 # find the end of this pattern

15 end_ix = i + n_steps

16 # check if we are beyond the dataset

17 if end_ix > len(sequences):

18 break

19 # gather input and output parts of the pattern

20 seq_x, seq_y = sequences[i:end_ix, :-1], sequences[end_ix-1, -1]

21 X.append(seq_x)

22 y.append(seq_y)

23 return array(X), array(y)

25 # define input sequence

26 in_seq1 = array([10, 20, 30, 40, 50, 60, 70, 80, 90])

27 in_seq2 = array([15, 25, 35, 45, 55, 65, 75, 85, 95])

28 out_seq = array([in_seq1[i]+in_seq2[i] for i in range(len(in_seq1))])

29 # convert to [rows, columns] structure

30 in_seq1 = in_seq1.reshape((len(in_seq1), 1))

31 in_seq2 = in_seq2.reshape((len(in_seq2), 1))

32 out_seq = out_seq.reshape((len(out_seq), 1))

33 # horizontally stack columns

34 dataset = hstack((in_seq1, in_seq2, out_seq))

35 # choose a number of time steps

36 n_steps = 3

37 # convert into input/output

38 X, y = split_sequences(dataset, n_steps)

39 # the dataset knows the number of features, e.g. 2

40 n_features = X.shape[2]

41 # define model

42 model = Sequential()

43 model.add(Conv1D(filters=64, kernel_size=2, activation='relu', input_shape=(n_steps, n_features)))

44 model.add(MaxPooling1D(pool_size=2))

45 model.add(Flatten())

46 model.add(Dense(50, activation='relu'))

47 model.add(Dense(1))

48 model.compile(optimizer='adam', loss='mse')

49 # fit model

50 model.fit(X, y, epochs=1000, verbose=0)

51 # demonstrate prediction

52 x_input = array([[80, 85], [90, 95], [100, 105]])

53 x_input = x_input.reshape((1, n_steps, n_features))

54 yhat = model.predict(x_input, verbose=0)

55 print(yhat)

Note: Your results may vary given the stochastic nature of the algorithm or evaluation
procedure, or differences in numerical precision. Consider running the example a few times
and compare the average outcome.
Running the example prepares the data, fits the model, and makes a prediction.

1[[206.0161]]

There is another, more elaborate way to model the problem.

Each input series can be handled by a separate CNN and the output of each of these
submodels can be combined before a prediction is made for the output sequence.

We can refer to this as a multi-headed CNN model. It may offer more flexibility or better
performance depending on the specifics of the problem that is being modeled. For example,
it allows you to configure each sub-model differently for each input series, such as the
number of filter maps and the kernel size.

This type of model can be defined in Keras using the Keras functional API.
First, we can define the first input model as a 1D CNN with an input layer that expects
vectors with n_steps and 1 feature.
1# first input model

2visible1 = Input(shape=(n_steps, n_features))

3cnn1 = Conv1D(filters=64, kernel_size=2, activation='relu')(visible1)

4cnn1 = MaxPooling1D(pool_size=2)(cnn1)

5cnn1 = Flatten()(cnn1)

We can define the second input submodel in the same way.

1# second input model

2visible2 = Input(shape=(n_steps, n_features))

3cnn2 = Conv1D(filters=64, kernel_size=2, activation='relu')(visible2)

4cnn2 = MaxPooling1D(pool_size=2)(cnn2)
5cnn2 = Flatten()(cnn2)

Now that both input submodels have been defined, we can merge the output from each
model into one long vector which can be interpreted before making a prediction for the
output sequence.

1# merge input models

2merge = concatenate([cnn1, cnn2])

3dense = Dense(50, activation='relu')(merge)

4output = Dense(1)(dense)

We can then tie the inputs and outputs together.

1model = Model(inputs=[visible1, visible2], outputs=output)

The image below provides a schematic for how this model looks, including the shape of the
inputs and outputs of each layer.

Plot of Multi-Headed 1D CNN for Multivariate Time Series Forecasting

This model requires input to be provided as a list of two elements where each element in
the list contains data for one of the submodels.

In order to achieve this, we can split the 3D input data into two separate arrays of input
data; that is from one array with the shape [7, 3, 2] to two 3D arrays with [7, 3, 1]

1# one time series per head

2n_features = 1

3# separate input data

4X1 = X[:, :, 0].reshape(X.shape[0], X.shape[1], n_features)

5X2 = X[:, :, 1].reshape(X.shape[0], X.shape[1], n_features)

These data can then be provided in order to fit the model.

1# fit model

2model.fit([X1, X2], y, epochs=1000, verbose=0)

Similarly, we must prepare the data for a single sample as two separate two-dimensional
arrays when making a single one-step prediction.

1x_input = array([[80, 85], [90, 95], [100, 105]])

2x1 = x_input[:, 0].reshape((1, n_steps, n_features))

3x2 = x_input[:, 1].reshape((1, n_steps, n_features))

We can tie all of this together; the complete example is listed below.

1 # multivariate multi-headed 1d cnn example

2 from numpy import array

3 from numpy import hstack

4 from keras.models import Model

5 from keras.layers import Input

6 from keras.layers import Dense

7 from keras.layers import Flatten

8 from keras.layers.convolutional import Conv1D

9 from keras.layers.convolutional import MaxPooling1D

10 from keras.layers.merge import concatenate

12 # split a multivariate sequence into samples

13 def split_sequences(sequences, n_steps):

14 X, y = list(), list()

15 for i in range(len(sequences)):

16 # find the end of this pattern

17 end_ix = i + n_steps

18 # check if we are beyond the dataset

19 if end_ix > len(sequences):

20 break

21 # gather input and output parts of the pattern

22 seq_x, seq_y = sequences[i:end_ix, :-1], sequences[end_ix-1, -1]

23 X.append(seq_x)

24 y.append(seq_y)

25 return array(X), array(y)

27 # define input sequence

28 in_seq1 = array([10, 20, 30, 40, 50, 60, 70, 80, 90])

29 in_seq2 = array([15, 25, 35, 45, 55, 65, 75, 85, 95])

30 out_seq = array([in_seq1[i]+in_seq2[i] for i in range(len(in_seq1))])

31 # convert to [rows, columns] structure

32 in_seq1 = in_seq1.reshape((len(in_seq1), 1))

33 in_seq2 = in_seq2.reshape((len(in_seq2), 1))

34 out_seq = out_seq.reshape((len(out_seq), 1))

35 # horizontally stack columns

36 dataset = hstack((in_seq1, in_seq2, out_seq))

37 # choose a number of time steps

38 n_steps = 3

39 # convert into input/output

40 X, y = split_sequences(dataset, n_steps)

41 # one time series per head

42 n_features = 1

43 # separate input data

44 X1 = X[:, :, 0].reshape(X.shape[0], X.shape[1], n_features)

45 X2 = X[:, :, 1].reshape(X.shape[0], X.shape[1], n_features)

46 # first input model

47 visible1 = Input(shape=(n_steps, n_features))

48 cnn1 = Conv1D(filters=64, kernel_size=2, activation='relu')(visible1)

49 cnn1 = MaxPooling1D(pool_size=2)(cnn1)

50 cnn1 = Flatten()(cnn1)

51 # second input model

52 visible2 = Input(shape=(n_steps, n_features))

53 cnn2 = Conv1D(filters=64, kernel_size=2, activation='relu')(visible2)

54 cnn2 = MaxPooling1D(pool_size=2)(cnn2)

55 cnn2 = Flatten()(cnn2)

56 # merge input models

57 merge = concatenate([cnn1, cnn2])

58 dense = Dense(50, activation='relu')(merge)

59 output = Dense(1)(dense)

60 model = Model(inputs=[visible1, visible2], outputs=output)

61 model.compile(optimizer='adam', loss='mse')

62 # fit model

63 model.fit([X1, X2], y, epochs=1000, verbose=0)

64 # demonstrate prediction

65 x_input = array([[80, 85], [90, 95], [100, 105]])

66 x1 = x_input[:, 0].reshape((1, n_steps, n_features))

67 x2 = x_input[:, 1].reshape((1, n_steps, n_features))

68 yhat = model.predict([x1, x2], verbose=0)

69 print(yhat)

Note: Your results may vary given the stochastic nature of the algorithm or evaluation
procedure, or differences in numerical precision. Consider running the example a few times
and compare the average outcome.
Running the example prepares the data, fits the model, and makes a prediction.

1[[205.871]]

Multiple Parallel Series

An alternate time series problem is the case where there are multiple parallel time series
and a value must be predicted for each.
For example, given the data from the previous section:

1[[ 10 15 25]

2 [ 20 25 45]

3 [ 30 35 65]

4 [ 40 45 85]

5 [ 50 55 105]

6 [ 60 65 125]

7 [ 70 75 145]

8 [ 80 85 165]

9 [ 90 95 185]]

We may want to predict the value for each of the three time series for the next time step.

This might be referred to as multivariate forecasting.

Again, the data must be split into input/output samples in order to train a model.

The first sample of this dataset would be:

Input:

110, 15, 25

220, 25, 45

330, 35, 65

Output:

140, 45, 85

The split_sequences() function below will split multiple parallel time series with rows for time
steps and one series per column into the required input/output shape.
1 # split a multivariate sequence into samples

2 def split_sequences(sequences, n_steps):

3 X, y = list(), list()

4 for i in range(len(sequences)):

5 # find the end of this pattern

6 end_ix = i + n_steps

7 # check if we are beyond the dataset

8 if end_ix > len(sequences)-1:

9 break

10 # gather input and output parts of the pattern

11 seq_x, seq_y = sequences[i:end_ix, :], sequences[end_ix, :]

12 X.append(seq_x)

13 y.append(seq_y)

14 return array(X), array(y)

We can demonstrate this on the contrived problem; the complete example is listed below.

1 # multivariate output data prep

2 from numpy import array

3 from numpy import hstack

5 # split a multivariate sequence into samples

6 def split_sequences(sequences, n_steps):

7 X, y = list(), list()

8 for i in range(len(sequences)):

9 # find the end of this pattern

10 end_ix = i + n_steps

11 # check if we are beyond the dataset

12 if end_ix > len(sequences)-1:

13 break

14 # gather input and output parts of the pattern

15 seq_x, seq_y = sequences[i:end_ix, :], sequences[end_ix, :]

16 X.append(seq_x)

17 y.append(seq_y)

18 return array(X), array(y)

20 # define input sequence

21 in_seq1 = array([10, 20, 30, 40, 50, 60, 70, 80, 90])

22 in_seq2 = array([15, 25, 35, 45, 55, 65, 75, 85, 95])

23 out_seq = array([in_seq1[i]+in_seq2[i] for i in range(len(in_seq1))])

24 # convert to [rows, columns] structure

25 in_seq1 = in_seq1.reshape((len(in_seq1), 1))

26 in_seq2 = in_seq2.reshape((len(in_seq2), 1))

27 out_seq = out_seq.reshape((len(out_seq), 1))

28 # horizontally stack columns

29 dataset = hstack((in_seq1, in_seq2, out_seq))

30 # choose a number of time steps

31 n_steps = 3

32 # convert into input/output

33 X, y = split_sequences(dataset, n_steps)

34 print(X.shape, y.shape)

35 # summarize the data

36 for i in range(len(X)):

37 print(X[i], y[i])

Running the example first prints the shape of the prepared X and y components.

The shape of X is three-dimensional, including the number of samples (6), the number of
time steps chosen per sample (3), and the number of parallel time series or features (3).

The shape of y is two-dimensional as we might expect for the number of samples (6) and
the number of time variables per sample to be predicted (3).

The data is ready to use in a 1D CNN model that expects three-dimensional input and two-
dimensional output shapes for the X and y components of each sample.

Then, each of the samples is printed showing the input and output components of each
sample.

1 (6, 3, 3) (6, 3)

3 [[10 15 25]

4 [20 25 45]

5 [30 35 65]] [40 45 85]

6 [[20 25 45]

7 [30 35 65]

8 [40 45 85]] [ 50 55 105]

9 [[ 30 35 65]

10 [ 40 45 85]

11 [ 50 55 105]] [ 60 65 125]

12 [[ 40 45 85]

13 [ 50 55 105]

14 [ 60 65 125]] [ 70 75 145]

15 [[ 50 55 105]

16 [ 60 65 125]

17 [ 70 75 145]] [ 80 85 165]

18 [[ 60 65 125]

19 [ 70 75 145]

20 [ 80 85 165]] [ 90 95 185]

We are now ready to fit a 1D CNN model on this data.

In this model, the number of time steps and parallel series (features) are specified for the
input layer via the input_shape argument.
The number of parallel series is also used in the specification of the number of values to
predict by the model in the output layer; again, this is three.

1# define model

2model = Sequential()

3model.add(Conv1D(filters=64, kernel_size=2, activation='relu', input_shape=(n_steps, n_features)))

4model.add(MaxPooling1D(pool_size=2))

5model.add(Flatten())

6model.add(Dense(50, activation='relu'))

7model.add(Dense(n_features))

8model.compile(optimizer='adam', loss='mse')

We can predict the next value in each of the three parallel series by providing an input of
three time steps for each series.

170, 75, 145

280, 85, 165

390, 95, 185

The shape of the input for making a single prediction must be 1 sample, 3 time steps, and 3
features, or [1, 3, 3].
1# demonstrate prediction

2x_input = array([[70,75,145], [80,85,165], [90,95,185]])

3x_input = x_input.reshape((1, n_steps, n_features))

4yhat = model.predict(x_input, verbose=0)

We would expect the vector output to be:

1[100, 105, 205]

We can tie all of this together and demonstrate a 1D CNN for multivariate output time series
forecasting below.

1 # multivariate output 1d cnn example

2 from numpy import array

3 from numpy import hstack

4 from keras.models import Sequential

5 from keras.layers import Dense

6 from keras.layers import Flatten

7 from keras.layers.convolutional import Conv1D

8 from keras.layers.convolutional import MaxPooling1D

10 # split a multivariate sequence into samples

11 def split_sequences(sequences, n_steps):

12 X, y = list(), list()

13 for i in range(len(sequences)):

14 # find the end of this pattern

15 end_ix = i + n_steps

16 # check if we are beyond the dataset

17 if end_ix > len(sequences)-1:

18 break

19 # gather input and output parts of the pattern

20 seq_x, seq_y = sequences[i:end_ix, :], sequences[end_ix, :]

21 X.append(seq_x)

22 y.append(seq_y)

23 return array(X), array(y)

24
25 # define input sequence

26 in_seq1 = array([10, 20, 30, 40, 50, 60, 70, 80, 90])

27 in_seq2 = array([15, 25, 35, 45, 55, 65, 75, 85, 95])

28 out_seq = array([in_seq1[i]+in_seq2[i] for i in range(len(in_seq1))])

29 # convert to [rows, columns] structure

30 in_seq1 = in_seq1.reshape((len(in_seq1), 1))

31 in_seq2 = in_seq2.reshape((len(in_seq2), 1))

32 out_seq = out_seq.reshape((len(out_seq), 1))

33 # horizontally stack columns

34 dataset = hstack((in_seq1, in_seq2, out_seq))

35 # choose a number of time steps

36 n_steps = 3

37 # convert into input/output

38 X, y = split_sequences(dataset, n_steps)

39 # the dataset knows the number of features, e.g. 2

40 n_features = X.shape[2]

41 # define model

42 model = Sequential()

43 model.add(Conv1D(filters=64, kernel_size=2, activation='relu', input_shape=(n_steps, n_features)))

44 model.add(MaxPooling1D(pool_size=2))

45 model.add(Flatten())

46 model.add(Dense(50, activation='relu'))

47 model.add(Dense(n_features))

48 model.compile(optimizer='adam', loss='mse')

49 # fit model

50 model.fit(X, y, epochs=3000, verbose=0)

51 # demonstrate prediction

52 x_input = array([[70,75,145], [80,85,165], [90,95,185]])

53 x_input = x_input.reshape((1, n_steps, n_features))

54 yhat = model.predict(x_input, verbose=0)

55 print(yhat)
Note: Your results may vary given the stochastic nature of the algorithm or evaluation
procedure, or differences in numerical precision. Consider running the example a few times
and compare the average outcome.
Running the example prepares the data, fits the model and makes a prediction.

1[[100.11272 105.32213 205.53436]]

As with multiple input series, there is another more elaborate way to model the problem.

Each output series can be handled by a separate output CNN model.

We can refer to this as a multi-output CNN model. It may offer more flexibility or better
performance depending on the specifics of the problem that is being modeled.

This type of model can be defined in Keras using the Keras functional API.
First, we can define the first input model as a 1D CNN model.

1# define model

2visible = Input(shape=(n_steps, n_features))

3cnn = Conv1D(filters=64, kernel_size=2, activation='relu')(visible)

4cnn = MaxPooling1D(pool_size=2)(cnn)

5cnn = Flatten()(cnn)

6cnn = Dense(50, activation='relu')(cnn)

We can then define one output layer for each of the three series that we wish to forecast,
where each output submodel will forecast a single time step.

1# define output 1

2output1 = Dense(1)(cnn)

3# define output 2

4output2 = Dense(1)(cnn)

5# define output 3

6output3 = Dense(1)(cnn)

We can then tie the input and output layers together into a single model.

1# tie together

2model = Model(inputs=visible, outputs=[output1, output2, output3])

3model.compile(optimizer='adam', loss='mse')
To make the model architecture clear, the schematic below clearly shows the three
separate output layers of the model and the input and output shapes of each layer.

Plot of Multi-Output 1D CNN for Multivariate Time Series Forecasting

When training the model, it will require three separate output arrays per sample. We can
achieve this by converting the output training data that has the shape [7, 3] to three arrays
with the shape [7, 1].

1# separate output

2y1 = y[:, 0].reshape((y.shape[0], 1))

3y2 = y[:, 1].reshape((y.shape[0], 1))

4y3 = y[:, 2].reshape((y.shape[0], 1))

These arrays can be provided to the model during training.

1# fit model

2model.fit(X, [y1,y2,y3], epochs=2000, verbose=0)

Tying all of this together, the complete example is listed below.

1 # multivariate output 1d cnn example

2 from numpy import array

3 from numpy import hstack

4 from keras.models import Model

5 from keras.layers import Input

6 from keras.layers import Dense

7 from keras.layers import Flatten

8 from keras.layers.convolutional import Conv1D

9 from keras.layers.convolutional import MaxPooling1D

11 # split a multivariate sequence into samples

12 def split_sequences(sequences, n_steps):

13 X, y = list(), list()

14 for i in range(len(sequences)):

15 # find the end of this pattern

16 end_ix = i + n_steps

17 # check if we are beyond the dataset

18 if end_ix > len(sequences)-1:

19 break

20 # gather input and output parts of the pattern

21 seq_x, seq_y = sequences[i:end_ix, :], sequences[end_ix, :]

22 X.append(seq_x)

23 y.append(seq_y)

24 return array(X), array(y)

26 # define input sequence

27 in_seq1 = array([10, 20, 30, 40, 50, 60, 70, 80, 90])

28 in_seq2 = array([15, 25, 35, 45, 55, 65, 75, 85, 95])

29 out_seq = array([in_seq1[i]+in_seq2[i] for i in range(len(in_seq1))])

30 # convert to [rows, columns] structure

31 in_seq1 = in_seq1.reshape((len(in_seq1), 1))

32 in_seq2 = in_seq2.reshape((len(in_seq2), 1))

33 out_seq = out_seq.reshape((len(out_seq), 1))

34 # horizontally stack columns

35 dataset = hstack((in_seq1, in_seq2, out_seq))

36 # choose a number of time steps

37 n_steps = 3

38 # convert into input/output

39 X, y = split_sequences(dataset, n_steps)

40 # the dataset knows the number of features, e.g. 2

41 n_features = X.shape[2]

42 # separate output

43 y1 = y[:, 0].reshape((y.shape[0], 1))

44 y2 = y[:, 1].reshape((y.shape[0], 1))

45 y3 = y[:, 2].reshape((y.shape[0], 1))

46 # define model

47 visible = Input(shape=(n_steps, n_features))

48 cnn = Conv1D(filters=64, kernel_size=2, activation='relu')(visible)

49 cnn = MaxPooling1D(pool_size=2)(cnn)

50 cnn = Flatten()(cnn)

51 cnn = Dense(50, activation='relu')(cnn)

52 # define output 1

53 output1 = Dense(1)(cnn)

54 # define output 2

55 output2 = Dense(1)(cnn)

56 # define output 3

57 output3 = Dense(1)(cnn)

58 # tie together

59 model = Model(inputs=visible, outputs=[output1, output2, output3])

60 model.compile(optimizer='adam', loss='mse')

61 # fit model

62 model.fit(X, [y1,y2,y3], epochs=2000, verbose=0)

63 # demonstrate prediction

64 x_input = array([[70,75,145], [80,85,165], [90,95,185]])

65 x_input = x_input.reshape((1, n_steps, n_features))

66 yhat = model.predict(x_input, verbose=0)

67 print(yhat)
Note: Your results may vary given the stochastic nature of the algorithm or evaluation
procedure, or differences in numerical precision. Consider running the example a few times
and compare the average outcome.
Running the example prepares the data, fits the model, and makes a prediction.

1[array([[100.96118]], dtype=float32),

2 array([[105.502686]], dtype=float32),

3 array([[205.98045]], dtype=float32)]

Multi-Step CNN Models

In practice, there is little difference to the 1D CNN model in predicting a vector output that
represents different output variables (as in the previous example), or a vector output that
represents multiple time steps of one variable.

Nevertheless, there are subtle and important differences in the way the training data is
prepared. In this section, we will demonstrate the case of developing a multi-step forecast
model using a vector model.

Before we look at the specifics of the model, let’s first look at the preparation of data for
multi-step forecasting.

Data Preparation
As with one-step forecasting, a time series used for multi-step time series forecasting must
be split into samples with input and output components.

Both the input and output components will be comprised of multiple time steps and may or
may not have the same number of steps.

For example, given the univariate time series:

1[10, 20, 30, 40, 50, 60, 70, 80, 90]

We could use the last three time steps as input and forecast the next two time steps.

The first sample would look as follows:

Input:

1[10, 20, 30]

Output:

1[40, 50]

The split_sequence() function below implements this behavior and will split a given

univariate time series into samples with a specified number of input and output time steps.
1 # split a univariate sequence into samples

2 def split_sequence(sequence, n_steps_in, n_steps_out):

3 X, y = list(), list()

4 for i in range(len(sequence)):

5 # find the end of this pattern

6 end_ix = i + n_steps_in

7 out_end_ix = end_ix + n_steps_out

8 # check if we are beyond the sequence

9 if out_end_ix > len(sequence):

10 break

11 # gather input and output parts of the pattern

12 seq_x, seq_y = sequence[i:end_ix], sequence[end_ix:out_end_ix]

13 X.append(seq_x)

14 y.append(seq_y)

15 return array(X), array(y)

We can demonstrate this function on the small contrived dataset.

The complete example is listed below.

1 # multi-step data preparation

2 from numpy import array

4 # split a univariate sequence into samples

5 def split_sequence(sequence, n_steps_in, n_steps_out):

6 X, y = list(), list()

7 for i in range(len(sequence)):

8 # find the end of this pattern

9 end_ix = i + n_steps_in
10 out_end_ix = end_ix + n_steps_out

11 # check if we are beyond the sequence

12 if out_end_ix > len(sequence):

13 break

14 # gather input and output parts of the pattern

15 seq_x, seq_y = sequence[i:end_ix], sequence[end_ix:out_end_ix]

16 X.append(seq_x)

17 y.append(seq_y)

18 return array(X), array(y)

20 # define input sequence

21 raw_seq = [10, 20, 30, 40, 50, 60, 70, 80, 90]

22 # choose a number of time steps

23 n_steps_in, n_steps_out = 3, 2

24 # split into samples

25 X, y = split_sequence(raw_seq, n_steps_in, n_steps_out)

26 # summarize the data

27 for i in range(len(X)):

28 print(X[i], y[i])

Running the example splits the univariate series into input and output time steps and prints
the input and output components of each.

1[10 20 30] [40 50]

2[20 30 40] [50 60]

3[30 40 50] [60 70]

4[40 50 60] [70 80]

5[50 60 70] [80 90]

Now that we know how to prepare data for multi-step forecasting, let’s look at a 1D CNN
model that can learn this mapping.

Vector Output Model

The 1D CNN can output a vector directly that can be interpreted as a multi-step forecast.
This approach was seen in the previous section were one time step of each output time
series was forecasted as a vector.

As with the 1D CNN models for univariate data in a prior section, the prepared samples
must first be reshaped. The CNN expects data to have a three-dimensional structure of
[samples, timesteps, features], and in this case, we only have one feature so the reshape is
straightforward.
1# reshape from [samples, timesteps] into [samples, timesteps, features]

2n_features = 1

3X = X.reshape((X.shape[0], X.shape[1], n_features))

With the number of input and output steps specified in

the n_steps_in and n_steps_out variables, we can define a multi-step time-series
forecasting model.
1# define model

2model = Sequential()

3model.add(Conv1D(filters=64, kernel_size=2, activation='relu', input_shape=(n_steps_in, n_features)))

4model.add(MaxPooling1D(pool_size=2))

5model.add(Flatten())

6model.add(Dense(50, activation='relu'))

7model.add(Dense(n_steps_out))

8model.compile(optimizer='adam', loss='mse')

The model can make a prediction for a single sample. We can predict the next two steps
beyond the end of the dataset by providing the input:

1[70, 80, 90]

We would expect the predicted output to be:

1[100, 110]

As expected by the model, the shape of the single sample of input data when making the
prediction must be [1, 3, 1] for the 1 sample, 3 time steps of the input, and the single
feature.

1# demonstrate prediction

2x_input = array([70, 80, 90])

3x_input = x_input.reshape((1, n_steps_in, n_features))

4yhat = model.predict(x_input, verbose=0)

Tying all of this together, the 1D CNN for multi-step forecasting with a univariate time series
is listed below.

1 # univariate multi-step vector-output 1d cnn example

2 from numpy import array

3 from keras.models import Sequential

4 from keras.layers import Dense

5 from keras.layers import Flatten

6 from keras.layers.convolutional import Conv1D

7 from keras.layers.convolutional import MaxPooling1D

9 # split a univariate sequence into samples

10 def split_sequence(sequence, n_steps_in, n_steps_out):

11 X, y = list(), list()

12 for i in range(len(sequence)):

13 # find the end of this pattern

14 end_ix = i + n_steps_in

15 out_end_ix = end_ix + n_steps_out

16 # check if we are beyond the sequence

17 if out_end_ix > len(sequence):

18 break

19 # gather input and output parts of the pattern

20 seq_x, seq_y = sequence[i:end_ix], sequence[end_ix:out_end_ix]

21 X.append(seq_x)

22 y.append(seq_y)

23 return array(X), array(y)

25 # define input sequence

26 raw_seq = [10, 20, 30, 40, 50, 60, 70, 80, 90]

27 # choose a number of time steps

28 n_steps_in, n_steps_out = 3, 2

29 # split into samples

30 X, y = split_sequence(raw_seq, n_steps_in, n_steps_out)

31 # reshape from [samples, timesteps] into [samples, timesteps, features]

32 n_features = 1

33 X = X.reshape((X.shape[0], X.shape[1], n_features))

34 # define model

35 model = Sequential()

36 model.add(Conv1D(filters=64, kernel_size=2, activation='relu', input_shape=(n_steps_in, n_features)))

37 model.add(MaxPooling1D(pool_size=2))

38 model.add(Flatten())

39 model.add(Dense(50, activation='relu'))

40 model.add(Dense(n_steps_out))

41 model.compile(optimizer='adam', loss='mse')

42 # fit model

43 model.fit(X, y, epochs=2000, verbose=0)

44 # demonstrate prediction

45 x_input = array([70, 80, 90])

46 x_input = x_input.reshape((1, n_steps_in, n_features))

47 yhat = model.predict(x_input, verbose=0)

48 print(yhat)

Note: Your results may vary given the stochastic nature of the algorithm or evaluation
procedure, or differences in numerical precision. Consider running the example a few times
and compare the average outcome.
Running the example forecasts and prints the next two time steps in the sequence.

1[[102.86651 115.08979]]

Multivariate Multi-Step CNN Models

In the previous sections, we have looked at univariate, multivariate, and multi-step time
series forecasting.

It is possible to mix and match the different types of 1D CNN models presented so far for
the different problems. This too applies to time series forecasting problems that involve
multivariate and multi-step forecasting, but it may be a little more challenging.
In this section, we will explore short examples of data preparation and modeling for
multivariate multi-step time series forecasting as a template to ease this challenge,
specifically:

1. Multiple Input Multi-Step Output.

2. Multiple Parallel Input and Multi-Step Output.
Perhaps the biggest stumbling block is in the preparation of data, so this is where we will
focus our attention.

Multiple Input Multi-Step Output

There are those multivariate time series forecasting problems where the output series is
separate but dependent upon the input time series, and multiple time steps are required for
the output series.

For example, consider our multivariate time series from a prior section:

1[[ 10 15 25]

2 [ 20 25 45]

3 [ 30 35 65]

4 [ 40 45 85]

5 [ 50 55 105]

6 [ 60 65 125]

7 [ 70 75 145]

8 [ 80 85 165]

9 [ 90 95 185]]

We may use three prior time steps of each of the two input time series to predict two time
steps of the output time series.

Input:

110, 15

220, 25

330, 35

Output:

165
285

The split_sequences() function below implements this behavior.

1 # split a multivariate sequence into samples

2 def split_sequences(sequences, n_steps_in, n_steps_out):

3 X, y = list(), list()

4 for i in range(len(sequences)):

5 # find the end of this pattern

6 end_ix = i + n_steps_in

7 out_end_ix = end_ix + n_steps_out-1

8 # check if we are beyond the dataset

9 if out_end_ix > len(sequences):

10 break

11 # gather input and output parts of the pattern

12 seq_x, seq_y = sequences[i:end_ix, :-1], sequences[end_ix-1:out_end_ix, -1]

13 X.append(seq_x)

14 y.append(seq_y)

15 return array(X), array(y)

We can demonstrate this on our contrived dataset. The complete example is listed below.

1 # multivariate multi-step data preparation

2 from numpy import array

3 from numpy import hstack

5 # split a multivariate sequence into samples

6 def split_sequences(sequences, n_steps_in, n_steps_out):

7 X, y = list(), list()

8 for i in range(len(sequences)):

9 # find the end of this pattern

10 end_ix = i + n_steps_in

11 out_end_ix = end_ix + n_steps_out-1

12 # check if we are beyond the dataset

13 if out_end_ix > len(sequences):

14 break

15 # gather input and output parts of the pattern

16 seq_x, seq_y = sequences[i:end_ix, :-1], sequences[end_ix-1:out_end_ix, -1]

17 X.append(seq_x)

18 y.append(seq_y)

19 return array(X), array(y)

21 # define input sequence

22 in_seq1 = array([10, 20, 30, 40, 50, 60, 70, 80, 90])

23 in_seq2 = array([15, 25, 35, 45, 55, 65, 75, 85, 95])

24 out_seq = array([in_seq1[i]+in_seq2[i] for i in range(len(in_seq1))])

25 # convert to [rows, columns] structure

26 in_seq1 = in_seq1.reshape((len(in_seq1), 1))

27 in_seq2 = in_seq2.reshape((len(in_seq2), 1))

28 out_seq = out_seq.reshape((len(out_seq), 1))

29 # horizontally stack columns

30 dataset = hstack((in_seq1, in_seq2, out_seq))

31 # choose a number of time steps

32 n_steps_in, n_steps_out = 3, 2

33 # convert into input/output

34 X, y = split_sequences(dataset, n_steps_in, n_steps_out)

35 print(X.shape, y.shape)

36 # summarize the data

37 for i in range(len(X)):

38 print(X[i], y[i])

Running the example first prints the shape of the prepared training data.

We can see that the shape of the input portion of the samples is three-dimensional,
comprised of six samples, with three time steps and two variables for the two input time
series.

The output portion of the samples is two-dimensional for the six samples and the two time
steps for each sample to be predicted.

The prepared samples are then printed to confirm that the data was prepared as we
specified.
1 (6, 3, 2) (6, 2)

3 [[10 15]

4 [20 25]

5 [30 35]] [65 85]

6 [[20 25]

7 [30 35]

8 [40 45]] [ 85 105]

9 [[30 35]

10 [40 45]

11 [50 55]] [105 125]

12 [[40 45]

13 [50 55]

14 [60 65]] [125 145]

15 [[50 55]

16 [60 65]

17 [70 75]] [145 165]

18 [[60 65]

19 [70 75]

20 [80 85]] [165 185]

We can now develop a 1D CNN model for multi-step predictions.

In this case, we will demonstrate a vector output model. The complete example is listed
below.

1 # multivariate multi-step 1d cnn example

2 from numpy import array

3 from numpy import hstack

4 from keras.models import Sequential

5 from keras.layers import Dense

6 from keras.layers import Flatten

7 from keras.layers.convolutional import Conv1D

8 from keras.layers.convolutional import MaxPooling1D

9
10 # split a multivariate sequence into samples

11 def split_sequences(sequences, n_steps_in, n_steps_out):

12 X, y = list(), list()

13 for i in range(len(sequences)):

14 # find the end of this pattern

15 end_ix = i + n_steps_in

16 out_end_ix = end_ix + n_steps_out-1

17 # check if we are beyond the dataset

18 if out_end_ix > len(sequences):

19 break

20 # gather input and output parts of the pattern

21 seq_x, seq_y = sequences[i:end_ix, :-1], sequences[end_ix-1:out_end_ix, -1]

22 X.append(seq_x)

23 y.append(seq_y)

24 return array(X), array(y)

26 # define input sequence

27 in_seq1 = array([10, 20, 30, 40, 50, 60, 70, 80, 90])

28 in_seq2 = array([15, 25, 35, 45, 55, 65, 75, 85, 95])

29 out_seq = array([in_seq1[i]+in_seq2[i] for i in range(len(in_seq1))])

30 # convert to [rows, columns] structure

31 in_seq1 = in_seq1.reshape((len(in_seq1), 1))

32 in_seq2 = in_seq2.reshape((len(in_seq2), 1))

33 out_seq = out_seq.reshape((len(out_seq), 1))

34 # horizontally stack columns

35 dataset = hstack((in_seq1, in_seq2, out_seq))

36 # choose a number of time steps

37 n_steps_in, n_steps_out = 3, 2

38 # convert into input/output

39 X, y = split_sequences(dataset, n_steps_in, n_steps_out)

40 # the dataset knows the number of features, e.g. 2

41 n_features = X.shape[2]

42 # define model
43 model = Sequential()

44 model.add(Conv1D(filters=64, kernel_size=2, activation='relu', input_shape=(n_steps_in, n_features)))

45 model.add(MaxPooling1D(pool_size=2))

46 model.add(Flatten())

47 model.add(Dense(50, activation='relu'))

48 model.add(Dense(n_steps_out))

49 model.compile(optimizer='adam', loss='mse')

50 # fit model

51 model.fit(X, y, epochs=2000, verbose=0)

52 # demonstrate prediction

53 x_input = array([[70, 75], [80, 85], [90, 95]])

54 x_input = x_input.reshape((1, n_steps_in, n_features))

55 yhat = model.predict(x_input, verbose=0)

56 print(yhat)

Running the example fits the model and predicts the next two time steps of the output
sequence beyond the dataset.

We would expect the next two steps to be [185, 205].

Note: Your results may vary given the stochastic nature of the algorithm or evaluation
procedure, or differences in numerical precision. Consider running the example a few times
and compare the average outcome.
It is a challenging framing of the problem with very little data, and the arbitrarily configured
version of the model gets close.

1[[185.57011 207.77893]]

Multiple Parallel Input and Multi-Step Output

A problem with parallel time series may require the prediction of multiple time steps of each
time series.

For example, consider our multivariate time series from a prior section:

1[[ 10 15 25]

2 [ 20 25 45]

3 [ 30 35 65]
4 [ 40 45 85]

5 [ 50 55 105]

6 [ 60 65 125]

7 [ 70 75 145]

8 [ 80 85 165]

9 [ 90 95 185]]

We may use the last three time steps from each of the three time series as input to the
model, and predict the next time steps of each of the three time series as output.

The first sample in the training dataset would be the following.

Input:

110, 15, 25

220, 25, 45

330, 35, 65

Output:

140, 45, 85

250, 55, 105

The split_sequences() function below implements this behavior.

1 # split a multivariate sequence into samples

2 def split_sequences(sequences, n_steps_in, n_steps_out):

3 X, y = list(), list()

4 for i in range(len(sequences)):

5 # find the end of this pattern

6 end_ix = i + n_steps_in

7 out_end_ix = end_ix + n_steps_out

8 # check if we are beyond the dataset

9 if out_end_ix > len(sequences):

10 break

11 # gather input and output parts of the pattern

12 seq_x, seq_y = sequences[i:end_ix, :], sequences[end_ix:out_end_ix, :]

13 X.append(seq_x)

14 y.append(seq_y)
15 return array(X), array(y)

We can demonstrate this function on the small contrived dataset.

The complete example is listed below.

1 # multivariate multi-step data preparation

2 from numpy import array

3 from numpy import hstack

4 from keras.models import Sequential

5 from keras.layers import LSTM

6 from keras.layers import Dense

7 from keras.layers import RepeatVector

8 from keras.layers import TimeDistributed

10 # split a multivariate sequence into samples

11 def split_sequences(sequences, n_steps_in, n_steps_out):

12 X, y = list(), list()

13 for i in range(len(sequences)):

14 # find the end of this pattern

15 end_ix = i + n_steps_in

16 out_end_ix = end_ix + n_steps_out

17 # check if we are beyond the dataset

18 if out_end_ix > len(sequences):

19 break

20 # gather input and output parts of the pattern

21 seq_x, seq_y = sequences[i:end_ix, :], sequences[end_ix:out_end_ix, :]

22 X.append(seq_x)

23 y.append(seq_y)

24 return array(X), array(y)

26 # define input sequence

27 in_seq1 = array([10, 20, 30, 40, 50, 60, 70, 80, 90])

28 in_seq2 = array([15, 25, 35, 45, 55, 65, 75, 85, 95])

29 out_seq = array([in_seq1[i]+in_seq2[i] for i in range(len(in_seq1))])

30 # convert to [rows, columns] structure

31 in_seq1 = in_seq1.reshape((len(in_seq1), 1))

32 in_seq2 = in_seq2.reshape((len(in_seq2), 1))

33 out_seq = out_seq.reshape((len(out_seq), 1))

34 # horizontally stack columns

35 dataset = hstack((in_seq1, in_seq2, out_seq))

36 # choose a number of time steps

37 n_steps_in, n_steps_out = 3, 2

38 # convert into input/output

39 X, y = split_sequences(dataset, n_steps_in, n_steps_out)

40 print(X.shape, y.shape)

41 # summarize the data

42 for i in range(len(X)):

43 print(X[i], y[i])

Running the example first prints the shape of the prepared training dataset.

We can see that both the input (X) and output (Y) elements of the dataset are three
dimensional for the number of samples, time steps, and variables or parallel time series
respectively.
The input and output elements of each series are then printed side by side so that we can
confirm that the data was prepared as we expected.

1 (5, 3, 3) (5, 2, 3)

3 [[10 15 25]

4 [20 25 45]

5 [30 35 65]] [[ 40 45 85]

6 [ 50 55 105]]

7 [[20 25 45]

8 [30 35 65]

9 [40 45 85]] [[ 50 55 105]

10 [ 60 65 125]]

11 [[ 30 35 65]

12 [ 40 45 85]
13 [ 50 55 105]] [[ 60 65 125]

14 [ 70 75 145]]

15 [[ 40 45 85]

16 [ 50 55 105]

17 [ 60 65 125]] [[ 70 75 145]

18 [ 80 85 165]]

19 [[ 50 55 105]

20 [ 60 65 125]

21 [ 70 75 145]] [[ 80 85 165]

22 [ 90 95 185]]

We can now develop a 1D CNN model for this dataset.

We will use a vector-output model in this case. As such, we must flatten the three-
dimensional structure of the output portion of each sample in order to train the model. This
means, instead of predicting two steps for each series, the model is trained on and
expected to predict a vector of six numbers directly.

1# flatten output

2n_output = y.shape[1] * y.shape[2]

3y = y.reshape((y.shape[0], n_output))

The complete example is listed below.

1 # multivariate output multi-step 1d cnn example

2 from numpy import array

3 from numpy import hstack

4 from keras.models import Sequential

5 from keras.layers import Dense

6 from keras.layers import Flatten

7 from keras.layers.convolutional import Conv1D

8 from keras.layers.convolutional import MaxPooling1D

10 # split a multivariate sequence into samples

11 def split_sequences(sequences, n_steps_in, n_steps_out):

12 X, y = list(), list()
13 for i in range(len(sequences)):

14 # find the end of this pattern

15 end_ix = i + n_steps_in

16 out_end_ix = end_ix + n_steps_out

17 # check if we are beyond the dataset

18 if out_end_ix > len(sequences):

19 break

20 # gather input and output parts of the pattern

21 seq_x, seq_y = sequences[i:end_ix, :], sequences[end_ix:out_end_ix, :]

22 X.append(seq_x)

23 y.append(seq_y)

24 return array(X), array(y)

26 # define input sequence

27 in_seq1 = array([10, 20, 30, 40, 50, 60, 70, 80, 90])

28 in_seq2 = array([15, 25, 35, 45, 55, 65, 75, 85, 95])

29 out_seq = array([in_seq1[i]+in_seq2[i] for i in range(len(in_seq1))])

30 # convert to [rows, columns] structure

31 in_seq1 = in_seq1.reshape((len(in_seq1), 1))

32 in_seq2 = in_seq2.reshape((len(in_seq2), 1))

33 out_seq = out_seq.reshape((len(out_seq), 1))

34 # horizontally stack columns

35 dataset = hstack((in_seq1, in_seq2, out_seq))

36 # choose a number of time steps

37 n_steps_in, n_steps_out = 3, 2

38 # convert into input/output

39 X, y = split_sequences(dataset, n_steps_in, n_steps_out)

40 # flatten output

41 n_output = y.shape[1] * y.shape[2]

42 y = y.reshape((y.shape[0], n_output))

43 # the dataset knows the number of features, e.g. 2

44 n_features = X.shape[2]

45 # define model
46 model = Sequential()

47 model.add(Conv1D(filters=64, kernel_size=2, activation='relu', input_shape=(n_steps_in, n_features)))

48 model.add(MaxPooling1D(pool_size=2))

49 model.add(Flatten())

50 model.add(Dense(50, activation='relu'))

51 model.add(Dense(n_output))

52 model.compile(optimizer='adam', loss='mse')

53 # fit model

54 model.fit(X, y, epochs=7000, verbose=0)

55 # demonstrate prediction

56 x_input = array([[60, 65, 125], [70, 75, 145], [80, 85, 165]])

57 x_input = x_input.reshape((1, n_steps_in, n_features))

58 yhat = model.predict(x_input, verbose=0)

59 print(yhat)

Running the example fits the model and predicts the values for each of the three time steps
for the next two time steps beyond the end of the dataset.

We would expect the values for these series and time steps to be as follows:

190, 95, 185

2100, 105, 205

1[[ 90.47855 95.621284 186.02629 100.48118 105.80815 206.52821 ]]

Summary
In this tutorial, you discovered how to develop a suite of CNN models for a range of
standard time series forecasting problems.

Specifically, you learned:

 How to develop CNN models for univariate time series forecasting.

 How to develop CNN models for multivariate time series forecasting.
 How to develop CNN models for multi-step time series forecasting

DL 2
No ratings yet
DL 2
37 pages
WWW Tensorflow Org Tutorials Structured Data Time Series
No ratings yet
WWW Tensorflow Org Tutorials Structured Data Time Series
41 pages
Chapter15 RNN
No ratings yet
Chapter15 RNN
29 pages
Module 4
No ratings yet
Module 4
36 pages
How To Develop LSTM Models For Time Series Forecasting
100% (1)
How To Develop LSTM Models For Time Series Forecasting
188 pages
Time Series Prediction With Recurrent Neural Networks
No ratings yet
Time Series Prediction With Recurrent Neural Networks
7 pages
Multivariate Multi Step Time Series Forecasting Using Stacked LSTM Sequence To Sequence Autoencoder in Tensorflow 2 0 Keras
No ratings yet
Multivariate Multi Step Time Series Forecasting Using Stacked LSTM Sequence To Sequence Autoencoder in Tensorflow 2 0 Keras
9 pages
Implementation of Time Series Forecasting
No ratings yet
Implementation of Time Series Forecasting
12 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
36 pages
Deep Learning Methods
No ratings yet
Deep Learning Methods
336 pages
Unit 4b - Recurrent Neural Networks
No ratings yet
Unit 4b - Recurrent Neural Networks
60 pages
Visual and Audio Signal Processing Lab University of Wollongong
No ratings yet
Visual and Audio Signal Processing Lab University of Wollongong
20 pages
Peerj Cs 2481
No ratings yet
Peerj Cs 2481
32 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
37 pages
Room Classification Using Machine Learning
No ratings yet
Room Classification Using Machine Learning
16 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Neural Networks in MATLAB
No ratings yet
Neural Networks in MATLAB
13 pages
A Beginner's Tutorial For CNN
100% (1)
A Beginner's Tutorial For CNN
35 pages
Conditional Time Series Forecasting With Convolutional Neural Networks
No ratings yet
Conditional Time Series Forecasting With Convolutional Neural Networks
22 pages
10 PDF
No ratings yet
10 PDF
12 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Unit 5
No ratings yet
Unit 5
8 pages
Step by Step Procedure That How I Resolve Given Task Pytorh
No ratings yet
Step by Step Procedure That How I Resolve Given Task Pytorh
6 pages
Lab 09
No ratings yet
Lab 09
5 pages
CISC 867 Deep Learning: 12. Recurrent Neural Networks
No ratings yet
CISC 867 Deep Learning: 12. Recurrent Neural Networks
72 pages
Time Series Forecasting With 2D Convolutions
No ratings yet
Time Series Forecasting With 2D Convolutions
33 pages
Deep Learning Models (Basic)
No ratings yet
Deep Learning Models (Basic)
35 pages
Guddu Jha - Organized
No ratings yet
Guddu Jha - Organized
3 pages
Stock Prices Prediction With Recurrent Neural Networks: Middi Appala Raju, Venkata Sai Rishita Middi
No ratings yet
Stock Prices Prediction With Recurrent Neural Networks: Middi Appala Raju, Venkata Sai Rishita Middi
3 pages
Deep Learning Convolution Neural Networks
No ratings yet
Deep Learning Convolution Neural Networks
73 pages
Time Series Forcasting
No ratings yet
Time Series Forcasting
18 pages
10 Time Series Fundamentals and Milestone Project 3 Bitpredict
No ratings yet
10 Time Series Fundamentals and Milestone Project 3 Bitpredict
48 pages
Neural Network Implementation Using Keras
No ratings yet
Neural Network Implementation Using Keras
8 pages
Image Classification Using CNN Pallavi
No ratings yet
Image Classification Using CNN Pallavi
26 pages
SSRN 4165241
No ratings yet
SSRN 4165241
28 pages
Introduction To Artificial Neural Networks
No ratings yet
Introduction To Artificial Neural Networks
31 pages
Mi-90 1
No ratings yet
Mi-90 1
24 pages
DL Experiments
No ratings yet
DL Experiments
19 pages
NNDL U-3
No ratings yet
NNDL U-3
7 pages
On Deep Machine Learning & Time Series Models: A Case Study With The Use of Keras
100% (1)
On Deep Machine Learning & Time Series Models: A Case Study With The Use of Keras
34 pages
Seriesnet:A Generative Time Series Forecasting Model: Zhipeng Shen, Yuanming Zhang, Jiawei Lu, Jun Xu, Gang Xiao
No ratings yet
Seriesnet:A Generative Time Series Forecasting Model: Zhipeng Shen, Yuanming Zhang, Jiawei Lu, Jun Xu, Gang Xiao
8 pages
Experiment 2
No ratings yet
Experiment 2
7 pages
AN Overview Artificial Neural Network Approach AN Overview Artificial Neural Network Approach
No ratings yet
AN Overview Artificial Neural Network Approach AN Overview Artificial Neural Network Approach
34 pages
Class Notes Unit 5
No ratings yet
Class Notes Unit 5
13 pages
18 Rnns
No ratings yet
18 Rnns
57 pages
Decoder Only Foundation Model For Time Series Forecasting: Reprint
No ratings yet
Decoder Only Foundation Model For Time Series Forecasting: Reprint
21 pages
Unit 4 Part 3 DL - 1
No ratings yet
Unit 4 Part 3 DL - 1
5 pages
Exp. No.: Aim Code:: AIML634P Neural Network Lab 2262034
No ratings yet
Exp. No.: Aim Code:: AIML634P Neural Network Lab 2262034
11 pages
Time Series Forecasting With Deep Learning: A Survey: Research
No ratings yet
Time Series Forecasting With Deep Learning: A Survey: Research
13 pages
19 - Introduction To Neural Networks
No ratings yet
19 - Introduction To Neural Networks
7 pages
Unit 3 NNDL-1
No ratings yet
Unit 3 NNDL-1
31 pages
Evaluation of Deep Learning Models For Multi-Step Ahead Time Series Prediction
No ratings yet
Evaluation of Deep Learning Models For Multi-Step Ahead Time Series Prediction
22 pages
Lire Pattern Learning NN
No ratings yet
Lire Pattern Learning NN
33 pages
Neural Network (RNN & CNN)
No ratings yet
Neural Network (RNN & CNN)
31 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
47 pages
Tensor Flow Guide
No ratings yet
Tensor Flow Guide
25 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
A Friendly Introduction to MATLAB Programming
From Everand
A Friendly Introduction to MATLAB Programming
Orhan Gazi
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Gottschalk's Conjecture Attempt
No ratings yet
Gottschalk's Conjecture Attempt
12 pages
Questions in Passive Voice
No ratings yet
Questions in Passive Voice
3 pages
04 Proposition
No ratings yet
04 Proposition
21 pages
GRADSmanual PDF
No ratings yet
GRADSmanual PDF
167 pages
LESSON 7 - Prepositions, Conjunctions, & Interjections
No ratings yet
LESSON 7 - Prepositions, Conjunctions, & Interjections
21 pages
African Philosophy of Reincarnation
No ratings yet
African Philosophy of Reincarnation
6 pages
LMO 2021 Grade 2
No ratings yet
LMO 2021 Grade 2
2 pages
Ailox Users
No ratings yet
Ailox Users
9 pages
Java Programming Unit-1 Mega Notes
No ratings yet
Java Programming Unit-1 Mega Notes
42 pages
Actual MH Cet 2020
No ratings yet
Actual MH Cet 2020
40 pages
Floppit
No ratings yet
Floppit
1 page
LPC2148 Ebook
No ratings yet
LPC2148 Ebook
89 pages
MATLAB Simulation For Digital Signal Processing PDF
No ratings yet
MATLAB Simulation For Digital Signal Processing PDF
5 pages
Untitled
No ratings yet
Untitled
3 pages
Notes Graph
No ratings yet
Notes Graph
9 pages
Ontological Engineering: Delivered by Joel Anandraj.E Ap/It
No ratings yet
Ontological Engineering: Delivered by Joel Anandraj.E Ap/It
39 pages
Dai 2 Glossario Unita Inglese U3
No ratings yet
Dai 2 Glossario Unita Inglese U3
2 pages
PostgreSQL Training 72622
No ratings yet
PostgreSQL Training 72622
3 pages
Universidad Politecnica de Puebla English I
No ratings yet
Universidad Politecnica de Puebla English I
6 pages
(Hons) SOL
No ratings yet
(Hons) SOL
5 pages
Close Passage - Reconcilliation Week
No ratings yet
Close Passage - Reconcilliation Week
2 pages
Data Entry
No ratings yet
Data Entry
1 page
Blood of Jesus
No ratings yet
Blood of Jesus
14 pages
Paper Scheme MATH 113
100% (1)
Paper Scheme MATH 113
2 pages
Final Evaluation Rose
No ratings yet
Final Evaluation Rose
7 pages
Mil Verbos
No ratings yet
Mil Verbos
22 pages
Yeshe Tsogyal, Terchen Urgyan Lingpa, Gustave-Charles Toussaint, Kenneth Douglas, Gwendolyn Bays, Tarthang Tulku-The Life and Liberation of Padmasambhava. 1 & 2-Dharma Publishing (1978) PDF
100% (4)
Yeshe Tsogyal, Terchen Urgyan Lingpa, Gustave-Charles Toussaint, Kenneth Douglas, Gwendolyn Bays, Tarthang Tulku-The Life and Liberation of Padmasambhava. 1 & 2-Dharma Publishing (1978) PDF
794 pages
Print Culture and The Modern World
100% (1)
Print Culture and The Modern World
19 pages
اساسيات الحاسوب وتطبيقاته المكتبية الجزء الثاني
No ratings yet
اساسيات الحاسوب وتطبيقاته المكتبية الجزء الثاني
3 pages
Mac 221
No ratings yet
Mac 221
169 pages