0% found this document useful (0 votes)

8 views12 pages

Time Series Analysis Handbook 03

This document introduces the AutoRegressive Integrated Moving Average (ARIMA) model for time series forecasting, detailing its definition, formulation, and implementation. It explains the model parameters (p, d, q), their significance, and how to determine them using statistical tests like the Augmented-Dickey Fuller Test, AutoCorrelation Function, and Partial AutoCorrelation Function. Additionally, it provides examples of stationary and non-stationary time series, along with methods to achieve stationarity through differencing.

Uploaded by

shakalya.garg2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views12 pages

Time Series Analysis Handbook 03

Uploaded by

shakalya.garg2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

2/22/25, 3:23 PM Chapter 1: AutoRegressive Integrated Moving Average (ARIMA) — Time Series Analysis Handbook

Chapter 1: AutoRegressive Integrated Moving Average Print to PDF

(ARIMA)
Prepared by: Benjur Emmanuel L. Borja and Maria Eloisa M. Ventura

In this notebook, we will introduce our first approach to time-series forecasting which is ARIMA or AutoRegressive Integrated Moving Average. This
notebook will discuss:

1. Definition and Formulation of ARIMA models

2. Model Parameters (p, d, and q) and Special Cases of ARIMA models
3. Model Statistics and How to Interpret
4. Implementation and Forecasting using ARIMA

#Import required Libraries

import numpy as np, pandas as pd
from statsmodels.graphics.tsaplots import plot_acf, plot_pacf
import matplotlib.pyplot as plt
from statsmodels.tsa.stattools import adfuller
from numpy import log
from statsmodels.tsa.arima_model import ARIMA
from sklearn.metrics import mean_squared_error, mean_absolute_error
from math import sqrt
from pandas import read_csv
import multiprocessing as mp

### Just to remove warnings to prettify the notebook.

import warnings
warnings.filterwarnings("ignore")

Introduction to ARIMA
ARIMA, or AutoRegressive Integrated Moving Average, is a set of models that explains a time series using its own previous values given by the lags
(AutoRegressive) and lagged errors (Moving Average) while considering stationarity corrected by differencing (oppossite of Integration.) In other
words, ARIMA assumes that the time series is described by autocorrelations in the data rather than trends and seasonality. In these context, we
define trends and seasonality as the following:

Trend: A time series has a trend if there is a overlying long term increase or decrease in the data, which is not necessarily linear.
Seasonality: A time series data has seasonality when it is affected by seasonal factors such as the time of the year or the day of the week. The
seasonality of data is apparent as there is a fixed frequency of pattern occurence.

Example 1: Stationary and Non-Stationary Univartiate Time Series

To illustrate, lets look at the following examples of time series (source):

(a) Google stock price for 200 consecutive days

(b) Daily change in the Google stock price for 200 consecutive days

(c) Annual number of strikes in the US

(d) Monthly sales of new one-family houses sold in the US; (e) Annual price of a dozen eggs in the US (constant dollars)

(f) Monthly total of pigs slaughtered in Victoria, Australia

(g) Annual total of lynx trapped in the McKenzie River district of north-west Canada; (h) Monthly Australian beer production

(i) Monthly Australian electricity production.

https://phdinds-aim.github.io/time_series_handbook/01_AutoRegressiveIntegratedMovingAverage/01_AutoRegressiveIntegratedMovingAverage.html 1/12
2/22/25, 3:23 PM Chapter 1: AutoRegressive Integrated Moving Average (ARIMA) — Time Series Analysis Handbook

Which of these series are stationary?

Obvious seasonality rules out series (d), (h) and (i). Trends and changing levels rules out series (a), (c), (e), (f) and (i). Increasing variance also rules out
(i). That leaves only (b) and (g) as stationary series. At a first glance, (g) might seem non-stationary but these cycles are aperiodic, and is dependedent
on the complex interactions between lynx population and the availability of the food source. The timing of these cycles is unpredictable hence, this
time series is non-stationary.

Now notice the Google Stock Price (a) and the daily change in the Google Stock Price (b). These time-series came from the same system, but exhibits
different dynamics. This illustrates that we can make non-stationary time-series stationary by taking the difference between consecutive
observations - or Differencing. Differencing stabilizes our time-series by removing changes in the level of our time series, therefore reducing trends
and seasonality.

Model Components
As previously mentioned, ARIMA models are built given the following key aspects:

AR: Autoregression. A model that uses the dependent relationship between an observation and some number of lagged observations.
I: Integrated. The use of differencing of raw observations (e.g. subtracting an observation from an observation at the previous time step) in order to
make the time series stationary.
MA: Moving Average. A model that uses the dependency between an observation and a residual error from a moving average model applied to
lagged observations.

Each of these components are explicitly specified in the model as a parameter. A standard notation is used of ARIMA(p,d,q) where the parameters are
substituted with integer values to quickly indicate the specific ARIMA model being used:
p: The number of lag observations included in the model, also called the lag order (deals with window of X t )
d: The number of times that the raw observations are differenced, also called the degree of differencing (deals with order of differencing of X t )
q: The size of the moving average window, also called the order of moving average (deals with residuals)

Given this, the general case of ARIMA(p,d,q) can be written as:

(1)
X t = α1 X t−1 + ⋯ + αp X t−p + εt + θ1 εt−1 + ⋯ + θq εt−q

Or in words :

Predicted X t = Constant + Linear combination of Lags of X (up to p lags) + Linear Combination of Lagged forecast errors (up to q lags). Provided
that the time-series is already differenced (up to d terms) to ensure stationarity.

Model parameters p, d, and q and Special Cases

Before we discuss how we determine p, d, and q that are best to represent a time series, let’s first take a look at special cases of ARIMA models that
should help us illustrate the formulation of the ARIMA equation.

Case 1: ARIMA(p,0,0) = autoregressive model: if the series is stationary and autocorrelated, perhaps it can be predicted as a multiple of its own
previous value, plus a constant.

The forecasting equation for ARIMA(1,0,0) is:

(2)
X t = μ + α1 X t−1

while ARIMA(2,0,0)
https://phdinds-aim.github.io/time_series_handbook/01_AutoRegressiveIntegratedMovingAverage/01_AutoRegressiveIntegratedMovingAverage.html 2/12
2/22/25, 3:23 PM Chapter 1: AutoRegressive Integrated Moving Average (ARIMA) — Time Series Analysis Handbook

(3)
X t = μ + α1 X t−1 + α2 X t−2

or in general ARIMA(p,0,0)

(4)
X t = μ + α1 X t−1 + α2 X t−2 ⋯ + αp X t−p

Case 2: ARIMA(0,0,q) = moving average model: if the series is stationary but is correlated to the errors of previous values, we can regress using the
past forecast errors.

The forecasting equation for this is ARIMA(0,0,1) given by:

(5)
X t = εt + θ1 εt−1

or in simillar fashion to p, this can be generalized to ARIMA(0,0,q):

(6)
X t = εt + θ1 εt−1 + ⋯ + θq εt−q

where θq is the coefficient at time t − q of the residual εt−q .

Case 3: ARIMA(0,1,0) = Random Walk: if the series is non-stationary then the simplest model that we can use is a random walk model, which is given
by:

(7)
X t = μ + X t−1

This can then further be generalized to ARIMA(0,d,0) simillar to the first two cases.

As shown in our cases, we can use value of 0 for a parameter, which indicates to not use that element of the model. This way, the ARIMA model can be
configured to perform the function of an ARMA model, and even a simple AR, I, or MA model. It is good to note that the case ARIMA(0,1,1) is a
Simple Exponential Smoothing model but we’ll leave that in another discussion.

Implementing an ARIMA model for a time series assumes that the observations is an ARIMA process. To implement ARIMA, a linear regression
model is constructed including the specified number and type of terms, and the data is prepared by a degree of differencing in order to make it
stationary, i.e. to remove trend and seasonal structures that negatively affect the regression model.

Determining p, d, and q
We need to explicitly indicate p, d, and q when implementing ARIMA models. Selecting appropriate p, d, and q can be difficult but there are several
methods of automating this process. For example, we can use grid search in python to scan through different values and check which model would be
optimal (will be discussed later.) R uses the auto.arima() function to do this automatically. For our case, we will look at the Augmented-Dickey Fuller
Test (ADF), AutoCorrelation Function (ACF), and Partial Autocorrelation Function (PACF) to determine our model parameters.

Finding the order differencing d

As stated before, ARIMA models are assumed to be stationary. Implementing differencing may induce stationarity for various time series. The
quickest way to determine d for our models is to difference and simply run ADF to check for stationarity. We can also look at the PACF and ACF to
see if our time series is stationary after d differencing.

To illustrate, let’s take a look at the following example:

# We're using a sample data from

https://raw.githubusercontent.com/selva86/datasets/master/wwwusage.csv
# Import data
df = pd.read_csv('../data/wwwusage.csv', names=['value'], header=0)
plt.figure(figsize=(15, 2))
plt.plot(df)
plt.title('Original Series')
plt.xlabel('Time')
plt.ylabel('Value')
plt.show()

Initial eyeballing shows that there is a trend for this time series and is non-stationary. Checking using ADF:

result = adfuller(df.value.dropna())
print('ADF Statistic: %f' % result[0])
print('p-value: %f' % result[1])

ADF Statistic: -2.464240

p-value: 0.124419

The null hypothesis of the ADF test is that the time series is non-stationary. So, if the p-value of the test is less than the significance level (0.05) then
you reject the null hypothesis and infer that the time series is indeed stationary. For our example, we fail to reject the null hypothesis.

Next we difference our time series and check the results of the ADF test. We will also look at the ACF.

https://phdinds-aim.github.io/time_series_handbook/01_AutoRegressiveIntegratedMovingAverage/01_AutoRegressiveIntegratedMovingAverage.html 3/12
2/22/25, 3:23 PM Chapter 1: AutoRegressive Integrated Moving Average (ARIMA) — Time Series Analysis Handbook

plt.rcParams.update({'figure.figsize':(15,8), 'figure.dpi':120})

# Original Series
fig, axes = plt.subplots(3, 2, sharex=True)
axes[0, 0].plot(df.value); axes[0, 0].set_title('Original Series')
plot_acf(df.value, ax=axes[0, 1])

# 1st Differencing
axes[1, 0].plot(df.value.diff()); axes[1, 0].set_title('1st Order
Differencing')
plot_acf(df.value.diff().dropna(), ax=axes[1, 1])

# 2nd Differencing
axes[2, 0].plot(df.value.diff().diff()); axes[2, 0].set_title('2nd
Order Differencing')
plot_acf(df.value.diff().diff().dropna(), ax=axes[2, 1])

plt.show()

print('ADF Statistic for 1st Order Differencing')

result = adfuller(df.value.diff().dropna())
print('ADF Statistic: %f' % result[0])
print('p-value: %f' % result[1])
print('Critical Values:')
for key, value in result[4].items():
print('\t%s: %.3f' % (key, value))

print('\n ADF Statistic for 2nd Order Differencing')

result = adfuller(df.value.diff().diff().dropna())
print('ADF Statistic: %f' % result[0])
print('p-value: %f' % result[1])
print('Critical Values:')
for key, value in result[4].items():
print('\t%s: %.3f' % (key, value))

ADF Statistic for 1st Order Differencing

ADF Statistic: -2.722238
p-value: 0.070268
Critical Values:
1%: -3.500
5%: -2.892
10%: -2.583

ADF Statistic for 2nd Order Differencing

ADF Statistic: -9.929762
p-value: 0.000000
Critical Values:
1%: -3.500
5%: -2.892
10%: -2.583

Given the results of our ACF and ADF, we can see that our time series reachees stationarity after two orders of differencing. However, the ACF of the
2nd order differencing goes into the negative zone fairly quick. This can indicates that the series might have been over differenced. It is now up to us
if we want consider the first or second order differencing for our ARIMA models.

Finding the order of the AutoRegressive term p

As we have discussed previously, we can look at the PACF plot to determine the lag for our AR terms. Partial autocorrelation can be imagined as the
correlation between the series and its lag, after excluding the contributions from the intermediate lags. So, PACF sort of conveys the pure correlation
between a lag and the series. That way, we will know if that lag is needed in the AR term or not.

# PACF plot of 1st differenced series

plt.rcParams.update({'figure.figsize':(15,2.5), 'figure.dpi':120})

fig, axes = plt.subplots(1, 2, sharex=True)

axes[0].plot(df.value.diff()); axes[0].set_title('1st Differencing')
axes[1].set(ylim=(0,5))
plot_pacf(df.value.diff().dropna(), ax=axes[1])

plt.show()

Immediately, we can observe that our PACF returns sigificance at Lag 1 and Lag 2, meaning it crosses the significance limit (blue region). We can also
observe significance at higher order terms but note that given the amount of lag that we are testing, it is statistically probable to see random spikes in
our PACF and ACF plots. Although this can also be attributed to Seasonality which will be tackled separately.

With this, we can now decide to use p = 2 for our ARIMA model.

https://phdinds-aim.github.io/time_series_handbook/01_AutoRegressiveIntegratedMovingAverage/01_AutoRegressiveIntegratedMovingAverage.html 4/12
2/22/25, 3:23 PM Chapter 1: AutoRegressive Integrated Moving Average (ARIMA) — Time Series Analysis Handbook

Finding the order of the Moving Average term q

Simillar to how we determined p , we will now look at the ACF to determine the q terms to be considered for our MA. The ACF tells how many MA
terms are required to remove any autocorrelation in the stationary series.

# ACF plot of 1st differenced series

plt.rcParams.update({'figure.figsize':(15,4), 'figure.dpi':120})

fig, axes = plt.subplots(1, 2, sharex=True)

axes[0].plot(df.value.diff()); axes[0].set_title('1st Differencing')
axes[1].set(ylim=(0,1))
plot_acf(df.value.diff().dropna(), ax=axes[1])

plt.show()

Our results for the ACF is not as apparent compared to our PCF. We can observed several ACF terms that is above our significance level. This may be
attritbuted to the fact that our model has a weak stationarity. This may also be caused by the fact that our time series is not perfectly MA and is an
ARIMA model. For now, let’s consider q = 3 .

Building the ARIMA model

Now that we’ve determined the values of p, d and q, we have everything needed to fit the ARIMA model. Let’s implement using this dataset first
before we move on to a deeper look at the implementation of ARIMA in the next section. Let’s use the ARIMA() implementation in statsmodels
package. As computed, we will use ARIMA(2,1,3):

model = ARIMA(df.value, order=(2,1,3))

model_fit = model.fit(disp=0)
print(model_fit.summary())

ARIMA Model Results

====================================================================
==========
Dep. Variable: D.value No. Observations:
99
Model: ARIMA(2, 1, 3) Log Likelihood
-251.701
Method: css-mle S.D. of innovations
3.050
Date: Wed, 24 Feb 2021 AIC
517.402
Time: 14:54:10 BIC
535.568
Sample: 1 HQIC
524.752

====================================================================
=============
coef std err z P>|z|
[0.025 0.975]
--------------------------------------------------------------------
-------------
const 1.0049 1.584 0.635 0.526
-2.099 4.109
ar.L1.D.value 0.5502 0.299 1.840 0.066
-0.036 1.136
ar.L2.D.value 0.2364 0.217 1.091 0.275
-0.188 0.661
ma.L1.D.value 0.6316 0.289 2.187 0.029
0.066 1.198
ma.L2.D.value -0.1730 0.219 -0.789 0.430
-0.603 0.257
ma.L3.D.value -0.3149 0.142 -2.211 0.027
-0.594 -0.036
Roots
====================================================================
=========
Real Imaginary Modulus
Frequency
--------------------------------------------------------------------
---------
AR.1 1.1994 +0.0000j 1.1994
0.0000
AR.2 -3.5268 +0.0000j 3.5268
0.5000
MA.1 1.7100 -0.0000j 1.7100
-0.0000
MA.2 -1.1296 -0.7624j 1.3628
-0.4055
MA.3 -1.1296 +0.7624j 1.3628
0.4055
--------------------------------------------------------------------
---------

There’s quite a bit of information to unpack from the summary. Generally, we are interested in the Akaike’s Information Criterion (AIC), coefficients
of our AR and MA terms (coef_), and the p-values of the terms (P>|z|). We need the p-values to be less than 0.05 to be significant, which means that
our model failed to reach significance for the AR and MA terms. Let’s try to be conservative and use small values for p and d, i.e. ARIMA(1,1,1), as
given by the p-values of AR1 and MA1 in our results.

model = ARIMA(df.value, order=(1,1,1))

model_fit = model.fit(disp=0)
print(model_fit.summary())

https://phdinds-aim.github.io/time_series_handbook/01_AutoRegressiveIntegratedMovingAverage/01_AutoRegressiveIntegratedMovingAverage.html 5/12
2/22/25, 3:23 PM Chapter 1: AutoRegressive Integrated Moving Average (ARIMA) — Time Series Analysis Handbook

ARIMA Model Results

====================================================================
==========
Dep. Variable: D.value No. Observations:
99
Model: ARIMA(1, 1, 1) Log Likelihood
-253.790
Method: css-mle S.D. of innovations
3.119
Date: Wed, 24 Feb 2021 AIC
515.579
Time: 14:54:11 BIC
525.960
Sample: 1 HQIC
519.779

====================================================================
=============
coef std err z P>|z|
[0.025 0.975]
--------------------------------------------------------------------
-------------
const 1.1205 1.286 0.871 0.384
-1.400 3.641
ar.L1.D.value 0.6344 0.087 7.317 0.000
0.464 0.804
ma.L1.D.value 0.5297 0.089 5.932 0.000
0.355 0.705
Roots
====================================================================
=========
Real Imaginary Modulus
Frequency
--------------------------------------------------------------------
---------
AR.1 1.5764 +0.0000j 1.5764
0.0000
MA.1 -1.8879 +0.0000j 1.8879
0.5000
--------------------------------------------------------------------
---------

Immediately, we can see that the p-values is now <<0.05 for both the AR and MA terms and is highly significant. We will now check the residuals of
our time-series to ensure that we will not see patterns and will have a constant mean and variance.

# Plot residual errors

residuals = pd.DataFrame(model_fit.resid)
fig, ax = plt.subplots(1,2, figsize=(15,2.5))
residuals.plot(title="Residuals", ax=ax[0])
residuals.plot(kind='kde', title='Density', ax=ax[1])
plt.show()

The residuals is a good final check for our ARIMA models. Ideally, the residual errors should be a Gaussian with a zero mean and uniform variance.
With this, we can now proceed with fitting our initial time series with our model.

# Actual vs Fitted
fig, ax = plt.subplots(figsize=(15,2))
ax = df.plot(ax=ax)
fig = model_fit.plot_predict(85, 100, dynamic=False, ax=ax,
plot_insample=False)
plt.show()

When we set dynamic=False the in-sample lagged values are used for prediction. That is, the model gets trained up until the previous value to make
the next prediction. We can also call this as a walk-forward or one-step ahead prediction.

However, we should note two things:

1. We used the entire time series to train our model

2. Some of the use-cases that we will be tackling will require us to forecast t-steps ahead

We will be solving the first point in the next section by tackling real-world datasets. The second point can be solved immediately using dynamic=True
argument shown below.

# Actual vs Fitted
fig, ax = plt.subplots(figsize=(15,2))
ax = df.plot(ax=ax)
fig = model_fit.plot_predict(90, 110, dynamic=True, ax=ax,
plot_insample=False)
plt.show()

Notice that our confidence interval is now increasing as we go farther our given data set since we will be compounding errors given by out forecasted
values. Also note that generally, our ARIMA model captured the direction of the trend of our time series but is consistently lower than the actual
values. We can correct this by varying the parameters of our ARIMA models and validating using performance metrics.

https://phdinds-aim.github.io/time_series_handbook/01_AutoRegressiveIntegratedMovingAverage/01_AutoRegressiveIntegratedMovingAverage.html 6/12
2/22/25, 3:23 PM Chapter 1: AutoRegressive Integrated Moving Average (ARIMA) — Time Series Analysis Handbook

Implementation and Forecasting using ARIMA

Given the steps discussed in the first three sections of this notebook, we can now create a framework of how we can approach ARIMA models in
general. We can use a modified version of the Hyndman-Khandakar algorithm (Hyndman & Khandakar, 2008), which combines unit root tests,
minimisation of the AIC and MLE to obtain an ARIMA model. The steps are outlined below:

1. The differencing term d is determined using repeated ADF tests.

2. The values of p and q are chosen based on the AIC, ACF, and PACF of our differenced time series.
3. Use a step-wise traversal of our parameter space (+-1) for p, d, and q to find a lower AIC. We can use insights and heuristics to observe our ACF and
PACF to determine if we need to add or reduce our p, d, and q.
4. Check the residuals for Gaussian Distribution to establish stationarity.

Or in our case, we will instead use a grid-search algorithm to find automatically configure our ARIMA and find our hyperparameters. We will search
values of p, d, and q for combinations (skipping those that fail to converge), and find the combination that results in the best performance on the test
set. We use grid search to explore all combinations in a subset of integer values.

Specifically, we will search all combinations of the following parameters:

p: 0 to 4. d: 0 to 3. q: 0 to 4. This is (4 * 3 * 4), or 48, potential runs of the test harness and will take some time to execute.

The complete worked example with the grid search version of the test harness is listed below.

# create a differenced series

def difference(dataset, interval=1):
diff = list()
for i in range(interval, len(dataset)):
value = dataset[i] - dataset[i - interval]
diff.append(value)
return np.array(diff)

# invert differenced value

def inverse_difference(history, yhat, interval=1):
return yhat + history[-interval]

# evaluate an ARIMA model for a given order (p,d,q) and return RMSE
def evaluate_arima_model(X, arima_order, train_size=None):
# prepare training dataset
X = X.astype('float32')
if train_size is None:
train_size = int(len(X) * 0.50)
else:
train_size = int(train_size)
train, test = X[0:train_size], X[train_size:]
history = [x for x in train]
# make predictions
predictions = list()
for t in range(len(test)):
# difference data
diff = difference(history, 1)
model = ARIMA(diff, order=arima_order)
model_fit = model.fit(trend='nc', disp=0)
yhat = model_fit.forecast()[0]
yhat = inverse_difference(history, yhat, 1)
predictions.append(yhat)
history.append(test[t])
# calculate out of sample error
mae = mean_absolute_error(test, predictions)
return mae

# evaluate combinations of p, d and q values for an ARIMA model

def evaluate_models(dataset, p_values, d_values, q_values):
dataset = dataset.astype('float32')
best_score, best_cfg = float("inf"), None
for p in p_values:
for d in d_values:
for q in q_values:
order = (p,d,q)
try:
mse = evaluate_arima_model(dataset, order)
if mse < best_score:
best_score, best_cfg = mse, order
print('ARIMA%s RMSE=%.3f' % (order,mse))
except:
continue
print('Best ARIMA%s RMSE=%.3f' % (best_cfg, best_score))

Example 2: Forecasting Temperature in Jena Climate Data

We now use the method above to perform multi-step forecasting (24-step) in the Jena temperature data.

train_data =
pd.read_csv('../data/train_series.csv',index_col=0).loc[:, ['T
(degC)']]
val_data = pd.read_csv('../data/val_series.csv',index_col=0).loc[:,
['T (degC)']]
temp = pd.concat([train_data, val_data]).values #temperature (in
degrees Celsius)
plt.figure(figsize=(15,2))
plt.plot(range(len(temp)), temp)
plt.ylabel('Temperature \n(degree Celcius)')
plt.xlabel('Time (every hour)')
plt.show()

https://phdinds-aim.github.io/time_series_handbook/01_AutoRegressiveIntegratedMovingAverage/01_AutoRegressiveIntegratedMovingAverage.html 7/12
2/22/25, 3:23 PM Chapter 1: AutoRegressive Integrated Moving Average (ARIMA) — Time Series Analysis Handbook

Model order selection (hyperparameter tuning)

Before we proceed with the grid-search algorithm, we perform some preliminary tests on the time series to narrow down our parameter search. We
can start by checking the stationarity of the signal to estimate d. Then, we plot the PACF and ACF of the time series to estimate p and q , respectively.

result = adfuller(temp)
print('ADF Statistic: %f' % result[0])
print('p-value: %f' % result[1])
print('Critical Values:')
for key, value in result[4].items():
print('\t%s: %.3f' % (key, value))

ADF Statistic: -7.958617

p-value: 0.000000
Critical Values:
1%: -3.430
5%: -2.862
10%: -2.567

Observation: Based on the results of the ADF test, the signal is stationary. This implies that d = 0 so we no longer need to perform differencing.

plt.rcParams.update({'figure.figsize':(15,2.5), 'figure.dpi':120})

fig, axes = plt.subplots(1, 2)

axes[0].plot(temp.flatten())
axes[0].set_title('Temperature Signal')
axes[1].set(xlim=(0,20))
plot_pacf(temp.flatten(), ax=axes[1])
plt.show()

Observation: Based on the plot of the PACF, we can see that the function drops to a value of almost zero for lags > 2. So, we can set p to be from 0 to
2.

plt.rcParams.update({'figure.figsize':(15,2.5), 'figure.dpi':120})

fig, axes = plt.subplots(1, 2)

axes[0].plot(temp.flatten())
axes[0].set_title('Temperature Signal')
plot_acf(pd.Series(temp.flatten()), ax=axes[1])
plt.show()

Observation: Unlike the PACF which displayed a sharp cutoff, the ACF decays more slowly. We can say that the series has an autoregressive signature
which means that the autocorrelation pattern can be explained more easily by adding AR terms than by adding MA terms. Let’s test for values q from
0 to 5 .

Due to constraints in computing resources, we will limit the length of the training data (from 35045 to 17524). We will also reduce the number of
folds to be used (from 730 to 20).

# evaluate an ARIMA model for a given order (p,d,q) and return RMSE
def evaluate_arima_jena_24hrstep(X, arima_order, train_size=35045):
# prepare training dataset
X = X.astype('float32')
train, test = X[:train_size], X[train_size:]
test = test[:len(X[train_size:]) - len(X[train_size:])%24]
test_24 = test.reshape(-1, 24)
history = train.flatten()

mae_cv = []
# There are 730 folds (24 hr chunks), for faster computation, we
limit the number of folds to 20 only
for t in range(len(test_24))[::37]:
x_cv = np.hstack([history, test_24[:t, :].flatten()])
y_cv = test_24[t]
model = ARIMA(x_cv, order=arima_order)
model_fit = model.fit(disp=0)
y_hat = model_fit.forecast(steps=24)[0]
mae_cv.append(mean_absolute_error(y_cv, y_hat))
mean_mae = np.mean(mae_cv)
return mean_mae

https://phdinds-aim.github.io/time_series_handbook/01_AutoRegressiveIntegratedMovingAverage/01_AutoRegressiveIntegratedMovingAverage.html 8/12
2/22/25, 3:23 PM Chapter 1: AutoRegressive Integrated Moving Average (ARIMA) — Time Series Analysis Handbook

## Uncomment the lines to perform model order selection

# arima_orders = [(0, 0, 0),
# (1, 0, 0),
# (0, 0, 1),
# (2, 0, 0),
# (1, 0, 1),
# (1, 0, 2),
# (2, 0, 1),
# (2, 0, 2)]
# mae_ao_list = []
# for ao in arima_orders:
# mae_ao = evaluate_arima_jena_24hrstep(temp[-2*len(val_data):],
# arima_order=ao,
# train_size=len(val_data))
# mae_ao_list.append((ao, mae_ao))
# print(ao, mae_ao)

# selected_order, best_mae = np.array(mae_ao_list)

[np.argmin(np.array(mae_ao_list)[:,-1])]
# print(f'Selected order: {selected_order}, MAE: {best_mae}')

## Expected results from code above:

## (p, d, q) MAE
## ---------------------------
## (0, 0, 0) 6.035602297003233
## (1, 0, 0) 3.025640357465662
## (0, 0, 1) 5.922866295058727
## (2, 0, 0) 3.124825200033295
## (1, 0, 1) 2.967933864811228
## (1, 0, 2) 2.9481444828018253
## (2, 0, 1) 3.5091372748649037
## (2, 0, 2) 3.4143813707023325
## Selected order: (1, 0, 2), MAE: 2.9481444828018253

selected_order = (1, 0, 2)

Evaluate model performance on test set

We use the multiprocessing package which supports spawning processes using an API similar to the threading module.

def wrapper_fit_arima(x_vals, order=(1, 0, 2)):

model = ARIMA(x_vals, order=order)
model_fit = model.fit(disp=0)
y_hat = model_fit.forecast(steps=24)[0]
# print(len(x_vals))
return y_hat

def evaluate_arima_jena_mp(X, arima_order, train_size=35045):

# prepare training dataset
X = X.astype('float32')
train, test = X[:train_size], X[train_size:]
test = test[:len(X[train_size:]) - len(X[train_size:])%24]
test_24 = test.reshape(-1, 24)
history = train.flatten()

X_cv = []
Y_cv = []
for t in range(len(test_24)):
x_cv = np.hstack([history, test_24[:t, :].flatten()])
y_cv = test_24[t]
X_cv.append(x_cv)
Y_cv.append(y_cv)

pool = mp.Pool(processes=mp.cpu_count()-4)
y_hats = pool.map(wrapper_fit_arima, X_cv)

mae_cv = []
for t in range(len(test_24)):
mae_cv.append(mean_absolute_error(Y_cv[t], y_hats[t]))
mean_mae = np.mean(mae_cv)
return mean_mae

# Load test data

test_data= pd.read_csv('../data/test_series.csv',index_col=0).loc[:,
['T (degC)']]
temp2 = pd.concat([val_data, test_data]).values #temperature (in
degrees Celsius)

We fit the ARIMA(1,0,2) model using the validation set to predict the 24-hour chunks of temperature measurements in the test set.

## Uncomment code below to fit and predict temperature values in the

test set
## By using CPU count of 28, the code ran for about 70 mins.
# MAE = evaluate_arima_jena_mp(temp2, arima_order=selected_order,
train_size=len(val_data))
# print(f'ARIMA MAE: {np.mean(MAE)}')

## Expected result from running the code above:

## ARIMA MAE: 3.191548582794122
print('ARIMA MAE for Jena test data: 3.191548582794122')

ARIMA MAE for Jena test data: 3.191548582794122

Observation/s: The ARIMA(1,0,2) model had a poorer performance (ARIMA MAE: 3.19) on the test set compared to our baseline models (naive
MAE: 3.18, seasonal naive MAE: 2.61).

Forecast 24 hours beyond test set

https://phdinds-aim.github.io/time_series_handbook/01_AutoRegressiveIntegratedMovingAverage/01_AutoRegressiveIntegratedMovingAverage.html 9/12
2/22/25, 3:23 PM Chapter 1: AutoRegressive Integrated Moving Average (ARIMA) — Time Series Analysis Handbook

%%time
future_data = wrapper_fit_arima(temp2)
future_data

plt.figure(figsize=(15,2))
plt.plot(temp2[-500:])
plt.plot(np.arange(24)+500, future_data, label='forecast')
plt.legend()
plt.ylabel('Temperature (degree Celsius)')
plt.show()

Example 3: Using ARIMA for Multivariate Time Series

In this final example, we use ARIMA to forecast different time series signals from an Air Quality Dataset measured in a polluted area at an Italian
City. To illustrate, we forecast the data for the last 24 hours for CO, and NO2. The following figure shows the training data for each of these
parameters.

train_aq =
pd.read_csv('../data/AirQualityUCI/train_data.csv',index_col=0).set_in
dex('Date_Time')
test_aq =
pd.read_csv('../data/AirQualityUCI/test_data.csv',index_col=0).set_ind
ex('Date_Time')
train_aq.head()

CO(GT) NO2(GT) RH

Date_Time

2004-10-01 01:00:00 1.6 74.0 69.275002

2004-10-01 02:00:00 1.3 69.0 70.775000

2004-10-01 03:00:00 0.8 61.5 65.833332

2004-10-01 04:00:00 0.6 54.0 67.174999

2004-10-01 05:00:00 0.7 60.0 70.275000

fig,ax = plt.subplots(3, figsize=(15, 5), sharex=True)

train_aq.plot(ax=ax, subplots=True)
plt.xlabel('')
plt.show()

def evaluate_arima_multistep(X, arima_order, train_size=None):

# prepare training dataset
X = X.astype('float32')
if train_size is None:
train_size = int(len(X) * 0.50)
else:
train_size = int(train_size)
train, test = X[0:train_size], X[train_size:]
history = [x for x in train]
# make predictions
model = ARIMA(history, order=arima_order)
model_fit = model.fit(disp=0)
yhat = model_fit.forecast(steps=len(test))
predictions = yhat[0]
return mean_absolute_error(test, predictions)

def evaluate_models_mae(dataset, p_values, d_values, q_values,

train_size=None):
dataset = dataset.astype('float32')
best_score, best_cfg = float("inf"), None
for p in p_values:
for d in d_values:
for q in q_values:
order = (p,d,q)
try:
mae = evaluate_arima_model_mae(dataset, order,
train_size=train_size)
if mae < best_score:
best_score, best_cfg = mae, order
print('ARIMA%s MAE=%.3f' % (order,mae))
except:
continue
print('Best ARIMA%s AME=%.3f' % (best_cfg, best_score))

To make predictions, we train 3 ARIMA models—one for CO, one for NO2, and one for RH. We perform grid search for the following parameters:

https://phdinds-aim.github.io/time_series_handbook/01_AutoRegressiveIntegratedMovingAverage/01_AutoRegressiveIntegratedMovingAverage.html 10/12
2/22/25, 3:23 PM Chapter 1: AutoRegressive Integrated Moving Average (ARIMA) — Time Series Analysis Handbook

p_values = range(0, 5)
d_values = range(0, 2)
q_values = range(0, 2)

arima_orders = []
for p in p_values:
for d in d_values:
for q in q_values:
arima_orders.append((p, d, q))

var_col = train_aq.columns
print(var_col)

Index(['CO(GT)', 'NO2(GT)', 'RH'], dtype='object')

for c in var_col:
df_var = train_aq[c].values.reshape(-1,1)

mae_order_list_var = []
for ao in arima_orders:
metric = evaluate_arima_multistep(df_var, arima_order=ao,
train_size=len(train_aq)-len(test_aq))
mae_order_list_var.append((ao, metric))

print(f'{c}: {np.array(mae_order_list_var)
[np.argmin(np.array(mae_order_list_var)[:,-1])]}')

CO(GT): [(0, 1, 0) 0.23812050839538637]

NO2(GT): [(0, 1, 0) 20.849329460041616]
RH: [(3, 1, 1) 9.583232972711079]

# For model order selection, refer to Chapter 1

selected_order = {'CO(GT)': [(0, 1, 0)],
'NO2(GT)': [(0, 1, 0)],
'RH': [(3, 1, 1)]}

%%time
forecast_arima = {}
for c in var_col:
model = ARIMA(train_aq[c].values, order=selected_order[c][0])
model_fit = model.fit(disp=0)
y_hat = model_fit.forecast(steps=24)[0]
plt.figure(figsize=(15,2))
plt.plot(train_aq[c].values, label='train')
plt.plot(np.arange(len(test_aq))+len(train_aq), test_aq[c].values,
label='train')
plt.plot(np.arange(len(test_aq))+len(train_aq), y_hat,
label='train')
plt.ylabel(c)
plt.xlim(xmin=3500, xmax=4460)
plt.show()

CPU times: user 6.32 s, sys: 192 ms, total: 6.52 s

Wall time: 1.9 s

Preview to the next Chapter

ARIMA model is just one of the many algorithms that we will discuss. We can see the inherent limitations of ARIMA. In particular, we are not
considering:

1. Trends and seasonality

2. Long-range dependence
3. Multi-variate time series

These factors are generally more frequent and will be important in several applications. For example, trend is important in technical analysis in stock
trading. This can be solved using Momentum Forecasting which will be discussed in Chapter 2, among other methods.

In Chapter 3, we will revisit the Air Quality example and use a multivariate approach. We’ll introduce vector autoregressive (VAR) methods—a family
of models that expresses each variable as a linear function of past lags and past lags of the other variables—and show that we can improve the
forecasts by extending this to a multivariate time series problem.

References
The contents of this notebook is compiled from the following sources:

https://www.machinelearningplus.com/time-series/arima-model-time-series-forecasting-python/
https://otexts.com/fpp2/arima.html
https://people.duke.edu/~rnau/411arim.htm

https://phdinds-aim.github.io/time_series_handbook/01_AutoRegressiveIntegratedMovingAverage/01_AutoRegressiveIntegratedMovingAverage.html 11/12
2/22/25, 3:23 PM Chapter 1: AutoRegressive Integrated Moving Average (ARIMA) — Time Series Analysis Handbook

https://www.analyticsvidhya.com/blog/2018/02/time-series-forecasting-methods/
https://machinelearningmastery.com/time-series-forecast-study-python-monthly-sales-french-champagne/
https://medium.com/@josemarcialportilla/using-python-and-auto-arima-to-forecast-seasonal-time-series-90877adff03c
https://en.wikipedia.org/wiki/Autoregressive_integrated_moving_average
C.Monterola, “Notebook 10 Time Series Forecasting Method — ARIMA MSDS2021”
Hyndman, R. J., & Khandakar, Y. (2008). Automatic time series forecasting: The forecast package for R. Journal of Statistical Software, 27(1), 1–
22. https://doi.org/10.18637/jss.v027.i03

By students of PhD in Data Science Batch 2023 at the Asian Institute of Management
© Copyright 2020.

https://phdinds-aim.github.io/time_series_handbook/01_AutoRegressiveIntegratedMovingAverage/01_AutoRegressiveIntegratedMovingAverage.html 12/12

Box Jenkins
No ratings yet
Box Jenkins
53 pages
Stationarity & AR, MA, ARIMA, SARIMA
100% (1)
Stationarity & AR, MA, ARIMA, SARIMA
6 pages
ARIMA Modelling and Forecasting: By: Amar Kumar
100% (1)
ARIMA Modelling and Forecasting: By: Amar Kumar
22 pages
Module 3.1 Time Series Forecasting ARIMA Model
No ratings yet
Module 3.1 Time Series Forecasting ARIMA Model
19 pages
Types of Statistical Distributions
No ratings yet
Types of Statistical Distributions
34 pages
Montessori Education
No ratings yet
Montessori Education
32 pages
Chapter - ARIMA Models For Time Series Data
No ratings yet
Chapter - ARIMA Models For Time Series Data
44 pages
FatFree User Manual
100% (1)
FatFree User Manual
41 pages
ARIMA Model Python Example - Time Series Forecasting
No ratings yet
ARIMA Model Python Example - Time Series Forecasting
11 pages
Sarima Group 11
No ratings yet
Sarima Group 11
21 pages
Plane Table Surneying 1 and Levelling
No ratings yet
Plane Table Surneying 1 and Levelling
30 pages
s4 - Arima - Sarima
No ratings yet
s4 - Arima - Sarima
57 pages
Sources of Experimental Error
No ratings yet
Sources of Experimental Error
3 pages
Assignment
100% (1)
Assignment
3 pages
Information Retrieval 7 Boolean Model
No ratings yet
Information Retrieval 7 Boolean Model
11 pages
Module 4 - Time Series Analysis
No ratings yet
Module 4 - Time Series Analysis
6 pages
Arima Modeling With R Listendata
No ratings yet
Arima Modeling With R Listendata
12 pages
Autoregressive Integrated Moving Average Models (Arima)
No ratings yet
Autoregressive Integrated Moving Average Models (Arima)
2 pages
Is 808
No ratings yet
Is 808
60 pages
Lines and Angles
No ratings yet
Lines and Angles
3 pages
ARIMA
No ratings yet
ARIMA
3 pages
Friction - DPPs
No ratings yet
Friction - DPPs
11 pages
Project Planning and Approval Worksheet
100% (2)
Project Planning and Approval Worksheet
8 pages
Exam Version
No ratings yet
Exam Version
413 pages
Autoregressive Integrated Moving Average
No ratings yet
Autoregressive Integrated Moving Average
4 pages
ARIMA Models For Time Series Forecasting - Introduction To ARIMA Models
No ratings yet
ARIMA Models For Time Series Forecasting - Introduction To ARIMA Models
6 pages
ARIMA Forecasting
No ratings yet
ARIMA Forecasting
9 pages
Unit 6 (C++) - Arrays
No ratings yet
Unit 6 (C++) - Arrays
91 pages
Success Mantra - Class - 10 All Subject
No ratings yet
Success Mantra - Class - 10 All Subject
126 pages
Time Series Analysis ARIMA Final RaychellSantosEmbile
No ratings yet
Time Series Analysis ARIMA Final RaychellSantosEmbile
45 pages
AP SHAH ADS Notes Smote
No ratings yet
AP SHAH ADS Notes Smote
52 pages
08 ASAP TimeSeriesForcasting - Day 8-11
No ratings yet
08 ASAP TimeSeriesForcasting - Day 8-11
62 pages
Stock Price Prediction: By: Aarushi Sunderrajan (S0 Paridhi Deval (S0 Pranjal Gupta (S059)
No ratings yet
Stock Price Prediction: By: Aarushi Sunderrajan (S0 Paridhi Deval (S0 Pranjal Gupta (S059)
34 pages
Finance Workshop Honda
No ratings yet
Finance Workshop Honda
30 pages
ARIMA Modelling and Forecasting: by Shipra Mishra Intern
No ratings yet
ARIMA Modelling and Forecasting: by Shipra Mishra Intern
17 pages
Introduction To Artificial Intelligence: by Tanu Dixit CS-3 Year
No ratings yet
Introduction To Artificial Intelligence: by Tanu Dixit CS-3 Year
33 pages
Be A 65 Ads Exp 8
No ratings yet
Be A 65 Ads Exp 8
10 pages
Time Series Analysis and Forecasting Using ARIMA Modeling, Neural Network and Hybrid Model Using ELM
No ratings yet
Time Series Analysis and Forecasting Using ARIMA Modeling, Neural Network and Hybrid Model Using ELM
14 pages
Arima Time Series Stock Prediction
No ratings yet
Arima Time Series Stock Prediction
23 pages
End Term Project (BA)
No ratings yet
End Term Project (BA)
19 pages
Mifi 564 - Unit 2 - 2022
No ratings yet
Mifi 564 - Unit 2 - 2022
71 pages
Navigational Aids Chief Mate F.G. Phase 2 Question Papers Till Nov24 A5gzkf
No ratings yet
Navigational Aids Chief Mate F.G. Phase 2 Question Papers Till Nov24 A5gzkf
92 pages
C 18,21,38,39,51business Forecasting
No ratings yet
C 18,21,38,39,51business Forecasting
8 pages
Presented By: Amit Thakur C-06 Anoop C C-08 Ashish Singh C-16 Dhirender Singh C-19 Sajin Sunny C-66
No ratings yet
Presented By: Amit Thakur C-06 Anoop C C-08 Ashish Singh C-16 Dhirender Singh C-19 Sajin Sunny C-66
8 pages
2009 Lotos Bssa
No ratings yet
2009 Lotos Bssa
21 pages
MIS410 Chapter7
No ratings yet
MIS410 Chapter7
49 pages
Models For Non-Stationary Time Series: T T T T T
No ratings yet
Models For Non-Stationary Time Series: T T T T T
29 pages
Xii Worksheet 1 Ms (CH 1,2)
No ratings yet
Xii Worksheet 1 Ms (CH 1,2)
15 pages
Arima
No ratings yet
Arima
13 pages
Arima 1b
No ratings yet
Arima 1b
6 pages
Arima Word
No ratings yet
Arima Word
13 pages
ARIMA
No ratings yet
ARIMA
16 pages
K-2615 (Paper-II) (Mathematical Science)
No ratings yet
K-2615 (Paper-II) (Mathematical Science)
8 pages
MBA Analytics For Finance 12
No ratings yet
MBA Analytics For Finance 12
9 pages
Arima Modelling by Ankit Bhandari
No ratings yet
Arima Modelling by Ankit Bhandari
6 pages
Auto-Regressive Integrated Moving Average Models I
No ratings yet
Auto-Regressive Integrated Moving Average Models I
12 pages
00 Time Series Analysis - Complete Study Guide
No ratings yet
00 Time Series Analysis - Complete Study Guide
26 pages
Leakage Current Mitigation in Photovoltaic String Inverter Using Predictive Control With Fixed Average Switching Frequency
No ratings yet
Leakage Current Mitigation in Photovoltaic String Inverter Using Predictive Control With Fixed Average Switching Frequency
11 pages
Time Series Methods - Arima
No ratings yet
Time Series Methods - Arima
11 pages
Arima: Autoregressive Integrated Moving Average
No ratings yet
Arima: Autoregressive Integrated Moving Average
32 pages
Time Arima 002
No ratings yet
Time Arima 002
11 pages
Business Analytis C4
No ratings yet
Business Analytis C4
10 pages
Lecture 9 Univariate Time Series Modelling - Part 2
No ratings yet
Lecture 9 Univariate Time Series Modelling - Part 2
12 pages
Measure of Dispersion
No ratings yet
Measure of Dispersion
11 pages
Time Series Components:: The Long-Term Direction.: The Periodic Behavior.: The Irregular Fluctuations
No ratings yet
Time Series Components:: The Long-Term Direction.: The Periodic Behavior.: The Irregular Fluctuations
19 pages
APMOPS (SMOPS) 2008 First Round With Answers
No ratings yet
APMOPS (SMOPS) 2008 First Round With Answers
6 pages
Tutorial 5 (With Answers)
No ratings yet
Tutorial 5 (With Answers)
10 pages
Autoregressive Integrated Moving Average
No ratings yet
Autoregressive Integrated Moving Average
10 pages
Quantitative Analysis For Decision Making: Course Description
No ratings yet
Quantitative Analysis For Decision Making: Course Description
4 pages
Vinay Ahlawat
No ratings yet
Vinay Ahlawat
5 pages
AIML
No ratings yet
AIML
8 pages
CW1 Balancing of Rotating Masses
No ratings yet
CW1 Balancing of Rotating Masses
5 pages
Nonlinear Solid Mechanics A Continuum Ap PDF
No ratings yet
Nonlinear Solid Mechanics A Continuum Ap PDF
2 pages
Arima Notes
No ratings yet
Arima Notes
4 pages
Autoregressive Integrated Moving Average
No ratings yet
Autoregressive Integrated Moving Average
3 pages
Comparison of Trend Forecast Using ARIMA and ETS Models For S&P500 Close Price
No ratings yet
Comparison of Trend Forecast Using ARIMA and ETS Models For S&P500 Close Price
4 pages
1 What Is ARIMA?: 1.1 A Little Historical Background
No ratings yet
1 What Is ARIMA?: 1.1 A Little Historical Background
5 pages
Stock Price Prediction Using ARIMA Model by Dereje Workneh Medium
No ratings yet
Stock Price Prediction Using ARIMA Model by Dereje Workneh Medium
1 page
Errata
No ratings yet
Errata
2 pages
Taxicab Geometry
No ratings yet
Taxicab Geometry
3 pages
Jyoti Mam 8th Cbse Class Test
No ratings yet
Jyoti Mam 8th Cbse Class Test
2 pages
Fluid Mechanics HW2
No ratings yet
Fluid Mechanics HW2
3 pages
Arima
No ratings yet
Arima
2 pages
Study Material - (Session 7,8)
No ratings yet
Study Material - (Session 7,8)
2 pages
19-10-2024 SR - Super60 Nucleus&Sterling-bt Jee-Main Rptm-11&14 Final Key
No ratings yet
19-10-2024 SR - Super60 Nucleus&Sterling-bt Jee-Main Rptm-11&14 Final Key
1 page
ARIMA Paper
No ratings yet
ARIMA Paper
3 pages
The Dirac equation
From Everand
The Dirac equation
Alessio Mangoni
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet