0% found this document useful (0 votes)

66 views32 pages

Automatic Time Series Forecasting: The Forecast Package For R

This document describes the forecast package in R, which implements automatic time series forecasting methods. It discusses two main approaches: exponential smoothing and ARIMA modeling. Exponential smoothing forecasts are based on innovations state space models and the document provides the forecasting equations for the Holt-Winters additive method as an example. It also includes a table summarizing the point forecast equations for all exponential smoothing methods. The package is designed to automatically select and estimate exponential smoothing or ARIMA models for univariate time series forecasting.

Uploaded by

Angelo Villalobos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views32 pages

Automatic Time Series Forecasting: The Forecast Package For R

Uploaded by

Angelo Villalobos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/222105759

Automatic Time Series Forecasting: The forecast Package for R

Article in Journal of statistical software · July 2008

DOI: 10.18637/jss.v027.i03

CITATIONS READS

1,553 2,339

2 authors, including:

Rob J Hyndman
Monash University (Australia)
300 PUBLICATIONS 20,183 CITATIONS

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Novel Approaches for Linking Air Quality Mixtures, Climate, and Human Health View project

Stress-testing algorithms: generating new test instances to elicit insights View project

All content following this page was uploaded by Rob J Hyndman on 31 May 2014.

The user has requested enhancement of the downloaded file.

ISSN 1440-771X

Department of Econometrics and Business Statistics

http://www.buseco.monash.edu.au/depts/ebs/pubs/wpapers/

Automatic time series forecasting:

the forecast package for R
Rob J Hyndman and Yeasmin Khandakar

June 2007

Working Paper 06/07

Automatic time series forecasting:
the forecast package for R

Rob J Hyndman
Department of Econometrics and Business Statistics,
Monash University, VIC 3800
Australia.
Email: [email protected]

Yeasmin Khandakar
Department of Econometrics and Business Statistics,
Monash University, VIC 3800
Australia.
Email: [email protected]

11 June 2007

JEL classification: C53,C22,C52

Automatic time series forecasting:
the forecast package for R

Abstract: Automatic forecasts of large numbers of univariate time series are often needed
in business and other contexts. We describe two automatic forecasting algorithms that have
been implemented in the forecast package for R. The first is based on innovations state space
models that underly exponential smoothing methods. The second is a step-wise algorithm for
forecasting with ARIMA models. The algorithms are applicable to both seasonal and non-
seasonal data, and are compared and illustrated using four real time series. We also briefly
describe some of the other functionality available in the forecast package.

Keywords: ARIMA models, automatic forecasting, exponential smoothing, prediction inter-

vals, state space models, time series, R.
Automatic time series forecasting: the forecast package for R

Automatic forecasts of large numbers of univariate time series are often needed in busi-
ness. It is common to have over one thousand product lines that need forecasting at least
monthly. Even when a smaller number of forecasts is required, there may be nobody
suitably trained in the use of time series models to produce them. In these circumstances,
an automatic forecasting algorithm is an essential tool. Automatic forecasting algorithms
must determine an appropriate time series model, estimate the parameters and compute
the forecasts. They must be robust to unusual time series patterns, and applicable to large
numbers of series without user intervention. The most popular automatic forecasting al-
gorithms are based on either exponential smoothing or ARIMA models.

In this article, we discuss the implementation of two automatic univariate forecasting

methods in the forecast package for R. The forecast package is part of the forecasting
bundle which also contains the packages fma and Mcomp. The forecast package con-
tains functions for univariate forecasting, while the other two packages contain large col-
lections of real time series data that are suitable for testing forecasting methods. The fma
package contains the 90 data sets from Makridakis et al. (1998), and the Mcomp package
contains the 1001 time series from the M-competition (Makridakis et al., 1982) and the
3003 time series from the M3-competition (Makridakis and Hibon, 2000).

The forecast package implements automatic forecasting using exponential smoothing,

ARIMA models, the Theta method (Assimakopoulos and Nikolopoulos, 2000), cubic
splines (Hyndman et al., 2005a), as well as other common forecasting methods. In this ar-
ticle, we only discuss the exponential smoothing approach (in Section 1) and the ARIMA
modelling approach (in Section 2) to automatic forecasting.

1 Exponential smoothing

Although exponential smoothing methods have been around since the 1950s, a mod-
elling framework incorporating procedures for model selection was not developed until
relatively recently. Ord et al. (1997), Hyndman et al. (2002) and Hyndman et al. (2005b)
have shown that all exponential smoothing methods (including non-linear methods) are
optimal forecasts from innovations state space models.

Exponential smoothing methods were originally classified by Pegels’ (1969) taxonomy.

Hyndman and Khandakar: June 2007 2

Automatic time series forecasting: the forecast package for R

This was later extended by Gardner (1985), modified by Hyndman et al. (2002), and ex-
tended again by Taylor (2003), giving a total of fifteen methods seen in the following
table.

Seasonal Component
Trend N A M
Component (None) (Additive) (Multiplicative)

N (None) N,N N,A N,M

A (Additive) A,N A,A A,M

Ad (Additive damped) Ad ,N Ad ,A Ad ,M

M (Multiplicative) M,N M,A M,M

Md (Multiplicative damped) Md ,N Md ,A Md ,M

Some of these methods are better known under other names. For example, cell (N,N)
describes the simple exponential smoothing (or SES) method, cell (A,N) describes Holt’s
linear method, and cell (Ad ,N) describes the damped trend method. The additive Holt-
Winters’ method is given by cell (A,A) and the multiplicative Holt-Winters’ method is
given by cell (A,M). The other cells correspond to less commonly used but analogous
methods.

1.1 Point forecasts for all methods

We denote the observed time series by y1 , y2 , . . . , yn . A forecast of yt+h based on all of the
data up to time t is denoted by ŷt+h|t . To illustrate the method, we give the point forecasts
and updating equations for method (A,A), the Holt-Winters’ additive method:

Level: ℓt = α(yt − st−m ) + (1 − α)(ℓt−1 + bt−1 ) (1a)

Growth: bt = β ∗ (ℓt − ℓt−1 ) + (1 − β ∗ )bt−1 (1b)

Seasonal: st = γ(yt − ℓt−1 − bt−1 ) + (1 − γ)st−m (1c)

Forecast: ŷt+h|t = ℓt + bt h + st−m+h+

m
. (1d)

Hyndman and Khandakar: June 2007 3

Automatic time series forecasting: the forecast package for R

where m is the length of seasonality (e.g., the number of months or quarters in a year),
ℓt represents the level of the series, bt denotes the growth, st is the seasonal component,
ŷt+h|t is the forecast for h periods ahead, and h+

m = (h − 1) mod m + 1. To use method

(1), we need values for the initial states ℓ0 , b0 and s1−m , . . . , s0 , and for the smoothing
parameters α, β ∗ and γ. All of these will be estimated from the observed data.

Equation (1c) is slightly different from the usual Holt-Winters equations such as those in
Makridakis et al. (1998) or Bowerman et al. (2005). These authors replace (1c) with

st = γ ∗ (yt − ℓt ) + (1 − γ ∗ )st−m .

If ℓt is substituted using (1a), we obtain

st = γ ∗ (1 − α)(yt − ℓt−1 − bt−1 ) + {1 − γ ∗ (1 − α)}st−m .

Thus, we obtain identical forecasts using this approach by replacing γ in (1c) with γ ∗ (1 −
α). The modification given in (1c) was proposed by Ord et al. (1997) to make the state
space formulation simpler. It is equivalent to Archibald’s (1990) variation of the Holt-
Winters’ method.

Table 1 gives recursive formulae for computing point forecasts h periods ahead for all of
the exponential smoothing methods. In each case, ℓt denotes the series level at time t,
bt denotes the slope at time t, st denotes the seasonal component of the series at time
t and m denotes the number of seasons in a year; α, β ∗ , γ and φ are constants, and
φh = φ + φ2 + · · · + φh .

Some interesting special cases can be obtained by setting the smoothing parameters to
extreme values. For example, if α = 0, the level is constant over time; if β ∗ = 0, the slope
is constant over time; and if γ = 0, the seasonal pattern is constant over time. At the other
extreme, naı̈ve forecasts (i.e., ŷt+h|t = yt for all h) are obtained using the (N,N) method
with α = 1. Finally, the additive and multiplicative trend methods are special cases of
their damped counterparts obtained by letting φ = 1.

Hyndman and Khandakar: June 2007 4

Hyndman and Khandakar: June 2007

Automatic time series forecasting: the forecast package for R

Seasonal
Trend N A M
ℓt = αyt + (1 − α)ℓt−1 ℓt = α(yt − st−m ) + (1 − α)ℓt−1 ℓt = α(yt /st−m ) + (1 − α)ℓt−1
N st = γ(yt − ℓt−1 ) + (1 − γ)st−m st = γ(yt /ℓt−1 ) + (1 − γ)st−m
ŷt+h|t = ℓt ŷt+h|t = ℓt + st−m+h+ m
ŷt+h|t = ℓt st−m+h+ m
ℓt = αyt + (1 − α)(ℓt−1 + bt−1 ) ℓt = α(yt − st−m ) + (1 − α)(ℓt−1 + bt−1 ) ℓt = α(yt /st−m ) + (1 − α)(ℓt−1 + bt−1 )
A bt = β ∗ (ℓt − ℓt−1 ) + (1 − β ∗ )bt−1 bt = β ∗ (ℓt − ℓt−1 ) + (1 − β ∗ )bt−1 bt = β ∗ (ℓt − ℓt−1 ) + (1 − β ∗ )bt−1
st = γ(yt − ℓt−1 − bt−1 ) + (1 − γ)st−m st = γ(yt /(ℓt−1 − bt−1 )) + (1 − γ)st−m
ŷt+h|t = ℓt + hbt ŷt+h|t = ℓt + hbt + st−m+h+ m
ŷt+h|t = (ℓt + hbt )st−m+h+ m
ℓt = αyt + (1 − α)(ℓt−1 + φbt−1 ) ℓt = α(yt − st−m ) + (1 − α)(ℓt−1 + φbt−1 ) ℓt = α(yt /st−m ) + (1 − α)(ℓt−1 + φbt−1 )
Ad bt = β ∗ (ℓt − ℓt−1 ) + (1 − β ∗ )φbt−1 bt = β ∗ (ℓt − ℓt−1 ) + (1 − β ∗ )φbt−1 bt = β ∗ (ℓt − ℓt−1 ) + (1 − β ∗ )φbt−1
st = γ(yt − ℓt−1 − φbt−1 ) + (1 − γ)st−m st = γ(yt /(ℓt−1 − φbt−1 )) + (1 − γ)st−m
ŷt+h|t = ℓt + φh bt ŷt+h|t = ℓt + φh bt + st−m+h+ m
ŷt+h|t = (ℓt + φh bt )st−m+h+ m
ℓt = αyt + (1 − α)ℓt−1 bt−1 ℓt = α(yt − st−m ) + (1 − α)ℓt−1 bt−1 ℓt = α(yt /st−m ) + (1 − α)ℓt−1 bt−1
M bt = β ∗ (ℓt /ℓt−1 ) + (1 − β ∗ )bt−1 bt = β ∗ (ℓt /ℓt−1 ) + (1 − β ∗ )bt−1 bt = β ∗ (ℓt /ℓt−1 ) + (1 − β ∗ )bt−1
st = γ(yt − ℓt−1 bt−1 ) + (1 − γ)st−m st = γ(yt /(ℓt−1 bt−1 )) + (1 − γ)st−m
ŷt+h|t = ℓt bht ŷt+h|t = ℓt bht + st−m+h+ m
ŷt+h|t = ℓt bht st−m+h+ m
ℓt = αyt + (1 − α)ℓt−1 bφt−1 ℓt = α(yt − st−m ) + (1 − α)ℓt−1 bφt−1 ℓt = α(yt /st−m ) + (1 − α)ℓt−1 bφt−1
Md bt = β ∗ (ℓt /ℓt−1 ) + (1 − β ∗ )bφt−1 bt = β ∗ (ℓt /ℓt−1 ) + (1 − β ∗ )bφt−1 bt = β ∗ (ℓt /ℓt−1 ) + (1 − β ∗ )bφt−1
st = γ(yt − ℓt−1 bφt−1 ) + (1 − γ)st−m st = γ(yt /(ℓt−1 bφt−1 )) + (1 − γ)st−m
ŷt+h|t = ℓt bφt h ŷt+h|t = ℓt bφt h + st−m+h+m
ŷt+h|t = ℓt bφt h st−m+h+m

Table 1: Formulae for recursive calculations and point forecasts. In each case, ℓt denotes the series level at time t, bt denotes the slope at time t,
st denotes the seasonal component of ∗
2 h +
t, and m denotes the number of seasons in a year; α, β , γ and φ are constants,
the series at time
φh = φ + φ + · · · + φ and hm = (h − 1) mod m + 1.
5
Automatic time series forecasting: the forecast package for R

1.2 Innovations state space models

For each exponential smoothing method in Table 1, Hyndman et al. (2002) describe two
possible innovations state space models, one corresponding to a model with additive
errors and the other to a model with multiplicative errors. If the same parameter values
are used, these two models give equivalent point forecasts, although different prediction
intervals. Thus there are 30 potential models described in this classification.

Historically, the nature of the error component has often been ignored, because the dis-
tinction between additive and multiplicative errors makes no difference to point fore-
casts.

We are careful to distinguish exponential smoothing methods from the underlying state
space models. An exponential smoothing method is an algorithm for producing point
forecasts only. The underlying stochastic state space model gives the same point fore-
casts, but also provides a framework for computing prediction intervals and other prop-
erties.

To distinguish the models with additive and multiplicative errors, we add an extra letter
to the front of the method notation. The triplet (E,T,S) refers to the three components:
error, trend and seasonality. So the model ETS(A,A,N) has additive errors, additive trend
and no seasonality—in other words, this is Holt’s linear method with additive errors.
Similarly, ETS(M,Md ,M) refers to a model with multiplicative errors, a damped multi-
plicative trend and multiplicative seasonality. The notation ETS(·,·,·) helps in remember-
ing the order in which the components are specified.

Once a model is specified, we can study the probability distribution of future values
of the series and find, for example, the conditional mean of a future observation given
knowledge of the past. We denote this as µt+h|t = E(yt+h | xt ), where xt contains the
unobserved components such as ℓt , bt and st . For h = 1 we use µt ≡ µt+1|t as a short-
hand notation. For many models, these conditional means will be identical to the point
forecasts given in Table 1, so that µt+h|t = ŷt+h|t . However, for other models (those with
multiplicative trend or multiplicative seasonality), the conditional mean and the point
forecast will differ slightly for h ≥ 2.

We illustrate these ideas using the damped trend method of Gardner and McKenzie

Hyndman and Khandakar: June 2007 6

Automatic time series forecasting: the forecast package for R

(1985).

Additive error model: ETS(A,Ad,N)

Let µt = ŷt = ℓt−1 + bt−1 denote the one-step forecast of yt assuming that we know the
values of all parameters. Also, let εt = yt − µt denote the one-step forecast error at time
t. From the equations in Table 1, we find that

yt = ℓt−1 + φbt−1 + εt (2)

ℓt = ℓt−1 + φbt−1 + αεt (3)

bt = φbt−1 + β ∗ (ℓt − ℓt−1 − φbt−1 ) = φbt−1 + αβ ∗ εt . (4)

We simplify the last expression by setting β = αβ ∗ . The three equations above constitute
a state space model underlying the damped Holt’s method. Note that it is an innovations
state space model (Anderson and Moore, 1979; Aoki, 1987) because the same error term
appears in each equation. We can write it in standard state space notation by defining the
state vector as xt = (ℓt , bt )′ and expressing (2)–(4) as

yt = [1 φ] xt−1 + εt (5a)
   
1 φ α
xt =   xt−1 +   εt . (5b)
0 φ β

The model is fully specified once we state the distribution of the error term εt . Usually
we assume that these are independent and identically distributed, following a normal
distribution with mean 0 and variance σ 2 , which we write as εt ∼ NID(0, σ 2 ).

Multiplicative error model: ETS(M,Ad,N)

A model with multiplicative error can be derived similarly, by first setting εt = (yt −
µt )/µt , so that εt is the relative error. Then, following a similar approach to that for
additive errors, we find

yt = (ℓt−1 + φbt−1 )(1 + εt )

Hyndman and Khandakar: June 2007 7

Automatic time series forecasting: the forecast package for R

ℓt = (ℓt−1 + φbt−1 )(1 + αεt )

bt = φbt−1 + β(ℓt−1 + φbt−1 )εt ,

yt = [1 φ] xt−1 (1 + εt )
   
1 φ α
xt =   xt−1 + [1 φ] xt−1   εt .
0 φ β

Again we assume that εt ∼ NID(0, σ 2 ).

Of course, this is a nonlinear state space model, which is usually considered difficult to
handle in estimating and forecasting. However, that is one of the many advantages of the
innovations form of state space models — we can still compute forecasts, the likelihood
and prediction intervals for this nonlinear model with no more effort than is required for
the additive error model.

1.3 State space models for all exponential smoothing methods

We now give the state space models for all 30 exponential smoothing variations. The
general model involves a state vector xt = (ℓt , bt , st , st−1 , . . . , st−m+1 )′ and state space
equations of the form

yt = w(xt−1 ) + r(xt−1 )εt (6a)

xt = f (xt−1 ) + g(xt−1 )εt (6b)

where {εt } is a Gaussian white noise process with mean zero and variance σ 2 , and µt =
w(xt−1 ). The model with additive errors has r(xt−1 ) = 1, so that yt = µt + εt . The model
with multiplicative errors has r(xt−1 ) = µt , so that yt = µt (1 + εt ). Thus, εt = (yt − µt )/µt
is the relative error for the multiplicative model. The models are not unique. Clearly, any
value of r(xt−1 ) will lead to identical point forecasts for yt .

All of the methods in Table 1 can be written in the form (6a) and (6b). The underlying
equations for additive error models are given in Table 2. We use β = αβ ∗ to simplify
the notation. Multiplicative error models are obtained by replacing εt with µt εt in the

Hyndman and Khandakar: June 2007 8

Hyndman and Khandakar: June 2007

Automatic time series forecasting: the forecast package for R

Trend Seasonal
N A M
N µt = ℓt−1 µt = ℓt−1 + st−m µt = ℓt−1 st−m
ℓt = ℓt−1 + αεt ℓt = ℓt−1 + αεt ℓt = ℓt−1 + αεt /st−m
st = st−m + γεt st = st−m + γεt /ℓt−1

µt = ℓt−1 + bt−1 µt = ℓt−1 + bt−1 + st−m µt = (ℓt−1 + bt−1 )st−m

A ℓt = ℓt−1 + bt−1 + αεt ℓt = ℓt−1 + bt−1 + αεt ℓt = ℓt−1 + bt−1 + αεt /st−m
bt = bt−1 + βεt bt = bt−1 + βεt bt = bt−1 + βεt /st−m
st = st−m + γεt st = st−m + γεt /(ℓt−1 + bt−1 )

µt = ℓt−1 + φbt−1 µt = ℓt−1 + φbt−1 + st−m µt = (ℓt−1 + φbt−1 )st−m

Ad ℓt = ℓt−1 + φbt−1 + αεt ℓt = ℓt−1 + φbt−1 + αεt ℓt = ℓt−1 + φbt−1 + αεt /st−m
bt = φbt−1 + βεt bt = φbt−1 + βεt bt = φbt−1 + βεt /st−m
st = st−m + γεt st = st−m + γεt /(ℓt−1 + φbt−1 )

µt = ℓt−1 bt−1 µt = ℓt−1 bt−1 + st−m µt = ℓt−1 bt−1 st−m

M ℓt = ℓt−1 bt−1 + αεt ℓt = ℓt−1 bt−1 + αεt ℓt = ℓt−1 bt−1 + αεt /st−m
bt = bt−1 + βεt /ℓt−1 bt = bt−1 + βεt /ℓt−1 bt = bt−1 + βεt /(st−m ℓt−1 )
st = st−m + γεt st = st−m + γεt /(ℓt−1 bt−1 )

µt = ℓt−1 bφt−1 µt = ℓt−1 bφt−1 + st−m µt = ℓt−1 bφt−1 st−m

Md ℓt = ℓt−1 bφt−1 + αεt ℓt = ℓt−1 bφt−1 + αεt ℓt = ℓt−1 bφt−1 + αεt /st−m
bt = bφt−1 + βεt /ℓt−1 bt = bφt−1 + βεt /ℓt−1 bt = bφt−1 + βεt /(st−m ℓt−1 )
st = st−m + γεt st = st−m + γεt /(ℓt−1 bφt−1 )

Table 2: State space equations for each additive error model in the classification.
9
Hyndman and Khandakar: June 2007

Automatic time series forecasting: the forecast package for R

Trend Seasonal
N A M
N µt = ℓt−1 µt = ℓt−1 + st−m µt = ℓt−1 st−m
ℓt = ℓt−1 (1 + αεt ) ℓt = ℓt−1 + α(ℓt−1 + st−m )εt ℓt = ℓt−1 (1 + αεt )
st = st−m + γ(ℓt−1 + st−m )εt st = st−m (1 + γεt )

µt = ℓt−1 + bt−1 µt = ℓt−1 + bt−1 + st−m µt = (ℓt−1 + bt−1 )st−m

A ℓt = (ℓt−1 + bt−1 )(1 + αεt ) ℓt = ℓt−1 + bt−1 + α(ℓt−1 + bt−1 + st−m )εt ℓt = (ℓt−1 + bt−1 )(1 + αεt )
bt = bt−1 + β(ℓt−1 + bt−1 )εt bt = bt−1 + β(ℓt−1 + bt−1 + st−m )εt bt = bt−1 + β(ℓt−1 + bt−1 )εt
st = st−m + γ(ℓt−1 + bt−1 + st−m )εt st = st−m (1 + γεt )

µt = ℓt−1 + φbt−1 µt = ℓt−1 + φbt−1 + st−m µt = (ℓt−1 + φbt−1 )st−m

Ad ℓt = (ℓt−1 + φbt−1 )(1 + αεt ) ℓt = ℓt−1 + φbt−1 + α(ℓt−1 + φbt−1 + st−m )εt ℓt = (ℓt−1 + φbt−1 )(1 + αεt )
bt = φbt−1 + β(ℓt−1 + φbt−1 )εt bt = φbt−1 + β(ℓt−1 + φbt−1 + st−m )εt bt = φbt−1 + β(ℓt−1 + φbt−1 )εt
st = st−m + γ(ℓt−1 + φbt−1 + st−m )εt st = st−m (1 + γεt )

µt = ℓt−1 bt−1 µt = ℓt−1 bt−1 + st−m µt = ℓt−1 bt−1 st−m

M ℓt = ℓt−1 bt−1 (1 + αεt ) ℓt = ℓt−1 bt−1 + α(ℓt−1 bt−1 + st−m )εt ℓt = ℓt−1 bt−1 (1 + αεt )
bt = bt−1 (1 + βεt ) bt = bt−1 + β(ℓt−1 bt−1 + st−m )εt /ℓt−1 bt = bt−1 (1 + βεt )
st = st−m + γ(ℓt−1 bt−1 + st−m )εt st = st−m (1 + γεt )

µt = ℓt−1 bφt−1 µt = ℓt−1 bφt−1 + st−m µt = ℓt−1 bφt−1 st−m

Md ℓt = ℓt−1 bφt−1 (1 + αεt ) ℓt = ℓt−1 bφt−1 + α(ℓt−1 bφt−1 + st−m )εt ℓt = ℓt−1 bφt−1 (1 + αεt )
bt = bφt−1 (1 + βεt ) bt = bφt−1 + β(ℓt−1 bφt−1 + st−m )εt /ℓt−1 bt = bφt−1 (1 + βεt )
st = st−m + γ(ℓt−1 bφt−1 + st−m )εt st = st−m (1 + γεt )

Table 3: State space equations for each multiplicative error model in the classification.
10
Automatic time series forecasting: the forecast package for R

equations of Table 2. The resulting multiplicative error equations are given in Table 3.

Some of the combinations of trend, seasonality and error can occasionally lead to numer-
ical difficulties; specifically, any model equation that requires division by a state compo-
nent could involve division by zero. This is a problem for models with additive errors
and either multiplicative trend or multiplicative seasonality, as well as for the model with
multiplicative errors, multiplicative trend and additive seasonality. These models should
therefore be used with caution.

The multiplicative error models are useful when the data are strictly positive, but are not
numerically stable when the data contain zeros or negative values. So when the time
series is not strictly positive, only the six fully additive models may be applied.

The point forecasts given in Table 1 are easily obtained from these models by iter-
ating equations (6a) and (6b) for t = n + 1, n + 2, . . . , n + h, setting εn+j = 0 for
j = 1, . . . , h. In most cases (notable exceptions being models with multiplicative sea-
sonality or multiplicative trend for h ≥ 2), the point forecasts can be shown to be equal to
µt+h|t = E(yt+h | xt ), the conditional expectation of the corresponding state space model.

The models also provide a means of obtaining prediction intervals. In the case of the
linear models, where the forecast distributions are normal, we can derive the conditional
variance vt+h|t = Var(yt+h | xt ) and obtain prediction intervals accordingly. This ap-
proach also works for many of the nonlinear models. Detailed derivations of the results
for many models are given in Hyndman et al. (2005b).

A more direct approach that works for all of the models is to simply simulate many fu-
ture sample paths conditional on the last estimate of the state vector, xt . Then prediction
intervals can be obtained from the percentiles of the simulated sample paths. Point fore-
casts can also be obtained in this way by taking the average of the simulated values at
each future time period. An advantage of this approach is that we generate an estimate
of the complete predictive distribution, which is especially useful in applications such as
inventory planning, where expected costs depend on the whole distribution.

Hyndman and Khandakar: June 2007 11

Automatic time series forecasting: the forecast package for R

1.4 Estimation

In order to use these models for forecasting, we need to know the values of x0 and the
parameters α, β, γ and φ. It is easy to compute the likelihood of the innovations state
space model (6), and so obtain maximum likelihood estimates. Ord et al. (1997) show
that
n
X n
X
∗
L (θ, x0 ) = n log ε2t +2 log |r(xt−1 )| (7)
t=1 t=1

is equal to twice the negative logarithm of the likelihood function (with constant terms
eliminated), conditional on the parameters θ = (α, β, γ, φ)′ and the initial states x0 =
(ℓ0 , b0 , s0 , s−1 , . . . , s−m+1 )′ , where n is the number of observations. This is easily com-
puted by simply using the recursive equations in Table 1. Unlike state space models
with multiple sources of error, we do not need to use the Kalman filter to compute the
likelihood.

The parameters θ and the initial states x0 can be estimated by minimizing L∗ . Most
implementations of exponential smoothing use an ad hoc heuristic scheme to estimate
x0 . However, with modern computers, there is no reason why we cannot estimate x0
along with θ, and the resulting forecasts are often substantially better when we do.

We constrain the initial states x0 so that the seasonal indices add to zero for additive sea-
sonality, and add to m for multiplicative seasonality. There have been several suggestions
for restricting the parameter space for α, β and γ. The traditional approach is to ensure
that the various equations can be interpreted as weighted averages, thus requiring α,
β ∗ = β/α, γ ∗ = γ/(1 − α) and φ to all lie within (0, 1). This suggests

0 < α < 1, 0 < β < α, 0 < γ < 1 − α, and 0 < φ < 1.

However, Hyndman et al. (2007) show that these restrictions are usually stricter than nec-
essary (although in a few cases they are not restrictive enough).

Hyndman and Khandakar: June 2007 12

Automatic time series forecasting: the forecast package for R

1.5 Model selection

The forecast accuracy measures described in the previous section can be used for selecting
a model for a given set of data, provided the errors are computed from data in a hold-
out set and not from the same data as were used for model estimation. However, there
are often too few out-of-sample errors to draw reliable conclusions. Consequently, a
penalized method based on the in-sample fit is usually better.

One such approach uses a penalized likelihood such as Akaike’s Information Criterion:

AIC = L∗ (θ̂, x̂0 ) + 2q,

where q is the number of parameters in θ plus the number of free states in x0 , and θ̂ and
x̂0 denote the estimates of θ and x0 . We select the model that minimizes the AIC amongst
all of the models that are appropriate for the data.

The AIC also provides a method for selecting between the additive and multiplicative
error models. The point forecasts from the two models are identical so that standard
forecast accuracy measures such as the mean squared error (MSE) or mean absolute per-
centage error (MAPE) are unable to select between the error types. The AIC is able to
select between the error types because it is based on likelihood rather than one-step fore-
casts.

Obviously, other model selection criteria (such as the BIC) could also be used in a similar
manner.

1.6 Automatic forecasting

We combine the preceding ideas to obtain a robust and widely applicable automatic fore-
casting algorithm. The steps involved are summarized below.

1. For each series, apply all models that are appropriate, optimizing the parameters
(both smoothing parameters and the initial state variable) of the model in each case.
2. Select the best of the models according to the AIC.
3. Produce point forecasts using the best model (with optimized parameters) for as
many steps ahead as required.

Hyndman and Khandakar: June 2007 13

Automatic time series forecasting: the forecast package for R

4. Obtain prediction intervals for the best model either using the analytical results of
Hyndman et al. (2005b), or by simulating future sample paths for {yn+1 , . . . , yn+h }
and finding the α/2 and 1 − α/2 percentiles of the simulated data at each forecast-
ing horizon. If simulation is used, the sample paths may be generated using the
normal distribution for errors (parametric bootstrap) or using the resampled errors
(ordinary bootstrap).

Hyndman et al. (2002) applied this automatic forecasting strategy to the M-competition
data (Makridakis et al., 1982) and the IJF-M3 competition data (Makridakis and Hibon,
2000) using a restricted set of exponential smoothing models, and demonstrated that the
methodology is particularly good at short term forecasts (up to about 6 periods ahead),
and especially for seasonal short-term series (beating all other methods in the competi-
tions for these series).

2 ARIMA models

A common obstacle for many people in using Autoregressive Integrated Moving Average
(ARIMA) models for forecasting is that the order selection process is usually considered
subjective and difficult to apply. But it does not have to be. There have been several
attempts to automate ARIMA modelling in the last 25 years.

Hannan and Rissanen (1982) proposed a method to identify the order of an ARMA model
for a stationary series. In their method the innovations can be obtained by fitting a long
autoregressive model to the data, and then the likelihood of potential models is computed
via a series of standard regressions. They established the asymptotic properties of the
procedure under very general conditions.

Gómez (1998) extended the Hannan-Rissanen identification method to include mul-

tiplicative seasonal ARIMA model identification. Gómez and Maravall (1998) imple-
mented this automatic identification procedure in the software TRAMO and SEATS. For
a given series, the algorithm attempts to find the model with the minimum BIC.

Liu (1989) proposed a method for identification of seasonal ARIMA models using a fil-
tering method and certain heuristic rules; this algorithm is used in the SCA-Expert soft-
ware. Another approach is described by Mélard and Pasteels (2000) whose algorithm for

Hyndman and Khandakar: June 2007 14

Automatic time series forecasting: the forecast package for R

univariate ARIMA models also allows intervention analysis. It is implemented in the

software package “Time Series Expert” (TSE-AX).

Other algorithms are in use in commercial software, although they are not documented in
the public domain literature. In particular, Forecast Pro (Goodrich, 2000) is well-known
for its excellent automatic ARIMA algorithm which was used in the M3-forecasting com-
petition (Makridakis and Hibon, 2000). Another proprietary algorithm is implemented
in AutoBox (Reilly, 2000). Ord and Lowe (1996) provide an early review of some of the
commercial software that implement automatic ARIMA forecasting.

2.1 Choosing the model order using unit root tests and the AIC

A non-seasonal ARIMA(p, d, q) process is given by

(1 − B d )yt = c + φ(B)yt + θ(B)εt

where {εt } is a white noise process with mean zero and variance σ 2 , B is the backshift
operator, and φ(z) and θ(z) are polynomials of order p and q respectively. To ensure
causality and invertibility, it is assumed that φ(z) and θ(z) have no roots for |z| < 1
(Brockwell and Davis, 1991). If c 6= 0, there is an implied polynomial of order d in the
forecast function.

The seasonal ARIMA(p, d, q)(P, D, Q)m process is given by

(1 − B m )D (1 − B)d yt = c + Φ(B m )φ(B)yt + Θ(B m )θ(B)εt

where Φ(z) and Θ(z) are polynomials of orders P and Q respectively, each containing no
roots inside the unit circle. If c 6= 0, there is an implied polynomial of order d + D in the
forecast function.

The main task in automatic ARIMA forecasting is selecting an appropriate model order,
that is the values p, q, P , Q, D, d. If d and D are known, we can select the orders p, q, P
and Q via an information criterion such as the AIC:

AIC = −2 log(L) + 2(p + q + P + Q + k)

Hyndman and Khandakar: June 2007 15

Automatic time series forecasting: the forecast package for R

where k = 1 if c 6= 0 and 0 otherwise, and L is the maximized likelihood of the model

fitted to the differenced data (1 − B m )D (1 − B)d yt . The likelihood of the full model for yt
is not actually defined and so the value of the AIC for different levels of differencing are
not comparable.

One solution to this difficulty is the “diffuse prior” approach which is outlined in
Durbin and Koopman (2001) and implemented in the arima function in R. In this ap-
proach, the initial values of the time series (before the observed values) are assumed to
have mean zero and a large variance. However, choosing d and D by minimizing the
AIC using this approach tends to lead to over-differencing. For forecasting purposes,
we believe it is better to make as few differences as possible because over-differencing
harms forecasts (Smith and Yadav, 1994) and widens prediction intervals. (Although, see
Hendry (1997) for a contrary view.)

Consequently, we need some other approach to choose d and D. We prefer unit-root tests.
However, most unit-root tests are based on a null hypothesis that a unit root exists which
biases results towards more differences rather than fewer differences. For example, vari-
ations on the Dickey-Fuller test (Dickey and Fuller, 1981) all assume there is a unit root
at lag 1, and the HEGY test of Hylleberg et al. (1990) is based on a null hypothesis that
there is a seasonal unit root. Instead, we prefer unit-root tests based on a null hypothesis
of no unit-root.

For non-seasonal data, we consider ARIMA(p, d, q) models where d is selected based on

successive KPSS unit-root tests (Kwiatkowski et al., 1992). That is, we test the data for a
unit root; if the test result is significant, we test the differenced data for a unit root; and
so on. We stop this procedure when we obtain our first insignificant result.

For seasonal data, we consider ARIMA(p, d, q)(P, D, Q)m models where m is the sea-
sonal frequency and D = 0 or D = 1 depending on an extended Canova-Hansen
test (Canova and Hansen, 1995). Canova and Hansen only provide critical values for
2 < m < 13. In our implementation of their test, we allow any value of m > 1. Let Cm
be the critical value for seasonal period m. We plotted Cm against m for values of m up
to 365 and noted that they fit the line Cm = 0.269m0.928 almost exactly. So for m > 12, we
use this simple expression to obtain the critical value.

Hyndman and Khandakar: June 2007 16

Automatic time series forecasting: the forecast package for R

We note in passing that the null hypothesis for the Canova-Hansen test is not an ARIMA
model as it includes seasonal dummy terms. It is a test for whether the seasonal pattern
changes sufficiently over time to warrant a seasonal unit root, or whether a stable sea-
sonal pattern modelled using fixed dummy variables is more appropriate. Nevertheless,
we have found that the test is still useful for choosing D in a strictly ARIMA framework
(i.e., without seasonal dummy variables). If a stable seasonal pattern is selected (i.e.,
the null hypothesis is not rejected), the seasonality is effectively handled by stationary
seasonal AR and MA terms.

After D is selected, we choose d by applying successive KPSS unit-root tests to the sea-
sonally differenced data (if D = 1) or the original data (if D = 0). Once d (and possibly
D) are selected, we proceed to select the values of p, q, P and Q by minimizing the AIC.
We allow c 6= 0 for models where d + D < 2.

2.2 A step-wise procedure for traversing the model space

Suppose we have seasonal data and we consider ARIMA(p, d, q)(P, D, Q)m models where
p and q can take values from 0 to 3, and P and Q can take values from 0 to 1. When c = 0
there are a total of 288 possible models, and when c 6= 0 there are a total of 192 possible
models, giving 480 models altogether. If the values of p, d, q, P , D and Q are allowed
to range more widely, the number of possible models increases rapidly. Consequently,
it is often not feasible to simply fit every potential model and choose the one with the
lowest AIC. Instead, we need a way of traversing the space of models efficiently in order
to arrive at the model with the lowest AIC value.

We propose a step-wise algorithm as follows.

Step 1: We try four possible models to start with.

• ARIMA(2, d, 2) if m = 1 and ARIMA(2, d, 2)(1, D, 1) if m > 1.

• ARIMA(0, d, 0) if m = 1 and ARIMA(0, d, 0)(0, D, 0) if m > 1.

• ARIMA(1, d, 0) if m = 1 and ARIMA(1, d, 0)(1, D, 0) if m > 1.

• ARIMA(0, d, 1) if m = 1 and ARIMA(0, d, 1)(0, D, 1) if m > 1.

If d + D ≤ 1, these models are fitted with c 6= 0. Otherwise, we set c = 0. Of these

Hyndman and Khandakar: June 2007 17

Automatic time series forecasting: the forecast package for R

four models, we select the one with the smallest AIC value. This is called the “cur-
rent” model and is denoted by ARIMA(p, d, q) if m = 1 or ARIMA(p, d, q)(P, D, Q)m
if m > 1.

Step 2: We consider up to thirteen variations on the current model:

• where one of p, q, P and Q is allowed to vary by ±1 from the current model;

• where p and q both vary by ±1 from the current model;

• where P and Q both vary by ±1 from the current model;

• where the constant c is included if the current model has c = 0 or excluded if

the current model has c 6= 0.

Whenever a model with lower AIC is found, it becomes the new “current” model
and the procedure is repeated. This process finishes when we cannot find a model
close to the current model with lower AIC.

There are several constraints on the fitted models to avoid problems with convergence or
near unit-roots. The constraints are outlined below.

• The values of p and q are not allowed to exceed specified upper bounds (with de-
fault values of 5 in each case).

• The values of P and Q are not allowed to exceed specified upper bounds (with
default values of 2 in each case).

• We reject any model which is “close” to non-invertible or non-causal. Specifically,

we compute the roots of φ(B)Φ(B) and θ(B)Θ(B). If either have a root that is
smaller than 1.001 in absolute value, the model is rejected.

• If there are any errors arising in the non-linear optimization routine used for esti-
mation, the model is rejected. The rationale here is that any model that is difficult
to fit is probably not a good model for the data.

The algorithm is guaranteed to return a valid model because the model space is finite
and at least one of the starting models will be accepted (the model with no AR or MA
parameters). The selected model is used to produce forecasts.

Hyndman and Khandakar: June 2007 18

Automatic time series forecasting: the forecast package for R

3 The forecast package

The algorithms and modelling frameworks for automatic univariate time series forecast-
ing are implemented in the forecast package (Hyndman, 2007) in R. It is available from
CRAN. Version 1.05 of the package was used for this paper.

We illustrate the methods using the following four real time series shown in Figure 1.

• Figure 1(a) shows 125 monthly U.S. government bond yields (percent per annum)
from January 1994 to May 2004.
• Figure 1(b) displays 55 observations of annual U.S. net electricity generation (billion
kwh) for 1949 through 2003.
• Figure 1(c) presents 113 quarterly observations of passenger motor vehicle produc-
tion in the U.K. (thousands of cars) for the first quarter of 1977 through the first
quarter of 2005.
• Figure 1(d) shows 240 monthly observations of the number of short term overseas
visitors to Australia from May 1985 to April 2005.

Figure 1: Four time series showing point forecasts and 80% & 95% prediction intervals obtained
using exponential smoothing state space models.
(a) US 10−year bonds yield (b) US net electricity generation
5000
8
Percentage per annum

Billion kwh

3000
6
5

1000
4
3

0 20 40 60 80 100 120 140 1950 1960 1970 1980 1990 2000 2010

Year Year

(c) UK passenger motor vehicle production (d) Overseas visitors to Australia

800
500

Thousands of people
Thousands of cars

600
400

400
300

200
200

1980 1985 1990 1995 2000 2005 1985 1990 1995 2000 2005

Year Year

Hyndman and Khandakar: June 2007 19

Automatic time series forecasting: the forecast package for R

3.1 Implementation of the automatic exponential smoothing algorithm

The innovations state space modelling framework described in Section 1 is implemented

via the ets() function in the forecast package. For example, to apply it to the US net
electricity generation time series uselec, we use the following commands.

etsfit <- ets(uselec)

fcast <- forecast(etsfit)
plot(fcast)

The object etsfit contains all of the necessary information about the fitted model in-
cluding model parameters, the value of the state vector xt for all t, residuals and so on.
The forecast function computes the required forecasts which are then plotted as in Fig-
ure 1(b).

The models chosen via the algorithm were:

• ETS(A,Ad ,N) for monthly US 10-year bonds yield

(α = 0.99, β = 0.12, φ = 0.80, ℓ0 = 5.30, b0 = 0.71);
• ETS(M,Md ,N) for annual US net electricity generation
(α = 0.99, β = 0.01, φ = 0.97, ℓ0 = 262.5, b0 = 1.12);
• ETS(A,N,A) for quarterly UK motor vehicle production
(α = 0.61, γ = 0.01, ℓ0 = 343.4, s−3 = 24.99, s−2 = 21.40, s−1 = −44.96, s0 = −1.42);
• ETS(M,A,M) for monthly Australian overseas visitors
(α = 0.57, β = 0.01, γ = 0.19, ℓ0 = 86.2, b0 = 2.66, s−11 = 0.851, s−10 = 0.844,
s−9 = 0.985, s−8 = 0.924, s−7 = 0.822, s−6 = 1.006, s−5 = 1.101, s−4 = 1.369,
s−3 = 0.975, s−2 = 1.078, s−1 = 1.087, s0 = 0.958).

Although there is a lot of computation involved, it can be handled remarkably quickly on

modern computers. Each of the forecasts shown in Figure 1 took no more than a few sec-
onds on a standard PC. The US electricity generation series took the longest as there are
no analytical prediction intervals available for the ETS(M,Md ,N) model. Consequently,
the prediction intervals for this series were computed using simulation of 5000 future
sample paths.

Hyndman and Khandakar: June 2007 20

Automatic time series forecasting: the forecast package for R

3.2 The HoltWinters function

There is another implementation of exponential smoothing in R via the HoltWinters

function in the stats package. It implements only the (N,N), (A,N), (A,A) and (A,M)
methods. The initial states x0 are fixed using a heuristic algorithm. Because of the way
the initial states are estimated, a full three years of seasonal data is required to imple-
ment the seasonal forecasts using HoltWinters. (See Hyndman and Kostenko (2007) for
the minimal sample size required.) The smoothing parameters are optimized by mini-
mizing the average squared prediction errors, which is equivalent to minimizing (7) in
the case of additive errors.

There is a predict method for the resulting object which can produce point forecasts and
prediction intervals. Although it is nowhere documented, it appears that the prediction
intervals produced by predict for an object of class HoltWinters are based on an equiva-
lent ARIMA model in the case of the (N,N), (A,N) and (A,A) methods, assuming additive
errors. These prediction intervals are equivalent to the prediction intervals that arise from
the (A,N,N), (A,A,N) and (A,A,A) state space models. For the (A,M) method, the predic-
tion interval provided by predict appears to be based on Chatfield and Yar (1991) which
is an approximation to the true prediction interval arising from the (A,A,M) model. Pre-
diction intervals with multiplicative errors are not possible using the HoltWinters func-
tion.

3.3 Implementation of the automatic ARIMA algorithm

The algorithm of Section 2 is applied to the same four time series. Unlike the exponential
smoothing algorithm, the ARIMA class of models assumes homoscedasticity which is not
always appropriate. Consequently, transformations are sometimes necessary. For these
four time series, we model the raw data for series (a)–(c), but the logged data for series
(d). The prediction intervals are back-transformed with the point forecasts to preserve
the probability coverage.

To apply this algorithm to the US net electricity generation time series uselec, we use the
following commands.

Hyndman and Khandakar: June 2007 21

Automatic time series forecasting: the forecast package for R

Figure 2: Four time series showing point forecasts and 80% & 95% prediction intervals obtained
using ARIMA models.
(a) US 10−year bonds yield (b) US net electricity generation

5000
8
Percentage per annum

Billion kwh

3000
6
5

1000
4
3

0 20 40 60 80 100 120 140 1950 1960 1970 1980 1990 2000 2010

Year Year

(c) UK passenger motor vehicle production (d) Overseas visitors to Australia

800 1000
500

Thousands of people
Thousands of cars

400

600
300

400
200
200

1980 1985 1990 1995 2000 2005 1985 1990 1995 2000 2005

Year Year

arimafit <- auto.arima(uselec)

fcast <- forecast(arimafit)
plot(fcast)

The function auto.arima() implements the algorithm of Section 2 and returns an object
of class Arima. The resulting forecasts are shown in Figure 2. The fitted models are as
follows:

• ARIMA(0,1,1) for monthly US 10-year bonds yield

(θ1 = 0.322);
• ARIMA(2,1,2) with drift for annual US net electricity generation
(φ1 = −1.3031, φ2 = −0.4334, θ1 = 1.5280, θ2 = 0.8338, c = 66.1473);
• ARIMA(3,1,1)(2,0,1)4 model for quarterly UK motor vehicle production
(φ1 = −0.9570, φ2 = −0.4500, φ3 = −0.3314, θ1 = 0.5784, Φ1 = 0.6743, Φ2 = 0.3175,
Θ1 = −0.8364;

Hyndman and Khandakar: June 2007 22

Automatic time series forecasting: the forecast package for R

• ARIMA(1,1,1)(2,0,0)12 model with drift for monthly Australian overseas visitors

(φ1 = 0.4740, θ1 = −0.7393, Φ1 = 0.4728, Φ2 = 0.4780, c = 0.0110).

Note that the R parameterization has θ(B) = (1 + θ1 B + · · · + θq B) and φ(B) = (1 − φ1 B +

· · · − φq B), and similarly for the seasonal terms.

The forecast package also contains the function arima() which is a wrapper to the func-
tion of the same name in the stats package. The arima() function in the forecast package
makes it easier to include a drift term when d + D = 1. (Setting include.mean=TRUE in
the arima() function from the stats package will only work when d + D = 0.)

3.4 The forecast() function

The forecast() function is generic and has S3 methods for a wide range of time se-
ries models. It computes point forecasts and prediction intervals from the time series
model. Methods exist for models fitted using ets(), arima(), ar(), HoltWinters() and
StructTS().

In the latter four cases, there is also a predict() function which is intended to do much
the same thing. Unfortunately, the resulting objects from the predict function contain
different information in each case and so it is not possible to build generic functions
(such as plot and summary) for the results. So, instead, forecast() acts as a wrapper to
predict(), and packages the information obtained in a common format (the forecast
class).

There is also a method for a time series object. If a time series object is passed as the first
argument to forecast(), the function will produce forecasts based on the exponential
smoothing algorithm of Section 1.

3.5 The forecast class

The output from the forecast() function is an object of class “forecast”. Several other
functions in the forecast package also produce objects of this class, but they are not dis-
cussed here. An object of class forecast includes at least the following information:

• the original series;

Hyndman and Khandakar: June 2007 23

Automatic time series forecasting: the forecast package for R

• point forecasts;
• prediction intervals of specified coverage;
• the forecasting method used and information about the fitted model;
• residuals from the fitted model;
• one-step forecasts from the fitted model for the period of the observed data.

There are print, plot and summary methods for the forecast class. Figures 1
and 2 were produced using plot.forecast. Figure 3 shows the summary out-
put for the forecasts plotted in Figures 1(c) and 2(c). The in-sample error mea-
sures (defined in Hyndman and Koehler, 2006) for the two models are almost iden-
tical. Note that the information criteria are not comparable. The prediction inter-
vals are, by default, computed for 80% and 95% coverage, although other values are
possible if requested. Fan charts (Wallis, 1999) are possible using the combination
plot(forecast(model.object,fan=TRUE)).

4 Comparisons

There is a widespread myth that ARIMA models are more general than exponential
smoothing. This is not true. The two classes of models overlap. The linear exponen-
tial smoothing models are all special cases of ARIMA models—the equivalences are dis-
cussed in Hyndman et al. (2007). However, the non-linear exponential smoothing mod-
els have no equivalent ARIMA counterpart. On the other hand, there are many ARIMA
models which have no exponential smoothing counterpart. Thus, the two model classes
overlap and are complimentary; each has its strengths and weaknesses.

The exponential smoothing state space models are all non-stationary. Models with sea-
sonality or non-damped trend (or both) have two unit roots; all other models—that is,
non-seasonal models with either no trend or damped trend—have one unit root. It is
possible to define a stationary model with similar characteristics to exponential smooth-
ing, but this is not normally done. The philosophy of exponential smoothing is that the
world is non-stationary. So if a stationary model is required, ARIMA models are better.

One advantage of the exponential smoothing models is that they can be non-linear. So

Hyndman and Khandakar: June 2007 24

Automatic time series forecasting: the forecast package for R

> summary(forecast(etsfit))

Forecast method: ETS(A,N,A)

Smoothing parameters:
alpha = 0.6063
gamma = 0.01

Initial states:
l = 343.4342
s = -1.4193 -44.9642 21.3933 24.9903

sigma: 25.5668
AIC: 1277.87
AICc: 1278.662
BIC: 1294.234

In-sample error measures:

ME RMSE MAE MPE MAPE MASE
0.917498382 25.469972683 20.449717769 -0.002726478 0.067189284 0.511400816

Forecasts:
Point Forecast Lo 80 Hi 80 Lo 95 Hi 95
2005 Q2 427.6845 394.9193 460.4497 377.5744 477.7945
2005 Q3 361.8133 323.4959 400.1308 303.2119 420.4148
2005 Q4 405.1787 361.8657 448.4917 338.9372 471.4202
2006 Q1 431.5437 383.8920 479.1954 358.6668 504.4207
2006 Q2 427.6845 376.0575 479.3115 348.7278 506.6412
2006 Q3 361.8133 306.4959 417.1307 277.2127 446.4140
2006 Q4 405.1787 346.2906 464.0668 315.1172 495.2403
2007 Q1 431.5437 369.3949 493.6925 336.4953 526.5921

> summary(forecast(arimafit))

Forecast method: ARIMA(3,1,1)(2,0,1)[4]

Coefficients:
ar1 ar2 ar3 ma1 sar1 sar2 sma1
-0.9570 -0.4500 -0.3314 0.5784 0.6743 0.3175 -0.8364
s.e. 0.2063 0.1442 0.1149 0.2173 0.1649 0.1500 0.2151

sigma^2 estimated as 667.4: log likelihood = -527.49

AIC = 1070.98 AICc = 1072.04 BIC = 1090.07

In-sample error measures:

ME RMSE MAE MPE MAPE MASE
1.584763755 25.718591689 20.378247940 0.003401545 0.067228348 0.509613519

Forecasts:
Point Forecast Lo 80 Hi 80 Lo 95 Hi 95
2005 Q2 420.3970 387.2903 453.5037 369.7647 471.0293
2005 Q3 375.4078 336.4292 414.3864 315.7952 435.0204
2005 Q4 407.6859 364.8897 450.4822 342.2347 473.1371
2006 Q1 441.5159 396.1285 486.9033 372.1018 510.9300
2006 Q2 425.1130 376.4140 473.8120 350.6343 499.5917
2006 Q3 377.9051 327.1692 428.6410 300.3113 455.4989
2006 Q4 408.1083 354.5331 461.6834 326.1722 490.0443
2007 Q1 441.1217 385.4706 496.7729 356.0106 526.2328

Figure 3: Example output from the summary method for the forecast class.
Hyndman and Khandakar: June 2007 25
Automatic time series forecasting: the forecast package for R

time series that exhibit non-linear characteristics including heteroscedasticity may be bet-
ter modelled using exponential smoothing state space models.

For seasonal data, there are many more ARIMA models than the 30 possible models in
the exponential smoothing class of Section 1. It may be thought that the larger model
class is advantageous. However, the results in Hyndman et al. (2002) show that the ex-
ponential smoothing models performed better than the ARIMA models for the seasonal
M3 competition data. (For the annual M3 data, the ARIMA models performed better.) In
a discussion of these results, Hyndman (2001) speculates that the larger model space of
ARIMA models actually harms forecasting performance because it introduces additional
uncertainty. The smaller exponential smoothing class is sufficiently rich to capture the
dynamics of almost all real business and economic time series.

Hyndman and Khandakar: June 2007 26

Automatic time series forecasting: the forecast package for R

References
Anderson, B. D. O. and J. B. Moore (1979) Optimal Filtering, Prentice-Hall, Englewood
Cliffs.

Aoki, M. (1987) State space modeling of time series, Springer-Verlag, Berlin.

Archibald, B. C. (1990) Parameter space of the Holt-Winters’ model, International Journal

of Forecasting, 6, 199–229.

Assimakopoulos, V. and K. Nikolopoulos (2000) The theta model: a decomposition ap-

proach to forecasting, International Journal of Forecasting, 16, 521–530.

Bowerman, B. L., R. T. O’Connell and A. B. Koehler (2005) Forecasting, time series and
regression: an applied approach, Thomson Brooks/Cole, Belmont CA.

Brockwell, P. J. and R. A. Davis (1991) Time series: theory and methods, Springer-Verlag,
New York, 2nd ed.

Canova, F. and B. E. Hansen (1995) Are seasonal patterns constant over time? a test for
seasonal stability, Journal of Business and Economic Statistics, 13, 237–252.

Chatfield, C. and M. Yar (1991) Prediction intervals for multiplicative Holt-Winters, In-
ternational Journal of Forecasting, 7, 31–37.

Dickey, D. A. and W. A. Fuller (1981) Likelihood ratio statistics for autoregressive time
series with a unit root, Econometrica, 49, 1057–1071.

Durbin, J. and S. J. Koopman (2001) Time series analysis by state space methods, Oxford
University Press, Oxford.

Gardner, Jr, E. S. (1985) Exponential smoothing: The state of the art, Journal of Forecasting,
4, 1–28.

Gardner, Jr, E. S. and E. McKenzie (1985) Forecasting trends in time series, Management
Science, 31(10), 1237–1246.

Gómez, V. (1998) Automatic model identification in the presence of missing observa-

tions and outliers, Tech. rep., Ministerio de Economı́a y Hacienda, Dirección General
de Análisis y Programación Presupuestaria, working paper D-98009.

Hyndman and Khandakar: June 2007 27

Automatic time series forecasting: the forecast package for R

Gómez, V. and A. Maravall (1998) Programs TRAMO and SEATS, instructions for the
users, Tech. rep., Direcciòn General de Anilisis y Programaciòn Presupuestaria, Minis-
terio de Economia y Hacienda, working paper 97001.

Goodrich, R. L. (2000) The Forecast Pro methodology, International Journal of Forecasting,

16(4), 533–535.

Hannan, E. J. and J. Rissanen (1982) Recursive estimation of mixed autoregressive-

moving average order, Biometrika, 69(1), 81–94.

Hendry, D. F. (1997) The econometrics of macroeconomic forecasting, The Economic Jour-

nal, 107(444), 1330–1357.

Hylleberg, S., R. Engle, C. Granger and B. Yoo (1990) Seasonal integration and cointegra-
tion, Journal of Econometrics, 44, 215–238.

Hyndman, R. J. (2001) It’s time to move from ‘what’ to ‘why’—comments on the M3-
competition, International Journal of Forecasting, 17(4), 567–570.

Hyndman, R. J. (2007) forecast: Forecasting functions for time series, R package version 1.05.
URL: http://www.robhyndman.info/Rlibrary/forecast/

Hyndman, R. J., M. Akram and B. C. Archibald (2007) The admissible parameter space
for exponential smoothing models, Annals of the Institute of Statistical Mathematics, to
appear.

Hyndman, R. J., M. L. King, I. Pitrun and B. Billah (2005a) Local linear forecasts using
cubic smoothing splines, Australian & New Zealand Journal of Statistics, 47(1), 87–99.

Hyndman, R. J. and A. B. Koehler (2006) Another look at measures of forecast accuracy,

International Journal of Forecasting, 22, 679–688.

Hyndman, R. J., A. B. Koehler, J. K. Ord and R. D. Snyder (2005b) Prediction intervals for
exponential smoothing using two new classes of state space models, Journal of Forecast-
ing, 24, 17–37.

Hyndman, R. J., A. B. Koehler, R. D. Snyder and S. Grose (2002) A state space framework
for automatic forecasting using exponential smoothing methods, International Journal
of Forecasting, 18(3), 439–454.

Hyndman and Khandakar: June 2007 28

Automatic time series forecasting: the forecast package for R

Hyndman, R. J. and A. V. Kostenko (2007) Minimum sample size requirements for sea-
sonal forecasting models, Foresight: the International Journal of Applied Forecasting, 6,
12–15.

Kwiatkowski, D., P. C. Phillips, P. Schmidt and Y. Shin (1992) Testing the null hypothesis
of statioanrity against the alternative of a unit root, Journal of Econometrics, 54, 159–178.

Liu, L. M. (1989) Identification of seasonal ARIMA models using a filtering method, Com-
munications in Statistics, Part A — Theory and Methods, 18, 2279–2288.

Makridakis, S., A. Anderson, R. Carbone, R. Fildes, M. Hibon, R. Lewandowski, J. New-

ton, E. Parzen and R. Winkler (1982) The accuracy of extrapolation (time series) meth-
ods: Results of a forecasting competition, Journal of Forecasting, 1, 111–153.

Makridakis, S. and M. Hibon (2000) The M3-competition: Results, conclusions and im-
plications, International Journal of Forecasting, 16, 451–476.

Makridakis, S. G., S. C. Wheelwright and R. J. Hyndman (1998) Forecasting: methods and

applications, John Wiley & Sons, New York, 3rd ed.

Mélard, G. and J.-M. Pasteels (2000) Automatic ARIMA modeling including intervention,
using time series expert software, International Journal of Forecasting, 16, 497–508.

Ord, J. K., A. B. Koehler and R. D. Snyder (1997) Estimation and prediction for a class of
dynamic nonlinear statistical models, Journal of the American Statistical Association, 92,
1621–1629.

Ord, K. and S. Lowe (1996) Automatic forecasting, The American Statistician, 50(1), 88–94.

Pegels, C. C. (1969) Exponential forecasting: Some new variations, Management Science,

15(5), 311–315.

Reilly, D. (2000) The AUTOBOX system, International Journal of Forecasting, 16(4), 531–533.

Smith, J. and S. Yadav (1994) Forecasting costs incurred from unit differencing fraction-
ally integrated processes, International Journal of Forecasting, 10(4), 507–514.

Taylor, J. W. (2003) Exponential smoothing with a damped multiplicative trend, Interna-

tional Journal of Forecasting, 19, 715–725.

Wallis, K. F. (1999) Asymmetric density forecasts of inflation and the Bank of England’s
fan chart, National Institute Economic Review, 167(1), 106–112.

Hyndman and Khandakar: June 2007 29

View publication stats

Calculus For Iit Jee PDF
77% (96)
Calculus For Iit Jee PDF
510 pages
Time Series For Data Science Analysis and Forecasting (Wayne A. Woodward, Bivin Philip Sadler Etc.) (Z-Library)
100% (1)
Time Series For Data Science Analysis and Forecasting (Wayne A. Woodward, Bivin Philip Sadler Etc.) (Z-Library)
529 pages
Time Series Cheat Sheet
No ratings yet
Time Series Cheat Sheet
2 pages
(Ebook PDF) The Analysis of Time Series: An Introduction With R 7th Edition Instant Download
100% (4)
(Ebook PDF) The Analysis of Time Series: An Introduction With R 7th Edition Instant Download
45 pages
Forecasting Using R: Rob J Hyndman
No ratings yet
Forecasting Using R: Rob J Hyndman
74 pages
(Ebook PDF) The Analysis of Time Series: An Introduction With R 7th Edition PDF Download
100% (2)
(Ebook PDF) The Analysis of Time Series: An Introduction With R 7th Edition PDF Download
45 pages
Sarima Group 11
No ratings yet
Sarima Group 11
21 pages
ARIMA Modelling and Forecasting: By: Amar Kumar
100% (1)
ARIMA Modelling and Forecasting: By: Amar Kumar
22 pages
TES Processes and ARIMA Models: A Comparison of Forecasting Performance
No ratings yet
TES Processes and ARIMA Models: A Comparison of Forecasting Performance
21 pages
UNIVAR4
No ratings yet
UNIVAR4
56 pages
MIS410 Chapter7
No ratings yet
MIS410 Chapter7
49 pages
Da Lab Exp 7,8,9,10,11,12
No ratings yet
Da Lab Exp 7,8,9,10,11,12
32 pages
Package Forecast': R Topics Documented
No ratings yet
Package Forecast': R Topics Documented
55 pages
7 Transformations
No ratings yet
7 Transformations
68 pages
Modeling and Analysis of Time Series Data: Chapter 1: Introduction
No ratings yet
Modeling and Analysis of Time Series Data: Chapter 1: Introduction
26 pages
CH 7 Forecasting
No ratings yet
CH 7 Forecasting
9 pages
08 ASAP TimeSeriesForcasting - Day 8-11
No ratings yet
08 ASAP TimeSeriesForcasting - Day 8-11
62 pages
Basic Time Series With Python Code
No ratings yet
Basic Time Series With Python Code
33 pages
10 The Forward Premium in Electricity Markets An Experimental Study
No ratings yet
10 The Forward Premium in Electricity Markets An Experimental Study
14 pages
Module 3.1 Time Series Forecasting ARIMA Model
No ratings yet
Module 3.1 Time Series Forecasting ARIMA Model
19 pages
FM - Resumes
No ratings yet
FM - Resumes
18 pages
Stationarity & AR, MA, ARIMA, SARIMA
100% (1)
Stationarity & AR, MA, ARIMA, SARIMA
6 pages
Homework Assignment #5
No ratings yet
Homework Assignment #5
8 pages
Time Series Components:: The Long-Term Direction.: The Periodic Behavior.: The Irregular Fluctuations
No ratings yet
Time Series Components:: The Long-Term Direction.: The Periodic Behavior.: The Irregular Fluctuations
19 pages
JSS2008
No ratings yet
JSS2008
23 pages
DSS16-Time Series
No ratings yet
DSS16-Time Series
65 pages
MATH545-Time Series
No ratings yet
MATH545-Time Series
79 pages
Orbit Forecasting Model
No ratings yet
Orbit Forecasting Model
6 pages
Short-Run Electricity Demand Forecast in Maharashtra
No ratings yet
Short-Run Electricity Demand Forecast in Maharashtra
6 pages
Time Series Cheat Sheet
No ratings yet
Time Series Cheat Sheet
1 page
From News To Forecast
No ratings yet
From News To Forecast
18 pages
Journal of Statistical Software: Automatic Time Series Forecasting: The Forecast Package For R
No ratings yet
Journal of Statistical Software: Automatic Time Series Forecasting: The Forecast Package For R
23 pages
WQU Econometrics M3 Compiled Content PDF
No ratings yet
WQU Econometrics M3 Compiled Content PDF
44 pages
Time Series Analysis Handbook 03
No ratings yet
Time Series Analysis Handbook 03
12 pages
Comparison of Trend Forecast Using ARIMA and ETS Models For S&P500 Close Price
No ratings yet
Comparison of Trend Forecast Using ARIMA and ETS Models For S&P500 Close Price
4 pages
cheatsheet的副本
No ratings yet
cheatsheet的副本
8 pages
ARIMAKASYOKI
No ratings yet
ARIMAKASYOKI
5 pages
Arima Word
No ratings yet
Arima Word
13 pages
Auto-Regressive Integrated Moving Average Models I
No ratings yet
Auto-Regressive Integrated Moving Average Models I
12 pages
2024 Ceed Mathematics - Paper I
No ratings yet
2024 Ceed Mathematics - Paper I
14 pages
Forecast2 PDF
No ratings yet
Forecast2 PDF
22 pages
Advanced Multivariate Time Series Forecasting Mode
No ratings yet
Advanced Multivariate Time Series Forecasting Mode
8 pages
8.7 ARIMA Modelling in R - Forecasting - Principles and Practice
No ratings yet
8.7 ARIMA Modelling in R - Forecasting - Principles and Practice
10 pages
Intro of Time Series
No ratings yet
Intro of Time Series
18 pages
Arima Notes
No ratings yet
Arima Notes
4 pages
Business Analytis C4
No ratings yet
Business Analytis C4
10 pages
End Term Project (BA)
No ratings yet
End Term Project (BA)
19 pages
Time Series Forecasting With Python Cheat Sheet
No ratings yet
Time Series Forecasting With Python Cheat Sheet
7 pages
Meow Meow Hehe Cute
No ratings yet
Meow Meow Hehe Cute
1 page
Design For Test Scan Test
100% (1)
Design For Test Scan Test
31 pages
Resumos Forecasting
No ratings yet
Resumos Forecasting
17 pages
Time - Series - in - Brief
No ratings yet
Time - Series - in - Brief
11 pages
ARIMA Forecasting
No ratings yet
ARIMA Forecasting
9 pages
Forecasting
No ratings yet
Forecasting
75 pages
Time Series
No ratings yet
Time Series
1 page
BS EN 12524-2000 Hygrothermal Properties
No ratings yet
BS EN 12524-2000 Hygrothermal Properties
14 pages
Time Series Analysis and Forecasting Using ARIMA Modeling, Neural Network and Hybrid Model Using ELM
No ratings yet
Time Series Analysis and Forecasting Using ARIMA Modeling, Neural Network and Hybrid Model Using ELM
14 pages
Huawei SinlgeSDB HSS9860-BE Feature Description
No ratings yet
Huawei SinlgeSDB HSS9860-BE Feature Description
26 pages
Introduction To Time Series Analysis and Forecasti
No ratings yet
Introduction To Time Series Analysis and Forecasti
10 pages
Iare DS Lecture Notes 2
No ratings yet
Iare DS Lecture Notes 2
135 pages
Chapter 12 Biology 11
No ratings yet
Chapter 12 Biology 11
52 pages
An Integer Grey Goal Programming For Project Time, Cost and Quality Trade-Off
No ratings yet
An Integer Grey Goal Programming For Project Time, Cost and Quality Trade-Off
9 pages
SAFECode Dev Practices0211
No ratings yet
SAFECode Dev Practices0211
56 pages
BCS 040 Previous Year Question Papers by Ignouassignmentguru 2
No ratings yet
BCS 040 Previous Year Question Papers by Ignouassignmentguru 2
66 pages
Haque 2008 - Durability Design in The African Concrete Code
No ratings yet
Haque 2008 - Durability Design in The African Concrete Code
17 pages
Ch2. Basics of Python Programming: Dr. Tulika Assistant Professor Department of Computer Science Miranda House
No ratings yet
Ch2. Basics of Python Programming: Dr. Tulika Assistant Professor Department of Computer Science Miranda House
47 pages
Computer Ebook English RBE
No ratings yet
Computer Ebook English RBE
69 pages
BES - Lecture 10 - Simple Linear Regression
No ratings yet
BES - Lecture 10 - Simple Linear Regression
15 pages
Grammar Jeopardy: Modal Auxiliaries, Relative Adverbs, & Relative Pronouns
No ratings yet
Grammar Jeopardy: Modal Auxiliaries, Relative Adverbs, & Relative Pronouns
18 pages
Measurement Instrumentation and Sensors Handbook Two Volume Set 2nd Edition John G. Webster (Editor) Instant Download
No ratings yet
Measurement Instrumentation and Sensors Handbook Two Volume Set 2nd Edition John G. Webster (Editor) Instant Download
42 pages
Trojan Port List
No ratings yet
Trojan Port List
13 pages
HKLS Valid Reabilit
No ratings yet
HKLS Valid Reabilit
8 pages
CS211 Flow Control Structures
No ratings yet
CS211 Flow Control Structures
29 pages
Quantitative Research Method: Olasile Babatunde Adedoyin
No ratings yet
Quantitative Research Method: Olasile Babatunde Adedoyin
8 pages
Advanced Micro Controller: Unit I - AVR Microcontroller
No ratings yet
Advanced Micro Controller: Unit I - AVR Microcontroller
52 pages
User Maual For Operation and PC Software and APP of TC66 (C) Type-C USB PD Trigger Meter 2019.6.5
No ratings yet
User Maual For Operation and PC Software and APP of TC66 (C) Type-C USB PD Trigger Meter 2019.6.5
12 pages
Solutions: Homework 5 Partial Differential Equations M 104228
No ratings yet
Solutions: Homework 5 Partial Differential Equations M 104228
4 pages
WF4 Pre Production HoW
No ratings yet
WF4 Pre Production HoW
142 pages
Optimal Techniques For Cost Reduction and Control in Construction Sites
No ratings yet
Optimal Techniques For Cost Reduction and Control in Construction Sites
10 pages
Summary of Assignment Grouped and Ungrouped Data
No ratings yet
Summary of Assignment Grouped and Ungrouped Data
8 pages
Determine and Describe The Intersection of Sets Using Various Representations and B
No ratings yet
Determine and Describe The Intersection of Sets Using Various Representations and B
18 pages
Lampiran Diah Ayu BLM Fix
No ratings yet
Lampiran Diah Ayu BLM Fix
22 pages
LTspice Tutorial Part 4 - Intermediate Circuits
No ratings yet
LTspice Tutorial Part 4 - Intermediate Circuits
23 pages
12-Article Text-45-1-10-20191005 PDF
No ratings yet
12-Article Text-45-1-10-20191005 PDF
16 pages
DUAL NATURE Test
No ratings yet
DUAL NATURE Test
2 pages
Cusps: Akshuz 09-Nov-1984 09:55:15 PM Ernakulam 76:17:0 E, 9:59:0 N Tzone: 5.5 KP (Original) Ayanamsha 23:33:6
No ratings yet
Cusps: Akshuz 09-Nov-1984 09:55:15 PM Ernakulam 76:17:0 E, 9:59:0 N Tzone: 5.5 KP (Original) Ayanamsha 23:33:6
1 page
Homomorphism
No ratings yet
Homomorphism
10 pages
10 11648 J Ijefm 20140202 12 PDF
No ratings yet
10 11648 J Ijefm 20140202 12 PDF
9 pages
CSP2101 Scripting Languages Assignment 3 - Software Based Solution
No ratings yet
CSP2101 Scripting Languages Assignment 3 - Software Based Solution
8 pages
قوانين الفصول بملف واحد فيزياء السادس علمي للاستاذ سعيد محي تومان PDF PDF Mathematical Analysis Teaching Mathematics
No ratings yet
قوانين الفصول بملف واحد فيزياء السادس علمي للاستاذ سعيد محي تومان PDF PDF Mathematical Analysis Teaching Mathematics
1 page
Portal of Research Methods and Methodologies For Research Projects and Degree Projects
No ratings yet
Portal of Research Methods and Methodologies For Research Projects and Degree Projects
7 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Nonlinear Transformations of Random Processes
From Everand
Nonlinear Transformations of Random Processes
Ralph Deutsch
No ratings yet
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Finite Element Methods
From Everand
Finite Element Methods
Rahul Basu
No ratings yet