0% found this document useful (0 votes)

43 views

Lecturenotes 3

The document discusses various techniques for forecasting time series data and regression analysis. It covers decomposing time series into trend, seasonal, and residual components using moving averages. Transformations like differencing and logarithms are also discussed as initial data processing steps prior to forecasting. Statistical models and probabilistic prediction methods are mentioned as approaches to generate forecasts.

Uploaded by

Houssam Fouki

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views

Lecturenotes 3

Uploaded by

Houssam Fouki

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

3 Basics of forecasting 1
3.1 First steps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
3.2 Decompositions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
3.3 Deterministic prediction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.4 Probabilistic prediction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.5 Statistical models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

3 Basics of forecasting
Here we delve in the heart of the matter. Throughout this chapter we jointly consider two tasks:

task 1 Predicting the next value yn+1 after observing a sequence y1 , . . . , yn . This is the “time
series forecasting” problem.

task 2 After observing pairs (x1 , y1 ), . . . , (xn , yn ), we observe a new xn+1 and we want to predict
the new yn+1 . This is the “regression” problem. We can think of y as the quantity of interest,
and of x as information that we find relevant for the prediction of y.

We could start with task 1 only but task 2 is fundamental and both tasks are deeply intertwined.
An introduction to time series forecasting is Forecasting: Principles and Practice by Hyndman &
Athanasopoulos. An introductory resource on regression is the book An Introduction to Statistical
Learning by James, Witten, Hastie & Tibshirani, Chapter 3 in particular. Both books are free
online and include R code.

3.1 First steps

Visualization. The recommended starting point of all data analysis is a thorough visual inspec-
tion of the “raw” data. We can use “trace plots”: they display a series yt against t for t = 1, . . . , n
where n is the number of available data points. See Figure 1(a) for examples. With multiple series,
one can plot them against each other, i.e. yt against xt for t = 1, . . . , n; see Figure 1(b). When
seasonality is important, we can cut a series into multiple series corresponding to different seasons,
and overlay them on the same plot; see Figure 4 in Chapter 1. This is sometimes called a seasonal
plot; see “seasonal subseries plots” for an alternative visualization of seasonal series.

Adjustments. Visualizing data might suggest simple transformations that will simplify subse-
quent analyses. The term “adjustement” is used to designate common-sense modifications of raw

Pierre Jacob 1 Forecasting and predictive analytics

800
700
sales 600 800
500
400
700
Jan Mar May Jul Sep Nov Jan

sales
600
temperature

25
20 500
15
10
5
0 400
−5
Jan Mar May Jul Sep Nov Jan −5 0 5 10 15 20 25

time temperature
(a) (b)

Figure 3.1: Sales of lemonade and outside temperature over one year. Synthetic data downloaded
from https://github.com/WHPAN0108/DurstExpress_exercise.

series, such as calendar adjustments (months can have 28, 29, 30 or 31 days, which affects data that
represent monthly counts), population adjustments, inflation adjustments, etc. See Figure 3.2.

Transformations. It is common to transform each element in the series through some function,
such as the logarithm if the series is made of positive values, which helps to identify whether a trend
is polynomial or exponential, and to make the variations around a trend more stable over time.
Another simple transformation is “differencing”, which refers to ∇yt = yt − yt−1 for all t ≥ 2.
The ∇ “nabla” sign before a variable indexed by t indicates differencing. Differencing can be
iterated, for example ∇2 yt = (yt − yt−1 ) − (yt−1 − yt−2 ), for all t ≥ 3. Seasonal differencing refers
to the computation of yt − yt−m , for a period m. Differencing can be used to remove trends. For
positive series it is common to apply the logarithmic transform and then the differencing operator,
i.e. to consider ∇ log yt = log yt − log yt−1 instead of yt . See Figure 3.2.

Back-transformations. It can be convenient to perform transformations for forecasting pur-

poses, but if the interest is ultimately in the original series, the analyst needs to “back-transform”
the forecast. For example, the inverse of the logarithm is the exponential function. With proba-
bilistic predictions, back-transformations will require some care.

3.2 Decompositions
We can consider a variety of manipulations to gain insight on the features of the data.

Pierre Jacob 2 Forecasting and predictive analytics

Quarterly nominal and real US GDP, in billion dollars, seasonally adjusted
Source: Federal Reserve Bank of Saint Louis
GDP real GDP
20000

10000

1950 1960 1970 1980 1990 2000 2010 2020

10 log(GDP) log(real GDP)

8
(with linear trend in green)
6
1950 1960 1970 1980 1990 2000 2010 2020
0.10
∆log(GDP) ∆ log(real GDP)
0.05

0.00

-0.05

1950 1960 1970 1980 1990 2000 2010 2020

Figure 3.2: Quarterly US GDP. The top panel shows the nominal values, and those adjusted for
inflation (“real”). The middle panel shows the log transforms, with fitted linear trends. The bottom
panel shows the first differences. Note that the 1970s were a period of high inflation in the US; this
is particularly apparent in the bottom panel.

Moving averages. Also called “rolling window” averages, they refer to the construction

k
1 X
ỹt = yt+j , (3.1)
m
j=−k

where m = 2k + 1 is the “order”, k ∈ N, and t = 1 + k, . . . , n − k. This smoothes out some variations

in the original series. If m is an even number, the sum is not symmetric around t; for example
P2
a moving average of order 4 can be defined as ỹt = 14 j=−1 yt+j . There are variants of moving
averages with weights, and we can also iterate: compute moving averages of moving averages.

Trend, seasonal and residual components. We might want to find the decomposition yt =
ỹt + st + rt , where

• ỹt refers to the trend, typically obtained by moving average

• st refers to a seasonal component, typically made of m values repeated across periods, where
the periodicity is denoted by m and chosen by the analyst. In math notation: st = st+m for

Pierre Jacob 3 Forecasting and predictive analytics

Pm
all t. We often impose that the seasonal component is centered: j=1 st+j = 0, since any
non-zero center could be incorporated into the trend ỹt .

• rt refers to a “remainder” or “residual” value, defined as yt − ỹt − st .

See Figure 3.3. The series ỹt + rt , or equivalently yt − st , is the “seasonally adjusted” series.

Decomposition of additive time series

observed
6.0
5.0
4.8 5.2 5.6 6.0
trend
0.2
seasonal
0.0
−0.2
random
0.00
−0.10

1950 1952 1954 1956 1958 1960

Time

Figure 3.3: Decomposition of the classic Box & Jenkins airline data: monthly totals of international
airline passengers, 1949 to 1960. On the left, the additive decomposition is done on the original
series, while it is applied to the log on the right.

The word “typically” appears in the above description because many variations exist, under the
names of X-11, SEATS, or STL (stl function in R). To forecast yn+1 , we can separately forecast
the trend, seasonal and residual components, and then combine the three predictions.

3.3 Deterministic prediction

We denote by (y1 , . . . , yn ), or simply y1:n , the series that we want to predict, perhaps obtained
after transformations, adjustments or decompositions. We start with the description of some basic
or “common-sense” techniques to predict yn+1 using y1:n .

Ask an expert. Experts can be people with domain expertise, and we can gather multiple
experts, hoping that a group would be more effective than individuals. The “Delphi method” is
a method to elicit a consensus from various experts. In recent times, prediction markets, where
many anonymous individuals bet on future outcomes, have been sometimes interpreted as a way of

Pierre Jacob 4 Forecasting and predictive analytics

leveraging the “wisdom of crowds”. Experts also include oracular animals, such as Paul the Octopus
and Mani the Parakeet who predicted the results of the 2010 Football World Cup.

Baseline strategies. How might we forecast yn+1 using y1:n ? Two basic strategies are:
Pn
average: we report n−1 t=1 yt as a prediction for yn+1 .

last value: we report the latest observation yn as a prediction for yn+1 .

Pn
Those strategies correspond to two choices of weighted averages: they both are of the form t=1 wt yt
Pn
with t=1 wt = 1. The first assigns the same weight n−1 to all observations, while the second puts
all the weight on the latest yn . These correspond to different assessments about the relevance of
the past for the prediction of the future.

Exponential smoothing. A very popular forecasting method is known as exponential smooth-

ing. It can be seen as an “interpolation” between the two baseline strategies described above. In
its simplest form, exponential smoothing refers to the prediction of yn+1 with ŷn+1 defined as

ŷn+1 = αyn + α(1 − α)yn−1 + α(1 − α)2 yn−2 + . . . (3.2)

− α)j = 1, the
P
where the value α ∈ [0, 1] is a tuning parameter to be chosen. Since α j≥0 (1
forecast can be interpreted as a weighted average of all past values. We can re-express the forecast
in a recursive form,
ŷt+1 = αyt + (1 − α)ŷt , (3.3)

for t = 1, 2, . . . , n, where ŷ1 has to be set somehow. Both ŷ1 and α need to be chosen by the analyst.

Adding trends and seasonalities. If we want to forecast yn+2 and plug ŷn+1 in place of yn+1
in the exponential smoothing recipe, we obtain ŷn+2 = ŷn+1 . Similarly, the forecast of all future
yn+h is equal to the forecast of yn+1 . In other words, our prediction of the future looks like a flat
line. To come up with a more plausible forecast, we can go beyond the above “simple exponential
smoothing” technique, and consider the inclusion of trend and seasonal components, as follows.
We first write the above model as ŷt+h = `t , and `t = αyt + (1 − α)`t−1 , where `t is called the
“level” at time t, and h is the horizon we want to predict over. We can then include a trend, with

Pierre Jacob 5 Forecasting and predictive analytics

370
Holt−Winters
observed

co2
350330
1986 1988 1990 1992 1994 1996 1998
time

Figure 3.4: Atmospheric concentrations (monthly) of CO2 in Mauna Loa, expressed in parts per
million (ppm), with predictions obtained with the HoltWinters in R.

the following modification:

ŷt+h = `t + hbt ,
`t = αyt + (1 − α)(`t−1 + bt−1 ),
bt = γ(`t − `t−1 ) + (1 − γ)bt−1 ,

with two parameters, α, γ ∈ [0, 1]. Here bt represents the slope of a linear trend at time t. Similarly
we can add equations and parameters to represent seasonality.
This strategy, generally known as exponential smoothing or “Holt–Winters”, introduced in two
articles in the late 1950s, remains widely used today. A comprehensive treatment of exponential
smoothing is the book “Forecasting with Exponential Smoothing” by Hyndman, Koehler, Ord and
Snyder, 2008. Figure 3.4 represents the forecast provided by the HoltWinters function in R, a
prediction made on a series of concentrations of CO2 in Hawaii. The figure shows a convincing
extrapolation of the monthly series onto two future years.
Those are deterministic recipes: you plug the observed series as an input, and some calculation
yields a series of predicted values. There is no prediction interval, no probability distribution of
the future values, no quantified uncertainty. We will see later in the course that we can re-visit
exponential smoothing in the paradigm of “ARIMA” models and “state space models” and that it
enables the construction of prediction intervals (e.g. as implemented in HoltWinters).

Basic linear regression. Next consider the case where you want to predict y using x (“task 2”),
see Figure 1(b). Given (xt , yt )nt=1 , and given a new xn+1 , how would we predict yn+1 ? We can try
to learn the relationship between yt and xt , by finding a function f such that yt is approximately
f (xt ). To make things simpler, we can restrict the search to the family of linear functions: we want

Pierre Jacob 6 Forecasting and predictive analytics

to find α, β such that f (xt ) = α + βxt is close to yt . The “least squares” method minimizes

n
X
(yt − (α + βxt ))2 (3.4)
t=1

with respect to α, β. Indeed “yt ≈ α + βxt ” is equivalent to “(yt − (α + βxt ))2 is small”. Denote by
α̂, β̂ the minimizers of (3.4). Given xn+1 our forecast of yn+1 is then given by α̂ + β̂xn+1 .
With a bit of calculus (differentiating (3.4) with respect to α, β and equating the derivatives to
zero), we find Pn
(x − x̄n )(yt − ȳn )
α̂ = ȳn − β̂ x̄n , β̂ = Pn t
t=1
2
. (3.5)
t=1 (xt − x̄n )
Pn Pn
Recall that x̄n and ȳn refer to the empirical means n−1 t=1 xt and n−1 t=1 yt . The estimates α̂
and β̂ are often called “ordinary least squares” (OLS). Compare β̂ with the correlation coefficient
Ĉov(x1:n , y1:n ) defined in Chapter 2: they’re not identical but still very similar.

800 800

700 700

600 600
y

500 500

400 400

−5 0 5 10 15 20 25 −5 0 5 10 15 20 25

x x
(a) (b)

Figure 3.5: Linear regression of sales of lemonade on the outside temperature. Left: regression
line obtained by minimizing (3.4). Right: various regression lines obtained by “bootstrap”.

The basic linear regression described above is a deterministic way of addressing task 2. How do
we know if it works? How do we assess the error, and construct a prediction intervals? Figure 3.5
illustrates linear regression using the data shown in Figure 3.1, as well as the uncertainty about the
regression line (on the right), obtained through a probabilistic approach.

Pierre Jacob 7 Forecasting and predictive analytics

3.4 Probabilistic prediction
Let’s move toward probabilistic tools. It will help to construct prediction intervals, and even entire
predictive distributions, to assess the quality of forecasts and to compare them. We first need to
take a big step: to view the data (y1 , . . . , yn ) as realizations of random variables (Y1 , . . . , Yn ). We
will reason about these random variables and their relationship to Yn+1 , for which we will try to
construct a probability distribution conditional on Y1 = y1 , . . . , Yn = yn .

Guessing a random variable. Consider the task of predicting a real-valued random variable
Y , with a guess c ∈ R. To evaluate our guess, we use a loss function (c, y) 7→ L(c, y). The loss
function, a central concept in decision theory, takes as arguments the guess c and a realization y of
the object to be predicted Y . The loss is larger if c is further away from y. A commonly-used loss
is L(c, y) = (y − c)2 , called the squared loss.
Our objective is to formulate a guess c that makes L(c, y) small on average: we minimize
E[L(c, Y )] with respect to c. With the squared loss this is E[(Y − c)2 ], called the mean squared error
(MSE). Some calculations show that c = E[Y ] minimizes the MSE. This provides a justification for
the first baseline strategy mentioned in Section 3.3.

Conditioning on observed data. Next, suppose that we observe X and we want to predict
Y given X. Any function of X, denoted by c(X), could be used to predict Y . By the “tower
property” (Equation (2.8)), we can write E[(Y − c(X))2 ] = E[E[(Y − c(X))2 |X]]. We can always
write Y − c(X) as Y − E[Y |X] + E[Y |X] − c(X), and then we can expand the square to obtain,

E[(Y − c(X))2 ] = E[E[(Y − E[Y |X])2 |X]] + E[(c(X) − E[Y |X])2 ]. (3.6)

We want to minimize that error. The first term is constant in c. The second term is minimized by
c(X) = E[Y |X]. Thus, the optimal prediction of Y given X is the conditional expectation E[Y |X].
Again, this is useful if we can approximate such quantity, using observed data. See Figure 3.6.

Linear regression, again. It is quite difficult to precisely estimate the conditional expectation
E[Y |X], even if we have data (xt , yt )nt=1 . This is the topic of “nonparametric regression”. We can
look at a simpler task: the best linear approximation. That is, we restrict the function x 7→ c(x)
to be a linear function of x, so c(x) = α + βx, and we find coefficients α, β ∈ R that minimize
the expected error E[(Y − (α + βX))2 ]. By differentiating with respect to α and β, we obtain two
equations:
E [(Y − (α + βX))] = 0 and E [X (Y − (α + βX))] = 0. (3.7)

Pierre Jacob 8 Forecasting and predictive analytics

10 10
y

y
5 5

0 0
−2 0 2 4 6 −2 0 2 4 6

x x
(a) (b)

Figure 3.6: Left: joint density of some pair of variables (X, Y ), and conditional mean E[Y |X = x]
as a function of x in dashed line. Right: scatter plot of independent samples (xt , yt )nt=1 following
that distribution, and regression line of Y onto X.

Two unknowns (α, β), two equations: we can solve that, and find

α? = E[Y ] − β ? E[X], β ? = Cov(X, Y )/V[X]. (3.8)

This provides practical guidance, as we might be able to estimate these quantities using observed
data: we can estimate E[X] with the “empirical mean” x̄n , and likewise we can estimate variances
and covariances. Doing so, we retrieve the expressions obtained in (3.5). But now we know that α̂, β̂
might be approximations of α? , β ? , and we might be interested in the approximation error. We also
note that, starting from the problem of predicting Y given X, and focusing on linear predictions
for simplicity, the concept of covariance between X and Y naturally appears.

Deriving methods from objectives. In passing, the above derivations show that one can define
an objective (minimize some expected prediction error) to arrive at a method (linear regression).
By modifying the objective we can derive other methods. It is very satisfying: we can specify
what you want to achieve, and derive a method that achieves that goal. If we are interested in
probabilistic prediction instead of point prediction, we can use a “scoring rule” as a loss function,
instead of the mean squared error. We’ll see more about scoring rules later in the course.

Pierre Jacob 9 Forecasting and predictive analytics

3.5 Statistical models
A statistical model is the proposal of a plausible, probabilistic description for the data. A model has
parameters, and the general workflow is to estimate the parameters using the available data, and
then to plug the estimates back into the model to predict future values. A good model encapsulates
important features of the data while remaining as simple as possible, in order to avoid overfitting,
computational and interpretability issues. Below we revisit linear regression (again!) by describing
it as a proper statistical model, before introducing autoregressions.

Linear regression as a model. Here we think of the sequence x1:n as given, fixed, constant.
Consider the following equation,
Yt = α + βxt + εt . (3.9)

It relates xt to Yt , it involves α, β which are called coefficients or parameters, and there is a term
εt called the residual. By re-writing εt = Yt − (α + βxt ) we see that the residual represents the
difference between Yt and a linear function of xt . Compared to basic linear regression where we
wrote yt ≈ α + βxt , here we define εt to be the discrepancy between Yt (the quantity of interest, a
random variable) and α + βxt (a linear function of xt ). The residual εt is seen as a random variable.
In linear regression we often assume that εt has mean zero (E[εt ] = 0), and that εt and εs are
uncorrelated for any times t 6= s (Cov(εt , εs ) = 0). Such assumptions are required to validate the
construction of confidence intervals on the parameters, and in turn the construction of prediction
intervals for future values of Y . There are many distributions for the sequence (εt )t≥1 that would
satisfy the two conditions: E[εt ] = 0 and Cov(εt , εs ) = 0 for t 6= s. We can be more explicit
about the distribution of εt , for example by assuming that (εt )t≥1 are independent Normal(0, σ 2 )
variables, which leads to the alternative model specification:

Yt ∼ N (α + βxt , σ 2 ), (3.10)

in words: Yt is Normal, centered at α + βxt , with variance σ 2 , and Y1:n are independent of one
another. The variance σ 2 is now included in the model parameters.

Likelihood associated with a model. If the residuals are given a specific distribution, we can
write the likelihood function associated with the data (xt , yt )nt=1 and with the parameters α, β, σ 2 .
By definition the likelihood is the probability density function of the model evaluated at the observed
data. It is then viewed as a function of the parameters. Here, using R notation, and factorizing the

Pierre Jacob 10 Forecasting and predictive analytics

likelihood as a product thanks to the assumption of independent Y1:n ,
n
Y
likelihood(α, β, σ 2 ) = dnorm(yt , mean = α + βxt , sd = σ) (3.11)
t=1
n
!
2 −n/2 1 X
= (2πσ ) exp − 2 (yt − (α + βxt ))2 , (3.12)
2σ t=1

with dnorm(x, mean = µ, sd = σ) = (2πσ 2 )−1/2 exp − 2σ1 2 (x − µ)2 . Here maximizing the likeli-

hood with respect to α, β, σ 2 corresponds exactly to the ordinary least squares estimates in (3.5)
for α and β. On top of that, we have an estimate of σ 2 which will be useful for prediction. The
approach is flexible: if we specify another distribution for the residuals (for example a Laplace or a
Student distribution), we can still maximize the likelihood to obtain parameter estimates. The esti-
mates are called “maximum likelihood estimates”, and they are shown to have appealing statistical
properties in general.

Time series modeling. We have just introduced a model to address task 2. Let’s return to
task 1: the prediction of yn+1 using y1:n . We have covered deterministic methods in Section 3.3.
Let’s take a probabilistic perspective. Denote by (Wt ) a sequence of independent Normal(0, σ 2 )
variables, called the “noise terms”.
Consider first the random walk (RW) model,

Yt = δ + Yt−1 + Wt . (3.13)

The parameter δ is called the drift. The initial condition can be set as Y1 = W1 . Here the variables
Y1:n are not independent; but we can still write down the likelihood function. We can check that
Pt
Yt = (t − 1)δ + s=1 Ws . We can then compute E[Yt ] = (t − 1)δ, by linearity of expectation, and
V [Yt ] = tσ 2 , using the independence between the noise terms. Prediction intervals constructed
using the random walk model have a width that increases as we predict further into the future.
Consider next the autoregressive model (AR),

Yt = δ + ρYt−1 + Wt . (3.14)

Here ρ is called the autoregressive coefficient. We recover the random walk model if ρ = 1. Other-
wise, it looks like a linear regression model where Yt is predicted using Xt = Yt−1 . The properties
of the process are much different if |ρ| < 1, compared to ρ = 1 or |ρ| > 1, as can be seen with simple
simulations. In the next chapter we will find that the AR model is a building block for many time
series models.

Pierre Jacob 11 Forecasting and predictive analytics

Offshore Platform Cost Estimation
50% (2)
Offshore Platform Cost Estimation
7 pages
Rao (2022) - A Course in Time Series Analysis
No ratings yet
Rao (2022) - A Course in Time Series Analysis
527 pages
Ghysels, Eric - Marcellino, Massimiliano - Applied Economic Forecasting Using Time Series Methods-Oxford University Press (2018)
No ratings yet
Ghysels, Eric - Marcellino, Massimiliano - Applied Economic Forecasting Using Time Series Methods-Oxford University Press (2018)
617 pages
Elliott Wave Timing Beyond Ordinary Fibonacci Methods
From Everand
Elliott Wave Timing Beyond Ordinary Fibonacci Methods
Mark Lytle
4/5 (23)
MassMin 2000 PDF
100% (1)
MassMin 2000 PDF
894 pages
4540 17 PDF
No ratings yet
4540 17 PDF
274 pages
Introduction to Applied Econometrics Analysis Using Stata
From Everand
Introduction to Applied Econometrics Analysis Using Stata
Justin Doran
5/5 (3)
Predictive Analytics: Module 11: Forecasting
No ratings yet
Predictive Analytics: Module 11: Forecasting
55 pages
Time Series Analysis. Trends, Patters, Seasonality
No ratings yet
Time Series Analysis. Trends, Patters, Seasonality
14 pages
DSS16-Time Series
No ratings yet
DSS16-Time Series
65 pages
Resumos Forecasting
No ratings yet
Resumos Forecasting
17 pages
S6 - Time - Series Analysis - 1
No ratings yet
S6 - Time - Series Analysis - 1
21 pages
Time Series Decomposition
No ratings yet
Time Series Decomposition
54 pages
DLBDSTSA01_Course_Book_time_series_analysis
No ratings yet
DLBDSTSA01_Course_Book_time_series_analysis
244 pages
Forecasting
No ratings yet
Forecasting
75 pages
Forecast Time Series-Notes
No ratings yet
Forecast Time Series-Notes
138 pages
Forecasting
No ratings yet
Forecasting
31 pages
Topic 8 Time Series and Forecasting
No ratings yet
Topic 8 Time Series and Forecasting
33 pages
Topic 8 Time Series and Forecasting
No ratings yet
Topic 8 Time Series and Forecasting
33 pages
Business Forecasting
No ratings yet
Business Forecasting
85 pages
Lecture 11
No ratings yet
Lecture 11
37 pages
Analyzing and Forecasting Time Series Data
No ratings yet
Analyzing and Forecasting Time Series Data
41 pages
Topic 8 Time Series and Forecasting (1)
No ratings yet
Topic 8 Time Series and Forecasting (1)
33 pages
Time Series 1
No ratings yet
Time Series 1
23 pages
Note - Unit-4
No ratings yet
Note - Unit-4
12 pages
TSF - Week 1 - MLSR
No ratings yet
TSF - Week 1 - MLSR
35 pages
1-Introduction To Time Series2022
No ratings yet
1-Introduction To Time Series2022
55 pages
4 Forecasting
No ratings yet
4 Forecasting
51 pages
Time Series Forecasting
No ratings yet
Time Series Forecasting
29 pages
Intro To Time Series
No ratings yet
Intro To Time Series
85 pages
Math7339TS1TimesSeries Intro
No ratings yet
Math7339TS1TimesSeries Intro
33 pages
W1. Forecasting (Pendek 2022 2023)
No ratings yet
W1. Forecasting (Pendek 2022 2023)
61 pages
BA mid-2
No ratings yet
BA mid-2
15 pages
FM - Resumes
No ratings yet
FM - Resumes
18 pages
? ?????? ?? ???? ?????? ????????
No ratings yet
? ?????? ?? ???? ?????? ????????
300 pages
Time Series Modeling: Shouvik Mani April 5, 2018
No ratings yet
Time Series Modeling: Shouvik Mani April 5, 2018
46 pages
Methods of Forecasting in A Manufacturing Company
No ratings yet
Methods of Forecasting in A Manufacturing Company
31 pages
Module 6: Introduction To Time Series Forecasting: Titus Awokuse and Tom Ilvento
No ratings yet
Module 6: Introduction To Time Series Forecasting: Titus Awokuse and Tom Ilvento
26 pages
Gakhov Time Series Forecasting With Python
No ratings yet
Gakhov Time Series Forecasting With Python
66 pages
A Course in Time Series Analysis 1662068197
No ratings yet
A Course in Time Series Analysis 1662068197
300 pages
Timeseries - Analysis
No ratings yet
Timeseries - Analysis
37 pages
Lecture 03
No ratings yet
Lecture 03
66 pages
CH 18
No ratings yet
CH 18
42 pages
tsa - Time Series Analysis
No ratings yet
tsa - Time Series Analysis
45 pages
Unit 6
No ratings yet
Unit 6
15 pages
MBA Analytics For Finance 11
No ratings yet
MBA Analytics For Finance 11
12 pages
Session 07
No ratings yet
Session 07
41 pages
Ingles Cap 1 Y 2
No ratings yet
Ingles Cap 1 Y 2
56 pages
Forecast
No ratings yet
Forecast
48 pages
Unit 3 B Time Series Analysis
No ratings yet
Unit 3 B Time Series Analysis
37 pages
SDA 3E Chapter 7
No ratings yet
SDA 3E Chapter 7
43 pages
A Comparative Study and Analysis of Time
No ratings yet
A Comparative Study and Analysis of Time
7 pages
Preview-9780190622022 A35242571
No ratings yet
Preview-9780190622022 A35242571
62 pages
Time Series Forecasting
No ratings yet
Time Series Forecasting
26 pages
CH 13
No ratings yet
CH 13
11 pages
time-series-forecast-a-comprehensive-guide - Jupyter Notebook
No ratings yet
time-series-forecast-a-comprehensive-guide - Jupyter Notebook
24 pages
Strategic Risk Management: Designing Portfolios and Managing Risk
From Everand
Strategic Risk Management: Designing Portfolios and Managing Risk
Campbell R. Harvey
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Pre-Calculus Essentials
From Everand
Pre-Calculus Essentials
Ernest Woodward
No ratings yet
Macro Economics: A Simplified Detailed Edition for Students Understanding Fundamentals of Macroeconomics
From Everand
Macro Economics: A Simplified Detailed Edition for Students Understanding Fundamentals of Macroeconomics
Hesbon R.M
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Visual Financial Accounting for You: Greatly Modified Chess Positions as Financial and Accounting Concepts
From Everand
Visual Financial Accounting for You: Greatly Modified Chess Positions as Financial and Accounting Concepts
Anthony Brticevic
No ratings yet
Main EL CM2end 2023
No ratings yet
Main EL CM2end 2023
33 pages
3.flajolet Martin Algorithm
No ratings yet
3.flajolet Martin Algorithm
31 pages
1 MinHash-1
No ratings yet
1 MinHash-1
4 pages
5.Topic-Sensitive PageRank S5
No ratings yet
5.Topic-Sensitive PageRank S5
11 pages
Chromeleon 7.2 OQ Operating Instructions Rev.1.3
No ratings yet
Chromeleon 7.2 OQ Operating Instructions Rev.1.3
31 pages
Georisk 2017 Paper
No ratings yet
Georisk 2017 Paper
10 pages
U2020 PC Control Software Uvmate User'S Manual: Irmeco Germany
No ratings yet
U2020 PC Control Software Uvmate User'S Manual: Irmeco Germany
53 pages
AFEM ch27 Slides
No ratings yet
AFEM ch27 Slides
38 pages
Continuous and Discrete Distributions
No ratings yet
Continuous and Discrete Distributions
32 pages
Lecture 9 - Physical Modeling Model and Parameter Accuracy
No ratings yet
Lecture 9 - Physical Modeling Model and Parameter Accuracy
30 pages
Space Engineering: Engineering Design Model Data Exchange (CDF)
No ratings yet
Space Engineering: Engineering Design Model Data Exchange (CDF)
31 pages
Structure in 5s A Synthesis of The Research On Organization Design by Henry Mintzberg
No ratings yet
Structure in 5s A Synthesis of The Research On Organization Design by Henry Mintzberg
21 pages
Practical Research 2ND Quarter Module
No ratings yet
Practical Research 2ND Quarter Module
21 pages
National University of Lesotho Department of Statistics and Demography St1311 - Introduction To Statistics 1 - Tutorial 1
No ratings yet
National University of Lesotho Department of Statistics and Demography St1311 - Introduction To Statistics 1 - Tutorial 1
5 pages
Statistik MBA
No ratings yet
Statistik MBA
41 pages
An Introduction To Modern Missing Data Analyses
No ratings yet
An Introduction To Modern Missing Data Analyses
33 pages
Interview Syllabus for students selection interview Session 2025-26 (1) (1)
No ratings yet
Interview Syllabus for students selection interview Session 2025-26 (1) (1)
2 pages
Summative Assessments Memorandum
No ratings yet
Summative Assessments Memorandum
14 pages
Towards A Graded Application of Best Estimate Plus Uncertainty Methodology
No ratings yet
Towards A Graded Application of Best Estimate Plus Uncertainty Methodology
13 pages
Lec 33
No ratings yet
Lec 33
15 pages
4) Results and Discussion
No ratings yet
4) Results and Discussion
28 pages
13-Process Capability Training
100% (1)
13-Process Capability Training
43 pages
Thesis Quality Control
100% (3)
Thesis Quality Control
4 pages
CumminNew Verilog-2001 Techniques For Creating Parameterized Models
No ratings yet
CumminNew Verilog-2001 Techniques For Creating Parameterized Models
10 pages
SIWAREX CS Quick Guide V4 0 PDF
No ratings yet
SIWAREX CS Quick Guide V4 0 PDF
19 pages
Insulation Ageing Diagnosis of XLPE Power Cables Under Service Conditions
No ratings yet
Insulation Ageing Diagnosis of XLPE Power Cables Under Service Conditions
4 pages
Hatcher 2019
No ratings yet
Hatcher 2019
13 pages
An Actuarial Approach To Stochastic Modeling of Casualty Catastrophe Cat Risk
No ratings yet
An Actuarial Approach To Stochastic Modeling of Casualty Catastrophe Cat Risk
15 pages
Cutting Process Control
No ratings yet
Cutting Process Control
5 pages
Central Limit Theorem The Cornerstone of Modern ST
No ratings yet
Central Limit Theorem The Cornerstone of Modern ST
14 pages
Glossary of Statistical Terms: Roger Stern, Ian Dale and Sandro Leidi
No ratings yet
Glossary of Statistical Terms: Roger Stern, Ian Dale and Sandro Leidi
23 pages
Exercise Sheet 7 Mathematics of Data Science
No ratings yet
Exercise Sheet 7 Mathematics of Data Science
2 pages

Lecturenotes 3

Uploaded by

Lecturenotes 3

Uploaded by

Contents

3.1 First steps

Pierre Jacob 1 Forecasting and predictive analytics

Back-transformations. It can be convenient to perform transformations for forecasting pur-

Pierre Jacob 2 Forecasting and predictive analytics

1950 1960 1970 1980 1990 2000 2010 2020

10 log(GDP) log(real GDP)

1950 1960 1970 1980 1990 2000 2010 2020

where m = 2k + 1 is the “order”, k ∈ N, and t = 1 + k, . . . , n − k. This smoothes out some variations

• ỹt refers to the trend, typically obtained by moving average

Pierre Jacob 3 Forecasting and predictive analytics

• rt refers to a “remainder” or “residual” value, defined as yt − ỹt − st .

Decomposition of additive time series

1950 1952 1954 1956 1958 1960

3.3 Deterministic prediction

Pierre Jacob 4 Forecasting and predictive analytics

last value: we report the latest observation yn as a prediction for yn+1 .

Exponential smoothing. A very popular forecasting method is known as exponential smooth-

ŷn+1 = αyn + α(1 − α)yn−1 + α(1 − α)2 yn−2 + . . . (3.2)

Pierre Jacob 5 Forecasting and predictive analytics

the following modification:

Pierre Jacob 6 Forecasting and predictive analytics

Pierre Jacob 7 Forecasting and predictive analytics

Pierre Jacob 8 Forecasting and predictive analytics

α? = E[Y ] − β ? E[X], β ? = Cov(X, Y )/V[X]. (3.8)

Pierre Jacob 9 Forecasting and predictive analytics

Pierre Jacob 10 Forecasting and predictive analytics

Pierre Jacob 11 Forecasting and predictive analytics

You might also like