Chapter 1 - Lecture Slides
Chapter 1 - Lecture Slides
Chapter 1 - Lecture Slides
Definition
A time series is a sequence of observations over time.
Time series in different fields
Example
Exhibit 1 displays a time series plot of the annual rainfall amounts
recorded in Los Angeles, California, over more than 100 years. The plot
shows considerable variation in rainfall amount over the year −some years
are low, some high, and many are in-between in value. The year 1883
was an exceptionally wet year for Los Angeles, while 1983 was quite dry.
For analysis and modeling purposes we are interested in whether or not
consecutive years are related in some way. If so, we might be able to use
one year’s rainfall value to help forecast next year’s rainfall amount.
The annual abundance of Canadian hare
Example
Our second example concerns the annual abundance of Canadian hare.
Fig 2 gives the time series plot of this abundance over about 30 years.
Neighboring values here are very closely related. Large changes in
abundance do not occur from one year to the next. We see an upward
trend in the plot−−low values tend to be followed by low values in the
next year, middle-sized values by middle-sized values, and high values by
high values.
Time plot for the annual rainfall
Definition
A time series is said to be discrete when the set T0 of times at which
observations are taken is a discrete set. A time series is said to be
continuous when observations are recorded continuously over some time
interval,e.g. when T0 = [0, 1].
Remarks: We are mainly interested in discrete-time time series with
equally fixed time intervals. e.g. observations made monthly, daily,
weekly, etc.
Some Representative Time Series (I)
Example
Unemployment Rate (%) in Singapore
1
1970 1975 1980 1985 1990 1995 2000 2005
7000
6000
number of lynx
5000
4000
3000
2000
1000
0
0 20 40 60 80 100 120
time (year)
35
30
daily temperature in HK
25
20
15
10
5
0 200 400 600 800 1000 1200
300
250
no. of patients
200
150
100
0 50 100 150 200 250 300 350 400
time (daily)
10000
8000
cases of measles
6000
4000
2000
0
0 100 200 300 400 500 600 700 800 900
time (week)
1600
1400
1200
SP500 index
1000
800
600
400
200
0 500 1000 1500 2000 2500 3000 3500
time (daily)
600
number of passengers
500
400
300
200
100
1948 1950 1952 1954 1956 1958 1960 1
time: month
Cycle
150
100
50
0
1770 1780 1790 1800 1810 1820 1830 1840 1850 1860 1870
time: year
Seasonality
35
30
daily temperature in HK
25
20
15
10
5
0 200 400 600 800 1000 1200
◮ Additive
Yt = Tt + Ct + St + Rt
◮ Multiplicative
Yt = Tt × Ct × St × Rt
where Tt is the trend component (or factor) in time period (or point) t;
St is the seasonal component (or factor) in time period (or point) t; Ct is
the cyclical component (or factor) in time period (or point) t; Rt is the
irregular component (or factor) in time period (or point) t;
The second type can be changed into an additive model by taking the
logarithms.
Errors in Forecasting
4 *
* *
3
1
1990 1992 1994 1996 1998 2000 2002 2004 2006
Prediction of unemployment(II)
1
1990 1992 1994 1996 1998 2000 2002 2004 2006
Measuring Forecasting error
Denote the actual value of the variable of interest in time t as yt and the
predicted value of yt by ŷt . We then introduce the following concepts.
Definition
Example
{yt : 0.1001, 1.6900, 2.3156, 2.7119, 3.7902, 3.6686, 4.6908, 2.7975,
4.4802, 4.8433, 3.8959, 6.2573, 5.4435, 8.4151, 6.6949, 8.5287, 8.7193,
8.0781, 7.3293, 9.9408}
2
12
random forecast errors
10
e : prediction errors
1
8
yt
6 0
4
−1
t
2
0
−2
0 5 10 15 20 0 5 10 15 20
times times
General Philosophy
X
n
n−1 |et |;
t=1
X
n
n−1 et2 .
t=1
An example demonstrating the power of the above three
measures
we have
and
We now describe how these two measures differ. The extreme prediction
errors have bigger influence on MSE than MAD.
Example
Suppose we have other two methods and their predict are
1X
7
MAD1 = |e1,t | = 2.14;
7 t=1
1X 2
7
MSE1 = e = 6.14;
7 t=1 1,t
For method 2:
1X
7
MAD2 = |e2,t | = 1.86;
7 t=1
1X 2
7
MSE2 = e = 8.14.
7 t=1 2,t
Denote them by y1 , ..., y10 . Suppose that it follows the following model
yt = β + εt .
X
n
β̂ = n−1 yt = 2.23.
t=1
A Scientific view towards statistical forecasting