Unit 14 - Time Series Analysis
Unit 14 - Time Series Analysis
14.1 Introduction
In the previous unit ‘Business Forecasting’, you have studied about the
ways of forecasting business events successfully. You also studied about
the different methods available for forecasting. In this unit, you will study
about the time series analysis and different components of time series. You
will also study about the forecasting methods using time series.
A time series is a set of numerical values of a given variable listed at
successive intervals of time, which means that, data regarding the variable
is listed in chronological order. Usually, the interval of time is taken as
uniform.
Yearly production of wheat in the country, hourly temperature of a city,
bimonthly electricity bills are all examples of time series. Almost all data like
industrial production, agricultural production, exports, imports, dairy
products can be arranged in chronological order.
Objectives:
After studying this unit, you should be able to
analyse the time series
describe different components of time series
describe the forecasting methods
apply time series analysis in business scenarios
14.1.1 Relevance
The 1990s brought a heightened awareness of an increased concern over
pollution in various forms in the United States. Air pollution is one of the
main areas of environmental concern. The U.S Environmental Protection
Agency (EPA) monitors the quality of air around the country. Some of the air
pollutants monitored include carbon monoxide emissions, nitrogen oxide
emissions, volatile organic compounds, sulphur dioxide emissions, etc. The
substances in these pollutants cause cancer and respiratory problems. If the
data is given for 15 years of period (1985-1999), then the question is to find
whether the air quality in U.S has been improving or deteriorating over time.
Managerial and statistical questions:
1. Is it possible to forecast the emissions of carbon monoxide or nitrogen
oxides for the year 2004-2007, or 2020 using the available data?
2. What techniques best forecast the emissions of carbon monoxide or
nitrogen oxides in the future?
Manipal University Jaipur Page No. 496
Statistics for Management Unit 14
Let us analyse the above data and give some trends regarding the sales.
For example, the company would like to know why sales dropped in 1998
and 1999 and why did it increase. In other words, the company would like to
analyse the various forces that affect the sales.
There can be changes in the values of the variable recorder over different
points of time due to various forces. Analysing the effect of all such forces
on the values of the variable is generally known as the analysis of time
series. Broadly, the following are the four types of changes in the values of
the variable:
i) Changes which generally occur due to general tendency of the data to
increase or decrease
ii) Changes which occur due to change in climate, weather conditions
and festivals
iii) Changes which occur due to booms and depressions
iv) Changes which occur due to some unpredictable forces like floods,
famines and earthquakes
ii. Forecasting can be done using the time series. By studying the
variations and other behaviour of the variables over a sufficiently long
period of time, it may be possible to forecast the future behaviour of the
variables. However, such a forecast has meaning only if the period of
forecast is a normal period. For example, various five-year plans by the
government of India are formulated by studying the time series and
forecasting.
iii. Study of the time series helps in analysing the post behaviour of the
variables. This helps in identifying the various forces that affect its
behaviour.
ii. The prices of cooking oils reduce after the harvesting of oil seeds and
go up after some time.
Solved problem 1
Find the trend with the help of free hand curve method for the data depicted
in table 14.2
Table 14.2: Production Data from 1991 to 2001
Year Production Data (in Lakh ton)
1991 15
1992 18
1993 16
1994 22
1995 19
1996 24
1997 20
1998 28
1999 22
2000 30
2001 26
Solution: Figure 14.1 depicts free hand curve of the production data versus
the time period. In the graph, we have taken production data values on
Y-axis and values of time on X-axis.
The trend of the export of sugar from India can be found by the semi
averages method as follows:
Here the number of years is 10 i.e., it is even. The series is divided into two
halves consisting of the first five years and last five years.
The semi average method is The semi average method assumes a straight
simple. line relationship between the plotted points,
regardless of the fact whether such
relationship exists or not.
The trend line can be extended This method has an in built limitation of
on either side in order to obtain arithmetic mean. This method is not suitable in
past or future estimates. case of very low or very large extreme values.
Fig. 14.2: Procedure for Determining the Trend when Moving Average is Odd
Solved Problem 3
Calculate the 3 yearly Moving Averages of the data depicted in table 14.5.
Table 14.5: Production Data from 1988 to 1996
Year 1988 1989 1990 1991 1992 1993 1994 1995 1996
Production
21 22 23 25 24 22 25 27 26
(in Lakh ton)
Fig. 14.3: Procedure for Determining the Trend when Moving Average is Even
Table 14.6 depicts the merits and demerits of the moving averages method.
Merits Demerits
This is a simple method. No functional relationship between the
values and time. Thus, this method is not
helpful in forecasting and predicting the
values on the basis of time.
This method is objective in the No trend values for some years in the
sense that anybody working on a beginning and some in the end.
problem with this method will get
the same results.
This method is used for determining In case of non–linear trend, the values
seasonal, cyclic and irregular obtained by this method are biased in one
variations besides the trend values. or the other direction.
This method is flexible enough to The period selection of moving average is
add more figures to the data a difficult task. Hence, great care has to be
because the entire calculations are taken in period selection, particularly when
not changed. there is no business cycle during that time.
If the period of moving averages
coincides with the period of cyclic
fluctuations in the data, such
fluctuations are automatically
eliminated.
Solved problem 4
The following table gives the average monthly production
(in thousands) of new passenger cars in the period 1976-1985. Calculate
four yearly moving averages.
Table 14.6: Average Monthly Production of Cars
Year Average monthly production of new cars
1976 708
1977 767
1978 764
1979 702
1980 533
1981 521
1982 421
1983 562
1984 635
1985 667
Manipal University Jaipur Page No. 506
Statistics for Management Unit 14
Solution:
The following table depicts four yearly moving averages.
Table 14.6a: Four yearly moving averages
1977 767
2941 735.25
1978 764 713.375
2766 691.5 1426.75
1979 702 660.75
2520 630 1321.5
1980 533 587.125
1174.25
2177 544.25
1981 521 1053.5 526.75
2037 509.25
1982 421 1044 522
2139 534.75
1983 562 1106 553
2285 571.25
1984 635
1985 667
Key Statistic
Let ‘Y’ be the actual values of ‘Y’ and ‘Yc’ be the computed values of ‘Y’ for
a given value of ‘X’.
Let ‘Y = a + bX’ be a straight line to be fitted for trend. To find the values of
‘a’ and ‘b’, such that the sum of squares of differences of the actual and
computed values of ‘Y’ is least, that is,
Y Y
2
c is least
Y
Y a , therefore, a
N
XY
XY b X 2
, therefore, b
X 2
Solved problem 5
The production of pig iron and ferro alloys in thousand metric tons in India is
as given below. Find the trend line by the method of least squares.
Table 14.7: Production data
Year Production
1976 672
1977 824
1978 967
1979 1204
1980 1464
1981 1758
1982 2057
Solution:
The trend line can be fitted by using the method of least squares for the
given data.
Table 14.7a: Calculation for trend line
Production
Year X= Year - 1979 XY X2
Y
1976 672 -3 -2016 9
1977 824 -2 -1648 4
1978 967 -1 -967 1
1979 1204 0 0 0
1980 1464 1 1464 1
1981 1758 2 3516 4
1982 2057 3 6171 9
Total ∑Y= 8946 ∑X=0 ∑XY= 6520 ∑X =28
2
XY 6520
XY b X 2
, therefore, b
X 2
28
232 .9
Y a bX 1278 232 .9 X
Merits Demerits
This method is a completely It requires many calculations and is tedious
objective method. and complicated.
This method gives the trend If even a single item is added to the series
values for the entire time period. a new equation has to be formed.
This method can be used to Future forecasts made by this method are
forecast future trend because based only on trend values. Seasonal,
trend line establishes a functional cyclical or irregular variations are ignored.
relationship between the value
and the time.
Non-linear trend
When the time series data do not confirm with the linear trend, then we
obtain non-linear trend. We do so by obtaining a parabolic curve or non-
linear curve in the method of least squares. For this we use the equation of
the form.
a b c 2 d 3 .......... k n which is known as a polynomial of
degree ‘n’ in ‘X’, k ≠ 0.
Let the parabolic curve be
a b c 2
The values of a, b, and c can be determined by solving the normal
equations:
ab c 2
a b 2 c 3
2 a 2 b 3 c 4
If we can change the origin at a suitable point, such that ‘X = 0’, then the
normal equations reduce to:
ac 2
b 2
2 a 2 c 4
Manipal University Jaipur Page No. 510
Statistics for Management Unit 14
Key Statistic
The additive model assumes that the observed value is the sum of four
components of time series, that is,
Y=T+S+C+I
where,
Y = original data
T = trend value
S = seasonal component
C = cyclical component
I = irregular component
The additive model for decomposition of time series assumes that all the
four components of the time series operate independently of one another. It
also assumes that the behaviour of components is additive in character. It is
to be noted that only absolute values are added or deducted from the trend
value to arrive at the observed value.
Key Statistic
The multiplicative model assumes that the observed value is obtained by
multiplying the trend (T) by the rates of three other components, that is,
Y=TxSxCxI
where,
Y = original data
T = trend value
S = seasonal component
C = cyclical component
I = irregular component
The multiplicative model assumes that the components, although due to
different causes, are not necessarily independent and they can affect one
another. It also assumes that the behaviour of components is of
multiplicative character. It is to be noted that except the value of trend, all
the other values on the right hand side are rates or index numbers.
3. Price changes
Adjustment for price changes becomes necessary wherever we have real
value changes. Current values are to be deflated by the ratio of current
prices to base year prices.
4. Comparability
The data which gets analysed should be comparable in order to have a valid
conclusion. When we deal with the analysis of time series it involves data
relating to the past which must be homogeneous and comparable.
Therefore, effects should be there to make the data as homogeneous and
comparable as possible.
S
Symbolically, seasonal index for first term is given by: I 1
100
1 S
Where, S1 = Average of first term
S = Average of all terms Sj / k where j = 1, 2, 3, 4……..k
k = 12 for monthly data
k = 4 for quarterly data
The merits and demerits of Simple average method are depicted in
table 14.8.
Table 14.9: Merits and Demerits of Simple Average Method
Merits Demerits
This method is the Most economic time series have trends and
simplest one. therefore, the seasonal index computed by this
method is really an index of trends and seasons.
ii) Under additive model, from each original value, the corresponding
moving average is deducted to find out short time fluctuations, which is
given as:
Y–T=S+C+I
iii) By preparing a separate table, monthly (or quarterly) short time
fluctuations are added for each month (or quarter) over all the years
and their average is obtained. These averages are known as seasonal
variations for each month or quarter.
iv) If we want to isolate / measure irregular variations, the mean of the
respective month or quarter is deducted from the short time
fluctuations.
14.8.3 Chain or link relative method
The steps involved in the chain or link relative method are described below.
i) Each quarterly or monthly value is divided by the preceding quarterly
or monthly value and the result is multiplied by 100. These
percentages are known as ‘Link Relatives’ of the seasonal values.
Thus:
Average Link Re lative of current year Chain Re lative of previous year
100
iv) The second chain relative of first is computed on the basis of the chain
relative for the last:
The Chain Re lative of first quarter
Average Link Re lative of the first quarter Chain Re lative of the last
100
This chain relative may or may not be 100. It is not equal to 100 due
to secular trend. If it is 100, go to ‘step vi’, if it is not 100, go to ‘step
v’ and then go to ‘step vi’.
v) Compute the difference ‘d’ between the new chain relatives first
obtained in ‘step iv’ and chain relative assumed as 100. ‘d’ is divided
by the number of seasons and the resulting figure is multiplied by
1, 2, 3 and the product is deducted respectively from the chain
relatives of 2nd, 3rd, and 4th quarters. These are called corrected
relatives.
vi) The seasonal indices are obtained when the corrected chain relatives
are expressed as percentage of their relative averages.
14.8.4 Ratio to trend method
The following steps are considered to determine seasonal indices by this
method:
i) Determine the trend values by the method of least squares.
ii) To find ratio to trend, divide the original data by the corresponding
trend values and multiply these ratios by 100, that is,
Original Data
Ratio to Trend 100
Trend Value
iii) Calculate the arithmetic mean of the trend ratios obtained in ‘step ii’.
iv) Finally, all the trend ratios will be converted into seasonal indices. Add
all averages obtained in ‘step iii’ and find their general average.
Seasonal indices are calculated by using the following formula:
Quarterly Averages
Seasonal Indices 100
General Averages
Y Y
t
In this method the trend effect and cyclic effects do not come into account.
14.9.2 Naive forecast
In this method we forecast the value, for the time period t, to be equal to the
actual value observed in the previous period, that is, time period (t-1). This
is given as:
Y Y
t t 1
Activity
Find seasonal variations by the ratio to trend method from the data given
below:
Activity Solution
Quarterly
Yearly Trend
Year Average X X2 XY
Total Values
Y
1994 280 70 -2 4 140 64
1995 360 90 -1 1 -90 88
1996 400 100 0 0 0 112
1997 520 130 1 1 130 136
1998 680 170 2 4 340 160
∑Y = 560 ∑X= 0 ∑ X =10
2
∑XY= 240
XY 240
XY b X 2
, therefore, b
X 2
10
24
14.10 Summary
Let us recapitulate the important concepts discussed in this unit:
A time series is a set of numerical values of a given variable listed at
successive intervals of time.
The time series is classified into the following four components: Long
term trend or secular trend, Seasonal variations, Cyclic variations and
Random variations.
The methods of measuring the trend of a time series are: Free hand or
graphic methods, Semi averages method, Moving average method and
Method of least squares.
The forecasting methods using time series are: Mean forecast, Naïve
forecast, Linear trend forecast, Non-linear trend forecast and Forecasting
with exponential smoothing.
14.11 Glossary
Chain Relative: Ratio of seasonal index for the quarter to the seasonal
index of the previous quarter
Link Relative: Ratio of value of the variable for the quarter to the value of
the variable of the previous quarter.
Random: Changes in data value due to the factors other than trend and
seasonal.
Seasonal: Periodical changes in data values over regular intervals of time.
Time series: A set of observations recorded over a period of time.
Trend: The tendency in the data values either to increase or decrease.
14.13 Answers
Terminal Questions
1. Refer section 14.2
2. Refer section 14.4.2 and section 14.4.3
3. Refer section 14.5
4. Refer section 14.5.3
5. Refer section 14.4 and section 14.5
6. Refer section 14.4
7. Refer section 14.8
8. The equation of the straight line is given as: Y = 90 + 2X
The trend values are 84, 86, 88, 90, 92, 94, 96.
9. The seasonal values obtained are 98.66, 110.74, 95.30, 95.30.
Table 14.12: The number of people visiting a hotel’s webpage per month
Jan Feb Mar Apr May Jun July Aug Sep Oct Nov Dec
2006-07 420 100 300 344 300 200 344 766 899 900 788 455
2007-08 620 399 345 455 677 355 766 500 799 800 880 555
2008-09 520 289 400 644 566 677 500 800 899 900 680 666
Using the suitable methods with justification, forecast the number of visitors
to web page for all the month in the academic year Jan 2010-2011.
References:
Agarwal B.L. (2006) Basic Statistics. 4th Ed., New Age International
Publishers.
Anderson, David R., Sweeney. Dennis J. & Williams, Thomas A.
5th ed, Thomson Business Information Pvt Ltd.
Bowerman, B. L. & O Connel, R.T. Applied Statistics: Improving
Business Processes. (1996) Irwin.
Levin, Richard I. , Rubin, David S. (2008) Statistics for Management.
7th Ed. PHI Learning Private Limited.
Manipal University Jaipur Page No. 522
Statistics for Management Unit 14
Pisani, Freedman D.R. & Purves, R. Statistics. (1997) 3rd Ed. W.W
Norton.
Srivastava & Shailaja Rejo, T.N. (2008) Statistics for Management
5th Ed.TMH.
Tanur, J.M. (2002) Statistics: A Guide to the unknown. 4th Ed. Brooks
/cole..
Tukey, J.W. (1977) Exploratory Data Analysis. Addison –Wesley.
Wilcox, Rand R. (2009) Basic Statistics – Understanding Conventional
Methods and Modern Insights. Oxford University Press.
E-References:
http://www.textbooksonline.tn.nic.in/Books/11/Stat-EM/Chapter-1.pdf