Conf1 Ieee Icaesm
Conf1 Ieee Icaesm
A. Making the data stationary In (3), rt represents residual data. To each of the components,
The time series data need not be stationary always. In this case trend and residual data, basic ARIMA model, described in the
an ARMA model cannot be fit. So initially, the data is made previous section is applied and each component is predicted.
stationary by performing differencing operation. The number Then the raw data predictions are obtained by summing up the
of times differencing is performed is denoted by ‘d’. predicted trend and predicted residual data as given in (4):
B. Identifying suitable values for model order y t , pre st , pre rt , pre (4)
To find out the values of p and q, the autocorrelation function
(ACF) and the partial autocorrelation function (PACF) are In (4), y t , pre represents predicted raw data, st , pre represents
applied on the differenced data. If the ACF shows a sinusoidal
decay and simultaneously, PACF becomes zero after a lag p, it
the predicted trend component, and rt , pre represents the
is a pure AR process of order p. If the ACF becomes zero after predicted residual component. The thus obtained predictions
a lag q, and PACF has sinusoidal decay, then it becomes an have greater accuracy compared to predictions from basic
MA process of order q. If both ACF and PACF have ARIMA. This has been confirmed as per the results obtained
sinusoidal decay, and become zero after lags q and p in section V.
respectively, then it becomes an ARMA process of order (p, q)
and correspondingly ARIMA process of order (p, d, q).
C. Prediction or forecasting the time series data IV. WAVELET-BASED ARIMA
After the model order (p, d, q) is obtained, using the estimation This is also a composite technique, where, the raw data is first
techniques, the parameters, a1 , a2 .. a p , b1 , b2 .. bq are found pre-processed using Wavelet decomposition technique and
then predictive data mining techniques are applied. The raw
out. Also, the variance of et can be obtained. Using these time series data is decomposed into several time series data
parameter estimates and error variance, the time series data components using wavelet transform. These components are
can be forecasted using methods like Least Squares, Maximum called as approximate and detailed components. This wavelet
Likelihood etc. transform can be applied in multiple levels.
2, ydt 2 . This decomposition can be further continued to a identification and estimation is accomplished with the help of
R software, which is a freeware. This can also be done using
required number of levels. This decomposition strategy is
SPSS or SAS softwares. The results of basic ARIMA are
shown in (5).
shown in figure 2.
(5)
Thus, the raw data is decomposed into three different time
series data yat 2 (approximate), ydt 2 (detail 2) and
ydt1 (detail 1). The approximate component resembles the
smoothened version of raw data (similar to trend of section
III). Detail 2 and Detail 1 are the high frequency components.
If wavelets are applied again on yat 2 , then yt is decomposed
into one approximate and three detail time series data. The
detailed outline of this method is described in [1].
Each decomposed time series data is predicted using the basic
ARIMA model described in section II. Then the raw data
predictions are obtained by adding the predicted approximate Figure 1. Original Time Series data (Average Global Temperature)
time series data and all the predicted detailed time series data,
as given in (6): The trend-based ARIMA has been applied on the raw data
and the model for trend has been identified as ARIMA(11, 0,
y t , pre yat 2, pre yd t 2, pre yd t1, pre (6) 11). The Residual model is obtained as ARIMA(5 ,0, 5). The
predicted data using this method is compared with the original
This method is an extension of the trend-based ARIMA
data as shown in figure 3. The smoothing filter can be
described in section III. Trend and residual data are like
implemented using MATLAB or SciLab.
decompositions of raw data. As trend is smooth version, it is
similar to the low frequency component of raw data and the
residual data comprises the high frequency component of raw
data. Trend and residual data are like first level
decompositions of the raw data. Trend can be further
decomposed and the method can be extended.
The difference between this method and the Trend-based
ARIMA method is that the filter used here is not a simple
moving average filter but a db5 filter. So this method has a
greater accuracy in predicting than trend-based ARIMA.
Nevertheless, trend based ARIMA has a better accuracy than
basic ARIMA. These have been confirmed from the results
shown in section V.
1 N
original i forecast i
MAPE
N
i 1 forecast i
(7)
original i forecast i
MaxAPE max
(8)
forecast i
N
1
MAE
N
original
i 1
i forecast i (9)
VI. CONCLUSION
This paper predicts average global temperature for the
forecasting period 2001 to 2010. This prediction is
accomplished using three variants of ARIMA models, basic-
ARIMA, Trend-Based ARIMA and Wavelet-Based ARIMA
models. The performance of each of these methods is
compared using MAPE. It is concluded that Wavelet–Based
ARIMA performs the best out of the three. Trend-Based
Figure 3. Raw data and Trend-based ARIMA predicted data
ARIMA is better than Basic-ARIMA but is inferior to Wavelet
–Based ARIMA. All the methods used here are linear in
nature. Using ANN, Fuzzy, HMM and Rough Set and Hybrid
models we can devise even better prediction method.
REFERENCES
[1] Ni Lihua, “ARIMA traffic flow using wavelets”, 2nd International
Conference ICISE, 2010.
[2] Dr. PDKV, “Use of the ARIMA Model for Forecasting Pigeon Pea
Production in India”, International Review of Business and Finance,
ISSN 0976-5891 Volume 2 Number 1, pp. 97–102, 2010.
[3] Xu Ye, “The Application of ARIMA Model in Chinese Mobile User
Prediction”, 2010 IEEE International Conference on Granular
Computing , 2010.
[4] Xiaoguo Wang, “ARIMA Time Series Application to Employment
Forecasting”, Proceedings of 4th International Conference on
Computer Science & Education, 2009.
[5] Yohanes Budiman Wijaya ,“Stock price prediction comparison of
Arima and artificial neural network Methods”, 2010 Second
Figure 4. Raw data and Wavelet-based ARIMA predicted data International Conference on Advances in Computing, Control, and
Telecommunication Technologies, 2010.
[6] Jinsuo Lu, “Data Mining on Algae Concentrations -Chlorophyll Time He has eleven years of teaching experience in various engineering
Series in Source Water Based on Wavelet”, Fifth International colleges. His research interests include Data Mining, Prediction on time
Conference on Fuzzy Systems and Knowledge Discovery, 2008. series data, Neural Networks, optimization techniques etc.
[7] Mehdi Khashei, “An artificial neural network (p, d,q) model for time
series forecasting”, Expert Systems with Applications pp. 479–489,
Elsevier, 2010.
[8] Chun-Hao Chen a, “Mining fuzzy frequent trends from time series”,
Expert Systems with Applications, pp. 4147–4153, Elsevier 2009.
[9] JingTao Yao, “Financial time-series analysis with rough sets”, Applied Dr. B. Eswara Reddy Graduated in B.Tech.(CSE) from Sri Krishna
Soft Computing , pp. 1000–1007, Elsevier 2009. Devaraya University in 1995. He received Masters Degree in
M.Tech.(Software Engineering), from JNT University, Hyderabad, in 1999.
[10] Jie Yin, “Integrating Hidden Markov Models and Spectral Analysis for He received Ph.D in Computer Science & Engineering from JNT University,
Sensory Time Series Clustering”, Proceedings of the Fifth IEEE Hyderabad, in 2008. He served as Assistant Professor from 1996 to 2006. He
International Conference on Data Mining (ICDM’05), 2005. is working as Associate Professor in CSE Dept., since 2006 and currently
[11] Brockwell ,“Time Series Theory and Methods ”, 1987. acting as Head of CSE Dept. at JNTUACE, Anantapur.
[12] http://www.earth-policy.org/data_center/C26 He has more than 10 Publications in various International Journals
and more than 15 Publications in various National and International
Conferences. He is one of the authors for the text book titled Programming
with Java published by Pearson/Sanguine Publishers. His research interests
include Pattern Recognition & Image Analysis, Data Warehousing & Mining
Narendra Babu C (M’10) became the Member(M) of IEEE in 2010. He was and Software Engineering.
born in 14 Jul 1978. He is presently working as Senior Assistant Professor , in He is a life member of ISTE, IE, ISCA and CSI.
the REVA Institute of Technology, Bangalore and also pursuing his Ph.D
from JNT-University Anantapur, AP, under the guidance of Dr. B. Eswara
Reddy. He completed his M.Tech degree from MSRIT, Bangalore in 2004.
He completed his BE from Adichunchanagiri Institutie of Technology,
Chikmagalur, Karnataka in 2000.