Evaluation of Artificial Neural Networks in Foreign 2013
Evaluation of Artificial Neural Networks in Foreign 2013
Email address:
[email protected]
Abstract: This study investigates the modeling, description and forecasting of exchange rates of four countries (Great
Britain Pound, Japanese Yen, Nigerian Naira and Batswana Pula) using Artificial Neural Network, the objective of this
paper is to use ANN to predict the trend of these four currencies. ANN was used in training and learning processes and
thereafter the forecast performance was evaluated or measured making use of various loss functions such as root mean
square error (RMSE), mean absolute error (MAE), mean absolute error (MAE), mean absolute precision error (MAPE) and
Theill inequality coefficient (TIC). The loss functions used are good indicator of measuring the forecast performance of
these series, the series with the lowest function gave a best forecast performance. Results show that the ANN is a very
effective tool for exchange rate forecasting. Classical statistical methods are unable to efficiently handle the prediction of
financial time series due to non-linearity, non-stationarity and high degree of noise. Advanced intelligence techniques have
been used in many financial markets to forecast future development of different capital markets. Artificial neural network is
a well tested method for financial markets analysis.
Keywords: Artificial Neural Networks, Foreign Exchange, Loss Functions, Training and Learning Processes
STEP II: Evaluation of the j th node in the hidden layer δ k is a weighted sum of all nodes and f h1 is the local
and the output of the k th node in the output layer as: slope of the node activation for hidden nodes.
STEP IV: The adjustment of weights and thresholds in the
n
h j = f h ∑ wij xi − θ j , output layer and hidden layer is obtained as follows:
i =1
(
wkjt +1 = wkjt + αδ k h j + η wkjt − wkj(
t −1)
),
n
yk = f k ∑ w jk x j − θ k
j =1 (
wtji+1 = wtji + αδ j hi + η wtji − w(ji
t −1)
),
Where
θ kt +1 = θ kt + αδ k ,θ tj +1 = θ tj + αδ j
1
and f k ( x ) = , Where α is the learning rate, η is the momentum factor,
(1 + exp ( −λ x ) )
k
and t the time period.
96 Akintunde Mutairu Oyewale: Evaluation of Artificial Neural Networks in Foreign Exchange Forecasting
Where
3. Train the Network and the Number of
Iteration for Training γ ij = the weight from hidden node I or from an input to
/ARCHITECTURE AUTOMATIC=YES
d j = the desired output of node j . (MINUNITS=1 MAXUNITS=50)
/CRITERIA TRAINING=BATCH
y j = the actual output of the node j . OPTIMIZATION=SCALEDCONJUGATE
LAMBDAINITIAL=0.0000005 SIGMAINITIAL=0.00005
The node biases are adapted through the similar iterative INTERVALCENTER=0 INTERVALOFFSET=0.5
algorithm by assuming they are connection weights on links MEMSIZE=1000
from constant-valued input /PRINT CPS NETWORKINFO SUMMARY
CLASSIFICATION
/PLOT NETWORK
4. Model Selection and Evaluation /STOPPINGRULES ERRORSTEPS= 1 (DATA=AUTO)
TRAININGTIMER=ON (MAXTIME=15)
Having obtained the outputs which are the forecast and
MAXEPOCHS=AUTO ERRORCHANGE=1.0E-4
the predicted values, these values are evaluated by
ERRORRATIO=0.0010
comparing the predicted values with the targets values
/MISSING USERMISSING=EXCLUDE.
using the mean square error and for the fewer pipes
deviation (the difference between the predicted from the 5.1. Case Processing Summary
actual in pipes).
The following criteria will be used to evaluate the Table 1 below shows that 320 cases were assigned to the
forecast performance of this model training sample and 124 to the testing sample (hold out),
this is in conformity with the standard rules that at least 70%
( )
T
of the sample must be set for training and the remaining
RMSE = T −1 ∑ Yt − Yˆt sample left is for testing.
t =1
T
Table 1. Case Processing Summary
MAE = T ∑ Yt − Yˆt
−1
t =1
N Percent
Training 320 72.1%
∑ (Y − Y )
T
−1 Sample
MAD = T t t Testing 124 27.9%
t =1
Valid 444 100.0%
T
Yt − Yˆt Excluded 0
MAPE = T −1 ∑ X 100
t =1 Yt Total 444
The smaller the values of these criteria the better the Table 2. Network Information
forecast performance; If the values are close to zero we say
it is better, if the results are not consistent among the first Input Factors 1 TIME
three we choose the MAPE to be the benchmark. Layer Number of Units a
37
1 consecutive
Stopping Rule Used step(s) with no
decrease in errora
5.4. Parameter Estimates of the Series Used in the Study the data for the study was collected is 37. The predicted
values for the ninth hidden layers are seen in the column
Table 4 show the parameters estimates and the predicted titled output layer. This prediction we shall later used in
values as shown in the table below, the total number of chapter to determine if this model has actually predicted
hidden layers used for the architecture of the model under very well.
study are nine and the number of years (Time) upon which
Table 4. Parameter Estimates of the series used in the study
Predictor Predicted
6. Prediction by Observed Chart 0n the x-axis for the combined training and testing samples.
The network shows that all series appears to be reasonably
Figures 2-4 below, shows that for scale dependent good at predicting as they all shows show how upward
variables, the predicted –by-observed chart displays scatter movement of each series.
plots of predicted values on the y-axis by observed values
109-114.
[2] Erims, K., Midilli, A., Dincer, I. and Rosen, M. A. (2007). [12] Shadboth, J. and Taylor, J. (2002). neural networks and the
Artifical Neural Network analysis of World Green Energy financial markets. Springer publishing company Limited.
Use. Energy Policy, 35, no. 3, p. 1731-1743. [13] Trippi, R. R. and Turban, E. (1993). Neural Networks in
[3] Franses, P.H and Van Griensen, K. (1998). Forecasting finance and investing: using artificial intelligence to
exchange rates using neural networks for technical trading improve real world performances. Probus publishing
rules. Studies in non-linear dynamics and Econometrics, 4, company (Chicago III).