The Weather Forecast Using Data Mining Research Based On Cloud Computing
The Weather Forecast Using Data Mining Research Based On Cloud Computing
Abstract. Weather forecasting has been an important application in meteorology and one of
the most scientifically and technologically challenging problem around the world. In my study,
we have analyzed the use of data mining techniques in forecasting weather. This paper
proposes a modern method to develop a service oriented architecture for the weather
information systems which forecast weather using these data mining techniques. This can be
carried out by using Artificial Neural Network and Decision tree Algorithms and
meteorological data collected in Specific time. Algorithm has presented the best results to
generate classification rules for the mean weather variables. The results showed that these data
mining techniques can be enough for weather forecasting.
1. Introduction
Weather Prediction is the application of science and technology to predict atmospheric conditions ahead
of time for a particular region. Prediction is one of the basic goals of Data Mining. Data Mining is to
dig out knowledge and rules, which are hidden and unknown. User may be interested in or has potential
value for decision-making from the large amounts of data. Such potential knowledge and rules can
reveal the laws between the data. There are many kinds of technical methods of data mining, which
mainly include: association rule mining algorithm, decision tree classification algorithm, clustering
algorithm and time series mining algorithm, etc. [1]. How to store, manage and use these massive
meteorological data, discover and understand the law and knowledge of the data, to contribute to
weather forecasting completely and effectively has attracted more and more Data Mining researcher’s
attention[2]. This article constructs the Weather Forecasting platform, using data mining for
meteorological forecast and the forecast results are analyzed.
2. Weather Forecasting
Weather forecasting plays a significant role in meteorology [3]. Weather forecasting has remained a
formidable challenge because of its data intensive and frenzied nature. Generally, two methods are used
to forecast weather: a) the empirical approach and b) the dynamical approach. The first approach is
based on the occurrence of analogues and often referred to as analogue forecasting. This approach is
useful in predicting local scale weather if recorded cases are plentiful. The second case is based upon
equations and forward simulations of the atmosphere and often referred to as computer modeling. Most
weather prediction systems use a combination of both of these techniques.
Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution
of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
Published under licence by IOP Publishing Ltd 1
CTCE2017 IOP Publishing
IOP Conf. Series: Journal of Physics: Conf. Series 1234567890
910 (2017) 012020 doi:10.1088/1742-6596/910/1/012020
This framework as a service (FAAS) has selected seven common forecasting methods. These are
Regression (R), Logistic Regression, Time Series, Artificial Neural Network, Random Forest, Support
Vector Machine and Multivariate Adaptive Regression Splines (MARS). For instance, Regression may
encounter them collinearity among variables. Logistic Regression could only deal with the dataset
where the dependent variable is nominal.
There are three basic elements of a neuron model. Figure.3 shows the basic elements of neuron
model with the help of a perceptron model, which are, (i) a set of synapses, connecting links, each of
which is considered by a weight/strength of its own (ii) an adder, for summing the input signals,
weighted by respective neuron’s synapses (iii) an activation function, for limiting the amplitude of
neuron’s output. A typical input-output relation can be expressed as shown in Equation 1.
2
CTCE2017 IOP Publishing
IOP Conf. Series: Journal of Physics: Conf. Series 1234567890
910 (2017) 012020 doi:10.1088/1742-6596/910/1/012020
n e tj W ijX i b j
j 1
O i f i ( n e ti )
(1)
Where = inputs to node in input, = weight between input node and hidden node, b – bias at node, net =
adder, f = activation function.
The type of transfer/activation function affects the size of the steps taken in weight and space [12].
ANN’s architecture needs determination of number of the connecting weights and the way in which the
information flows through this network is carried out via the number of layers, nodes number in each
layer, and their connectivity. The output nodes numbers are fixed, according to the estimated quantities.
The input nodes numbers are dependent on the existing problem under consideration, and the modeler’s
choice to utilize knowledge of domain. The neurons in the hidden layer are enhanced gradually, and the
network performance in the form an error is examined.
the .
3
CTCE2017 IOP Publishing
IOP Conf. Series: Journal of Physics: Conf. Series 1234567890
910 (2017) 012020 doi:10.1088/1742-6596/910/1/012020
The MLP network is trained through the back-propagation learning algorithm. This training
algorithm is to minimize the cost function (E) of the network, expressed as mean square error amongst
all the outputs of the network and the respective desired values defined in the training set. The mean
square error is calculated by using the following expression:
Figure 4 Weather Forecasting Temperature for Last Five Years (2011-2015) Highest and Lowest
Value
Figure 5 Results of wind speed prediction Value in Last Five Years. (2011-2015)
4
CTCE2017 IOP Publishing
IOP Conf. Series: Journal of Physics: Conf. Series 1234567890
910 (2017) 012020 doi:10.1088/1742-6596/910/1/012020
The results have shown that concerning Wind Speed and Temperature highest and lowest Value
displayed the simulation results (figure 4 and 5) in Intervals of last five years. While using more input
data values, the execution time can be as longer as expected.
7. Acknowlegment
This research was supported by the National Natural Science Foundation of China (Grant No.
61672126) and (Grant No. 51579041), We would like to thank the anonymous reviewers for their
attentive reading and for their constructive comments that have helped to further strengthen this paper.
8. References
[1] M. Zhang, “Application of Data Mining Technology in Digital Library”, Journal of Computers,
vol. 6, no. 4, (2011) April, pp. 761-768.
[2] Z. Danping and D. Jin, “The Data Mining of the Human Resources Data Warehouse in University
Based on Association Rule”, Journal of Computers, vol. 6, no. 1, (2011) January, pp. 139-146.
[3] Introduction to Data Mining and Knowledge Discovery, Third Edition, Two Crowds Corporation,
http://www.twocrows.com/introdm.pdf, accessed on 12 April 2009.
[4] L. M. Saini and M. K. Soni, “Artificial neural network-based peak load forecasting using
conjugate gradient methods,” IEEE Transactions on Power Systems, vol. 12, no. 3, pp. 907–
912, . 2002.
[5] S. Fan, C. X. Mao, and L. N. Chen, “Peak load forecasting using the self-organizing map,” in
Advances in Neural Network-ISNN 2005. New York: Springer-Verlag, 2005, pt. III, pp. 640–
649.
[6] Kourentzes, N., “Intermittent demand forecasts with neural networks”, International
Journal of Production Economics, Volume 143, Number 1, pages 198-206, 2013.
[7] Elia G. P., 2009, “A Decision Tree for Weather Prediction”, Universitatea Petrol-Gaze din Ploiesti,
Bd. Bucuresti 39, Ploiesti, Catedra de Informatică, Vol. LXI, No. 1.
[8] A. R. Finamore; V. Calderaro; V. Galdi; A. Piccolo; G. Conio; S. Grasso, “A day-ahead wind
speed forecasting using data-mining model– a feed-forward NN algorithm", IEEE International
Conference on Renewable Energy Research and Applications, 2015, pp. 1230-1235.
[9] E. Erdem, J. Shi, “ARMA based approaches for forecasting the tuple of wind speed and direction”,
Applied Energy 88, ELSEVIER, 2011, pp. 1405–1414.
5
CTCE2017 IOP Publishing
IOP Conf. Series: Journal of Physics: Conf. Series 1234567890
910 (2017) 012020 doi:10.1088/1742-6596/910/1/012020
[10] N. Chen, Z. Qian, I. T. Nabney, and X. Meng, “Wind Power Forecasts Using Gaussian Processes
and Numerical Weather Prediction,” IEEE Transaction on Power Systems, vol. 29, no. 2, 2014.
[11] EWEA Wind in power: European Statistics. 2013.
[12] Martin T. H., Howard B. D, Mark B., 2002, Neural Network Design, Shanghai: Thomson Asia
PTE LTD and China Machine Press.
[13] N. Chen, Z. Qian, I. T. Nabney, and X. Meng, “Wind Power Forecasts Using Gaussian Processes
and Numerical Weather Prediction,” IEEE Transaction on Power Systems, vol. 29, no. 2, 2014.
[14] R. R. B. de Aquino, H. T. V. Gouveia, M. M. S. Lira, A. A. Ferreira, O.N. Neto, M. A. Carvalho
Jr., “Wind Forecasting and Wind Power Generation: Looking for the Best Model Based on
Artificial Intelligence”, IEEE World Congress on Computational Intelligence, 2012.
[15] Roger A. Pielke Sr., “Mesoscale Meteorological Modelling”, International Geophysics Series,
Volume 98, oct, 2013.
[16] David J. Stensrud ,“Parameterization schemes – Keys To Understanding Numerical Weather
Prediction Models”, Cambridge University Press, 2007.
[17] Renewables – Global Status Report- REN21, 2015.