MATLAB Based Graphical User Interface GUI For Data Mining As A Tool For Environment Management Libre
MATLAB Based Graphical User Interface GUI For Data Mining As A Tool For Environment Management Libre
+
= + , where
w: The weight, J . Jacobian, . Comb
which is alwavs positive, I . The identitv ma
As the combination oI steepest descent a
Gauss-Newton algorithm, The Leve
algorithm switches between the two algor
training process. (See |10|).
The design process Ior data mining alg
neural network has been published to Iit
opportunity oI developing the interIace
soItware.
ThereIore, we use again the design oI |8
oI data mining method (see Fig. 12).
Example 4, this example shows the regre
dataset (see Figs. 13 and 14), which has be
measurements Irom thermometer sensor d
over observing each 60 minutes. The inp
imported as DBASE1`, which is a |3x25|
static data: 25 samples oI 3 elements, with
selections: 70 oI samples Ior training, 15
and 15 Ior testing.
ing algorithms Ior FeedIorward
arquardt algorithms can be
, where
. Combination coefficient
ntitv matrix.
est descent algorithm and the
The Levenberg-Marquardt
e two algorithms during the
a mining algorithms Ior our
to Iit with the Iuture
e interIace as open source
8| to show the design
ws the regression Ior a small
hich has been constructed as
ter sensor during three days
tes. The input Iile has been
| matrix, representing
ments, with these percentage
training, 15 Ior validation,
Fig. 13 Regression, d
Fig. 14 Neural network training pe
Fig. 15 InterIaces output and progra
mentions our interIace and which it
pane
egression, dataset oI example 4
rk training perIormance, dataset oI example 4
ut and programming based, 'Get Observation
and which it includes the other interIaces in its
panels
World Academy of Science, Engineering and Technology
International Journal of Computer, Information Science and Engineering Vol:8 No:1, 2014
135
I
n
t
e
r
n
a
t
i
o
n
a
l
S
c
i
e
n
c
e
I
n
d
e
x
8
5
,
2
0
1
4
w
a
s
e
t
.
o
r
g
/
p
u
b
l
i
c
a
t
i
o
n
s
/
9
9
9
7
4
9
4
IV. DATA MINING ALGORITHMS AND REVIEW
There are three primary components in any data-mining
algorithm: (i) model representation (ii) model evaluation, and
(iii) search. |2|,|12|-|16|.
Model representation our interIace is operated by a
graphical user interIace based on Matlab. This representation
can be divided into two Iaces: (i) Hidden programming Ior
data collecting which based on X-DROPS code, at the end oI
this stage the system has been initialized and the discoverable
patterns are described into graphical user interIace based on
Matlab. (ii) Matlab programming, which the interIace has
been built and executed over Matlab code using (GUI) tools.
Interactive graphical user interIace describe the discoverable
patterns as graphs, tables, analysis, numerical results, etc. As
mentioned beIore we can summarize the representation Ior our
interIace and interIaces inside as shown in Fig. 15.
Another component oI data mining algorithms is Model-
evaluation criteria, as mention in |2|, this component are
quantitative statements oI how well a particular pattern (a
model and its parameters) meets the goal oI knowledge
discovery in database (KDD) process. KDD is the 'nontrivial
process of identifving valid, novel, potentiallv useful, and
ultimatelv understandable patterns in data |2|. Our interIace
meets this deIinition by its application. The understandable-
constrains plot oI measurements, which have been extracted
corresponding to speciIic user criteria and have been exported
to the data models (predictive models, data training pattern,
clustering, and data Iitting) give a clear idea Ior the observer
about the desired data with their analysis. All oI models have
been tested under a randomly user input choices; test set has
been used to examine the predictive model accuracy with
other multi-input levels.
Search method, the third components oI data mining
algorithm, which it consists oI two components: (1) parameter
search and (2) model search. In this stage data mining task is
reduced to purely an optimization task: Iind the parameters
and models Irom selected Iamily that optimized the evaluation
criteria |2|, |11|. (See Fig. 16).
Fig. 16 InterIace data Ilow and process Irom the search model up to
extracted data. More details about the search model have been
discussed in the previous sections oI this paper
V. CONCLUSION
The aim oI these tools is to provide an interIace Ior
applying data mining methods in the environmental
applications. This interIace plays an intermediate-cooperated
role between two Iields: environmental monitoring and data
mining. We use this interIace to collect data Irom sensor
(extract Iorm database) and applying the data mining methods
on these sets oI data, Iinally Ior extracting theses data in
understandable and Ilexible model to be used in the decision
support system. The system has been designed to Iit with real
on-line observing and the algorithms have been constructed
Ilexibly to meet with Iuture needs. This interIace will be
developed in an upgrading-phase to be as an open source
software.
ACKNOWLEDGMENT
Authors would like to extend grateIul thanks to proIessor
Angelo Alessandri and proIessor Patrizia Bagnerini Irom the
department oI mathematical engineering and simulation in the
university oI Genova, Cosimo Versaci Irom ACROTEC
Company, Ior there guidance and their appreciated eIIorts. In
special way, the author wishes to acknowledge all members oI
the project (N.I.E), in providing the data on which this toolwas
based.
REFERENCES
|1| Jessica Spate, Karina Gibert, Miquel S`anchez-Marr`e, Eibe Frank,
Joaquim Comas, Ioannis Athanasiadis, Rebecca Letcher, 'Data Mining
as a Tool Ior Environmental Scientists. Al Maga:ine Vol.17.1996.
|2| Usama Fayyad, Gregory Piatetsky-Shapiro, and Padhraic Smyth, From
Data Mining to Knowledge Discovery in Databases, Article.
|3| Richard Johnson and Dean Wichern, "Applied Multivariate Statistical
Analysis", 1992.
|4| John W. Tukey, 'Exploratory data analysis, 1977.
|5| Weiss, Sholom M. and Kulikowski, Casimir A. "ClassiIication and
prediction methods Irom statistics, neural nets, machine learning, and
expert systems", 1991.
|6| Anil K. Jain and Richard C. Dubes. 'Algorithms Ior data clustering",
1988.
|7| Math Works, http://www.mathworks.it/index.html
|8| Mikut, R.; Reischl, M.; Burmeister, O.; Loose, T.: Data Mining in
Medical Time Series. Biomedizinische Technik, vol. 51, pp. 288293,
2006.J.
|9| R. Mikut, O. Burmeister, S. Braun, M. Reischl, 'the open source matlab
toolbox gait-cad and its application to bioelectric signal processing,
Paper, Institute Ior Applied Computer Science, Forschungszentrum
Karlsruhe GmbH, Germany.
|10| Hao Yu and B. M. Wilamowski, 'LevenbergMarquardt Training
Industrial Electronics Handbook, vol. 5 Intelligent Systems, 2nd
Edition, chapter 12, pp. 12-1 to 12-15, CRC Press 2011.
|11| NIE project documents, ACROTEC Company. http://www.acrotec.it
|12| Muhammad Aqil, Ichiro Kita, Akira Yano, and NishiyamaSoichi.
'Decision Support System Ior Flood Crisis Management using ArtiIicial
Neural Network, International Journal oI Electrical and Computer
Engineering 1:5 2006
|13| David Hand, Principle oI Data Mining. Massachusetts Institute oI
Technology, 2001.
|14| Feng Jiansheng. KDD and its applications, BaoGangtechniqyes. 1993.
|15| Xianjun NI, Research oI Data Mining Based on Neural Networks. World
Academy oI Science, Engineering and Technology, 39 2008.
|16| Ben-Gal, Data Mining and Knowledge Discovery Handbook, chapter
one.'Kluwer Academic Publisher, 2005.
World Academy of Science, Engineering and Technology
International Journal of Computer, Information Science and Engineering Vol:8 No:1, 2014
136
I
n
t
e
r
n
a
t
i
o
n
a
l
S
c
i
e
n
c
e
I
n
d
e
x
8
5
,
2
0
1
4
w
a
s
e
t
.
o
r
g
/
p
u
b
l
i
c
a
t
i
o
n
s
/
9
9
9
7
4
9
4