Development of An Optimal Neural Network For Avala
Development of An Optimal Neural Network For Avala
net/publication/220705259
CITATION READS
1 135
4 authors, including:
Virender Rihani
Shoolini University
12 PUBLICATIONS 61 CITATIONS
SEE PROFILE
All content following this page was uploaded by Virender Rihani on 10 June 2016.
Abstract - This paper deals with the application temperatures. This data has secondary relevance in
of a well-known data mining technique, multi-layer avalanche formation.
back-propagation neural network, for forecasting of Class III: Meteorological factors, such as the
an avalanche in Himalayan region. Metrological and temperature, winds, precipitation and humidity which
snow data for Himalayan region has been used for bears indirect relevance to avalanche formation.
training the neural network. EasyNN-plus 6.0g, neural Primarily avalanche forecasting [4] is done by an
network software for Microsoft windows, is used for expert, acting on their knowledge and experience with
the development of an optimal neural network- snow stability situations. The physical [5] principles
PUSHPDEV. The system tries to model the decision underlying avalanche formation remain an area of
making process of a pragmatic expert. PUSHPDEV active research as they are not yet fully understood. A
can forecast whether an avalanche will trigger on a forecasting tool can therefore be used as an aid to
particular day from November to April. The network expert in decision-making for prediction of an
accepts eighteen inputs and produces an output whose avalanche. Different types of techniques are used in
value is zero or one, zero for no avalanche and one for avalanche forecast. They can be categorized as non-
avalanche on that day. computer assisted [6] and computer assisted
techniques [4-5][7-17].
Keywords: Avalanche, data mining, data warehouse Avalanches in Himalayan are a serious threat to
both human as well as to property. Avalanches results
1 Introduction in closure of roads which affects the necessary
supplies and services to both army and humans. Fuzzy
An avalanche is a mass of snow or ice that slides rule bases system [14] and nearest neighbour method
down the side of a mountain rapidly. Avalanches [1] [17] are two computer assisted tools, used for
are generated by structural weakness in the snowcover. avalanche forecasting in Himalayan region. The need
Avalanches occur due to the failure of the snowpack to for development of another tool is to use it along with
withstand the stresses placed upon it by its self weight already existing tools so as to have a better avalanche
and other imposed forces. The forecasting factors of warning system for Himalayan region.
avalanche are divided into three classes [2-3]. The Data mining [18] is known as “knowledge
higher the class number less relevance is the data for discovery in databases (KDD)”. It is the process of
avalanche formation. These classes are: extraction of patterns representing knowledge stored
Class I: Stability factors, including current avalanche in either data warehouse or databases. Data mining is a
activity, stability tests (e.g. Rutschblock, Shovel test) multidisciplinary field, drawing work from area
as well as other signs of instability, such as cracking of including database technology, artificial intelligence,
the snowcover. This is most relevant data for neural network, machine learning, statistics,
avalanche formation. knowledge acquisition, high performance computing,
Class II: Snowpack factors, such as the structure of information retrieval, and data visualization. C4.5, C5,
the snowpack, including the existence of weak layers, CART and feedforward backpropagation are different
crystal forms and sizes, densities and snow types of data mining algorithm used. Artificial neural
network (feedforward backpropagation), a form of speed, fresh snow amount, fresh duration and standing
data mining, has the advantage that it can handle snow etc. These parameters are taken twice in a day, at
inconsistent data. Artificial neural network can 8:30 and 17:30 hours.
therefore is a good choice for forecasting problems 2) Avalanche occurrence data, which specify when
like avalanche where data collected over years is an avalanche occurred.
inconsistent. Artificial neural network is an attempt to
mimic the biological decision making process of the 3 Method
human. Artificial neural network [18] development is
a two-step process. In the first step, supervised type of The development of an optimal neural network
learning is used to build a model. In step two, the has three main processes: objective determination,
model is used for the classification and the accuracy is data preparation and neural network modeling. Detail
estimated. If estimated accuracy of the model is descriptions of these processes are
acceptable then the model is used for prediction.
Artificial neural networks have been developed in 3.1 Objective Determination
south central Colorado of United States [4]. They uses The basic aim of this process is to obtain clear
fifteen metrological and snow parameters, and output understanding of the model to be developed. Objective
is an index whose value ranged from zero to one, with determination leads to the actual specification to be
higher values corresponding to increased likelihood's used for the development of the model. The output of
of avalanche activity. These models have the accuracy this phase is the problem definition and expected
of predicting avalanche activity up to 78 to 91 percent result. Problem definition for our work is “To develop
at different learning rate. an optimal neural network model to predict whether
EasyNN-plus 6.0g, is neural network software for an avalanche will trigger on a particular day or not in
Microsoft windows, is used for the development of an Himalayan region.” The expected output of the model
optimal neural network. All the component parts of is a value, which is zero or one, zero for no avalanche
EasyNN-plus are implemented as C++ reusable and one for avalanche on that day.
classes. The inspiration to use EasyNN-plus was based
on the fact that it has a graphical user interface and
Data Pre-Analysis
backpropagation algorithm. EasyNN-plus uses double
linked lists for implementation to store the examples,
the nodes and the connections. The lists can be
processed quickly in both directions simultaneously, Data-Preprocessing
and it can also be extended and contracted
dynamically.
Table 1 Data used in PUSHPDEV Data Post-Analysis
1999 2001 2002 2003 2004 2005
Figure 1 Data preparation scheme
Jan √ √ √ √ √ *
Feb √ √ √ √ √ * 3.2 Data Preparation
Mar √ √ √ √ √ * Data preparation is an important and critical step
April √ √ √ √ √ * in neural network modeling. Data preparation
Nov √ √ √ √ * includes all operation that is carried out on data for
Dec √ √ √ √ * development of the model. Data preparation [19]
√ : Data used for training * : Data used for testing includes data pre-analysis, data preprocessing and data
post-analysis. Data preparation scheme is shown in
Figure 1.
2 Data
The present study lies in Lower Himalayan Zone 3.2.1 Data Pre-Analysis
and data for a particular site, is provided by SASE Data pre-analysis phase includes collection of
(Snow & Avalanche Study Establishment). It includes metrological and snow data for last few years. It also
metrological and snow data, which falls in Class III, identifies main data variables from the collected data,
least relevant data for avalanche forecast. Data which was used later for the development of the model.
available for different years is shown in Table 1. The selections of variables were based on their
Following is the description of data. importance to problem and quantity of data available.
1) Daily weather data, which records meteorological The parameters identified and used as input to
and snow parameter like minimum temperature, PUSHPDEV are shown in Table 2.
maximum temperature, dry temperature, average wind
Table 2 Eighteen input to the PUSHPDEV 3.2.3 Post Analysis
Sr.
No.
Input Last step in data preparation is post analysis,
1 Max temperature which include dividing data into two parts, one for
2 Min temperature training and other for testing to verify the network
3 Dry temperature performance. Data divided for training (including 99
4 Dry temperature difference records randomly selected for validation) and testing
5 Average maximum(Average of last three reading) (for an entire season) is shown in Table 1.
6 Average minimum(Average of last three reading)
7 Difference of average maximum(Difference of current
and last average maximum temperature) 3.3 Neural Network Modeling
8 Difference of average minimum(Difference of current Neural network modeling is an iterative process.
and last average minimum temperature)
9 Pressure difference
It includes selecting an initial configuration. Then a
10 Average speed number of experiments with each configuration are
11 Difference of average speed performed and the best network is retained. On each
12 Fresh snow continuous (sum of current and last fresh experiment, if under-learning occurs, then new
snow amount) network is tried by adding more neurons to the hidden
13 Fresh snow duration layer/layers or by adding an extra hidden layer. If
14 Fresh snow water equivalent (log (1+x)/log181, where over-learning occurs, then new network is tried by
x is the value of Fresh snow water equivalent).
removing hidden units. After training the network
15 Standing Snow
16 Difference in snow temperature
under predefined average error, network performance
17 Snow Penetration below Surface(Is the depth of is verified against data kept for testing. If network
snowpack up to which the ram penetrometer penetrates performance against test data is satisfied, then the
with its own weight) network is selected.
18 Characteristic of weather during proceeding one hour Table 3 Specification of PUSHPDEV
(new code developed by Code conversion method Number of input neurons 18
describes later.)
Number of output neuron 1