Learning Rules in ANN
Learning Rules in ANN
The Hebbian rule was the first learning rule. In 1949 Donald Hebb developed it as
learning algorithm of the unsupervised neural network. We can use it to identify how to
improve the weights of nodes of a network. The Hebb learning rule assumes that – If two
neighbor neurons activated and deactivated at the same time, then the weight connecting
these neurons should increase. At the start, values of all weights are set to zero. This learning
rule can be used for both soft- and hard-activation functions. Since desired responses of
neurons are not used in the learning procedure, this is the unsupervised learning rule. The
absolute values of the weights are usually proportional to the learning time, which is
undesired.
Perceptron Learning Rule
Each connection in a neural network has an associated weight, which changes in the course of
learning. According to it, an example of supervised learning, the network starts its learning by assigning a
random value to each weight. Calculate the output value on the basis of a set of records for which we can
know the expected output value. This is the learning sample that indicates the entire definition. As a
result, it is called a learning sample. The network then compares the calculated output value with the
expected value. Next calculates an error function ∈, which can be the sum of squares of the errors
occurring for each individual in the learning sample which can be computed as:
Perform the first summation on the individuals of the learning set, and perform the second
summation on the output units. Eij and Oij are the expected and obtained values of the jth unit for the ith
individual. The network then adjusts the weights of the different units, checking each time to see if the
error function has increased or decreased. As in a conventional regression, this is a matter of solving a
problem of least squares. Since assigning the weights of nodes according to users, it is an example of
supervised learning.
Delta Learning Rule
• Developed by Widrow and Hoff, the delta rule, is one of the most common learning rules. It depends on supervised
learning. This rule states that the modification in sympatric weight of a node is equal to the multiplication of error and the
input. In Mathematical form the delta rule is as follows:
• For a given input vector, compare the output vector is the correct answer. If the difference is zero, no learning takes place;
otherwise, adjusts its weights to reduce this difference. The change in weight from ui to uj is: dwij = r* ai * ej. where r is the
learning rate, ai represents the activation of ui and ej is the difference between the expected output and the actual output
of uj. If the set of input patterns form an independent set then learn arbitrary associations using the delta rule. It has seen
that for networks with linear activation functions and with no hidden units. The error squared vs. the weight graph is a
paraboloid in n-space. Since the proportionality constant is negative, the graph of such a function is concave upward and
has the least value. The vertex of this paraboloid represents the point where it reduces the error. The weight vector
corresponding to this point is then the ideal weight vector. We can use the delta learning rule with both single output unit
and several output units. While applying the delta rule assume that the error can be directly measured. The aim of applying
the delta rule is to reduce the difference between the actual and expected output that is the error.
Correlation Learning Rule
• The correlation learning rule based on a similar principle as the Hebbian learning rule. It assumes that weights between
responding neurons should be more positive, and weights between neurons with opposite reaction should be more
negative. Contrary to the Hebbian rule, the correlation rule is the supervised learning, instead of an actual. The response, oj,
the desired response, dj, uses for the weight-change calculation. In Mathematical form the correlation learning rule is as
follows:
• Where dj is the desired value of output signal. This training algorithm usually starts with the initialization of weights to zero.
Since assigning the desired weight by users, the correlation learning rule is an example of supervised learning.
Out Star Learning Rule
• We use the Out Star Learning Rule when we assume that nodes or neurons in a network arranged in a layer. Here the
weights connected to a certain node should be equal to the desired outputs for the neurons connected through those
weights. The out start rule produces the desired response t for the layer of n nodes. Apply this type of learning for all nodes
in a particular layer. Update the weights for nodes are as in Kohonen neural networks. In Mathematical form, express the
out star learning as follows:
Here ‘b’ is bias and ‘n’ is the total number of input neurons.
Step 6 − Apply the following activation function to obtain the final output:
Step 7 − Adjust the weight and bias as follows :
Case 1 − if y ≠ t then, wi(new) = wi(old)+α(t−yin)xi
b(new) = b(old)+α(t−yin)
Case 2 − if y = t then, wi(new) = wi(old)
b(new) = b(old)
Here ‘y’ is the actual output and ‘t’ is the desired/target output. (t−yin) is the computed error.
Step 8 − Test for the stopping condition, which will happen when there is no change in weight or the highest
weight change occurred during training is smaller than the specified tolerance.