0% found this document useful (0 votes)

66 views5 pages

Neural Networks: Single Neurons (Continued) : G. Extension of The Delta Rule: Smooth F (Z)

This document discusses extensions of the delta rule for neural networks with non-step activation functions and multiple neurons. It introduces Madalines, which have a single hidden layer of neurons that connect inputs to outputs. The delta rule is generalized to update weights for Madalines based on error between actual and target outputs. Variations are discussed, including updating only certain weights, to try to improve learning before the development of backpropagation.

Uploaded by

rebel_nerd_cloud

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views5 pages

Neural Networks: Single Neurons (Continued) : G. Extension of The Delta Rule: Smooth F (Z)

Uploaded by

rebel_nerd_cloud

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Neural Networks: Single Neurons (continued)

G. Extension of the Delta Rule: smooth f(z) 1. The delta rule is easily extendable to cases where the step function output function is not sufficient, i.e. if you want to better model a real neuron with a sigmoidal f(z). 2. Recall for a given training vector, the output is
n " % y = f ( z ) = f $ wo + ! w jT j ' # & j =1

Now, for non-step-function activation function, we define the error using the true output:
2 E= 1 2 ( y ! t)

2. Again, the direction of steepest decrease of E is given by !

! wi = " #

"E , so "w i

$E $w i

3. Differentiating

!E !y !f ( z) !z = (y " t ) = (y " t ) = ( y " t ) f # (z) = (y " t ) f #( z)Ti !wi !w i !wi !w i

where f(z) is the derivative of f(z) with respect to z. Hence, the weights are modified by

! wi = "# ( y " t ) f $ (z)Ti = # (t " y ) f $( z)Ti

The main differences from the original delta rule are the presence of y and the factor of f(z). The same equation can be used for updating the bias weights, but the factor of Ti is replaced by 1. 4. Note that the step function is no longer a possibility for f(z), since its derivative is either 0 or (explaining why z, rather than y, was used in the original delta rule error function). The function f must now be differentiable, like the sigmoid functions described earlier. Here are some typical examples:

1 (asymptotes are f(z) = 0 and f(z) =1). For this 1 + e ! "z case f(z) = f(z)[1-f(z)], so the derivative is easily calculable from f(z) itself.
Binary sigmoid: f (z) =

2 ! 1 (asymptotes are f(z) = 1 and f(z) =1). For 1 + e ! "z this case f(z) = [1+f(z)][1-f(z)], so the derivative is again easily calculable.
Bipolar sigmoid: f (z) = Hyperbolic tangent: f(z) = tanh(z) (asymptotes are f(z) = 1 and f(z) =1). The derivative is f(z) = sech2(z) = [1 f2(z)].

2. Multiple Neuron Networks

I. Madalines (Multiple Adalines) A. A Single Layer of Adalines 1. Let there be n inputs into m output neurons. Assume each input is connected to each output unit, so we'll have an nxm array of weights, wij, i = 1,...,n; j = 1,...,m. Then the outputs are given by
n " % y j = f ( z j ) = f $ woj + ! Tk wkj ' # & k =1

Here's an example with n = 2 and m = 2:

b1
w 01

x1
w 21

w 11 w 12

w 22 w 02

2. The error function to be minimized should now include all the outputs. For step function activity function:

1 2

# (zk " t k )
k =1

The derivation of the weight changes is basically the same as for a single neuron, since

m m "E 1 " m "z " n 2 =2 $ ( zk # tk ) = $ (zk # t k ) k = $ (zk # t k ) $ Tl w lk "wij "wij k =1 "wij k =1 "wij l =1 k =1 m k =1 n l =1

= $ ( zk # t k ) $ Tl

m n " w lk = $ ( zk # t k ) $ Tl%li%kj = ( z j # t j )Ti "wij k =1 l =1

so the weights are modified by

!
"wij = #$ z j # t j Ti = $ t j # z j Ti

The same equation holds for updating the bias weights, if we take i = 0 and T0 = 1.
! 3. For smooth activity functions, the error function to be minimized is based on the outputs:

1 2

m k=1

" (y k ! tk )

A similar calculation to that above yields

! wi j = "# y j " t j f $(z j )Ti = # t j " y j f $(z j )Ti

Again, the same equation holds for updating the bias weights, if we take i = 0 and 0 = 1. B. Madaline Networks with one hidden layer and one output layer 1. Begin with the simple case of a single output neuron (m = 1). Let there be n inputs and l hidden neurons. We assume each input is connected to each hidden unit and the outputs of the hidden units are the inputs to the output unit. Thus, we'll have an nxl array of input-hidden weights, wij, i = 1,...,n; j = 1,...,l and l hidden-output weights, vj, plus bias weights for each neuron. Heres an example with n = l = 2:

b1
w 01

x1
w 21

w 11 w 12

by H1
h1

v2 w 22 w 02

b2
The intermediate neurons, labeled with upper-case Hs in the figure, are often called hidden units since theyre not visible at input or output, but only play an internal processing role. Nonetheless, these are what makes it possible to solve non-linearseparable problems and get around Minsky and Paperts theorem. The hidden unit activations zi and outputs h j are given by
n

z j = w 0 j + " w qj Tq
q =1

and h j = f ( z j )

The output neuron satisfies similar equations to the single neuron case we studied in chapter 1: The output unit activation g and output y are given by !
n

g = v0 + ! v p h p
p =1

and

y = f (g)

2. In the original form of Madaline, the output unit had fixed weights, v0, v1, v2 (usually a "majority rules" algorithm, or an OR for 2 inputs). Hence, these weights would not need to be trained. In addition, the activation function f was taken to be step function. 3. The original delta rule for weight update can be generalized for this single hidden layer case, using the hidden unit activities

"w ij = # ( t $ z j )Ti

where now the activities zj are now even further removed from the target output, t. Nonetheless, this method can succeed if the parameters and algorithm are chosen ! carefully - i.e. to get this to work requires some experimentation, and there may be different best methods for different problems. 4. Before the backpropagation learning rule was devised, there were many attempts to improve learning using the delta rule. One variation that can be reasonably efficient is the following algorithm; instead of updating all the weights at each iteration try to short-circuit the process like so: Epoch loop: While the stopping criterion is false, do the following: Training vector loop: For each training vector (1, ..., n): Compute zj and hj for each hidden unit, g and overall output, y update weights as follows 1. If y = t, no update is performed 2. If y t and t = 1, then update weights only for the hidden unit Hc whose input sum is closest to zero

"w ic = # (1 $ zc )Ti
3. If y t, and t = -1, then update weights for all hidden units Hs whose inputs sums are positive: ! "w is = # ($1 $ zs )Ti , for all s such that zs > 0 End Training Vector Loop when all training vectors have been used Check stopping condition using updated weights after each epoch If stopping ! criterion is satisfied then terminate, else do new epoch This rather ad hoc method was no doubt the product of some experimentation as well as theory and is typical of the efforts to make the delta rule work for complex networks in the era before backpropagation. In fact, one of the problems in the 1970's slow period of research was that there was no uniformly good method of optimally modifying the weights, especially for multi-layer Madalines. It became a bit of an art to find rules that would converge for a given problem, in a reasonable amount of time. Although many other rules were suggested, we will not cover most of them since the Delta rule leads directly into the backpropagation method, which is quite general.

Pat Cadigan - Synners
No ratings yet
Pat Cadigan - Synners
588 pages
Mac 35B
100% (1)
Mac 35B
1,020 pages
Exp 3
No ratings yet
Exp 3
9 pages
Back Propagation
No ratings yet
Back Propagation
9 pages
Exp 4
No ratings yet
Exp 4
9 pages
3-ADALINE (Adaptive Linear Neuron) (Widrow & Hoff, 1960) : W X T E
No ratings yet
3-ADALINE (Adaptive Linear Neuron) (Widrow & Hoff, 1960) : W X T E
8 pages
Model of Neuron in An ANN
No ratings yet
Model of Neuron in An ANN
12 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Supervised Learning Network
No ratings yet
Supervised Learning Network
33 pages
Back Propagation ALGORITHM
No ratings yet
Back Propagation ALGORITHM
11 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
14 pages
Learning Rules For Multilayer Feedforward Neural Networks
No ratings yet
Learning Rules For Multilayer Feedforward Neural Networks
19 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
38 Backpropagation
No ratings yet
38 Backpropagation
19 pages
ML Unit-5
No ratings yet
ML Unit-5
19 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
Soft Computing
No ratings yet
Soft Computing
92 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
CSD311: Artificial Intelligence
No ratings yet
CSD311: Artificial Intelligence
12 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
Chapter 3
No ratings yet
Chapter 3
30 pages
Lec 15 MLP Cont
No ratings yet
Lec 15 MLP Cont
34 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Jntuk R20 ML Unit-V
No ratings yet
Jntuk R20 ML Unit-V
19 pages
Soft Computing-Lab File
No ratings yet
Soft Computing-Lab File
40 pages
Multi Layer Feed-Forward Network Learning
No ratings yet
Multi Layer Feed-Forward Network Learning
5 pages
Artificial Neural Network: Lecture Module 22
No ratings yet
Artificial Neural Network: Lecture Module 22
54 pages
Neural Networks
No ratings yet
Neural Networks
37 pages
Ppt-Ii NNFL
No ratings yet
Ppt-Ii NNFL
43 pages
John Bullinaria's Step by Step Guide To Implement Neuronal Network in C
No ratings yet
John Bullinaria's Step by Step Guide To Implement Neuronal Network in C
6 pages
A Review of Artificial Neural Network (ANN)
No ratings yet
A Review of Artificial Neural Network (ANN)
5 pages
L04 Slides - mlp1
No ratings yet
L04 Slides - mlp1
22 pages
Learning With Linear Neurons: Adapted From Lectures by Geoffrey Hinton and Others Updated by N. Intrator, May 2007
No ratings yet
Learning With Linear Neurons: Adapted From Lectures by Geoffrey Hinton and Others Updated by N. Intrator, May 2007
59 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
3ML.05.NeuralNetworks DeepLearning
No ratings yet
3ML.05.NeuralNetworks DeepLearning
67 pages
2012-1158. Backpropagation NN
No ratings yet
2012-1158. Backpropagation NN
56 pages
Introduction To Neural Networks: Revision Lectures: © John A. Bullinaria, 2004
No ratings yet
Introduction To Neural Networks: Revision Lectures: © John A. Bullinaria, 2004
24 pages
Soft Computing
No ratings yet
Soft Computing
40 pages
Main
No ratings yet
Main
25 pages
Week 3
No ratings yet
Week 3
15 pages
Pr2 ANN WriteUp
No ratings yet
Pr2 ANN WriteUp
11 pages
Introduction To Feed Forward Neural Networks
No ratings yet
Introduction To Feed Forward Neural Networks
121 pages
Lecture 10 Neural Network
No ratings yet
Lecture 10 Neural Network
34 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
Learning in A Feed Forward Multiple Layer ANN - Backpropagation
No ratings yet
Learning in A Feed Forward Multiple Layer ANN - Backpropagation
18 pages
Multi-Layer Feed-Forward Networks
No ratings yet
Multi-Layer Feed-Forward Networks
6 pages
Lecture 02 - Artificial Neural Network
No ratings yet
Lecture 02 - Artificial Neural Network
37 pages
Additional Topics
No ratings yet
Additional Topics
21 pages
Back Propagation
No ratings yet
Back Propagation
20 pages
L6 Neural Network
No ratings yet
L6 Neural Network
57 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Tasks On Neurons and ANN
No ratings yet
Tasks On Neurons and ANN
15 pages
6.034f Neural Net Notes October 28, 2010
No ratings yet
6.034f Neural Net Notes October 28, 2010
7 pages
Back Propagation
No ratings yet
Back Propagation
56 pages
22SCSE1180094 - Shyam Lab File (SCA) !!!!
No ratings yet
22SCSE1180094 - Shyam Lab File (SCA) !!!!
28 pages
Introduction To Neural Networks: John Paxton Montana State University Summer 2003
No ratings yet
Introduction To Neural Networks: John Paxton Montana State University Summer 2003
24 pages
Unit 2
No ratings yet
Unit 2
36 pages
Applications of Robotics and Artificial Intelligence
No ratings yet
Applications of Robotics and Artificial Intelligence
276 pages
C++ Neural Networks and Fuzzy Logic
No ratings yet
C++ Neural Networks and Fuzzy Logic
454 pages
Step by Step Linux Guide
100% (24)
Step by Step Linux Guide
396 pages
The International University of Scholars: Department of Computer Science & Engineering
No ratings yet
The International University of Scholars: Department of Computer Science & Engineering
2 pages
1 - ETC Handbook On Tourism Forecasting Methodologies
No ratings yet
1 - ETC Handbook On Tourism Forecasting Methodologies
97 pages
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
No ratings yet
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
13 pages
GARCH Models in Python 2
No ratings yet
GARCH Models in Python 2
33 pages
Introduction To OO SAD & UML: Slide 1
No ratings yet
Introduction To OO SAD & UML: Slide 1
27 pages
2 CTH
No ratings yet
2 CTH
61 pages
CMP3008 LN2 FiniteAutomata
No ratings yet
CMP3008 LN2 FiniteAutomata
35 pages
Mcqs Prob 2
No ratings yet
Mcqs Prob 2
6 pages
Mohamed Ayman Sbaih 20160713 Modeling and Simulation - Activity
No ratings yet
Mohamed Ayman Sbaih 20160713 Modeling and Simulation - Activity
7 pages
LSTM Architecture Presentation
No ratings yet
LSTM Architecture Presentation
18 pages
Econometrics - MCQ Flashcards - Quizlet
No ratings yet
Econometrics - MCQ Flashcards - Quizlet
19 pages
What Is Recurrent Neural Network
No ratings yet
What Is Recurrent Neural Network
2 pages
Deep Learning Syllabus
No ratings yet
Deep Learning Syllabus
4 pages
A Comprehensive Survey and Analysis of Generative Models in Machine Learning - CSR 2020
No ratings yet
A Comprehensive Survey and Analysis of Generative Models in Machine Learning - CSR 2020
29 pages
CHAP3.0 - STA116 - Discrete Random Variables and Probability Distribution - Part3
No ratings yet
CHAP3.0 - STA116 - Discrete Random Variables and Probability Distribution - Part3
9 pages
Exercise 3B: NP P X
No ratings yet
Exercise 3B: NP P X
3 pages
TM Samples
No ratings yet
TM Samples
17 pages
23 DeepLearning PDF
No ratings yet
23 DeepLearning PDF
74 pages
CPM Pert
No ratings yet
CPM Pert
18 pages
Deep Learning Exp
No ratings yet
Deep Learning Exp
25 pages
Ann-Unit Ii
No ratings yet
Ann-Unit Ii
21 pages
CS6503 Theory of Computation Question Paper Nov Dec 2017
No ratings yet
CS6503 Theory of Computation Question Paper Nov Dec 2017
3 pages
Polymorphism: Example
No ratings yet
Polymorphism: Example
4 pages
Standard Normal
No ratings yet
Standard Normal
2 pages
Discrete Probability Distribution
No ratings yet
Discrete Probability Distribution
34 pages
ARDL
No ratings yet
ARDL
6 pages
Closure Properties of CFL
No ratings yet
Closure Properties of CFL
10 pages
Artificial Intelligence Questions and Answers - Neural Networks - 1
No ratings yet
Artificial Intelligence Questions and Answers - Neural Networks - 1
4 pages
Advanced Deep Learning Questions - ChatGPT
No ratings yet
Advanced Deep Learning Questions - ChatGPT
13 pages

Neural Networks: Single Neurons (Continued) : G. Extension of The Delta Rule: Smooth F (Z)

Uploaded by

Neural Networks: Single Neurons (Continued) : G. Extension of The Delta Rule: Smooth F (Z)

Uploaded by

Neural Networks: Single Neurons (continued)

2. Again, the direction of steepest decrease of E is given by !

!E !y !f ( z) !z = (y " t ) = (y " t ) = ( y " t ) f # (z) = (y " t ) f #( z)Ti !wi !w i !wi !w i

! wi = "# ( y " t ) f $ (z)Ti = # (t " y ) f $( z)Ti

2. Multiple Neuron Networks

Here's an example with n = 2 and m = 2:

m n " w lk = $ ( zk # t k ) $ Tl%li%kj = ( z j # t j )Ti "wij k =1 l =1

so the weights are modified by

A similar calculation to that above yields

You might also like