0% found this document useful (0 votes)

154 views10 pages

3 DeltaRule PDF

The document discusses the delta rule for training neural networks. [1] It reviews mathematical concepts like vectors and dot products. [2] It then reviews the perceptron model and how neurons sum weighted inputs and apply an activation function. [3] The delta rule is introduced as a way to minimize error between the actual and desired outputs by adjusting weights in the direction that most reduces error, allowing neural networks to learn through supervised learning.

Uploaded by

Es E

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

154 views10 pages

3 DeltaRule PDF

Uploaded by

Es E

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Lecture 3: Delta Rule

Kevin Swingler
[email protected]

1
Mathematical Preliminaries: Vector Notation

Vectors appear in lowercase bold font

e.g. input vector: x = [x0 x1 x2 … xn]

Dot product of two vectors:

wx = w0 x0 + w1 x1 + … + wn xn =
n
= ∑ wi xi
i =0

E.g.: x = [1,2,3], y = [4,5,6] xy = (14)+(25)+(3*6) = 4+10+18 = 32

2
Review of the McCulloch-Pitts/Perceptron Model

I1 Wj1
I2 Aj Yj
∑
I3

In Wjn

Neuron sums its weighted inputs: n

w0 x0 + w1 x1 + … + wn xn = ∑ wi xi =wx
i =0

Neuron applies threshold activation function:

y = f(w x)
where, e.g. f(w x) = + 1 if w x > 0
f(w x) = - 1 if w x ≤ 0

3
Review of Geometrical Interpretation
X2

Y=1

X1
Y=-1

wx = 0

Neuron defines two regions in input space where it outputs -1 and 1.

The regions are separated by a hyperplane wx = 0 (i.e. decision
boundary)

4
Review of Supervised Learning
x
Supervisor ytarget
Generator

Learning
y
Machine

Training: Learn from training pairs (x, ytarget)

Testing: Given x, output a value y close to the supervisor’s output ytarget

5
Learning by Error Minimization

The Perceptron Learning Rule is an algorithm for adjusting the network

weights w to minimize the difference between the actual and the
desired outputs.

We can define a Cost Function to quantify this difference:

1
E ( w) = ∑∑ desired
2 p j
( y − y ) 2

Intuition:
• Square makes error positive and penalises large errors more
• ½ just makes the math easier
• Need to change the weights to minimize the error – How?
• Use principle of Gradient Descent

6
Principle of Gradient Descent

Gradient descent is an optimization algorithm that approaches a local

minimum of a function by taking steps proportional to the negative of
the gradient of the function as the current point.

So, calculate the derivative (gradient) of the Cost Function with respect
to the weights, and then change each weight by a small increment in
the negative (opposite) direction to the gradient

∂E ∂E ∂y
= ⋅ = −( ydesired − y ) x = −δ x
∂w ∂y ∂w

To reduce E by gradient descent, move/increment weights in the

negative direction to the gradient, -(-δx)= +δx

7
Graphical Representation of Gradient Descent

8
Widrow-Hoff Learning Rule
(Delta Rule)

∂E or
∆w = w − wold = −η = +η δ x w = wold + η δ x
∂w

where δ = ytarget – y and η is a constant that controls the learning rate

(amount of increment/update ∆w at each training step).
Note: Delta rule (DR) is similar to the Perceptron Learning Rule
(PLR), with some differences:
1. Error (δ) in DR is not restricted to having values of 0, 1, or -1
(as in PLR), but may have any value
2. DR can be derived for any differentiable output/activation
function f, whereas in PLR only works for threshold output
function

9
Convergence of PLR/DR

The weight changes ∆wij need to be applied repeatedly for each weight wij in
the network and for each training pattern in the training set.

One pass through all the weights for the whole training set is called an epoch
of training.

After many epochs, the network outputs match the targets for all the training
patterns, all the ∆wij are zero and the training process ceases. We then say
that the training process has converged to a solution.

It has been shown that if a possible set of weights for a Perceptron exist, which
solve the problem correctly, then the Perceptron Learning rule/Delta Rule
(PLR/DR) will find them in a finite number of iterations.

Furthermore, if the problem is linearly separable, then the PLR/DR will find a
set of weights in a finite number of iterations that solves the problem
correctly.

7-Knowledge Distillation
No ratings yet
7-Knowledge Distillation
29 pages
Matrix Theory and Applications For Scientists and Engineers Alexander Graham Instant Download
No ratings yet
Matrix Theory and Applications For Scientists and Engineers Alexander Graham Instant Download
79 pages
National University: Calculus and Analytical Geometry Course Outline According To OBE FALL-2021
No ratings yet
National University: Calculus and Analytical Geometry Course Outline According To OBE FALL-2021
5 pages
Applied III Chapter-2
No ratings yet
Applied III Chapter-2
15 pages
LIMITS OF ACCURACY (Upper Lower Bound) - Zainematics
No ratings yet
LIMITS OF ACCURACY (Upper Lower Bound) - Zainematics
7 pages
Week 3 ECO 103 Exponential and Logarithmic Functions
No ratings yet
Week 3 ECO 103 Exponential and Logarithmic Functions
15 pages
Soalan Ulangkaji Uf1 Sem1 t4
No ratings yet
Soalan Ulangkaji Uf1 Sem1 t4
7 pages
ANN Unit-2 Chapter-2
No ratings yet
ANN Unit-2 Chapter-2
56 pages
10 Polymath S
No ratings yet
10 Polymath S
33 pages
7th Lecture Widrow Hoff Learning Algorithm s1!21!22
No ratings yet
7th Lecture Widrow Hoff Learning Algorithm s1!21!22
20 pages
Single Layer Perceptron
No ratings yet
Single Layer Perceptron
6 pages
IMO2007 Shortlisted Problems With Solutions
No ratings yet
IMO2007 Shortlisted Problems With Solutions
65 pages
Deep Learning - Unit-III Two Marks
100% (1)
Deep Learning - Unit-III Two Marks
3 pages
NM Laboratory 2 Roots Non Linear Function Bracketing Methods
No ratings yet
NM Laboratory 2 Roots Non Linear Function Bracketing Methods
12 pages
Chapter 7
No ratings yet
Chapter 7
68 pages
Delhi Public School Bopal, Ahmedabad Assignment:Mathematics CLASS9 (2023-24) Number System
No ratings yet
Delhi Public School Bopal, Ahmedabad Assignment:Mathematics CLASS9 (2023-24) Number System
3 pages
A Presentation On "Deep Neural Network" Nikhil Sunil Patil
No ratings yet
A Presentation On "Deep Neural Network" Nikhil Sunil Patil
9 pages
Maths 10 Term 1
100% (4)
Maths 10 Term 1
211 pages
Neural Network Tutorial 1
No ratings yet
Neural Network Tutorial 1
3 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
24 pages
Decimals
No ratings yet
Decimals
2 pages
Unit Assignment (Omit 3,15 and 20!)
No ratings yet
Unit Assignment (Omit 3,15 and 20!)
9 pages
Pincodes of TN
No ratings yet
Pincodes of TN
52 pages
01-Language Fundas - Program
No ratings yet
01-Language Fundas - Program
3 pages
Module 1 Integral Without Solutions To Problem Set
No ratings yet
Module 1 Integral Without Solutions To Problem Set
38 pages
GHBV
No ratings yet
GHBV
37 pages
Lec7 Inroduction To Neural Network
No ratings yet
Lec7 Inroduction To Neural Network
24 pages
Arithmetic Operations
No ratings yet
Arithmetic Operations
3 pages
Ch07 - Isoparametric Formulation
No ratings yet
Ch07 - Isoparametric Formulation
35 pages
g8 Creya Xel Teacher Tool-Kit
No ratings yet
g8 Creya Xel Teacher Tool-Kit
13 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Transformation Matrices Geometric and Otherwise
No ratings yet
Transformation Matrices Geometric and Otherwise
11 pages
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
Unit 8 - Linear Programming
No ratings yet
Unit 8 - Linear Programming
39 pages
Dfai Markov Chains 02 PDF
No ratings yet
Dfai Markov Chains 02 PDF
25 pages
MTL107 Set 10
No ratings yet
MTL107 Set 10
11 pages
Exponential Functions and Their Graphs Section 3-1
No ratings yet
Exponential Functions and Their Graphs Section 3-1
19 pages
The Correct Answer For Each Question Is Indicated by A
100% (1)
The Correct Answer For Each Question Is Indicated by A
86 pages
Book Review H Dym and H P McKean Fourier Series An
No ratings yet
Book Review H Dym and H P McKean Fourier Series An
6 pages
SQL - String Functions
No ratings yet
SQL - String Functions
4 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
18 pages
I Hear A Sound
No ratings yet
I Hear A Sound
3 pages
PERCEPTRONS
No ratings yet
PERCEPTRONS
13 pages
Know Your Body
No ratings yet
Know Your Body
107 pages
Thomson Atom Model: Physics Notes
No ratings yet
Thomson Atom Model: Physics Notes
13 pages
Character Set Consists of
No ratings yet
Character Set Consists of
18 pages
Compendium: I DQ/DT I Q/T Ne/t
No ratings yet
Compendium: I DQ/DT I Q/T Ne/t
17 pages
Biology Chapter 2 11th Class - 0 PDF
No ratings yet
Biology Chapter 2 11th Class - 0 PDF
28 pages
Motion in One Dimension: Physics Notes
No ratings yet
Motion in One Dimension: Physics Notes
17 pages
The Export-Import Bank of India Act, 1981 No. 28 of 1981 (11 September, 1981.)
No ratings yet
The Export-Import Bank of India Act, 1981 No. 28 of 1981 (11 September, 1981.)
23 pages
IES - Electronics Engineering - Computer Engineering PDF
No ratings yet
IES - Electronics Engineering - Computer Engineering PDF
48 pages
Artificial Neural Networks Video Tutorial: Machine Learning 17CS73
No ratings yet
Artificial Neural Networks Video Tutorial: Machine Learning 17CS73
23 pages
Fluid Mechanism PDF
No ratings yet
Fluid Mechanism PDF
30 pages
Normalization of Database Tables
No ratings yet
Normalization of Database Tables
22 pages
Electromagnetic Waves: Current in Capacitors
No ratings yet
Electromagnetic Waves: Current in Capacitors
7 pages
Electromagnetic Induction
No ratings yet
Electromagnetic Induction
21 pages
CNN PPT Unit Iv
No ratings yet
CNN PPT Unit Iv
134 pages
Lec11&12-Adversarial Search
No ratings yet
Lec11&12-Adversarial Search
30 pages
Difference Calculus: N K 1 3 M X 1 N y 1 2 N K 0 K
No ratings yet
Difference Calculus: N K 1 3 M X 1 N y 1 2 N K 0 K
9 pages
582 Problems
No ratings yet
582 Problems
42 pages
Lesson 2 - CONTINUITY OF A FUNCTION
No ratings yet
Lesson 2 - CONTINUITY OF A FUNCTION
17 pages
L10 - Intro - To - Deep - Learning
No ratings yet
L10 - Intro - To - Deep - Learning
75 pages
ProbStochProc 1.42 NoSolns PDF
No ratings yet
ProbStochProc 1.42 NoSolns PDF
241 pages
Paper II Solved For 10 Years
No ratings yet
Paper II Solved For 10 Years
73 pages
Introduction To Resnet
No ratings yet
Introduction To Resnet
14 pages
TRB Computer Teacher Syllabus
No ratings yet
TRB Computer Teacher Syllabus
10 pages
Neural Networks
No ratings yet
Neural Networks
27 pages
Neural Network Module 2 Notes
100% (1)
Neural Network Module 2 Notes
72 pages
SCOA MCQ-merged PDF
100% (2)
SCOA MCQ-merged PDF
216 pages
Network Security
No ratings yet
Network Security
11 pages
Perceptron Lecture 3
No ratings yet
Perceptron Lecture 3
25 pages
Machine Learning: Neural Networks
No ratings yet
Machine Learning: Neural Networks
22 pages
Python
No ratings yet
Python
3 pages
Python Code
100% (1)
Python Code
2 pages
Bian - Deep Learning On Smooth Manifolds
No ratings yet
Bian - Deep Learning On Smooth Manifolds
6 pages
Natural Logarithm Notes
No ratings yet
Natural Logarithm Notes
3 pages
02 Understanding Mini Batch Gradient Descent C2W2L02
No ratings yet
02 Understanding Mini Batch Gradient Descent C2W2L02
4 pages
Introduction To Neural Networks Using Matlab 6 0 S N Sivanandam Sumathi Deepa
0% (1)
Introduction To Neural Networks Using Matlab 6 0 S N Sivanandam Sumathi Deepa
4 pages
Preparation of Papers For IEEE Access (February 2022)
No ratings yet
Preparation of Papers For IEEE Access (February 2022)
12 pages
3D U-Net Based Brain Tumor Segmentation
No ratings yet
3D U-Net Based Brain Tumor Segmentation
11 pages
Convolutional Neural Networks: Computer Vision
No ratings yet
Convolutional Neural Networks: Computer Vision
14 pages
Back Propagation Technique
No ratings yet
Back Propagation Technique
24 pages
P1 - Single Layer Feed Forward Networks
No ratings yet
P1 - Single Layer Feed Forward Networks
52 pages
Regular Falsi Method: B.S. (SE) Semester Project Report
No ratings yet
Regular Falsi Method: B.S. (SE) Semester Project Report
12 pages
Lab Manual Soft Computing
No ratings yet
Lab Manual Soft Computing
44 pages
NNDesign PDF
No ratings yet
NNDesign PDF
1,012 pages
Google Net
100% (1)
Google Net
9 pages
Machine Learning: Algorithms and Applications: (Continued)
No ratings yet
Machine Learning: Algorithms and Applications: (Continued)
17 pages
Particle Swarm Optimization - Wikipedia
No ratings yet
Particle Swarm Optimization - Wikipedia
9 pages
Master Thesis Template Polito
No ratings yet
Master Thesis Template Polito
16 pages
Expert System and Apllications: Ai - Iii-Unit
No ratings yet
Expert System and Apllications: Ai - Iii-Unit
27 pages
ICEF 2020 Keynote Prith Banerjee
No ratings yet
ICEF 2020 Keynote Prith Banerjee
23 pages
An Introduction To Kohonen Self Organizing Maps: Rajarshi Guha
No ratings yet
An Introduction To Kohonen Self Organizing Maps: Rajarshi Guha
12 pages
Answers All 2007
0% (1)
Answers All 2007
64 pages
Soft Computing Assignment
100% (1)
Soft Computing Assignment
13 pages
I RPROP
No ratings yet
I RPROP
7 pages
Learning Rules of ANN
No ratings yet
Learning Rules of ANN
25 pages
Unit 4
No ratings yet
Unit 4
16 pages
Neural
No ratings yet
Neural
35 pages
النظرية الاحتسابية
No ratings yet
النظرية الاحتسابية
5 pages
3 DeltaRule PDF
No ratings yet
3 DeltaRule PDF
10 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
AI-Lecture 12 - Simple Perceptron
100% (1)
AI-Lecture 12 - Simple Perceptron
24 pages
Multiple-Layer Networks Backpropagation Algorithms
No ratings yet
Multiple-Layer Networks Backpropagation Algorithms
46 pages
Manual For Neural and Matlab Applications
No ratings yet
Manual For Neural and Matlab Applications
37 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
71 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
Adaline Madaline
No ratings yet
Adaline Madaline
8 pages
Kevin Swingler - Lecture 3: Delta Rule
No ratings yet
Kevin Swingler - Lecture 3: Delta Rule
10 pages

3 DeltaRule PDF

Uploaded by

3 DeltaRule PDF

Uploaded by

Lecture 3: Delta Rule

Vectors appear in lowercase bold font

Dot product of two vectors:

E.g.: x = [1,2,3], y = [4,5,6] xy = (1*4)+(2*5)+(3*6) = 4+10+18 = 32

Neuron sums its weighted inputs: n

Neuron applies threshold activation function:

Neuron defines two regions in input space where it outputs -1 and 1.

Training: Learn from training pairs (x, ytarget)

The Perceptron Learning Rule is an algorithm for adjusting the network

We can define a Cost Function to quantify this difference:

Gradient descent is an optimization algorithm that approaches a local

To reduce E by gradient descent, move/increment weights in the

where δ = ytarget – y and η is a constant that controls the learning rate

You might also like

E.g.: x = [1,2,3], y = [4,5,6] xy = (14)+(25)+(3*6) = 4+10+18 = 32