0% found this document useful (0 votes)

179 views52 pages

P1 - Single Layer Feed Forward Networks

The document discusses a lecture on single layer perceptron classifiers, including an overview of what a perceptron and single layer perceptron are, the limitations of a single perceptron, and training and classification using the discrete perceptron algorithm, which iteratively adjusts the perceptron weights based on whether the current training pattern is correctly or incorrectly classified.

Uploaded by

Yashaswini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

179 views52 pages

P1 - Single Layer Feed Forward Networks

Uploaded by

Yashaswini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

CS407 Neural Computation

Lecture 4:
Single Layer Perceptron (SLP)
Classifiers

Lecturer: A/Prof. M. Bennamoun

Outline
What’s a SLP and what’s classification?
Limitation of a single perceptron.
Foundations of classification and Bayes Decision making theory
Discriminant functions, linear machine and minimum distance
classification
Training and classification using the Discrete perceptron
Single-Layer Continuous perceptron Networks for linearly
separable classifications
Appendix A: Unconstrained optimization techniques
Appendix B: Perceptron Convergence proof
Suggested reading and references
What is a perceptron and what is
a Single Layer Perceptron (SLP)?
Perceptron
The simplest form of a neural network
consists of a single neuron with adjustable
synaptic weights and bias
performs pattern classification with only two
classes
perceptron convergence theorem :
– Patterns (vectors) are drawn from two
linearly separable classes
– During training, the perceptron algorithm
converges and positions the decision
surface in the form of hyperplane between
two classes by adjusting synaptic weights
What is a perceptron?
m

Bias v = ∑w x +b
k
j =1
kj j k
bk
x1 wk1 Activation

x2 wk2
function
y = ϕ (v )
k k

vk Output
Σ ϕ(.) yk
...

...

xm wkm Summing
junction Discrete Perceptron:
Input Synaptic
weights
ϕ (⋅) = sign (⋅)
signal

Continous Perceptron:
ϕ (⋅) = S − shape
Activation Function of a perceptron

+1
+1

vi vi

-1

Signum Function
(sign) Continous Perceptron:
Discrete Perceptron: ϕ (v) = s − shape
ϕ (⋅) = sign (⋅)
SLP Architecture
Single layer perceptron

Input layer Output layer

Where are we heading? Different
Non-Linearly Separable Problems
http://www.zsolutions.com/light.htm

Types of Exclusive-OR Classes with Most General

Structure Decision Regions Problem Meshed regionsRegion Shapes

Single-Layer Half Plane A B

Bounded By B
Hyperplane A
B A

Two-Layer Convex Open A B

Or B
Closed Regions A
B A

Three-Layer Arbitrary
(Complexity A B
Limited by No. B
A
of Nodes) B A
Review from last lectures:
Implementing Logic Gates with
Perceptrons http://www.cs.bham.ac.uk/~jxb/NN/l3.pdf

We can use the perceptron to implement the basic logic gates (AND, OR
and NOT).
All we need to do is find the appropriate connection weights and neuron
thresholds to produce the right outputs for each set of inputs.
We saw how we can construct simple networks that perform NOT, AND,
and OR.
It is then a well known result from logic that we can construct any logical
function from these three operations.
The resulting networks, however, will usually have a much more complex
architecture than a simple Perceptron.
We generally want to avoid decomposing complex problems into simple
logic gates, by finding the weights and thresholds that work directly in a
Perceptron architecture.
Implementation of Logical NOT, AND, and OR
In each case we have inputs ini and outputs out, and need to determine
the weights and thresholds. It is easy to find solutions by inspection:
The Need to Find Weights Analytically
Constructing simple networks by hand is one thing. But what about
harder problems? For example, what about:

How long do we keep looking for a solution? We need to be able to

calculate appropriate parameters rather than looking for solutions by trial
and error.
Each training pattern produces a linear inequality for the output in terms
of the inputs and the network parameters. These can be used to compute
the weights and thresholds.
Finding Weights Analytically for the AND Network

We have two weights w1 and w2 and the threshold θ, and for each
training pattern we need to satisfy

So the training data lead to four inequalities:

It is easy to see that there are an infinite number of solutions. Similarly,

there are an infinite number of solutions for the NOT and OR networks.
Limitations of Simple Perceptrons

We can follow the same procedure for the XOR network:

Clearly the second and third inequalities are incompatible with the
fourth, so there is in fact no solution. We need more complex networks,
e.g. that combine together many simple networks, or use different
activation/thresholding/transfer functions.
It then becomes much more difficult to determine all the weights and
thresholds by hand.
These weights instead are adapted using learning rules. Hence, need to
consider learning rules (see previous lecture), and more complex
architectures.
E.g. Decision Surface of a Perceptron
x2
x2
+
+ + -
+ -
- x1
x1
+ - +
-
-
Linearly separable Non-Linearly separable

• Perceptron is able to represent some useful functions

• But functions that are not linearly separable (e.g. XOR)
are not representable
The Discrete Perceptron
Discrete Perceptron Training Algorithm
• So far, we have shown that coefficients of linear
discriminant functions called weights can be
determined based on a priori information about sets of
patterns and their class membership.
•In what follows, we will begin to examine neural
network classifiers that derive their weights during the
learning cycle.
•The sample pattern vectors x1, x2, …, xp, called the
training sequence, are presented to the machine along
with the correct response.
Discrete Perceptron Training Algorithm
- Geometrical Representations http://140.122.185.120
Zurada, Chapter 3

(Intersects the origin

point w=0)
5 prototype patterns in this case: y1, y2, …y5
If dim of augmented pattern vector is > 3, our power of visualization are no longer of assistance. In this case,
the only recourse is to use the analytical approach.
Discrete Perceptron Training Algorithm
- Geometrical Representations…
•Devise an analytic approach based on the geometrical
representations
– E.g. the decision surface for the training pattern y1
( )
∇ w w t y1 = y1 Gradient
(the direction of
y1 in Class 1 If y1 in steepest increase)
(see previous slide) Class 1:
Weight w ′ = w1 + cy1
Space
If y1 in c controls the
Class 2: size of adjustment

y1 in Class 2 w ′ = w1 − cy1
c (>0) is the correction
Weight increment (is two times the
Space learning constant ρ
introduced before)
(correction in negative gradient direction)
Discrete Perceptron Training Algorithm
- Geometrical Representations…
Discrete Perceptron Training Algorithm
- Geometrical Representations…

w1t y
cy = y =p
yt y

Note 1: p=distance so >0

Note 2: c is not constant and depends on the current training pattern as expressed by eq. Above.
Discrete Perceptron Training Algorithm
- Geometrical Representations…
•For fixed correction rule: c=constant, the correction of
weights is always the same fixed portion of the current
training vector
– The weight can be initialised at any value

•For dynamic correction rule: c depends on the distance

from the weight (i.e. the weight vector) to the decision
surface in the weight space. Hence
Current weight Current input
pattern

– The initial weight should be different from 0.

(if w1=0, then cy =0 and w’=w1+cy=0, therefore no possible adjustments).
Discrete Perceptron Training Algorithm
- Geometrical Representations…
•Dynamic correction rule: Using the value of c from previous slide as
a reference, we devise an adjustment technique which depends on
the length w2-w1 λ=2: Symmetrical reflection w.r.t decision plane

λ=0: No weight adjustment

Νote: λ is the ratio of the distance

between the old weight vector w1
and the new w2, to the distance
from w1 to the pattern hyperplane
Discrete Perceptron Training Algorithm
- Geometrical Representations…
•Example:

x1 = 1, x3 = 3, d1 = d 3 = 1 : class 1
x2 = −0.5, x4 = −2, d 2 = d 4 = −1 : class 2
•The augmented input vectors are:

1 − 0.5 3  − 2

y1 =  , y 2 =   , y3 =   y4 =  
1  1  1 1
•The decision lines wtyi=0, for i=1, 2, 3, 4 are sketched
on the augmented weight space as follows:
Discrete Perceptron Training Algorithm
- Geometrical Representations…
Discrete Perceptron Training Algorithm
- Geometrical Representations…
For c = 1 and w1 = [− 2.5 1.75]
t

•Using w ' = w ± cy the weight training with each step can

be summarized as follows:
c
∆w = [d k − sgn(w kt y k )]y k
k

2
•We obtain the following outputs and weight updates:
•Step 1: Pattern y1 is input
 1 
o1 = sgn [− 2.5 1.75]    = −1
 1 
d1 − o1 = 2
− 1.5
w =w +y = 
2 1 1

 2.75 
Discrete Perceptron Training Algorithm
- Geometrical Representations…
•Step 2: Pattern y2 is input
 − 0.5 

o2 = sgn  [− 1.5 2.75]    =1

  1 
d 2 − o2 = −2
 −1 
w = w −y = 
3 2 2

1.75
•Step 3: Pattern y3 is input
 3 
o3 = sgn [− 1 1.75]    = −1
 1 
d 3 − o3 = 2
 2 
w =w +y = 
4 3 3

 2.75
Discrete Perceptron Training Algorithm
- Geometrical Representations…
• Since we have no evidence of correct classification of
weight w4 the training set consisting of an ordered
sequence of patterns y1 ,y2 and y3 needs to be recycled.
We thus have y4= y1 , y5= y2, etc (the superscript is used
to denote the following training step number).
•Step 4, 5: w6 = w5 = w4 (no misclassification, thus no
weight adjustments).
•You can check that the adjustment following in steps 6
through 10 are as follows:
w 7 = [2.5 1.75]
t

w10 = w 9 = w 8 = w 7
w11 = [3 0.75]
t

w11 is in solution area.

The Continuous Perceptron
Continuous Perceptron Training Algorithm
http://140.122.185.120
Zurada, Chapter 3

•Replace the TLU (Threshold Logic Unit) with the

sigmoid activation function for two reasons:
– Gain finer control over the training procedure
– Facilitate the differential characteristics to enable
computation of the error gradient

(of current
error function)

The factor ½ does not affect the location of

the error minimum
Continuous Perceptron Training Algorithm…

•The new weights is obtained by moving in the direction

of the negative gradient along the multidimensional error
surface

By definition of the steepest descent concept,

each elementary move should be
perpendicular to the current error contour.
Continuous Perceptron Training Algorithm…
•Define the error as the squared difference between the
desired output and the actual output

Training rule of
continous perceptron
∂ (net ) (equivalent to delta
Since net = w t y, we have = yi i = 1,2,..., n + 1 training rule)
∂wi
Continuous Perceptron Training Algorithm…
Continuous Perceptron Training Algorithm…
Same as previous example (of discrete perceptron) but with a
continuous activation function and using the delta rule.

Same training pattern set as

discrete perceptron example
Continuous Perceptron Training Algorithm…
2
1  2 
E k = d k −  − 1
1 + exp(−λ net )  
k
2

2
1  2 
E1 (w ) = 1 −  − 1 
2  1 + exp[− λ ( w1 + w2 )]  

λ = 1 and reducing the terms simplifies this expression to the following form
2
E1 (w ) =
[1 + exp(w1 + w2 )]2
similarly
2
E2 ( w ) =
[1 + exp(0.5w1 − w2 )]2
2 2
E3 (w ) = E4 ( w ) =
[1 + exp(3w1 + w2 )]2 [1 + exp(2w1 − w2 )]2
These error surfaces are as shown on the previous slide.
Continuous Perceptron Training Algorithm…

minimum
Mutlicategory SLP
Multi-category Single layer Perceptron nets
•Treat the last fixed component of input pattern vector as
the neuron activation threshold…. T=wn+1

yn+1= -1 (irrelevant wheter it

is equal to +1 or –1)
Multi-category Single layer Perceptron nets…
• R-category linear classifier using R discrete bipolar
perceptrons
– Goal: The i-th TLU response of +1 is indicative of
class i and all other TLU respond with -1
Multi-category Single layer Perceptron nets…
•Example 3.5

Indecision regions = regions

should be where no class membership of
(-1, - 1, 1) t an input pattern can be
uniquely determined based on
the response of the classifier
(patterns in shaded areas are
not assigned any reasonable
classification. E.g. point Q for
which o=[1 1 –1]t => indecisive
response). However no
patterns such as Q have been
used for training in the
example.
Multi-category Single layer Perceptron nets…
For c = 1 and w11 = [1 − 2 0] w12 = [0 − 1 2] and w13 = [1 3 − 1]
t t t

•Step 1: Pattern y1 is input

 10  
   
sgn [1 − 2 0] 2   = 1 Since the
  only w12 = w11
  −
 1 incorrect
response is w 22 = w12
 10   provided
    by TLU3,  1  10  − 9
sgn [0 − 1 2] 2   = −1
w 32 =  3  −  2  =  1 
we have
  − 1 
   − 1 − 1  0 
 10  
   
sgn [1 3 − 1] 2   = 1*
  − 1 
  
Multi-category Single layer Perceptron nets…
•Step 2: Pattern y2 is input

  2 
   
sgn [1 − 2 0]− 5  = 1*
  − 1  
   1  2  − 1
  2  w13 = 2 − − 5 =  3 
   
sgn [0 − 1 2]− 5  = 1 0  − 1  1 
  − 1  
   w 32 = w 22
  2  w 33 = w 32
   
sgn [− 9 1 0]− 5  = −1
  − 1  
  
Multi-category Single layer Perceptron nets…
•Step 3: Pattern y3 is input 4 One can
w14 = − 2
( )
verify that
sgn w13t y 3 = 1* the only
 2 
sgn (w y ) = −1
adjusted
3t
weights
2 3
w 42 = w 32
sgn (w y ) = 1
from now
3t
on are those
3 3
w 34 = w 33 of TLU1

• During the second cycle:

w15 = w14
 2 w18 = w17
w16 = 3 5
3 w19 = 3
7 5
w17 = − 2
 4 
Multi-category Single layer Perceptron nets…
•R-category linear classifier using R continuous bipolar
perceptrons
APPENDIX B

Perceptron Convergence Proof

Perceptron Convergence Proof Haykin, Chapter 3

Consider the following perceptron:

m
v(n) = ∑ wi (n) xi (n)
i =0

= w T ( n) x( n)

w T x > 0 for every input vector x belonging to class C1

w T x ≤ 0 for every input vector x belonging to class C 2
Perceptron Convergence Proof…
The algorithm for the weight adjustment for the
perceptron
– if x(n) is correctly classified no adjustments to w
w (n + 1) = w (n) if w T x(n) ≤ 0 and x(n) belongs to class C 2

w (n + 1) = w (n) if w T x(n) > 0 and x(n) belongs to class C1

– otherwise

w(n + 1) = w(n) − η (n)x(n) if wT x(n) > 0 and x(n) belongs to class C2

w(n + 1) = w(n) + η (n)x(n) if wT x(n) ≤ 0 and x(n) belongs to class C1

– learning rate parameter η (n) controls adjustment

applied to weight vector
Perceptron Convergence Proof
For η (n) = 1 and w (0) = 0
Suppose the perceptron incorrectly classifies the vectors
x(1), x(2),... such that

wT x(n) ≤ 0 so that : w(n + 1) = w(n) + η (n)x(n)

But sinceη = 1 ⇒
w(n + 1) = w(n) + x(n) for x(n) belonging to C1
Since w(0) = 0, iteratively we find w(n + 1)
w(n + 1) = x(1) + x(2) + ... + x(n) (B1)
Since the classes C1 and C2 are assumed to be linearly
separable, there exists a solution w0 for which wTx(n)>0 for
the vectors x(1), …x(n) belonging to the subset H1(subset of
training vectors that belong to class C1).
Perceptron Convergence Proof
For a fixed solution w0, we may then define a positive number
α as
α = min w x(n) T
0 ( B 2)
x ( n )∈H1

Hence equation (B1) above implies

w T0 w(n + 1) = w T0 x(1) + w T0 x(2) + ... + w T0 x(n)
Using equation B2 above, (since each term is greater or equal
than α), we have T
w 0 w(n + 1) ≥ nα
Now we use the Cauchy-Schwartz inequality:
2 2
(a.b) ≤ a b
2
or
2 (a.b) 2 2
a ≥ 2
for b ≠ 0
b
Perceptron Convergence Proof
This implies that:
2 n 2α 2
w(n + 1) ≥ 2
( B3)
w0
Now let’s follow another development route (notice index k)
w(k + 1) = w(k ) + x(k ) for k = 1, ..., n and x(k) ∈ H1
By taking the squared Euclidean norm of both sides, we get:
2 2 2
w(k + 1) = w(k ) + x(k ) + 2wT (k )x(k )
But under the assumption the the perceptron incorrectly
classifies an input vector x(k) belonging to the subset H1, we
have wT (k )x(k ) < 0 and hence :
2 2 2
w(k + 1) ≤ w(k ) + x(k )
Perceptron Convergence Proof
Or equivalently,
2 2 2
w(k + 1) − w(k ) ≤ x(k ) ; k = 1,...n

Adding these inequalities for k=1,…n, and invoking the initial

condition w(0)=0, we get the following inequality:
n
w(n + 1) ≤ ∑ x(k ) ≤ nβ
2 2
( B4)
k =1

Where β is a positive number defined by;

n
β = max ∑ x(k )
2

x ( k )∈H1
k =1
Eq. B4 states that the squared Euclidean norm of w(n+1)
grows at most linearly with the number of iterations n.
Perceptron Convergence Proof
The second result of B4 is clearly in conflict with Eq. B3.
•Indeed, we can state that n cannot be larger than some
value nmax for which Eq. B3 and B4 are both satisfied with
the equality sign. That is nmax is the solution of the eq.
2
nmaxα2
2
= nmax β
w0
•Solving for nmax given a solution w0, we find that
2
β w0
nmax =
α2
We have thus proved that for η(n)=1 for all n, and for w(0)=0,
given that a sol’ vector w0 exists, the rule for adapting the
synaptic weights of the perceptron must terminate after at most
nmax iterations.

ML Unit-Iv
No ratings yet
ML Unit-Iv
18 pages
Deep Learning - Unit-III Two Marks
100% (1)
Deep Learning - Unit-III Two Marks
3 pages
ANN Unit-2 Chapter-2
No ratings yet
ANN Unit-2 Chapter-2
56 pages
Perceptons Neural Networks
No ratings yet
Perceptons Neural Networks
33 pages
Single Layer Perceptron
No ratings yet
Single Layer Perceptron
6 pages
Learning Rules of ANN
No ratings yet
Learning Rules of ANN
25 pages
Instructions To Install CentOS Linux 9 On VMWare-2
No ratings yet
Instructions To Install CentOS Linux 9 On VMWare-2
22 pages
Topic 5 - Part1 Multilayer Perceptron
No ratings yet
Topic 5 - Part1 Multilayer Perceptron
28 pages
Unit 2 Convolutional Neural Network
No ratings yet
Unit 2 Convolutional Neural Network
16 pages
Pattern Recognition - Unit - 1&2
100% (1)
Pattern Recognition - Unit - 1&2
41 pages
Biological Neuron and Memory: Understanding The Basics of Neural Function and Memory Mechanisms
No ratings yet
Biological Neuron and Memory: Understanding The Basics of Neural Function and Memory Mechanisms
455 pages
Instructions To Install CentOS Linux 9 On VMWare-2
No ratings yet
Instructions To Install CentOS Linux 9 On VMWare-2
22 pages
Ann Assignmeent 1,2,3
No ratings yet
Ann Assignmeent 1,2,3
23 pages
Process Redesign
No ratings yet
Process Redesign
43 pages
Unit 2ANNs
No ratings yet
Unit 2ANNs
169 pages
Introduction To Soft Computing: Practice Sheet: NN-1
No ratings yet
Introduction To Soft Computing: Practice Sheet: NN-1
2 pages
Unit IV Artificial Neural Networks
No ratings yet
Unit IV Artificial Neural Networks
25 pages
Rajesh (DL Unit1) 04dec2024
No ratings yet
Rajesh (DL Unit1) 04dec2024
125 pages
Deep Learning
No ratings yet
Deep Learning
127 pages
CP5261 Data Analytics Laboratory LTPC0042 Objectives
No ratings yet
CP5261 Data Analytics Laboratory LTPC0042 Objectives
80 pages
4-Hebbian Net-25-Jul-2018 - Reference Material I - Learning Rules in Neural Network PDF
No ratings yet
4-Hebbian Net-25-Jul-2018 - Reference Material I - Learning Rules in Neural Network PDF
19 pages
Unit 5 Neural Network
No ratings yet
Unit 5 Neural Network
31 pages
Artificial Neural Networks Video Tutorial: Machine Learning 17CS73
No ratings yet
Artificial Neural Networks Video Tutorial: Machine Learning 17CS73
23 pages
AI-Lecture 12 - Simple Perceptron
100% (1)
AI-Lecture 12 - Simple Perceptron
24 pages
Single Layer Perceptron Classifier
No ratings yet
Single Layer Perceptron Classifier
62 pages
Iv. Single Layer Structures: 4.1. Perceptrons
No ratings yet
Iv. Single Layer Structures: 4.1. Perceptrons
26 pages
DL Unit-2
No ratings yet
DL Unit-2
31 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
221 pages
Chapter 3
No ratings yet
Chapter 3
12 pages
CS632 Neural Networks
No ratings yet
CS632 Neural Networks
1 page
Artificial Intelligence & Neural Networks Unit-5 Basics of NN
50% (2)
Artificial Intelligence & Neural Networks Unit-5 Basics of NN
16 pages
ML LAB Mannual-1
No ratings yet
ML LAB Mannual-1
79 pages
Multiple-Layer Networks Backpropagation Algorithms
No ratings yet
Multiple-Layer Networks Backpropagation Algorithms
46 pages
Types of Neural Networks
No ratings yet
Types of Neural Networks
7 pages
Ann Rec054
No ratings yet
Ann Rec054
1 page
Robotics and Machine Vision Internal 3 Important Questions
No ratings yet
Robotics and Machine Vision Internal 3 Important Questions
1 page
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
02 Fundamentals of Neural Network
No ratings yet
02 Fundamentals of Neural Network
40 pages
Unit 2
No ratings yet
Unit 2
112 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
55 pages
All Pairs Shortest Path
No ratings yet
All Pairs Shortest Path
28 pages
Activation Functions - Ipynb - Colaboratory
No ratings yet
Activation Functions - Ipynb - Colaboratory
10 pages
Unit 4
No ratings yet
Unit 4
24 pages
Soft Computing Assignment
100% (1)
Soft Computing Assignment
13 pages
The Multilayer Perceptron
No ratings yet
The Multilayer Perceptron
11 pages
Unit 5
No ratings yet
Unit 5
23 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
Neuro Fuzzy Systems
100% (1)
Neuro Fuzzy Systems
27 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
21 pages
Perceptons Neural Networks
No ratings yet
Perceptons Neural Networks
33 pages
Artificial Intelligence Module 5
No ratings yet
Artificial Intelligence Module 5
23 pages
82-P01.91.300096-07 GE300 GE320 Operation Manual
No ratings yet
82-P01.91.300096-07 GE300 GE320 Operation Manual
126 pages
Unit 5
No ratings yet
Unit 5
61 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
8 pages
Dave Reed: Connectionist Approach To AI
No ratings yet
Dave Reed: Connectionist Approach To AI
26 pages
Unit - 3
No ratings yet
Unit - 3
42 pages
Supervised Learning Neural Networks
No ratings yet
Supervised Learning Neural Networks
34 pages
White Paper: MPO Connector Basics and Best Practices
No ratings yet
White Paper: MPO Connector Basics and Best Practices
9 pages
Associative Memory Neural Networks
100% (1)
Associative Memory Neural Networks
26 pages
1911 Encyclopædia Britannica
No ratings yet
1911 Encyclopædia Britannica
301 pages
Solicitation Letter
No ratings yet
Solicitation Letter
5 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
19 pages
Mazda Engineering Standard: Teruhisa Morishige
No ratings yet
Mazda Engineering Standard: Teruhisa Morishige
10 pages
Fluorescence Micros
No ratings yet
Fluorescence Micros
22 pages
Psychometric Assessment
0% (1)
Psychometric Assessment
10 pages
Ud Module 4
No ratings yet
Ud Module 4
105 pages
FTS - Test-02 (Code-B) - 24-03-2023
No ratings yet
FTS - Test-02 (Code-B) - 24-03-2023
32 pages
Deep Learning
No ratings yet
Deep Learning
2 pages
Director's Concept & Vision Slides
100% (2)
Director's Concept & Vision Slides
14 pages
Unit - III: Neurological Instrumentation
No ratings yet
Unit - III: Neurological Instrumentation
62 pages
Handbook of Econometrics Volume 3
No ratings yet
Handbook of Econometrics Volume 3
620 pages
15-Nguyen Van Thin-Bai Bao28!3!2007
No ratings yet
15-Nguyen Van Thin-Bai Bao28!3!2007
8 pages
IECEx PRE 19.0093U 000
No ratings yet
IECEx PRE 19.0093U 000
5 pages
Unit - Iv Equipment For Critical Care
No ratings yet
Unit - Iv Equipment For Critical Care
89 pages
Assessment Task 1.2
No ratings yet
Assessment Task 1.2
14 pages
Learning Area Grade Level 7 Quarter Date: English 4
No ratings yet
Learning Area Grade Level 7 Quarter Date: English 4
4 pages
GPT-9000 User Manual - EN Rev G 201712
No ratings yet
GPT-9000 User Manual - EN Rev G 201712
183 pages
Jayson Dr. Palisoc Domain 3 Diversity of Learners
No ratings yet
Jayson Dr. Palisoc Domain 3 Diversity of Learners
7 pages
Lab Manual-Chem 203 Practical General Chemistry - 5 Sept 2020
No ratings yet
Lab Manual-Chem 203 Practical General Chemistry - 5 Sept 2020
151 pages
Class X Prep Ideas
No ratings yet
Class X Prep Ideas
6 pages
PPTPTPTPTTP
No ratings yet
PPTPTPTPTTP
13 pages
Leaders Are Born Not Made
No ratings yet
Leaders Are Born Not Made
6 pages
Avalanche Formation and Characteristics
No ratings yet
Avalanche Formation and Characteristics
13 pages
Data Science For Civil Engineering Unit 3 Notes-1
No ratings yet
Data Science For Civil Engineering Unit 3 Notes-1
29 pages
Beta Catalog Et b1 2005
No ratings yet
Beta Catalog Et b1 2005
317 pages
Chapter 9&10 Prepraration of Consumer Behavior
No ratings yet
Chapter 9&10 Prepraration of Consumer Behavior
90 pages
310-A STO FY 2024 TIER 1
No ratings yet
310-A STO FY 2024 TIER 1
12 pages
Business and New Economic Environment: UNIT-1
No ratings yet
Business and New Economic Environment: UNIT-1
36 pages
Fig: Dual Beam CRO With Separate Time Bases
No ratings yet
Fig: Dual Beam CRO With Separate Time Bases
27 pages
Spanos - Past-Life Ids Ufos Satanic Abuse
No ratings yet
Spanos - Past-Life Ids Ufos Satanic Abuse
8 pages
Locally GAN-generated Face Detection Based On An Improved Xception
No ratings yet
Locally GAN-generated Face Detection Based On An Improved Xception
13 pages
137-E Blank Form
No ratings yet
137-E Blank Form
3 pages
Verbal Classfication
No ratings yet
Verbal Classfication
2 pages
Country Frost King Creek Cowboys Book 8 Cheyenne Mccray PDF Download
No ratings yet
Country Frost King Creek Cowboys Book 8 Cheyenne Mccray PDF Download
29 pages

P1 - Single Layer Feed Forward Networks

Uploaded by

P1 - Single Layer Feed Forward Networks

Uploaded by

CS407 Neural Computation

Lecturer: A/Prof. M. Bennamoun

Input layer Output layer

Types of Exclusive-OR Classes with Most General

Single-Layer Half Plane A B

Two-Layer Convex Open A B

 How long do we keep looking for a solution? We need to be able to

 So the training data lead to four inequalities:

 It is easy to see that there are an infinite number of solutions. Similarly,

 We can follow the same procedure for the XOR network:

• Perceptron is able to represent some useful functions

(Intersects the origin

Note 1: p=distance so >0

•For dynamic correction rule: c depends on the distance

– The initial weight should be different from 0.

λ=0: No weight adjustment

Νote: λ is the ratio of the distance

1 − 0.5 3  − 2

•Using w ' = w ± cy the weight training with each step can

w11 is in solution area.

•Replace the TLU (Threshold Logic Unit) with the

The factor ½ does not affect the location of

•The new weights is obtained by moving in the direction

By definition of the steepest descent concept,

Same training pattern set as

yn+1= -1 (irrelevant wheter it

Indecision regions = regions

•Step 1: Pattern y1 is input

• During the second cycle:

Perceptron Convergence Proof

Consider the following perceptron:

w T x > 0 for every input vector x belonging to class C1

w (n + 1) = w (n) if w T x(n) > 0 and x(n) belongs to class C1

w(n + 1) = w(n) − η (n)x(n) if wT x(n) > 0 and x(n) belongs to class C2

w(n + 1) = w(n) + η (n)x(n) if wT x(n) ≤ 0 and x(n) belongs to class C1

– learning rate parameter η (n) controls adjustment

wT x(n) ≤ 0 so that : w(n + 1) = w(n) + η (n)x(n)

Hence equation (B1) above implies

Adding these inequalities for k=1,…n, and invoking the initial

Where β is a positive number defined by;

You might also like

How long do we keep looking for a solution? We need to be able to

So the training data lead to four inequalities:

It is easy to see that there are an infinite number of solutions. Similarly,

We can follow the same procedure for the XOR network: