Perceptron Lecture 3

The document discusses Rosenblatt's perceptron, which was the first algorithmically described neural network and introduced the concept of online learning. It presents the perceptron learning algorithm, which adjusts weights to reduce errors between actual and desired outputs. The algorithm detects errors, and if misclassification occurs, it updates the weights in a way that increases or decreases activation based on the error. A perceptron can only learn linearly separable functions, as its decision boundary is defined by a hyperplane.

Uploaded by

amjad tamish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

144 views25 pages

Perceptron Lecture 3

Uploaded by

amjad tamish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

501582-3 Neural Networks

Perceptron

Dr. Huda Hakami

Department of Computer Science, Taif University
Introduction
• Rosenblatt’s Perceptron:
• Rosenblatt (1958) for proposing the perceptron as the first model for learning with a teacher
(i.e., supervised learning).
• The first algorithmically described neural network
• bio-inspired algorithm that tries to mimic a single neuron
• Occupies a special place in the historical development of neural networks
• Consider only one training instance at a time (online learning)
• Error-driven learning: learn only if we make a mistake when we classify using the current
weight vector. Otherwise, we don’t make adjustment to the weight vector.
Motivative Example
• Each day you get lunch at the cafeteria.
• Your diet consists of fish, chips, and Pepsi.
• You get several portions of each
• The cashier only tells you the total price of the meal
• After several days, you should be able to know the price of each portion.
• Each meal price gives a linear constraint on the prices (w) of the portions (x):

• The prices of the portions are like the weights in of a linear neuron.

• We will start with guesses for the weights and then adjust the guesses to give a better fit to the
prices given by the cashier.
The Artificial Perceptron

𝑎 = 𝑏 + % 𝑥! 𝑤!
!"#

The perceptron is an algorithm for supervised learning of binary linear

classifiers: functions that can decide whether an input (represented by
a vector of numbers) belong to one class or another.
Perceptron Training
How does the perceptron learn its classification tasks?

• This is done by making small adjustments in the weights to reduce the difference between the
actual and desired outputs of the perceptron.

• The initial weights are randomly assigned, usually in the range [-0.5, 0.5], and then updated to
obtain the output consistent with the training examples.

• The perceptron learns classification tasks through multiple iterations. Each iteration include the
weights adjustments process.
Perceptron Learning Algorithm
Perceptron Learning Algorithm (Cont.)
• Detecting error
Perceptron Learning Algorithm (Cont.)
• Detecting error
Desired (given) Predicted Update the weights Action
label y (
(actual) label 𝒚 𝒘
+1 sign(a)=+1 No error -> no
update
-1 sign(a)=-1 No error -> no
update
+1 sign(a)=-1 Misclassification Positive error, we
need to increase 𝑦 !
-1 sign(a)=+1 Misclassification Negative error, we
need to decrease 𝑦
!
𝑒𝑟𝑟𝑜𝑟 = 𝑦 − 𝑦'
Perceptron Learning Algorithm (Cont.)
• Update rule - Intuitive Explanation
• The update rule if we have misclassification (if 𝑦𝑎 < 0 ):

• Incorrectly classify a positive instance as negative:

• We should increase the activation (𝐰 !𝐱)
• ADD the current instance to the weight vector
• Incorrectly classify a negative instance as positive:
• We should decrease the activation (𝐰 !𝐱)
• DEDUCT the current instance from the weight vector
Perceptron Learning Algorithm (Cont.)
• Update rule - Math Explanation

If the misclassified instance is a

positive one, then after we update
using w = w + x,
the new activation 𝑎" is greater
than the old activation 𝑎
Perceptron Learning Algorithm (Cont.)
• Update rule - Math Explanation
• Show that the analysis in the previous slide holds when y = -1 (i.e. we misclassified a negative
instance).
• Order of training instances:
• Randomly shuffling the training instances within each iteration improve the performance
• Showing only all the positives first and all the negatives next is a bad idea
Perceptron Algorithm (Compact)
• Variables and parameters:
• Given training data 𝐱, y where:
input vector: 𝐱 = [+1, 𝑥# , 𝑥$ , … , 𝑥% ]
desired response 𝑦: +1 or − 1
• Weight vector: 𝐰 = [𝑏, 𝑤# , 𝑤$ , … , 𝑤% ]
• Learning rate hyperparameter 0 ≤ 𝜂 ≤ 1
Perceptron Algorithm (Compact)
• Step 1: Initialization
• Set the initial weight vector to zero or to random numbers in a range [-1,+1] or [-0.5,+0.5],
then perform the following computations for time-step n = 1, 2, ....
• Step 2: Activation
• Computer activation 𝑎 and the actual output 𝑦' for an instance as follows:
𝑦' = sgn[𝐰 !𝐱]
• Step 3: Adaptation of weight vector
• Apply error-correction learning rule
𝐰&'( = 𝐰)*+ + 𝜂 𝑦 − 𝑦' 𝐱
• Continuation: Increment time step by one and go back to step 2.
Linear Separability: math review
• Linear functions are those whose graph is a straight line.
• A linear function has the following form: y = f(x) = ax + b, where a and b are
constants, often real numbers.
• A linear function has one independent variable and one dependent variable
• For a function f(x1, x2, ......xn), of any finite number of independent variables, the
general formula is f(x1, x2, ......xn)=a1x1+ a2x2+.....anxn +b
Perceptron: Linear Separability Concept
• A single perceptron can only be used to implement linearly separable functions
• For the perceptron to function properly, the two classes c1 and c2 must be linearly separable.
• the patterns to be classified must be separated from each other to ensure that the decision
surface consists of a hyperplane
Perceptron: Linear Separability Concept
• The training process involves the adjustment of the weight vector w in such a way that the two
classes c1 and c2 are linearly separable.
• That is, there exists a weight vector w such that we may state:

• Thus, the decision in perceptron is made depending on:

• Therefore,𝐰 !𝐱 = 0 is the decision boundary (defines the hyperplane)
Perceptron: Linear Separability Concept
• The decision in perceptron is made depending on:
• Therefore,𝐰 +𝐱 = 0 is the decision boundary (defines the hyperplane)
• Example:
• In 2D space we have 𝑤# 𝑥# + 𝑤$ 𝑥$ = 0, a straight line through the origin ignoring the bias
• In N dimensional space this is an (N-1) dimensional hyperplane
Geometric representation of Hyperplane
• Hyperplane defined by the weight vector is perpendicular to the weight vector

Hyperplane 𝐰 % 𝐱 = 0
perpendicular to w

The new weight vector w’ is the

This positive instance is misclassified addition of w + x according to the
Weight vector w
as negative as 𝐰 % 𝐱<0, why? perceptron update rule. x will be
classified as positive by w’, why?
Perceptron: Linear • A perceptron can learn logical operators AND, OR but cannot
Separability Concept learn Exclusive-OR (XOR)?
Linear Separability: Remarks
• When a dataset is linearly separable, there can exist more than one hyperplanes that separates
the dataset into positive/negative groups (not unique)
• However, (by definition) if a dataset is non-linearly separable, then there exist NO hyperplane that
separates the dataset into positive/negative groups.
• When a dataset is linearly separable it can be proved that the perceptron will always find a
separating hyperplane!
• As the final weight vector returned by the Perceptron is more influenced by the final training
instances it sees.
• Average over all weight vectors during the training
(averaged perceptron algorithm)
Example of Perceptron Learning
• Logical AND operator:
• Suppose: the initial weights are w1=0.3 and w2 =-0.1, threshold: 𝜃=0.2 and learning rate 0.1
• Activation function: step function (1/true or zero/false)
• After the initialization, the perceptron is activated by the sequence of four input patterns
representing an epoch.

w1=0.3 X
1
∑ ʃ 𝑦!
w2=-0.1 X
2 Actual / Resulting output

Desired / Target output

Example of Perceptron Learning
Desired Actual
Epoch Iteration
Inputs
output
Initial weights
output
Error Final weights Iteration is every
𝑦!
single repetition
X1 X2 Yd w1 w2 e w1 w2
1 0 0 0 0.3 -0.1 of a process
2 0 1 0
1
3 1 0 0
4 1 1 1 Epoch is the
5 0 0 0 presentation of
2
6 0 1 0 the entire
7 1 0 0 training set to
8 1 1 1 the ANN during
9 0 0 0 the training
10 0 1 0
3 process.
11 1 0 0
12 1 1 1
13 0 0 0 Threshold: 𝜽=0.2
14 0 1 0
4
15 1 0 0 learning rate: 0.1
16 1 1 1
Example of Perceptron Learning
Desired Actual
Inputs Initial weights Error Final weights
Epoch Iteration output output

X1 X2 Yd w1 w2 𝑦! e w1 w2
1 0 0 0 0.3 -0.1 0 0 0.3 -0.1
2 0 1 0 0.3 -0.1 0 0 0.3 -0.1
1
3 1 0 0 0.3 -0.1 1 -1 0.2 -0.1
4 1 1 1 0.2 -0.1 0 1 0.3 0.0
Iteration 1:
𝐰&'( = 𝐰)*+ + 𝜂 𝑦 − 𝑦' 𝐱
• 0 * 0.3 + 0 * -0.1 - 0.2 = -0.2 -> step(-0.2) = 0 (negative)
• error= 0 – 0 = 0 (no update for w1 and w2)
Iteration 2:
• 0 * 0.3 + 1 * -0.1 -0.2 = -0.3 -> step(-0.3) = 0 (negative) w1 = 0.3 + (0.1 * -1 * 1) = 0.2
• error= 0 – 0 = 0 (no update) w2 = -0.1 + (0.1 * -1 * 0) = 0.1
Iteration 3:
• 1 * 0.3 + 0 * -0.1 -0.2 = 0.1 -> step(0.1) = 1 (positive)
• error = 0 – 1 = -1 (apply update rule)
Example of Perceptron Learning
Desired Initial Actual
Inputs Error Final weights
Epoch Iteration output weights output
X1 X2 Yd w1 w2 𝑦! e w1 w2
1 0 0 0 0.3 -0.1 0 0 0.3 -0.1
2 0 1 0 0.3 -0.1 0 0 0.3 -0.1
1
3 1 0 0 0.3 -0.1 1 -1 0.2 -0.1
4 1 1 1 0.2 -0.1 0 1 0.3 0.0
5 0 0 0 0.3 0.0 0 0 0.3 0.0
6 0 1 0 0.3 0.0 0 0 0.3 0.0
2
7 1 0 0 0.3 0.0 1 -1 0.2 0.0
8 1 1 1 0.2 0.0 1 0 0.2 0.0
9 0 0 0 0.2 0.0 0 0 0.2 0.0
10 0 1 0 0.2 0.0 0 0 0.2 0.0
3
11 1 0 0 0.2 0.0 1 -1 0.1 0.0
12 1 1 1 0.1 0.0 0 1 0.2 0.1
13 0 0 0 0.2 0.1 0 0 0.2 01
14 0 1 0 0.2 0.1 0 0 0.2 0.1
4
15 1 0 0 0.2 0.1 1 -1 0.1 0.1
16 1 1 1 0.1 0.1 1 0 0.1 0.1
17 0 0 0 0.1 0.1 0 0 0.1 0.1
18 0 1 0 0.1 0.1 0 0 0.1 0.1
5
19 1 0 0 0.1 0.1 0 0 0.1 0.1
20 1 1 1 0.1 0.1 1 0 0.1 0.1
Example of Perceptron Learning: OR

03 Deep Learning Overview
No ratings yet
03 Deep Learning Overview
80 pages
Robotics Research Paper
100% (3)
Robotics Research Paper
23 pages
02 Ai Project Cycle Important Questions Answers 1
No ratings yet
02 Ai Project Cycle Important Questions Answers 1
33 pages
Chapter 5 Introduction To Non-Linear and Optimal Control Systems
No ratings yet
Chapter 5 Introduction To Non-Linear and Optimal Control Systems
15 pages
Various Neural Network Architect Assignment Questions
No ratings yet
Various Neural Network Architect Assignment Questions
9 pages
Lecture Notes 5
No ratings yet
Lecture Notes 5
3 pages
November Lecture 6 Takagi Sugeno
No ratings yet
November Lecture 6 Takagi Sugeno
114 pages
Fuzzy Control: Lect 4 Fuzzy Logic Process
No ratings yet
Fuzzy Control: Lect 4 Fuzzy Logic Process
89 pages
Ci - Adaline & Madaline Network
No ratings yet
Ci - Adaline & Madaline Network
35 pages
Adaptive Control: Presented by Harikrishna Satish.T
100% (1)
Adaptive Control: Presented by Harikrishna Satish.T
22 pages
314324-Digital Electronics and Microcontroller Applications
No ratings yet
314324-Digital Electronics and Microcontroller Applications
8 pages
BEE Question Bank With Answers
No ratings yet
BEE Question Bank With Answers
11 pages
Lecture Notes: Neural Network & Fuzzy Logic
No ratings yet
Lecture Notes: Neural Network & Fuzzy Logic
82 pages
Hough Transform
No ratings yet
Hough Transform
16 pages
Control System & Fuzzy Logic - Lab Manual - 23 - 24
No ratings yet
Control System & Fuzzy Logic - Lab Manual - 23 - 24
55 pages
Microprocessor All Experiment IT PDF
No ratings yet
Microprocessor All Experiment IT PDF
22 pages
FLNN Lab Manual 17-18
No ratings yet
FLNN Lab Manual 17-18
41 pages
Sathyabama University: Register Number
100% (1)
Sathyabama University: Register Number
3 pages
Fuzzy Logic and Neural Networks - 4 - Solution
100% (1)
Fuzzy Logic and Neural Networks - 4 - Solution
13 pages
MCT Question Bank
No ratings yet
MCT Question Bank
5 pages
Chapter 1 Coulomb's Law PDF
50% (2)
Chapter 1 Coulomb's Law PDF
43 pages
Lecture-3 - Mathematical Modelling of Dynamic Systems
No ratings yet
Lecture-3 - Mathematical Modelling of Dynamic Systems
34 pages
Fuzzy Logic and Applications PDF
No ratings yet
Fuzzy Logic and Applications PDF
13 pages
Fuzzy Logic Control: Lect 5 Fuzzy Logic Control Basil Hamed Electrical Engineering Islamic University of Gaza
No ratings yet
Fuzzy Logic Control: Lect 5 Fuzzy Logic Control Basil Hamed Electrical Engineering Islamic University of Gaza
96 pages
Experiment No:01 Full Adder: Aim Algorithm
No ratings yet
Experiment No:01 Full Adder: Aim Algorithm
25 pages
Waveform Coding Techniques
0% (1)
Waveform Coding Techniques
214 pages
Color Making and Mixing Process Using PLC
No ratings yet
Color Making and Mixing Process Using PLC
5 pages
RBF Slides
No ratings yet
RBF Slides
30 pages
CS3491 - Notes - Unit 4 - Ensemble Techniques and Unsupervised Learning
No ratings yet
CS3491 - Notes - Unit 4 - Ensemble Techniques and Unsupervised Learning
35 pages
EBA 1203 - Math 1 Sheet
No ratings yet
EBA 1203 - Math 1 Sheet
45 pages
Unsupervised Learning: Part III Counter Propagation Network
100% (1)
Unsupervised Learning: Part III Counter Propagation Network
17 pages
Omkar Sabnis B4-764 Experiment No. 7 Aim: Implementation of MC-Culloch Pitt Model For AND Gate Using Python. Theory
No ratings yet
Omkar Sabnis B4-764 Experiment No. 7 Aim: Implementation of MC-Culloch Pitt Model For AND Gate Using Python. Theory
10 pages
241 CSM-4 - Digital Logic - Lab Manual - Course Specification - 1
No ratings yet
241 CSM-4 - Digital Logic - Lab Manual - Course Specification - 1
48 pages
Adaline Madaline
No ratings yet
Adaline Madaline
8 pages
Full Adder VHDL
No ratings yet
Full Adder VHDL
52 pages
1 s2.0 S0950705121008170 Main
No ratings yet
1 s2.0 S0950705121008170 Main
21 pages
BEE Question Bank 2 PDF
No ratings yet
BEE Question Bank 2 PDF
2 pages
ANN Architecture
No ratings yet
ANN Architecture
41 pages
Ec3351 Control Systems
No ratings yet
Ec3351 Control Systems
18 pages
Neural Network Two Mark Q.B
No ratings yet
Neural Network Two Mark Q.B
19 pages
Complex Engineering Problem DCS 2020
No ratings yet
Complex Engineering Problem DCS 2020
1 page
Answers All 2007
0% (1)
Answers All 2007
64 pages
Construction and Working of H-Bridge
No ratings yet
Construction and Working of H-Bridge
22 pages
Perceptons Neural Networks
No ratings yet
Perceptons Neural Networks
33 pages
Exercise Fuzzy Logic Controller
No ratings yet
Exercise Fuzzy Logic Controller
13 pages
Digital Systems of PPT Lecture 1
No ratings yet
Digital Systems of PPT Lecture 1
59 pages
Applied Soft Computing
50% (2)
Applied Soft Computing
32 pages
Chapter 3 Controllability and Observability Part II PDF
No ratings yet
Chapter 3 Controllability and Observability Part II PDF
37 pages
Neurofuzzy Controller
No ratings yet
Neurofuzzy Controller
15 pages
EE3503 Control Systems Lecture Notes 1
100% (1)
EE3503 Control Systems Lecture Notes 1
149 pages
Soft Computing
No ratings yet
Soft Computing
92 pages
Fundamentals of Robotics PDF
No ratings yet
Fundamentals of Robotics PDF
33 pages
Perceptron Notes
No ratings yet
Perceptron Notes
4 pages
Fuzzy Inference System
100% (1)
Fuzzy Inference System
42 pages
Comparator Circuit
No ratings yet
Comparator Circuit
6 pages
8.1 Finite Word Length Effects
No ratings yet
8.1 Finite Word Length Effects
18 pages
Perceptron Linear Classifiers
No ratings yet
Perceptron Linear Classifiers
42 pages
Perceptron PDF
0% (1)
Perceptron PDF
8 pages
Unit 1 Until MLP
No ratings yet
Unit 1 Until MLP
56 pages
Artificial Neural Networks Unit 3: Single-Layer Perceptrons
No ratings yet
Artificial Neural Networks Unit 3: Single-Layer Perceptrons
11 pages
NN 03
No ratings yet
NN 03
27 pages
Activator Office 2016.Cmd
No ratings yet
Activator Office 2016.Cmd
1 page
RLDB Quick Start Guide D2L Student
No ratings yet
RLDB Quick Start Guide D2L Student
1 page
M.tech (Water Resourses Engg) Syllabus
No ratings yet
M.tech (Water Resourses Engg) Syllabus
22 pages
Technical Answers For Real World Problems (TARP) CSE-3999: Assessment - 3
No ratings yet
Technical Answers For Real World Problems (TARP) CSE-3999: Assessment - 3
9 pages
JD - MIS Data Scientist
No ratings yet
JD - MIS Data Scientist
2 pages
Staad Aashto LRFD Parameters
No ratings yet
Staad Aashto LRFD Parameters
2 pages
Week Eight Term Project
No ratings yet
Week Eight Term Project
5 pages
Interaction Model
No ratings yet
Interaction Model
11 pages
Thesis Samples For Information Technology
100% (3)
Thesis Samples For Information Technology
8 pages
Upload A Document - Scribd
No ratings yet
Upload A Document - Scribd
4 pages
SDA Exp.4
No ratings yet
SDA Exp.4
8 pages
Keishank T
No ratings yet
Keishank T
1 page
Chapter 5 Analog Transmission PDF
No ratings yet
Chapter 5 Analog Transmission PDF
6 pages
Nicolet In10 MX-PS51511
No ratings yet
Nicolet In10 MX-PS51511
4 pages
Lab Manual
No ratings yet
Lab Manual
56 pages
Root Insurance: Car Insurance Based On How People Drive, Not Who They Are
No ratings yet
Root Insurance: Car Insurance Based On How People Drive, Not Who They Are
4 pages
Env SPV DR B 001 QC Manual Rev.A
No ratings yet
Env SPV DR B 001 QC Manual Rev.A
92 pages
(2018!04!16) Bali DL PRB Justification
No ratings yet
(2018!04!16) Bali DL PRB Justification
7 pages
Module 8 (Topic 8) Socialmedia Etiquette
No ratings yet
Module 8 (Topic 8) Socialmedia Etiquette
6 pages
LMR 16020
No ratings yet
LMR 16020
36 pages
MoldDesign Catalog Installation Guide
No ratings yet
MoldDesign Catalog Installation Guide
10 pages
Xu Open-Source MATLAB GPS
No ratings yet
Xu Open-Source MATLAB GPS
21 pages
Natural Language Processing 16CSE16-3-6-21
No ratings yet
Natural Language Processing 16CSE16-3-6-21
1 page
Attachment 1619812866
No ratings yet
Attachment 1619812866
2 pages
Shell Model Ebook v4
No ratings yet
Shell Model Ebook v4
9 pages
Chapter 6 Exponential Functions (指數函數) Tutorial Class (常規課堂)
No ratings yet
Chapter 6 Exponential Functions (指數函數) Tutorial Class (常規課堂)
11 pages
2308.08708-Consciousness in Artificial Intelligence
No ratings yet
2308.08708-Consciousness in Artificial Intelligence
88 pages
General EBS Setup
No ratings yet
General EBS Setup
119 pages