0% found this document useful (0 votes)

11 views23 pages

Artificial Neural Networks (III)

Uploaded by

micheal amash

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views23 pages

Artificial Neural Networks (III)

Uploaded by

micheal amash

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Artificial neural networks (III)

Can we train a perceptron to perform basic logical

operations such as AND, OR or Exclusive-OR?
The truth tables for the operations AND, OR and Exclusive-OR
are shown in the table below. The table presents all possible
combinations of values for two variables, 𝑥1 and 𝑥2, and the
results of the operations.

1
Artificial neural networks (III)
The figure below represents the AND, OR and Exclusive-OR
functions as two-dimensional plots based on the values of the
two inputs. Points in the input space where the function output
is 1 are indicated by black dots, and points where the output is
0 are indicated by white dots.

2
Artificial neural networks (III)
In (𝑎) and (𝑏), we can draw a line so that black dots are on one
side and white dots on the other, but dots shown in (𝑐) are not
separable by a single line.

A perceptron is able to represent a function only if there is

some line that separates all the black dots from all the white
dots.

Such functions are called linearly separable. Therefore, a

perceptron can learn the operations AND and OR, but not
Exclusive-OR.
3
Artificial neural networks (III)
Can we do better by using a sigmoidal or linear element
in place of the hard limiter?

Single-layer perceptrons make decisions in the same way,

regardless of the activation function used by the perceptron.

It means that a single-layer perceptron can classify only linearly

separable patterns, regardless of whether we use a hard-limit or
soft-limit activation function.

4
Artificial neural networks (III)
How do we cope with problems which are not linearly
separable?

To cope with such problems we need multilayer neural

networks.

In fact, history has proved that the limitations of Rosenblatt’s

perceptron can be overcome by advanced forms of neural
networks, for example multilayer perceptrons trained with the
back-propagation algorithm.

5
Artificial neural networks (III)
Multilayer neural networks

A multilayer perceptron is a feedforward neural network with

one or more hidden layers. Typically, the network consists of an
input layer of source neurons, at least one middle or hidden
layer of computational neurons, and an output layer of
computational neurons.

The input signals are propagated in a forward direction on a

layer-by-layer basis.

6
Artificial neural networks (III)
A multilayer perceptron with two hidden layers is shown below.

7
Artificial neural networks (III)
But why do we need a hidden layer?

Each layer in a multilayer neural network has its own specific

function. The input layer accepts input signals from the
outside world and redistributes these signals to all neurons in
the hidden layer. Actually, the input layer rarely includes
computing neurons, and thus does not process input patterns.

The output layer accepts output signals, or in other words a

stimulus pattern, from the hidden layer and establishes the
output pattern of the entire network.
8
Artificial neural networks (III)
Neurons in the hidden layer detect the features; the weights of
the neurons represent the features hidden in the input patterns.

These features are then used by the output layer in determining

the output pattern.

With one hidden layer, we can represent any continuous

function of the input signals, and with two hidden layers even
discontinuous functions can be represented.

9
Artificial neural networks (III)
Why is a middle layer in a multilayer network called a
‘hidden’ layer? What does this layer hide?

A hidden layer ‘hides’ its desired output. Neurons in the hidden

layer cannot be observed through the input/output behaviour of
the network.

There is no obvious way to know what the desired output of the

hidden layer should be. In other words, the desired output of
the hidden layer is determined by the layer itself.

10
Artificial neural networks (III)
Can a neural network include more than two hidden
layers?
Commercial ANNs incorporate three and sometimes four
layers, including one or two hidden layers. Each layer can
contain from 10 to 1000 neurons.

Experimental neural networks may have five or even six layers,

including three or four hidden layers, and utilise millions of
neurons, but most practical applications use only three layers,
because each additional layer increases the computational
burden exponentially.
11
Artificial neural networks (III)
How do multilayer neural networks learn?

More than a hundred different learning algorithms are

available, but the most popular method is back-propagation.

This method was first proposed in 1969, but was ignored

because of its demanding computations. Only in the mid-1980s
was the back-propagation learning algorithm rediscovered.

12
Artificial neural networks (III)
Learning in a multilayer network proceeds the same way as for
a perceptron. A training set of input patterns is presented to the
network. The network computes its output pattern, and if there
is an error – or in other words a difference between actual and
desired output patterns – the weights are adjusted to reduce
this error.

In a perceptron, there is only one weight for each input and

only one output. But in the multilayer network, there are many
weights, each of which contributes to more than one output.

13
Artificial neural networks (III)
How can we assess the blame for an error and divide it
among the contributing weights?

In a back-propagation neural network, the learning algorithm

has two phases.

First, a training input pattern is presented to the network input

layer. The network then propagates the input pattern from layer
to layer until the output pattern is generated by the output layer.

14
Artificial neural networks (III)
If this pattern is different from the desired output, an error is
calculated and then propagated backwards through the network
from the output layer to the input layer. The weights are
modified as the error is propagated.

As with any other neural network, a back-propagation one is

determined by the connections between neurons (the network’s
architecture), the activation function used by the neurons, and
the learning algorithm (or the learning law) that specifies the
procedure for adjusting weights.

15
Artificial neural networks (III)
Typically, a back-propagation network is a multilayer network
that has three or four layers. The layers are fully connected,
that is, every neuron in each layer is connected to every other
neuron in the adjacent forward layer.

A neuron determines its output in a manner similar to

Rosenblatt’s perceptron. First, it computes the net weighted
input as before:
𝑛

𝑋 = ෍ 𝑥𝑖 𝑤𝑖 − 𝜃
𝑖=1
where 𝑛 is the number of inputs, and 𝜃 is the threshold applied
to the neuron. 16
Artificial neural networks (III)
Next, this input value is passed through the activation function.
However, unlike a percepron, neurons in the back-propagation
network use a sigmoid activation function:

1
𝑌 𝑠𝑖𝑔𝑚𝑜𝑖𝑑 =
1 + 𝑒 −𝑋

The derivative of this function is easy to compute. It also

guarantees that the neuron output is bounded between 0 and 1.

17
Artificial neural networks (III)
However, it's worth noting that while the sigmoid activation
function has been historically popular, modern deep learning
architectures often prefer other activation functions like
ReLU (Rectified Linear Unit) due to some of the limitations
associated with sigmoid, such as the vanishing gradient
problem.

18
Artificial neural networks (III)
Vanishing Gradient Problem
• Neural Network Training: Neural networks are trained by
adjusting weights. This is done by backpropagation,
which uses gradients from the loss function.

• Role of Gradients: Gradients indicate the direction and

magnitude to adjust the weights for better accuracy.

19
Artificial neural networks (III)
• Gradient: In calculus, the gradient is a vector that points
in the direction of the steepest increase of a function. In
the context of neural networks, the gradient tells us how
much the loss will change if we change the weights by a
small amount. If we know the direction in which the loss
increases the most, we can adjust the weights in the
opposite direction to reduce the loss.

20
Artificial neural networks (III)
• Vanishing Gradients Issue: In deep networks, especially
with certain activation functions, gradients can become
extremely small. This is like multiplying many small
numbers together.

• Implication: When gradients are tiny, weight updates are

negligible. This means the deeper layers of the network
learn very slowly or not at all, hindering overall training.

21
Artificial neural networks (III)
Rectified Linear Units (ReLU)
The Rectified Linear Unit is the most commonly used
activation function in deep learning models. The function
returns 0 if it receives any negative input, but for any
positive value x it returns that value back. So, it can be
written as f(x)=max(0,x).

22
Artificial neural networks (III)
It's surprising that such a simple function (and one
composed of two linear pieces) can allow your model to
account for non-linearities and interactions so well. But the
ReLU function works great in most applications, and it is
very widely used as a result.

Sample Exam COMP 9444 NEURAL NETWORKS PDF
No ratings yet
Sample Exam COMP 9444 NEURAL NETWORKS PDF
7 pages
Artificial Intelligence in Healthcare
No ratings yet
Artificial Intelligence in Healthcare
13 pages
Machine Learning Imp Questions
100% (2)
Machine Learning Imp Questions
95 pages
4.0 The Complete Guide To Artificial Neural Networks
No ratings yet
4.0 The Complete Guide To Artificial Neural Networks
23 pages
Neural Network and Fuzzy Logic
No ratings yet
Neural Network and Fuzzy Logic
46 pages
Multi Layer Perceptron
No ratings yet
Multi Layer Perceptron
82 pages
Neural Network
No ratings yet
Neural Network
20 pages
AI17-Neural Networks
No ratings yet
AI17-Neural Networks
34 pages
ML Unit4
No ratings yet
ML Unit4
38 pages
Perceptron and Multi Layer Perceptron
No ratings yet
Perceptron and Multi Layer Perceptron
5 pages
Neural Networks From Scratch: 3.1 Formal Neuron
No ratings yet
Neural Networks From Scratch: 3.1 Formal Neuron
8 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
Neural Net 2002
No ratings yet
Neural Net 2002
12 pages
Artificial Neural Network Notes
No ratings yet
Artificial Neural Network Notes
24 pages
Feedforward Neural Network
No ratings yet
Feedforward Neural Network
5 pages
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
No ratings yet
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
40 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
AI Lab 1
No ratings yet
AI Lab 1
11 pages
Unit-5: Introduction To Deep Learning: Artificial Neural Networks
No ratings yet
Unit-5: Introduction To Deep Learning: Artificial Neural Networks
14 pages
Performance Evaluation of Artificial Neural Networks For Spatial Data Analysis
No ratings yet
Performance Evaluation of Artificial Neural Networks For Spatial Data Analysis
15 pages
What Are Neural Nets
No ratings yet
What Are Neural Nets
4 pages
Unit 12
No ratings yet
Unit 12
28 pages
Weather Forecasting IIT KANPUR
No ratings yet
Weather Forecasting IIT KANPUR
6 pages
The Deep Neural Network-A Review
No ratings yet
The Deep Neural Network-A Review
5 pages
ML Unit-5
No ratings yet
ML Unit-5
19 pages
Unit 3 - IntrotoNN
No ratings yet
Unit 3 - IntrotoNN
4 pages
4 Neural Network
No ratings yet
4 Neural Network
74 pages
Unit 3
No ratings yet
Unit 3
8 pages
Mid Summary
No ratings yet
Mid Summary
13 pages
Module 3
No ratings yet
Module 3
83 pages
Unit V
No ratings yet
Unit V
42 pages
Module-3 Notes
No ratings yet
Module-3 Notes
51 pages
Unit 6 Application of AI
No ratings yet
Unit 6 Application of AI
91 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Module 5
No ratings yet
Module 5
8 pages
Types of Neural Networks and Definition of Neural Network
No ratings yet
Types of Neural Networks and Definition of Neural Network
15 pages
Lecture 7 - Neural Networks
No ratings yet
Lecture 7 - Neural Networks
48 pages
Unit I - Afs
No ratings yet
Unit I - Afs
18 pages
Neural Networks EN
No ratings yet
Neural Networks EN
16 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
Deep Learning 10 Hours: - Artificial Neural Networks (ANN) : Architecture
No ratings yet
Deep Learning 10 Hours: - Artificial Neural Networks (ANN) : Architecture
24 pages
19 Learning
No ratings yet
19 Learning
31 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
13 pages
Redes Neuronal Es
No ratings yet
Redes Neuronal Es
9 pages
Artifical Neural Network
No ratings yet
Artifical Neural Network
7 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
UNIT-II Chapter-2
No ratings yet
UNIT-II Chapter-2
20 pages
DM Chapter 7
No ratings yet
DM Chapter 7
6 pages
DWDM Unit4-2
No ratings yet
DWDM Unit4-2
4 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
29 pages
Chapter 05
No ratings yet
Chapter 05
25 pages
DL Presentation
No ratings yet
DL Presentation
82 pages
Neural Network
No ratings yet
Neural Network
7 pages
Unit - 4
No ratings yet
Unit - 4
17 pages
Artificial Neural Network Image Processing: Presented by
No ratings yet
Artificial Neural Network Image Processing: Presented by
88 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
Unit-3 ML
No ratings yet
Unit-3 ML
21 pages
Unit 4
No ratings yet
Unit 4
38 pages
Unit 3
100% (1)
Unit 3
11 pages
Neural Networks
No ratings yet
Neural Networks
40 pages
Lecture 2
No ratings yet
Lecture 2
52 pages
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
From Everand
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
Fouad Sabry
No ratings yet
Techniques and Tools for Artificial Intelligence. Neural Networks via R and PYTHON
From Everand
Techniques and Tools for Artificial Intelligence. Neural Networks via R and PYTHON
César Pérez López
No ratings yet
Evolutionary Computation (X)
No ratings yet
Evolutionary Computation (X)
10 pages
Instant - Centers - and - Centrodes-Chapter 6
No ratings yet
Instant - Centers - and - Centrodes-Chapter 6
31 pages
Evolutionary Computation (II, III, IV)
No ratings yet
Evolutionary Computation (II, III, IV)
19 pages
Velocity - Analysis-Chapter 6
No ratings yet
Velocity - Analysis-Chapter 6
20 pages
Linkage - Position - Analysis-Chapter 4
No ratings yet
Linkage - Position - Analysis-Chapter 4
52 pages
Evolutionary Computation (VIII, IX)
No ratings yet
Evolutionary Computation (VIII, IX)
19 pages
Artificial Neural Networks (IV) (Part II)
No ratings yet
Artificial Neural Networks (IV) (Part II)
5 pages
Artificial Neural Networks (II) (Part I)
No ratings yet
Artificial Neural Networks (II) (Part I)
12 pages
Artificial Neural Networks (V)
No ratings yet
Artificial Neural Networks (V)
15 pages
Artificial Neural Networks (IV) (Part I)
No ratings yet
Artificial Neural Networks (IV) (Part I)
5 pages
Artificial Neural Networks (I)
No ratings yet
Artificial Neural Networks (I)
18 pages
Artificial Neural Networks (II) (Part II)
No ratings yet
Artificial Neural Networks (II) (Part II)
4 pages
Evolutionary Computation (I)
No ratings yet
Evolutionary Computation (I)
9 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
EC-Example 1
No ratings yet
EC-Example 1
1 page
Chapter 5
No ratings yet
Chapter 5
14 pages
Chapter 9
No ratings yet
Chapter 9
15 pages
Chapter 8
No ratings yet
Chapter 8
32 pages
Chapter 6
No ratings yet
Chapter 6
21 pages
Lab Physics I Mid Exam
No ratings yet
Lab Physics I Mid Exam
14 pages
كيميا 1 فاينل
No ratings yet
كيميا 1 فاينل
6 pages
Neural Networks: Tricea Wade (99-002187) Swain Henry (02-006844)
No ratings yet
Neural Networks: Tricea Wade (99-002187) Swain Henry (02-006844)
11 pages
10.1201 9781003051220 Previewpdf
No ratings yet
10.1201 9781003051220 Previewpdf
35 pages
Behavior Prediction Through Handwriting Analysis: Parmeet Kaur Grewal, Deepak Prashar
No ratings yet
Behavior Prediction Through Handwriting Analysis: Parmeet Kaur Grewal, Deepak Prashar
4 pages
Neural Networks: An Overview of Early Research, Current Frameworks and New Challenges
No ratings yet
Neural Networks: An Overview of Early Research, Current Frameworks and New Challenges
27 pages
Beginner Guide To Neutral Network
No ratings yet
Beginner Guide To Neutral Network
6 pages
Ann 1
No ratings yet
Ann 1
102 pages
Lecture-20 21 22 (ANN)
No ratings yet
Lecture-20 21 22 (ANN)
30 pages
A Concise Introduction To Machine Learni PDF
No ratings yet
A Concise Introduction To Machine Learni PDF
7 pages
Neural Network FAQ
No ratings yet
Neural Network FAQ
277 pages
Neural Networks
No ratings yet
Neural Networks
21 pages
Tutorial Math Deep Learning 2018 PDF
No ratings yet
Tutorial Math Deep Learning 2018 PDF
103 pages
Supervised ANN
No ratings yet
Supervised ANN
19 pages
Soft Computing Book
No ratings yet
Soft Computing Book
130 pages
Final Report Artificial Intelligence in Civil Engineering (8257)
No ratings yet
Final Report Artificial Intelligence in Civil Engineering (8257)
45 pages
Dr. Meenakshi Sood Associate Professor, NITTTR Chandigarh: Meenkashi@nitttrchd - Ac.in
No ratings yet
Dr. Meenakshi Sood Associate Professor, NITTTR Chandigarh: Meenkashi@nitttrchd - Ac.in
39 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
18 pages
To Send
No ratings yet
To Send
6 pages
The Perceptron, Delta Rule and Its Variants
No ratings yet
The Perceptron, Delta Rule and Its Variants
7 pages
ML Lecture#4
No ratings yet
ML Lecture#4
109 pages
Module3a Notes
No ratings yet
Module3a Notes
37 pages
Artificial Neural Network - Quick Guide - Tutorialspoint
No ratings yet
Artificial Neural Network - Quick Guide - Tutorialspoint
61 pages
Dynamics Identification of A Monocopter Using Neural Networks
No ratings yet
Dynamics Identification of A Monocopter Using Neural Networks
5 pages
VISCOSITY CORRELATION Woelflin 1942 Loose
No ratings yet
VISCOSITY CORRELATION Woelflin 1942 Loose
4 pages
A Comparison of Deep Learning Methods For Time Series Forecasting With Limited Data
No ratings yet
A Comparison of Deep Learning Methods For Time Series Forecasting With Limited Data
55 pages
An Overview of Deep Learning in Medical Imaging Focusing On MRI
No ratings yet
An Overview of Deep Learning in Medical Imaging Focusing On MRI
26 pages
Sem 6 PDF
No ratings yet
Sem 6 PDF
13 pages
Application of Machine Learning in FPGA EDA Tool D
No ratings yet
Application of Machine Learning in FPGA EDA Tool D
18 pages

Artificial Neural Networks (III)

Uploaded by

Artificial Neural Networks (III)

Uploaded by

Artificial neural networks (III)

Can we train a perceptron to perform basic logical

A perceptron is able to represent a function only if there is

Such functions are called linearly separable. Therefore, a

Single-layer perceptrons make decisions in the same way,

It means that a single-layer perceptron can classify only linearly

To cope with such problems we need multilayer neural

In fact, history has proved that the limitations of Rosenblatt’s

A multilayer perceptron is a feedforward neural network with

The input signals are propagated in a forward direction on a

Each layer in a multilayer neural network has its own specific

The output layer accepts output signals, or in other words a

These features are then used by the output layer in determining

With one hidden layer, we can represent any continuous

A hidden layer ‘hides’ its desired output. Neurons in the hidden

There is no obvious way to know what the desired output of the

Experimental neural networks may have five or even six layers,

More than a hundred different learning algorithms are

This method was first proposed in 1969, but was ignored

In a perceptron, there is only one weight for each input and

In a back-propagation neural network, the learning algorithm

First, a training input pattern is presented to the network input

As with any other neural network, a back-propagation one is

A neuron determines its output in a manner similar to

The derivative of this function is easy to compute. It also

• Role of Gradients: Gradients indicate the direction and

• Implication: When gradients are tiny, weight updates are

You might also like