Unit 4

The document discusses artificial neural networks (ANNs). It describes the key components of ANNs including their layered architecture with input, hidden, and output layers. Different ANN types are presented such as feedforward and recurrent networks. Perceptrons, multi-layer perceptrons, backpropagation, and other ANN algorithms are also summarized.

Uploaded by

Abhinav Kaushik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views38 pages

Unit 4

Uploaded by

Abhinav Kaushik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Unit-4

Artificial Neural Network

 ANN has 3 layers i.e. Input layer, Hidden layer, and
Output layer.
 Each ANN has a single input & output but may also have
none, one or many hidden layers.
 Structure of ANN classifies into many types of architecture
such as a Single layer, Multi-layer, Feed-forward, and
Recurrent networks.
 There are weights associated with each input neuron in
ANN .
 An activation function is applied over the net input to
calculate the output. The output is then compared to the
target and weights are adjusted.
Components of ANN
 Inputs: Inputs are the set of values for which we need
to predict a output value. They can be viewed as
features or attributes in a dataset.
 Weights: weights are the real values that are attached
with each input/feature and they convey the
importance of that corresponding feature in predicting
the final output.
 Bias: Bias is used for shifting the activation function
towards left or right. It is a constant which helps the
model in a way that it can fit best for the given data.
Components of ANN

 Summation Function: The work of the summation

function is to bind the weights and inputs together
and calculate their sum.
 Activation Function: Activation Function decides
whether a neuron should be activated or not. This
means that it will decide whether the neuron’s input to
the network is important or not in the process of
prediction using simpler mathematical operations.
Neurons
 The building blocks for neural networks are artificial
neurons.
 These are simple computational units that have
weighted input signals and produce an output signal
using an activation function.
Basic Structure of ANN
Feed-forward Vs Back-propagation
 Two essential terms describing the movement of
information—feed-forward and back-propagation.
 Feed-forward Propagation - the flow of information
occurs in the forward direction. The input is used to
calculate some intermediate function in the hidden
layer, which is then used to calculate an output.
 Back-propagation - the weights of the network
connections are repeatedly adjusted to minimize the
difference between the actual output vector of the net
and the desired output vector.
Perceptron
 A perceptron is a neural network unit that does a precise
computation to detect features in the input data
 Perceptron is mainly used to classify the data into two parts.
Therefore, it is also known as Linear Binary Classifier.
 Given inputs x1 through xn, the output 0(x1,…..,xn) computed
by perceptron is :
o(x1,…….,xn) = 1 if w0+w1x1+w2x2+…….+wnxn >0
-1 , otherwise
where each wi is a real- valued constant or weight that
determines contribution of input xi to the perceptron output
 Perceptron model works in two important steps as follows:
 Step-1 : In the first step first, multiply all input values with
corresponding weight values and then add them to determine
the weighted sum. Mathematically, we can calculate the weighted
sum as follows:
∑wi*xi = x1*w1 + x2*w2 +…wn*xn
 Add a special term called bias 'b' to this weighted sum to
improve the model's performance.
∑wi*xi + b
 Step-2 : In the 2nd step, an activation function is applied with
the above-mentioned weighted sum, which gives us output either
in binary form or a continuous value as follows:
Y = f(∑wi*xi + b)
This step function or Activation function is vital in ensuring that
output is mapped between (0,1) or (-1,1).
Multi-layer Perceptron Model
 In single layer perceptron there are only input & output
layers
 It is a 2 layer architecture
 A multi-layer perceptron is a neural network that has
multiple layers.
 MLPs are faster than single layer networks
 A multi-layer perceptron has one input layer and for each
input, there is one neuron(or node), it has one output
layer with a single node for each output and it can have
any number of hidden layers and each hidden layer can
have any number of nodes.
Multi-Layer Perceptron Model
 A schematic diagram of a Multi-Layer Perceptron
(MLP) is depicted below :
Perceptron Training Rule
Activation Functions of Perceptron
 The activation function applies a step rule (convert the
numerical output into +1 or -1) to check if the output of the
weighting function is greater than zero or not.
Delta & Gradient Descent Rule
 The perceptron rule finds a successful weight vector
when the training examples are linearly separable, it
can fail to converge if the examples are not linearly
separable.
Delta & Gradient Descent Rule
 A second training rule, called the delta rule, is
designed to overcome this difficulty.
 The key idea behind the delta rule is to use gradient
descent to search the hypothesis space of possible
weight vectors to find the weights that best fit the
training examples.
 The delta training rule is best understood by
considering the task of training an unthresholded
perceptron; that is, a linear unit for which the output o
is given by
Delta & Gradient Descent Rule
 In order to derive a weight learning rule for linear units,
we specify a measure for the training error of a hypothesis
(weight vector), relative to the training examples :

where D is the set of training examples, ‘td’ is the target

output for training example ‘d’, and od is the output of
the linear unit for training example ‘d’.
 The direction of steepest descent can be found by computing
the derivative of E with respect to each component of the vector
w.
 This vector derivative is called the gradient of E with respect to
w, written as,

 The training rule for gradient descent is :

where

The negative sign is present because we want to move the weight

vector in the direction that decreases E.
 This training rule can also be written in its component
form as :

Where
 Finally,
Gradient Descent Algorithm
Derivation of Back-propagation
Derivation of Back-propagation
Derivation of Back-propagation
Derivation of Back-propagation
Derivation of Back-propagation
Derivation of Back-propagation
Derivation of Back-propagation
Derivation of Back-propagation
Derivation of Back-propagation
Generalization of Neural Networks
 Generalization is a term used to describe a model’s ability
to react to new data i.e after being trained on a training
set, a model can digest new data and make accurate
predictions.
 A model’s ability to generalize is central to the success of a
model.
 If a model has been trained too well on training data, it
will be unable to generalize. It will make inaccurate
predictions when given new data, making the model
useless even though it is able to make accurate predictions
for the training data. This is called over-fitting. The
inverse is also true
Generalization of Neural Networks
 Under-fitting happens when a model has not been trained
enough on the data. In the case of under-fitting, it makes
the model just as useless and it is not capable of making
accurate predictions, even with the training data.
 In any real world application, the performance of ANN
mostly depends upon its generalization capability.
 Generalization of ANN is its ability to handle unseen data.
 Generalization capability of the network is mostly
determined by system complexity and training of the
network
Generalization of Neural Networks
SOM Algorithm
 Self Organizing Map (or Kohonen Map or SOM) is a
type of ANN
 It follows an unsupervised learning approach and trains its
network through a competitive learning algorithm.
 SOM is used for clustering and mapping (or dimensionality
reduction) techniques to map multidimensional data onto
lower-dimensional which helps to reduce complex problems
for easy interpretation.
 SOM has two layers, one is the Input layer and the other one
is the Output layer.
SOM Algorithm
 The architecture of the Self Organizing Map with two
clusters and n input features of any sample is given
below:
SOM Algorithm
SOM Algorithm
Deep Learning
 Deep learning is a machine learning technique
 The inspiration for deep learning is the way that the human
brain filters information.
 The majority of modern deep learning architectures are based
on ANNs
Classification of Neural Networks :
 Shallow neural network: The Shallow neural network has
only one hidden layer between the input and output.
 Deep neural network: Deep neural networks have more than
one layer. For instance, Google LeNet model for image
recognition counts 22 layers.
 Nowadays, deep learning is used in many ways like a driverless
car, mobile phone, Google Search Engine, Fraud detection, TV,
and so on.

Hydraulic Seal Catalogue 2022
100% (3)
Hydraulic Seal Catalogue 2022
423 pages
YIP 6.0 Students
No ratings yet
YIP 6.0 Students
86 pages
Organisational Behaviour
No ratings yet
Organisational Behaviour
75 pages
Lecture 7 - Neural Networks
No ratings yet
Lecture 7 - Neural Networks
48 pages
Neural Network
100% (1)
Neural Network
54 pages
Fire Prevention & Protection
50% (2)
Fire Prevention & Protection
33 pages
Proton Waja 4G18 Engine Service Manual
No ratings yet
Proton Waja 4G18 Engine Service Manual
144 pages
4.0 The Complete Guide To Artificial Neural Networks
No ratings yet
4.0 The Complete Guide To Artificial Neural Networks
23 pages
Check List of Cable Tray Erection
0% (2)
Check List of Cable Tray Erection
1 page
Understanding and Coding Neural Networks From Scratch in Python and R
100% (1)
Understanding and Coding Neural Networks From Scratch in Python and R
15 pages
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
100% (1)
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
40 pages
Unit - 2
No ratings yet
Unit - 2
24 pages
Oracle Upgrade To EBS R12
No ratings yet
Oracle Upgrade To EBS R12
10 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
AIML-Module-3-part 2
No ratings yet
AIML-Module-3-part 2
122 pages
Unit 5
No ratings yet
Unit 5
219 pages
Isch 4
No ratings yet
Isch 4
44 pages
Marine Catalogue
No ratings yet
Marine Catalogue
81 pages
Unit 6 Application of AI
No ratings yet
Unit 6 Application of AI
91 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
3 pages
Unit - 4
No ratings yet
Unit - 4
17 pages
Deep Learning 10 Hours: - Artificial Neural Networks (ANN) : Architecture
No ratings yet
Deep Learning 10 Hours: - Artificial Neural Networks (ANN) : Architecture
24 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
15 pages
Neural
No ratings yet
Neural
53 pages
Neural Network: Prof. Subodh Kumar Mohanty
No ratings yet
Neural Network: Prof. Subodh Kumar Mohanty
37 pages
Entrepreneurship Chapter 9 (Building A New-Venture Team)
No ratings yet
Entrepreneurship Chapter 9 (Building A New-Venture Team)
6 pages
Ai 7
No ratings yet
Ai 7
41 pages
Chapter 2 - Artificial Neural Networks
No ratings yet
Chapter 2 - Artificial Neural Networks
19 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
Neural Network: Presented by Lecturer Dept. of Mechatronics Engineering Rajshahi University of Engineering & Technology
No ratings yet
Neural Network: Presented by Lecturer Dept. of Mechatronics Engineering Rajshahi University of Engineering & Technology
25 pages
Neural Nets
No ratings yet
Neural Nets
43 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
Classification BP Regression KNN Other Classifiers - Final
No ratings yet
Classification BP Regression KNN Other Classifiers - Final
116 pages
Basics
No ratings yet
Basics
48 pages
Unit-4 Full
No ratings yet
Unit-4 Full
48 pages
Lecture 2
No ratings yet
Lecture 2
52 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Unit 1
No ratings yet
Unit 1
72 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
82 pages
Artificial Neural Network: Lecture Module 22
No ratings yet
Artificial Neural Network: Lecture Module 22
54 pages
Chapter 05
No ratings yet
Chapter 05
25 pages
Unit 1
No ratings yet
Unit 1
19 pages
Unit - 4 ANN
No ratings yet
Unit - 4 ANN
46 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
Unit 4 Neural Networks
No ratings yet
Unit 4 Neural Networks
76 pages
Machine Learning
No ratings yet
Machine Learning
83 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
3ML.05.NeuralNetworks DeepLearning
No ratings yet
3ML.05.NeuralNetworks DeepLearning
67 pages
Lecture-2 Learning Process45452465442
No ratings yet
Lecture-2 Learning Process45452465442
50 pages
Unit V
No ratings yet
Unit V
49 pages
Unit 3
No ratings yet
Unit 3
8 pages
Neural Networks
No ratings yet
Neural Networks
40 pages
Neural Network
No ratings yet
Neural Network
55 pages
Lecture 10 Neural Network
No ratings yet
Lecture 10 Neural Network
34 pages
Shortnotedeeplearning
No ratings yet
Shortnotedeeplearning
11 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
19 pages
Chapter 6 AI
No ratings yet
Chapter 6 AI
52 pages
ML Unit 2
No ratings yet
ML Unit 2
24 pages
Introduction Deep Eng
No ratings yet
Introduction Deep Eng
50 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
Unit 1
No ratings yet
Unit 1
29 pages
L6 Neural Network
No ratings yet
L6 Neural Network
57 pages
9540WTS 9560WTS 9580WTS Combines MY 2001 2004 Europe Edition Introduction
No ratings yet
9540WTS 9560WTS 9580WTS Combines MY 2001 2004 Europe Edition Introduction
6 pages
Centenary of 'A Portrait of The Artist As A Young Man' (ABEI Journal, Vol.18-2016)
No ratings yet
Centenary of 'A Portrait of The Artist As A Young Man' (ABEI Journal, Vol.18-2016)
206 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
For Your Salvation
No ratings yet
For Your Salvation
455 pages
Blending in Perfectly - Jackson Tegu - 2020sep
0% (1)
Blending in Perfectly - Jackson Tegu - 2020sep
8 pages
Seanewdim Philology Ii10 Issue 47
No ratings yet
Seanewdim Philology Ii10 Issue 47
118 pages
77777
No ratings yet
77777
29 pages
I Pai Wu - Determination of Peak Discharge and Design Hydrographs For Small Watersheds in Indiana
No ratings yet
I Pai Wu - Determination of Peak Discharge and Design Hydrographs For Small Watersheds in Indiana
135 pages
Energy Storage Targets 2030 and 2050 Full Report
No ratings yet
Energy Storage Targets 2030 and 2050 Full Report
36 pages
Fabco Sda 1800 Steerable Drive Axle Parts Manual
No ratings yet
Fabco Sda 1800 Steerable Drive Axle Parts Manual
14 pages
Pro B760M P DDR4
No ratings yet
Pro B760M P DDR4
1 page
23S1-SS ZG653-M1-CS02B - WhatIsSoftArch
No ratings yet
23S1-SS ZG653-M1-CS02B - WhatIsSoftArch
39 pages
Welding Research: Development of A New Hot-Cracking Test-The Sigmajig
No ratings yet
Welding Research: Development of A New Hot-Cracking Test-The Sigmajig
6 pages
Will Happen?
No ratings yet
Will Happen?
43 pages
On Linear Diophantine Equation Dr. D. Ramprasad
No ratings yet
On Linear Diophantine Equation Dr. D. Ramprasad
2 pages
Robinson Crusoe
No ratings yet
Robinson Crusoe
34 pages
Floating Solar Project at The Kariba Dam
No ratings yet
Floating Solar Project at The Kariba Dam
15 pages
Development of Presentation Media Design Based On Google Slides Add-On Pear-Deck On High School Sequences and Series Material
No ratings yet
Development of Presentation Media Design Based On Google Slides Add-On Pear-Deck On High School Sequences and Series Material
9 pages
Dacia Spring 2022 0120
No ratings yet
Dacia Spring 2022 0120
5 pages
2021-2022 Akademik Yılı Güz Dönemi İktisat (İng.) Lisans Final Programı
No ratings yet
2021-2022 Akademik Yılı Güz Dönemi İktisat (İng.) Lisans Final Programı
2 pages
1.b. GPP Monitoring Tool
No ratings yet
1.b. GPP Monitoring Tool
2 pages
Modelo de Method Statement Maintenance of Stainless Steel Gutters
No ratings yet
Modelo de Method Statement Maintenance of Stainless Steel Gutters
2 pages
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

Unit 4

Uploaded by

Unit 4

Uploaded by

Unit-4

Artificial Neural Network

 Summation Function: The work of the summation

where D is the set of training examples, ‘td’ is the target

 The training rule for gradient descent is :

The negative sign is present because we want to move the weight

You might also like