0% found this document useful (0 votes)

106 views44 pages

Artificial Neural Network

Artificial Neural Networks (ANN) are computing systems inspired by biological neural networks. ANNs learn by example through an algorithm called backpropagation that adjusts the synaptic connections between neurons. The document provides a history of ANNs including early models in the 1940s-1950s and breakthroughs like backpropagation in the 1970s and successful applications in the 1990s. It then discusses the basic components and functioning of ANNs including the neuron model, forward propagation, training through backpropagation, design considerations, and applications. Pseudocode is also provided for the forward propagation and training algorithms.

Uploaded by

gangotri chakraborty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

106 views44 pages

Artificial Neural Network

Uploaded by

gangotri chakraborty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 44

Artificial Neural

Networks
Introduction
 Artificial Neural Networks (ANN)
 Information processing paradigm inspired by
biological nervous systems
 ANN is composed of a system of neurons
connected by synapses
 ANN learn by example
 Adjust synaptic connections between neurons
History
 1943: McCulloch and Pitts model neural
networks based on their understanding of
neurology.
 Neurons embed simple logic functions:
 a or b
 a and b

 1950s:
 Farley and Clark
 IBM group that tries to model biological behavior
 Consult neuro-scientists at McGill, whenever stuck

 Rochester, Holland, Haibit and Duda

History
 Perceptron (Rosenblatt 1958)
 Three layer system:
 Input nodes
 Output node
 Association layer

 Can learn to connect or associate a given input to a

random output unit
 Minsky and Papert
 Showed that a single layer perceptron cannot learn
the XOR of two binary inputs
 Lead to loss of interest (and funding) in the field
History
 Back-propagation learning method (Werbos
1974)
 Three layers of neurons
 Input, Output, Hidden

 Better
learning rule for generic three layer networks
 Regenerates interest in the 1980s
 Successful applications in medicine, marketing,
risk management, … (1990)
 In need for another breakthrough.
ANN
 Promises
 Combine speed of silicon with proven success
of carbon  artificial brains
Neuron Model
 Natural neurons
Neuron Model
 Neuron collects signals from dendrites
 Sends out spikes of electrical activity through an
axon, which splits into thousands of branches.
 At end of each brand, a synapses converts
activity into either exciting or inhibiting activity of
a dendrite at another neuron.
 Neuron fires when exciting activity surpasses
inhibitory activity
 Learning changes the effectiveness of the
synapses
Neuron Model
 Natural neurons
Neuron Model
 Abstract neuron model:
ANN Forward Propagation
ANN Forward Propagation
 Bias Nodes
 Add one node to each layer that has constant
output
 Forward propagation
 Calculatefrom input layer to output layer
 For each neuron:
 Calculate weighted average of input
 Calculate activation function
Neuron Model
 Firing Rules:
 Threshold rules:
 Calculate weighted average of input
 Fire if larger than threshold
 Perceptron rule
 Calculate weighted average of input input
 Output activation level is
 1
 1  
2

 1
 ( )   0   
 2
0  0


Neuron Model
 Firing Rules: Sigmoid functions:
 Hyperbolic tangent function
1  exp(  )
    tanh( / 2) 
1  exp(  )

 Logistic activation function

   
1
1  exp  
ANN Forward Propagation
ANN Forward Propagation
 Density Plot of
Output
ANN Forward Propagation
ANN Forward Propagation
 Network can learn a non-linearly
separated set of outputs.
 Need to map output (real value) into binary
values.
ANN Training
 Weights are determined by training
 Back-propagation:
 On given input, compare actual output to desired
output.
 Adjust weights to output nodes.

 Work backwards through the various layers

 Start out with initial random weights

 Best to keep weights close to zero (<<10)
ANN Training
 Weights are determined by training
 Need a training set
 Should be representative of the problem
 During each training epoch:
 Submit training set element as input
 Calculate the error for the output neurons

 Calculate average error during epoch

 Adjust weights
ANN Training
 Error is the mean square of differences in
output layer
 1 K   2
E ( x )   ( y k ( x )  t k ( x ))
2 k 1
y – observed output
t – target output
ANN Training Example
x1 x2 y Error
0 -0,5 2 0.1 0 0 0.69 0.472448
-0.5
4 0 1 0.67 0.110583
1
1 3 -0.5 1 0 0.70 0.0911618
1
-1 -0.5 1 1 1 0.68 0.457959
Bias

Average Error is 0.283038

ANN Training Example
 Calculate the derivative of the error with
respect to the weights and bias into the
output layer neurons
ANN Training Example
New weights going into node 4
0 -0,5 2 0.1
-0.5 We do this for all training inputs, then
4 average out the changes
1
1 3 -0.5 net4 is the weighted sum of input going
1
-0.5 into neuron 4:
-1 1
net4(0,0)= 0.787754
Bias
net4(0,1)= 0.696717
net4(1,0)= 0.838124
net4(1,1)= 0.73877
ANN Training
 ANN Back-propagation is an empirical
algorithm
ANN Training
 XOR is too simple an example, since
quality of ANN is measured on a finite sets
of inputs.
 More relevant are ANN that are trained on
a training set and unleashed on real data
ANN Training
 Need to measure effectiveness of training
 Need training sets
 Need test sets.
 There can be no interaction between test sets
and training sets.
 Example of a Mistake:
 Train ANN on training set.
 Test ANN on test set.
 Results are poor.
 Go back to training ANN.
 After this, there is no assurance that ANN will work well in
practice.
 In a subtle way, the test set has become part of the training set.
ANN Training
 Convergence
 ANN back propagation uses gradient decent.
 Naïve implementations can
 overcorrect weights
 undercorrect weights
 In either case, convergence can be poor
 Stuck in the wrong place
 ANN starts with random weights and improves them
 If improvement stops, we stop algorithm
 No guarantee that we found the best set of weights
 Could be stuck in a local minimum
ANN Training
 Overtraining
 An ANN can be made to work too well on a
training set
 But loose performance on test sets
Training set
Performance

Test set

Training time
ANN Training
 Improving Convergence
 Many Operations Research Tools apply
 Simulated annealing
 Sophisticated gradient descent
ANN Design
 ANN is a largely empirical study
 “Seems to work in almost all cases that we
know about”
 Known to be statistical pattern analysis
ANN Design
 Number of layers
 Apparently, three layers is almost always good
enough and better than four layers.
 Also: fewer layers are faster in execution and training
 How many hidden nodes?
 Many hidden nodes allow to learn more complicated
patterns
 Because of overtraining, almost always best to set the
number of hidden nodes too low and then increase
their numbers.
ANN Design
 Interpreting Output
 ANN’s output neurons do not give binary
values.
 Good or bad
 Need to define what is an accept.

 Can indicate n degrees of certainty with n-1

output neurons.
 Number of firing output neurons is degree of
certainty
ANN Applications
 Pattern recognition
 Network attacks
 Breast cancer
 …
 handwriting recognition
 Pattern completion
 Auto-association
 ANN trained to reproduce input as output
 Noise reduction
 Compression
 Finding anomalies
 Time Series Completion
Pseudo-Code
 phi – activation function
 phid – derivative of activation function
Pseudo-Code
 Forward Propagation:
 Input nodes i, given input xi:
foreach inputnode i
outputi = xi
 Hidden layer nodes j
foreach hiddenneuron j
outputj = i phi(wjioutputi)
 Output layer neurons k
foreach outputneuron k
outputk = k phi(wkjoutputj)
Pseudo-Code
ActivateLayer(input,output)
foreach i inputneuron
calculate outputi
foreach j hiddenneuron
calculate outputj
foreach k hiddenneuron
calculate outputk
output = {outputk}
Pseudo-Code
 Output Error
Error() {
foreach input in InputSet
Errorinput = k output neuron (targetk-outputk)2
return Average(Errorinput,InputSet)
Pseudo-Code
For each output neuron k calculate:

 k   ' (net k )  ( target k  output k )

For each output neuron k calculate and hidden
layer neuron j calculate:

E
 output j   k
Wkj
Pseudo-Code
For each hidden neuron j calculate:

 j   ' (net j )  k  kWkj 

For each hidden neuron j and each input neuron i

calculate:
E
 output i   j
W ji
Pseudo-Code
 These calculations were done for a single
input.
 Now calculate the average gradient over
all inputs (and for all weights).
 You also need to calculate the gradients
for the bias weights and average them.
ANN Training Example 2
 Start out with random, small weights

1 Node 0: x0
0 2 0.3
0 Node 1: x1
4 Node 2: o2 = (x0 + x1 -0.5)
1
1 3 -0.7 Node 3: o3 = (0.5 x1 -1)
0.5
Node 4: o4 = (0.3 o2 – 0.7 o3 + 1)
-1 -0.5 1

Bias
ANN Training Example 2
 Calculate outputs

1 x1 x2 y=o4
0 2 0.3
0 0 0 0.7160
4
1 0 1 0.7155
1 3 -0.7
0.5 1 0 0.7308
-1 -0.5 1 1 1 0.7273

Bias
ANN Future
 ANNs can do some things really well
 They lack in structure found in most
natural neural networks

DAAD Epos Motivation Letter
100% (2)
DAAD Epos Motivation Letter
2 pages
ICI Overview
No ratings yet
ICI Overview
40 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
43 pages
Neural Networks
No ratings yet
Neural Networks
40 pages
AI Lecture 16
No ratings yet
AI Lecture 16
51 pages
Machine Learning: Neural Networks Slides Mostly Adapted From Tom Mithcell, Han and Kamber
No ratings yet
Machine Learning: Neural Networks Slides Mostly Adapted From Tom Mithcell, Han and Kamber
40 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
34 pages
Lecture 7 - ANN
No ratings yet
Lecture 7 - ANN
73 pages
Lecture15 NeuronNetworks
No ratings yet
Lecture15 NeuronNetworks
61 pages
12 Neural Network
No ratings yet
12 Neural Network
52 pages
ML Unit 3 Notes
No ratings yet
ML Unit 3 Notes
10 pages
Ann 1
No ratings yet
Ann 1
13 pages
Week 8 - ANN
No ratings yet
Week 8 - ANN
42 pages
Business Intelligence & Data Mining-10
No ratings yet
Business Intelligence & Data Mining-10
39 pages
Artificial Neural Network
100% (2)
Artificial Neural Network
20 pages
Lesson 14 ANN Supervised
No ratings yet
Lesson 14 ANN Supervised
37 pages
Gaurav Ann PDF
No ratings yet
Gaurav Ann PDF
75 pages
Lecture-2 Learning Process45452465442
No ratings yet
Lecture-2 Learning Process45452465442
50 pages
15 Neural Network Updated
No ratings yet
15 Neural Network Updated
85 pages
ANN Introduction
No ratings yet
ANN Introduction
37 pages
Ai 7
No ratings yet
Ai 7
41 pages
Neural Nets
No ratings yet
Neural Nets
43 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
24 pages
NFGP Unit I Paavai
No ratings yet
NFGP Unit I Paavai
111 pages
DM Lecture 09
No ratings yet
DM Lecture 09
36 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
46 pages
Ai 7
No ratings yet
Ai 7
41 pages
Unit 1
No ratings yet
Unit 1
29 pages
Kiet School of Engineering & Technology: Department of Computer Appication
No ratings yet
Kiet School of Engineering & Technology: Department of Computer Appication
30 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
48 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
17 pages
Neural Networks
No ratings yet
Neural Networks
75 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
20 pages
7 Neural Networks - Lecture Slides
No ratings yet
7 Neural Networks - Lecture Slides
74 pages
ELET442 - Artificial Neural Networks (ANNs)
No ratings yet
ELET442 - Artificial Neural Networks (ANNs)
56 pages
Unit - 4
No ratings yet
Unit - 4
17 pages
ML Unit-5
No ratings yet
ML Unit-5
22 pages
Introduction To Neural Networks
No ratings yet
Introduction To Neural Networks
125 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
48 pages
Artificial Neural Networks in Bi: Information System Dept ITS Surabaya 2009
No ratings yet
Artificial Neural Networks in Bi: Information System Dept ITS Surabaya 2009
42 pages
Unit - 2
No ratings yet
Unit - 2
24 pages
Artificial Neural Networks (Anns) : Intro
No ratings yet
Artificial Neural Networks (Anns) : Intro
15 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
82 pages
Lecture 25 - Artificial Neural Networks
No ratings yet
Lecture 25 - Artificial Neural Networks
42 pages
Lecture-13 Machine Learning With Python
No ratings yet
Lecture-13 Machine Learning With Python
26 pages
UNIT4 - Part1 Aiml
No ratings yet
UNIT4 - Part1 Aiml
79 pages
Chapter2-Neural+Network PartA
No ratings yet
Chapter2-Neural+Network PartA
38 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
66 pages
Unit 2 - Soft Computing
No ratings yet
Unit 2 - Soft Computing
49 pages
Lecture Slides-Week13,14
No ratings yet
Lecture Slides-Week13,14
62 pages
What Is A Neural Network?
100% (1)
What Is A Neural Network?
26 pages
Unit 2-Ann
No ratings yet
Unit 2-Ann
62 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
w1 01 Introtonn
No ratings yet
w1 01 Introtonn
42 pages
Basics
No ratings yet
Basics
48 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
83 pages
ECSE484 Intro v2
No ratings yet
ECSE484 Intro v2
67 pages
19 - Introduction To Neural Networks
No ratings yet
19 - Introduction To Neural Networks
7 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
Lecture Notes SC
No ratings yet
Lecture Notes SC
21 pages
Neural Network
No ratings yet
Neural Network
58 pages
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet
Bath Bombs
No ratings yet
Bath Bombs
2 pages
TTT Trainer Checklist
No ratings yet
TTT Trainer Checklist
4 pages
Wholesaling: DR de Epth I San Kar
No ratings yet
Wholesaling: DR de Epth I San Kar
6 pages
Cab 2024
No ratings yet
Cab 2024
1 page
Cecilia Laurente - Nursing Practice and Career
100% (1)
Cecilia Laurente - Nursing Practice and Career
11 pages
Banking Finance Tax Test SK2019 - 1
No ratings yet
Banking Finance Tax Test SK2019 - 1
4 pages
Untitled Document
No ratings yet
Untitled Document
1 page
Types of Videos For Social Media
No ratings yet
Types of Videos For Social Media
4 pages
Beton Dizayn Programi
No ratings yet
Beton Dizayn Programi
4 pages
Hillstone HSM 4.0.0 EN
No ratings yet
Hillstone HSM 4.0.0 EN
2 pages
PHY106 Week9
No ratings yet
PHY106 Week9
53 pages
Visual Guide To Phrasal Verbs - Part 1 - 1 - Run
No ratings yet
Visual Guide To Phrasal Verbs - Part 1 - 1 - Run
9 pages
Automotive Servicing NC Ii Jay Christian T Agsalon
No ratings yet
Automotive Servicing NC Ii Jay Christian T Agsalon
3 pages
Adesanya Adedamola David: Microbiologist
No ratings yet
Adesanya Adedamola David: Microbiologist
2 pages
Acknowledgement Thesis Sample Friends
100% (2)
Acknowledgement Thesis Sample Friends
5 pages
Unit 1 Rates of Change Assessment of Learning 1 PDF
No ratings yet
Unit 1 Rates of Change Assessment of Learning 1 PDF
11 pages
User Manual M900/M1800 Base Transceiver Station (BTS30) System Description
No ratings yet
User Manual M900/M1800 Base Transceiver Station (BTS30) System Description
3 pages
Gas Treating Technology Comparison GPA 2008
No ratings yet
Gas Treating Technology Comparison GPA 2008
12 pages
KR 120 R3900-2 K: Workspace Graphic
No ratings yet
KR 120 R3900-2 K: Workspace Graphic
1 page
A Study On Job Satisfaction of Employees at
No ratings yet
A Study On Job Satisfaction of Employees at
6 pages
Introduction For Term Paper Sample
100% (1)
Introduction For Term Paper Sample
4 pages
DVR Sky j104
No ratings yet
DVR Sky j104
1 page
Chapter 2-3 The Simplex Method
No ratings yet
Chapter 2-3 The Simplex Method
19 pages
Afmp 2011-2017
100% (2)
Afmp 2011-2017
263 pages
Salience Model For Classifying Stakeholders
No ratings yet
Salience Model For Classifying Stakeholders
2 pages
Digest By: Shimi Fortuna Ali Akang Vs Municipality of Isulan
No ratings yet
Digest By: Shimi Fortuna Ali Akang Vs Municipality of Isulan
2 pages
SSP 406 DCC Adaptive Chassis Control Design and Function
No ratings yet
SSP 406 DCC Adaptive Chassis Control Design and Function
32 pages

Artificial Neural Network

Uploaded by

Artificial Neural Network

Uploaded by

Artificial Neural

 Rochester, Holland, Haibit and Duda

 Can learn to connect or associate a given input to a

 Logistic activation function

 Work backwards through the various layers

 Start out with initial random weights

 Calculate average error during epoch

Average Error is 0.283038

 Can indicate n degrees of certainty with n-1

 k   ' (net k )  ( target k  output k )

 j   ' (net j )  k  kWkj 

For each hidden neuron j and each input neuron i

You might also like