Unit4 1

Uploaded by

akg.uk14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views8 pages

Unit4 1

Uploaded by

akg.uk14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Artificial Neural Network:

Artificial neural networks (ANNs) provide a general, practical method for learning real-valued, discrete-valued, and
vector-valued functions from examples. Algorithms such as BACKPROPAGATION use gradient descent to tune
network parameters to best fit a training set of input-output pairs. ANN learning is robust to errors in the training
data and has been successfully applied to problems such as interpreting visual scenes, speech recognition, and
learning robot control strategies.

Biological Motivation

The biological learning systems are built of very complex webs of interconnected neurons. In rough
analogy, artificial neural networks are built out of a densely interconnected set of simple units, where
each unit takes a number of real-valued inputs (possibly the outputs of other units) and produces a
single real-valued output (which may become the input to many other units).

To develop a feel for this analogy, let us consider a few facts from neuro- biology. The human brain, for example, is
estimated to contain a densely interconnected network of approximately 10 11 neurons, each connected, on average,
to 104 others. Neuron activity is typically excited or inhibited through connections to other neurons. The fastest
neuron switching times are known to be on the order of 10-3 seconds quite slow compared to computer switching
speed. Yet humans are able to make surprisingly complex decisions, surprisingly quickly. For example, it requires
approximately 1/10 seconds to visually recognizing your mother. Notice the sequence of neuron firings that can take
place during this 1/10 second interval cannot possibly be longer than a few hundred steps, given the switching speed
of single neurons.

One motivation for ANN systems is to capture this kind of highly parallel computation based on distributed
representations. Most ANN software runs on sequential machines emulating distributed processes, although faster
versions of the algorithms have also been implemented on highly parallel machines and on specialized hardware
designed specifically for ANN applications.

Inconsistencies between ANN and Biological system: we can consider here ANNs whose individual units output a
single constant value, whereas biological neurons output a complex time series of spikes.

NEURAL NETWORK REPRESENTATIONS-

A prototypical example of ANN learning is provided by Pomerleau's (1993) system ALVINN, which uses a learned
ANN to steer an autonomous vehicle driving at normal speeds on public highways. The input to the neural network
is a 30 x 32 grid of pixel intensities obtained from a forward-pointed camera mounted on the vehicle. The network
output is the direction in which the vehicle is steered. The ANN is trained to mimic the observed steering commands
of a human driving the vehicle for approximately 5 minutes. ALVINN has used its learned networks to successfully
drive at speeds up to 70 miles per hour and for distances of 90 miles on public highways (driving in the left lane of a
divided public highway, with other vehicles present).
The BACKPROPAGATION algorithm is the most commonly used ANN learning technique. It is appropriate for
problems with the following characteristics:

1. Instances are represented by many attribute-value pairs. The target function to be learned is defined
over instances that can be described by a vector of predefined features, such as the pixel values in the
ALVINN example. These input attributes may be highly correlated or independent of one another. Input
values can be any real values.
2. The target function output may be discrete-valued, real-valued, or a vector of several real- or
discrete-valued attributes. For example, in the ALVINN system the output is a vector of 30 attributes,
each corresponding to a recommendation regarding the steering direction. The value of each output is some
real number between 0 and 1, which in this case corresponds to the confidence in predicting the
corresponding steering direction
3. The training examples may contain errors. ANN learning methods are quite robust to noise in the
training data.
4. Long training times are acceptable. Network training algorithms typically require longer training times
than, say, decision tree learning algorithms. Training times can range from a few seconds to many hours,
depending on factors such as the number of weights in the network, the number of training examples
considered, and the settings of various learning algorithm parameters.
5. Fast evaluation of the learned target function may be required. Although ANN learning times are
relatively long, evaluating the learned network, in order to apply it to a subsequent instance, is typically
very fast. For example, ALVINN applies its neural network several times per second to continually update
its steering command as the vehicle drives forward.
6. The ability of humans to understand the learned target function is not important. The weights learned
by neural networks are often difficult for humans to interpret.

Representational Power of Perceptrons :

The perceptron is represented as a hyperplane decision surface in the n-dimensional space of instances (i.e., points).
The perceptron outputs a 1 for instances lying on one side of the hyperplane and outputs a -1 for instances lying on
the other side, The equation for this decision hyperplane is wx=0. Of course, some sets of positive and negative
examples cannot be separated by any hyperplane. Those that can be separated are called linearly separable sets of
examples.
A single perceptron can be used to represent many boolean functions. For example, if we assume boolean values of
1 (true) and -1 (false), then one way to use a two-input perceptron to implement the AND function is to set the
weights wo = -3, and wl = wz = .5. This perceptron can be made to represent the OR function instead by altering the
threshold to wo = -.3. In fact, AND and OR can be viewed as special cases of m-of-n functions: that is, functions
where at least m of the n inputs to the perceptron must be true. The OR function corresponds to rn = 1 and the AND
function to m = n.

Perceptrons can represent all of the primitive boolean functions AND, OR, NAND, and NOR . Unfortunately,
however, some boolean functions cannot be represented by a single perceptron, such as the XOR function whose
value is 1 if and only if xl != x2. Every boolean function can be represented by some network of perceptrons only
two levels deep, in which the inputs are fed to multiple units, and the outputs of these units are then input to a
second, final stage.

The Perceptron Training Rule:

The precise learning problem is to determine a weight vector that causes the perceptron to produce the correct +-1
output for each of the given training examples. Several algorithms are known to solve this learning problem. Here
we con-sider two: the perceptron rule and the delta rule These two algorithms are guaranteed to converge to
somewhat different acceptable hypotheses, under somewhat different conditions. They are important to ANNs
because they provide the basis for learning networks of many units. One way to learn an acceptable weight vector is
to begin with random weights, then iteratively apply the perceptron to each training example, modifying the
perceptron weights whenever it misclassifies an example.

Weights are modified at each step according to the perceptron training rule, which revises the weight wi associated
with input xi according to the rule:

Here t is the target output for the current training example, o is the output generated by the perceptron, and q is a
positive constant called the learning rate. The role of the learning rate is to moderate the degree to which weights are
changed at each step. It is usually set to some small value (e.g., 0.1)

Gradient Descent and the Delta Rule:

where D is the set of training examples, td is the target output for training example d, and od is the output of the
linear unit for training example d. By this definition, E is simply half the squared difference between the target
output td and the unit output od, summed over all training examples. Here we characterize E as a function of weight
vector, because the linear unit output o depends on this weight vector.

Derivation of Gradient Descent:

Because the error surface contains only a single global minimum, this algorithm will converge to a weight vector
with minimum error, regardless of whether the training examples are linearly separable, given a sufficiently small
learning rate is used.

STOCHASTIC APPROXIMATION TO GRADIENT DESCENT:

Gradient descent is an important general paradigm for learning. It is a strategy for searching through a large or
infinite hypothesis space that can be applied whenever (1) the hypothesis space contains continuously parameterized
hypotheses (e.g., the weights in a linear unit), and (2) the error can be differentiated with respect to these hypothesis
parameters. The key practical difficulties in applying gradient descent are (1) converging to a local minimum can
sometimes be quite slow (i.e., it can require many thousands of gradient descent steps), and (2) if there are multiple
local minima in the error surface, then there is no guarantee that the procedure will find the global minimum.

One common variation on gradient descent intended to alleviate these difficulties is called incremental gradient
descent, or alternatively stochastic gradient descent. Whereas the gradient descent training rule presented in
Equation (4.7) computes weight updates after summing over all the training examples in D, the idea behind
stochastic gradient descent is to approximate this gradient descent search by updating weights incrementally,
following the calculation of the error for each individual example.
The key differences between standard gradient descent and stochastic gradient descent are:

1. In standard gradient descent, the error is summed over all examples before updating weights, whereas in
stochastic gradient descent weights are updated upon examining each training example.
2. Summing over multiple examples in standard gradient descent requires more computation per weight
update step. On the other hand, because it uses the true gradient, standard gradient descent is often used
with a larger step size per weight update than stochastic gradient descent.
3. In cases where there are multiple local minima with respect to E, stochastic, gradient descent can
sometimes avoid falling into these local minima because it uses the various Ed rather than E to guide its
search.

MULTILAYER NETWORKS AND THE BACKPROPAGATION ALGORITHM

The multilayer networks learned by the BACKPROPOGATION algorithm are capable of expressing a rich
variety of nonlinear decision surface.
A Differentiable Threshold Unit: We need a unit whose output is a nonlinear function of its inputs, but
whose output is also a differentiable function of its inputs. One solution is the sigmoid unit-a unit very
much like a perceptron, but based on a smoothed, differentiable threshold function. Like the perceptron, the
sigmoid unit first computes a linear combination of its inputs, then applies a threshold to the result. The
threshold output is a continuous function of its input. More precisely, the sigmoid unit computes its output
,often called the sigmoid function or, alternatively, the logistic function. Note its output ranges between 0
and 1, increasing monotonically with its input.

Payroll Management System Project Documentation PDF
71% (7)
Payroll Management System Project Documentation PDF
4 pages
Selenium Java Interview Questions
100% (5)
Selenium Java Interview Questions
22 pages
A Handbook of Mechanical Engineering
67% (3)
A Handbook of Mechanical Engineering
348 pages
BCA 103 - Mathematical Foundation of Computer SC - BCA
100% (2)
BCA 103 - Mathematical Foundation of Computer SC - BCA
274 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
37 pages
Whitepaper 7steps To Data Governance
No ratings yet
Whitepaper 7steps To Data Governance
16 pages
Addict Him To You: Learn More..
0% (1)
Addict Him To You: Learn More..
2 pages
Unit 2 Machine Learning Notes
100% (1)
Unit 2 Machine Learning Notes
25 pages
ML Unit-5
No ratings yet
ML Unit-5
19 pages
Module 3
No ratings yet
Module 3
83 pages
Wordpress Security: Define ('DISALLOW - UNFILTERED - HTML', True)
100% (1)
Wordpress Security: Define ('DISALLOW - UNFILTERED - HTML', True)
11 pages
ABB FT Switch Family Brochure
100% (1)
ABB FT Switch Family Brochure
40 pages
ECSWI269ver020 Online Proctored Exams Candidate GuidelinesmacOs
No ratings yet
ECSWI269ver020 Online Proctored Exams Candidate GuidelinesmacOs
14 pages
MATLAB Code of Artificial Neural Networks Estimation: February 2016
No ratings yet
MATLAB Code of Artificial Neural Networks Estimation: February 2016
7 pages
Synopsis: "Online Banking System "
No ratings yet
Synopsis: "Online Banking System "
4 pages
ANN-unit 4 PDF
No ratings yet
ANN-unit 4 PDF
23 pages
Siprotec 5 Catalog - en
No ratings yet
Siprotec 5 Catalog - en
456 pages
ch6 Perceptron MLP PDF
No ratings yet
ch6 Perceptron MLP PDF
31 pages
Unit 2
No ratings yet
Unit 2
85 pages
ML Unit 2
No ratings yet
ML Unit 2
91 pages
01 Goals and Components (13 Files Merged)
No ratings yet
01 Goals and Components (13 Files Merged)
188 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
83 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
C Question For Interview
No ratings yet
C Question For Interview
22 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
71 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
XOR Problem Demonstration Using MATLAB
0% (1)
XOR Problem Demonstration Using MATLAB
19 pages
Unit 1
No ratings yet
Unit 1
23 pages
A Basic Introduction To Neural Networks
No ratings yet
A Basic Introduction To Neural Networks
23 pages
PassLeader JN0-102 Exam Dumps (201-250)
No ratings yet
PassLeader JN0-102 Exam Dumps (201-250)
11 pages
Ipcw Ann
No ratings yet
Ipcw Ann
100 pages
Information Technology in Civil Engineering: Introduction - Ceng 116B Database Management in Construction
No ratings yet
Information Technology in Civil Engineering: Introduction - Ceng 116B Database Management in Construction
26 pages
Chapter 7
No ratings yet
Chapter 7
68 pages
Artificial Intelligent
No ratings yet
Artificial Intelligent
23 pages
Machine Learning Course in Bangalore
No ratings yet
Machine Learning Course in Bangalore
14 pages
Lec03 NeuralNetwork
No ratings yet
Lec03 NeuralNetwork
39 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
81 pages
F 003
No ratings yet
F 003
39 pages
Neural Net 2002
No ratings yet
Neural Net 2002
12 pages
Artificial Neural Networks: Module-3
No ratings yet
Artificial Neural Networks: Module-3
18 pages
Sage 50 Payroll Year End Guide
No ratings yet
Sage 50 Payroll Year End Guide
43 pages
1.1 Introduction
No ratings yet
1.1 Introduction
73 pages
Engg. Graphics and Design - 2
No ratings yet
Engg. Graphics and Design - 2
72 pages
Module 3 Chap 4 ANNs
No ratings yet
Module 3 Chap 4 ANNs
69 pages
Basic Electrical Engineering Unit 1
No ratings yet
Basic Electrical Engineering Unit 1
53 pages
lcp11 01 Pef 20230112
No ratings yet
lcp11 01 Pef 20230112
4 pages
Unit 2-Ann
No ratings yet
Unit 2-Ann
62 pages
Module 4 Chapter 2
No ratings yet
Module 4 Chapter 2
22 pages
Computational Thinking (MIT Press Essential Knowledge Series) Peter J. Denning 2024 Scribd Download
100% (3)
Computational Thinking (MIT Press Essential Knowledge Series) Peter J. Denning 2024 Scribd Download
37 pages
3 Decision Trees - LMS
No ratings yet
3 Decision Trees - LMS
47 pages
Ann I
No ratings yet
Ann I
41 pages
SECA4002
No ratings yet
SECA4002
65 pages
Unit V
No ratings yet
Unit V
42 pages
3 TrainingNetwork
No ratings yet
3 TrainingNetwork
65 pages
Unit-5 AI
No ratings yet
Unit-5 AI
19 pages
Jntuk R20 ML Unit-V
No ratings yet
Jntuk R20 ML Unit-V
19 pages
Unit - 4 ANN
No ratings yet
Unit - 4 ANN
17 pages
B-K Radio Programming Manual
No ratings yet
B-K Radio Programming Manual
29 pages
Rashmi Gupta
No ratings yet
Rashmi Gupta
3 pages
Azure Security Telescript - July 2021
No ratings yet
Azure Security Telescript - July 2021
30 pages
AML Mod4
No ratings yet
AML Mod4
22 pages
4.2 Ann
No ratings yet
4.2 Ann
26 pages
Microsoft Excel
No ratings yet
Microsoft Excel
23 pages
Emerging Technology
No ratings yet
Emerging Technology
20 pages
Learning With Linear Neurons: Adapted From Lectures by Geoffrey Hinton and Others Updated by N. Intrator, May 2007
No ratings yet
Learning With Linear Neurons: Adapted From Lectures by Geoffrey Hinton and Others Updated by N. Intrator, May 2007
59 pages
Review of "On The Duality of Operating System Structures"
No ratings yet
Review of "On The Duality of Operating System Structures"
32 pages
13 - Chapter 5 PDF
No ratings yet
13 - Chapter 5 PDF
40 pages
An Introduction To Back
No ratings yet
An Introduction To Back
4 pages
Machine Learning Module-3
No ratings yet
Machine Learning Module-3
23 pages
Artificial Neural Networks: Biological Motivation
No ratings yet
Artificial Neural Networks: Biological Motivation
22 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
03-Lecture Notes-Mid
No ratings yet
03-Lecture Notes-Mid
23 pages
Unit-4 Questions
No ratings yet
Unit-4 Questions
3 pages
6.1-Fundamentals of Artificial Neural Networks
No ratings yet
6.1-Fundamentals of Artificial Neural Networks
12 pages
Article ANN Estimation
No ratings yet
Article ANN Estimation
7 pages
508 Test Report NIST Mobile UFED4PC v4.2.6.5 January 2016
No ratings yet
508 Test Report NIST Mobile UFED4PC v4.2.6.5 January 2016
20 pages
Quantum Series Unit 1
No ratings yet
Quantum Series Unit 1
16 pages
Edge and Fog Computing For IoT: A Survey On Current Research Activities & Future Directions
No ratings yet
Edge and Fog Computing For IoT: A Survey On Current Research Activities & Future Directions
22 pages
Start Up Test and Integration of Engine Access Ramp
No ratings yet
Start Up Test and Integration of Engine Access Ramp
34 pages
PERCEPTRONS
No ratings yet
PERCEPTRONS
13 pages
Model of Neuron in An ANN
No ratings yet
Model of Neuron in An ANN
12 pages
Future Scope and Conclusion
No ratings yet
Future Scope and Conclusion
13 pages
Unit-3 - Question Bank
No ratings yet
Unit-3 - Question Bank
4 pages
Unit 4 - Question Bank
No ratings yet
Unit 4 - Question Bank
4 pages
ANN Assignment
No ratings yet
ANN Assignment
10 pages
A Basic Introduction To Neural Networks
No ratings yet
A Basic Introduction To Neural Networks
6 pages
A Review of Artificial Neural Network (ANN)
No ratings yet
A Review of Artificial Neural Network (ANN)
5 pages
COME6102 Chapter 1 Introduction 2 of 2
No ratings yet
COME6102 Chapter 1 Introduction 2 of 2
8 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Full Stack Engineer - Pareto - Ai
No ratings yet
Full Stack Engineer - Pareto - Ai
3 pages
Feedforward Neural Network
No ratings yet
Feedforward Neural Network
5 pages
Unit-1 Questions
No ratings yet
Unit-1 Questions
2 pages
Assignment 3
No ratings yet
Assignment 3
1 page
GFW0018 W6 Poster (S2116309)
No ratings yet
GFW0018 W6 Poster (S2116309)
3 pages
Adroitec Engineering Solutions P LTD: Corporate Training Feedback Form
No ratings yet
Adroitec Engineering Solutions P LTD: Corporate Training Feedback Form
1 page
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
From Everand
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
Fouad Sabry
No ratings yet
Perceptrons: Fundamentals and Applications for The Neural Building Block
From Everand
Perceptrons: Fundamentals and Applications for The Neural Building Block
Fouad Sabry
No ratings yet
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
From Everand
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
Fouad Sabry
No ratings yet
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet