0% found this document useful (0 votes)

9 views39 pages

13 Ann

The document discusses the evolution and principles of artificial neural networks, highlighting their historical context, structure, and learning mechanisms. It covers key concepts such as the limitations of perceptrons, the significance of hidden units, and various training methods including backpropagation and regularization. Additionally, it touches on advanced topics like spiking nets, recurrent networks, and hyper-networks, emphasizing the importance of network architecture and design decisions in AI development.

Uploaded by

TAMAL BANIK

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views39 pages

13 Ann

Uploaded by

TAMAL BANIK

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Artificial Neural Networks

The future of AI
Restaurant Data Set
Limited Expressiveness of Perceptrons
The XOR affair
• Minsky and Papert (1969) showed
certain simple functions cannot be
represented (e.g. Boolean XOR).
Killed the field!
• Mid 80th: Non-linear Neural
Networks (Rumelhart et al. 1986)
The XOR affair
Neural Networks
• Rich history, starting in the early forties
(McCulloch and Pitts 1943).
• Two views:
– Modeling the brain
– “Just” representation of complex functions
(Continuous; contrast decision trees)
• Much progress on both fronts.
• Drawn interest from: Neuroscience,
Cognitive science, AI, Physics, Statistics, and
CS/EE.
Neuron
Neural Structure
1. Cell body; one axon (delivers output to other connect neurons); many
dendrites (provide surface area for connections from other neurons).

2. Axon is a single long fiber. 100 or more times the diameter of cell body.
Axon connects via synapses to dendrites of other cells.

3. Signals propagated via complicated electrochemical reaction.

4. Each neuron is a “threshold unit”. Neurons do nothing unless the collective

influence from all inputs reaches a threshold level.

5. Produces full-strength output. “fires”. Stimulation at some synapses

encourages neurons to fire; some discourage from firing.

6. Synapses can increase (excitatory) or decrease (inhibitory) potential (signal

Why Neural Nets?
Motivation:
Solving problems under the constraints similar to
those of the brain may lead to solutions to AI
problems that would otherwise be overlooked.
• Individual neurons operate very slowly
But the brain does complex tasks fast:  massively parallel algorithms
• Neurons are failure-prone devices
But brain is reliable anyway  distributed representations
• Neurons promote approximate matching
less brittle  learnable
Connectionist Models of Learning
Characterized by:

• A large number of very simple neuron-like processing elements.

• A large number of weighted connections between the elements.

• Highly parallel, distributed control.

• An emphasis on learning internal representations automatically.

Artificial Neurons

Activation Functions:

stept (x) = 1, if x ≥ t; otherwise 0. sign(x) = +1, if x ≥ 0; otherwise -1 sigmoid(x) = 1/(1+e-x)

Example: Perceptron
Perceptrons
Single Layer Feed Forward Neural Networks

Can be easily trained using perceptron algorithm

2-Layer Feedforward Networks
Boolean functions:
x1 o1
• Every boolean function can be
represented by network with single x2 o2
hidden layer
• But might require exponential (in number
of inputs) hidden units
Continuous functions:
• Every bounded continuous function can
be approximated with arbitrarily small
error, by network with one hidden layer
[Cybenko 1989; Hornik et al. 1989]
xN oO
Any function can be approximated to
arbitrary accuracy by a network with two
hidden layers [Cybenko 1988].
Multi-Layer Nets
• Fully connected, two layer, feedforward
Activation function: g(x) = (1 if greater than threshold, 0 otherwise)

Jonathan

Mary

Joe

Elizabeth

Alice

Bart How are Mary and Elizabeth related?

A=Acquaintances B=Family
Multi-Layer Nets
• Fully connected, two layer, feedforward
Ofer Melnik, http://www.demo.cs.brandeis.edu/pr/DIBA
Ofer Melnik, http://www.demo.cs.brandeis.edu/pr/DIBA
Ofer Melnik, http://www.demo.cs.brandeis.edu/pr/DIBA
How can we train perceptrons?
Hebbian learning
• D. O. Hebb:
– The general idea is an old one, that any two cells or systems of
cells that are repeatedly active at the same time will tend to
become 'associated', so that activity in one facilitates activity in
the other." (Hebb 1949, p. 70)
– "When one cell repeatedly assists in firing another, the axon of
the first cell develops synaptic knobs (or enlarges them if they
already exist) in contact with the soma of the second cell."
(Hebb 1949, p. 63)
• Cells that fire together, wire together
– If error is small, increase magnitude of connections that
contributed.
– If error is large, decrease magnitude of connections that
contributed.
Backpropagation
• Classical measure of error
– Sum of square errors
– hw(x) is output on perceptron on x.
• Gradient decent using partial derivatives

• Update weights
Backpropagation Training (Overview)
Training data:
– (x1,y1),…, (xn,yn), with target labels yz {0,1}
Optimization Problem (single output neuron):
– Variables: network weights wij
– Obj.:E=minw∑z=1..n(yz–o(xz))2,
– Constraints: none
Algorithm: local search via gradient descent.
• Randomly initialize weights.
• Until performance is satisfactory,
– Compute partial derivatives ( E /  wi j) of objective
function E for each weight wi j
– Update each weight by wi j Ã wi j +  ( E /  wi j)
Smooth and Differentiable Threshold Function

• Replace sign function by a differentiable

activation function
 sigmoid function:
Slope of Sigmoid Function
Backpropagation Training (Detail)
• Input: training data (x1,y1),…, (xn,yn), learning rate parameter α.
• Initialize weights.
• Until performance is satisfactory
– For each training instance,
• Compute the resulting output
• Compute βz = (yz – oz) for nodes in the output layer
• Compute βj = ∑k wjk ok (1 – ok) βk for all other nodes.
• Compute weight changes for all weights using
∆wi j(l) = oi oj (1 – oj) βj
– Add up weight changes for all training instances, and
update the weights accordingly.
wi,j ← wi,j + α ∑l ∆wi,j(l)
Summary: Hidden Units
• Hidden units are nodes that are situated between the input nodes
and the output nodes.

• Hidden units allow a network to learn non-linear functions.

• Hidden units allow the network to represent combinations of the

input features.

• Given too many hidden units, a neural net will simply memorize the
input patterns (overfitting).

• Given too few hidden units, the network may not be able to
represent all of the necessary generalizations (underfitting).
How long should you train the net?

A B C D E

When would you stop training?

How long should you train the net?
• The goal is to achieve a balance between correct
responses for the training patterns and correct responses
for new patterns.
– That is, a balance between memorization and generalization)

• If you train the net for too long, then you run the risk of
overfitting.
– Select number of training iterations via cross-validation on a
holdout set.
Regularization
• Simpler models are better
• NN with smaller/fewer weights are better
– Add penalty to total sum of absolute weights
– Pareto optimize
Design Decisions
• Choice of learning rate 
• Stopping criterion – when should training stop?
• Network architecture
– How many hidden layers? How many hidden units
per layer?
– How should the units be connected? (Fully? Partial?
Use domain knowledge?)
• How many restarts (local optima) of search to
find good optimum of objective function?
Spiking Nets
• Represent continues values using rates
– Output spike if # of incoming spikes > threshold
– Leaky counter

http://www.ine-news.org
Spiking

From http://www.cs.uu.nl/research/techreps/repo/CS-2003/2003-008.pdf
Recurrent networks
• Nodes connect
– Laterally
– Backwards,
– To themselves
• Complex behavior
– Dynamics, Memory

www.stowa-nn.ihe.nl/ANN.htm
Learning Network Topology
• Optimal Brain Damage algorithm
– Trains a fully connected network
– Removes connections and nodes that contribute least
to the performance
• Using information-theoretic criteria
– Repeats until performance starts decreasing
• Tiling algorithm: Grows networks
– Start with a small network that classifies many
examples
– Repeatedly add more nodes to classify remaining
examples
Hyper-Networks
• Use a network to generate a network
– E.g. to determine connection wij use network that
takes in i and j and produces w.
– In 2D:

Ken Stanley, eplex.cs.ucf.edu

Hyper-Networks

Ken Stanley, eplex.cs.ucf.edu

Module 3
No ratings yet
Module 3
83 pages
Neural Networks
100% (1)
Neural Networks
119 pages
Caregiving Tools and Paraphernalia
100% (2)
Caregiving Tools and Paraphernalia
34 pages
Catálogo Bombas K3V y K5V
100% (3)
Catálogo Bombas K3V y K5V
15 pages
11 D - Snake and Scorpion Safety Awareness
No ratings yet
11 D - Snake and Scorpion Safety Awareness
1 page
Yuva Bharat Health Policy Brochure 2025
No ratings yet
Yuva Bharat Health Policy Brochure 2025
8 pages
Lecture 7 - Neural Networks
No ratings yet
Lecture 7 - Neural Networks
48 pages
Unit 3 - Ann
No ratings yet
Unit 3 - Ann
49 pages
General Management Guide Commercials Bovans White
60% (5)
General Management Guide Commercials Bovans White
41 pages
Chapter 2
No ratings yet
Chapter 2
41 pages
Chapter 5 Artificial Neural Networks
No ratings yet
Chapter 5 Artificial Neural Networks
50 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
86 pages
5 - Neural Network
No ratings yet
5 - Neural Network
105 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
44 pages
Refined Chapter 5 UceQEJ
No ratings yet
Refined Chapter 5 UceQEJ
79 pages
Machine Learning Using Neural Networks: Presentation By: C. Vinoth Kumar SSN College of Engineering
No ratings yet
Machine Learning Using Neural Networks: Presentation By: C. Vinoth Kumar SSN College of Engineering
24 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
Module - 3 AAI
No ratings yet
Module - 3 AAI
119 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
81 pages
Introduction To Neural Networks: Training Learn Generalization
No ratings yet
Introduction To Neural Networks: Training Learn Generalization
46 pages
Unit V
No ratings yet
Unit V
49 pages
Unit 5 ML
No ratings yet
Unit 5 ML
37 pages
Dave Reed: Connectionist Approach To AI
No ratings yet
Dave Reed: Connectionist Approach To AI
26 pages
06 NeuralNetworks 2024
No ratings yet
06 NeuralNetworks 2024
82 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Introduction To Neural Networks
100% (1)
Introduction To Neural Networks
46 pages
BIM Guidlines and Specifications For SP 041216
No ratings yet
BIM Guidlines and Specifications For SP 041216
18 pages
Machine Learning
No ratings yet
Machine Learning
77 pages
AI Mod4 Session 8 Best Fit Line & ANN
No ratings yet
AI Mod4 Session 8 Best Fit Line & ANN
39 pages
All Bu03A9SS Pricing Guide (02E-1114), NEW Compress
0% (1)
All Bu03A9SS Pricing Guide (02E-1114), NEW Compress
44 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
82 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
47 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Lecture 8
No ratings yet
Lecture 8
65 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
61 pages
Module 5 Lecture 2
No ratings yet
Module 5 Lecture 2
45 pages
NN Learning
No ratings yet
NN Learning
69 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
ANN Lecture Note1F
No ratings yet
ANN Lecture Note1F
50 pages
Lecture15 NeuronNetworks
No ratings yet
Lecture15 NeuronNetworks
61 pages
2-Ann - 1-14-12-2024
No ratings yet
2-Ann - 1-14-12-2024
34 pages
STD-INSP-0123 IGC Practice - A - (ASTM G28)
No ratings yet
STD-INSP-0123 IGC Practice - A - (ASTM G28)
7 pages
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
100% (1)
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
40 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
Week 1
No ratings yet
Week 1
24 pages
Lesson 7.0 Supervised Learning With Neural Networks
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks
22 pages
AI17-Neural Networks
No ratings yet
AI17-Neural Networks
34 pages
Neural Nets
No ratings yet
Neural Nets
33 pages
Types of Neural Networks and Definition of Neural Network
No ratings yet
Types of Neural Networks and Definition of Neural Network
15 pages
Neural Networks: Machine Learning Is Machine Learning Is
No ratings yet
Neural Networks: Machine Learning Is Machine Learning Is
23 pages
ECSE484 Intro v2
No ratings yet
ECSE484 Intro v2
67 pages
Unit 2 - Soft Computing
No ratings yet
Unit 2 - Soft Computing
49 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
21 pages
Form Two Terminal Examinations Commerce: Time 2HOURS 29 August 2020 Instructions
No ratings yet
Form Two Terminal Examinations Commerce: Time 2HOURS 29 August 2020 Instructions
4 pages
Gate Cast Iron Astm A126 Class 125 Rs Os&Y: Pressure Temperature Ratings
No ratings yet
Gate Cast Iron Astm A126 Class 125 Rs Os&Y: Pressure Temperature Ratings
2 pages
14 BASF SkinDelivery PDF
No ratings yet
14 BASF SkinDelivery PDF
36 pages
Zincum Valerianicum
No ratings yet
Zincum Valerianicum
3 pages
Artificial Neural Network: Lecture Module 22
No ratings yet
Artificial Neural Network: Lecture Module 22
54 pages
Introduction To Neural Networks
No ratings yet
Introduction To Neural Networks
54 pages
(E) Chemical Formulae, Equations and Calculations - Exam Questions
No ratings yet
(E) Chemical Formulae, Equations and Calculations - Exam Questions
14 pages
Neural Networks: Ellen Walker Hiram College
No ratings yet
Neural Networks: Ellen Walker Hiram College
25 pages
Neural Networks: - Genetic Algorithms - Genetic Programming - Behavior-Based Systems
No ratings yet
Neural Networks: - Genetic Algorithms - Genetic Programming - Behavior-Based Systems
74 pages
LIET III-II CSE AIML IV UNIT Previous Yrs QN Papers Qns and Answers
No ratings yet
LIET III-II CSE AIML IV UNIT Previous Yrs QN Papers Qns and Answers
15 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
31 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
15 pages
A New Sort of Computer: What Are (Everyday) Computer Systems Good At... and Not So Good At?
No ratings yet
A New Sort of Computer: What Are (Everyday) Computer Systems Good At... and Not So Good At?
30 pages
Neural Networks: Some Material Adopted From Notes by
No ratings yet
Neural Networks: Some Material Adopted From Notes by
35 pages
Lecture 10 Neural Network
No ratings yet
Lecture 10 Neural Network
34 pages
Case Study Nutrition 1 Nur Nadzatul
No ratings yet
Case Study Nutrition 1 Nur Nadzatul
18 pages
TO Artificial Neural Networks
No ratings yet
TO Artificial Neural Networks
22 pages
Kiet School of Engineering & Technology: Department of Computer Appication
No ratings yet
Kiet School of Engineering & Technology: Department of Computer Appication
30 pages
Neuro-Fuzzy Systems and Their Applications: Bogdan
No ratings yet
Neuro-Fuzzy Systems and Their Applications: Bogdan
15 pages
9th Grade A Variation
No ratings yet
9th Grade A Variation
5 pages
Mercedes 270
No ratings yet
Mercedes 270
2 pages
Assignment Activity Unit 5
No ratings yet
Assignment Activity Unit 5
6 pages
Rules Sustainability Report
No ratings yet
Rules Sustainability Report
6 pages
Dharma Rath Ne 2020
No ratings yet
Dharma Rath Ne 2020
6 pages
AdyaRitu Kala Ritusuddhi Samskarah - Wikipedia
No ratings yet
AdyaRitu Kala Ritusuddhi Samskarah - Wikipedia
8 pages
Objective 7.01 Key Terms
No ratings yet
Objective 7.01 Key Terms
3 pages
Green Printing: Inevitability For Printing Industry Sustainability
No ratings yet
Green Printing: Inevitability For Printing Industry Sustainability
4 pages
Gysmi 190
No ratings yet
Gysmi 190
7 pages
Reiji Ecstasy 1
No ratings yet
Reiji Ecstasy 1
5 pages
Oiv Ma As323 01b
No ratings yet
Oiv Ma As323 01b
4 pages
Presentation Schedule 5nov 1page
No ratings yet
Presentation Schedule 5nov 1page
6 pages
F 70 02 PSD 7157 Smoke Detector
No ratings yet
F 70 02 PSD 7157 Smoke Detector
4 pages
Proof of Residency: Frequently Asked Questions
No ratings yet
Proof of Residency: Frequently Asked Questions
2 pages
Zomaland Tickets 9eb6b80c-2
No ratings yet
Zomaland Tickets 9eb6b80c-2
1 page
2 CV Gearbox
No ratings yet
2 CV Gearbox
1 page
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
From Everand
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
Fouad Sabry
No ratings yet

13 Ann

Uploaded by

13 Ann

Uploaded by

Artificial Neural Networks

3. Signals propagated via complicated electrochemical reaction.

4. Each neuron is a “threshold unit”. Neurons do nothing unless the collective

5. Produces full-strength output. “fires”. Stimulation at some synapses

6. Synapses can increase (excitatory) or decrease (inhibitory) potential (signal

• A large number of very simple neuron-like processing elements.

• A large number of weighted connections between the elements.

• Highly parallel, distributed control.

• An emphasis on learning internal representations automatically.

stept (x) = 1, if x ≥ t; otherwise 0. sign(x) = +1, if x ≥ 0; otherwise -1 sigmoid(x) = 1/(1+e-x)

Can be easily trained using perceptron algorithm

Bart How are Mary and Elizabeth related?

• Replace sign function by a differentiable

• Hidden units allow a network to learn non-linear functions.

• Hidden units allow the network to represent combinations of the

When would you stop training?

Ken Stanley, eplex.cs.ucf.edu

Ken Stanley, eplex.cs.ucf.edu

You might also like