Machine Learning: Lecture 4: Artificial Neural Networks (Based On Chapter 4 of Mitchell T.., Machine Learning, 1997)

1) Artificial neural networks are computational models inspired by biological neural networks and composed of interconnected nodes that perform simple computations. 2) Multilayer neural networks can learn complex patterns by adjusting the weights between nodes through backpropagation, which minimizes error by propagating output errors back through the network. 3) Backpropagation uses gradient descent to update weights between nodes in a feedforward network based on the delta rule, which calculates an error term that is then backpropagated to update weights and reduce overall error.

Uploaded by

dollsicecream

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views14 pages

Machine Learning: Lecture 4: Artificial Neural Networks (Based On Chapter 4 of Mitchell T.., Machine Learning, 1997)

Uploaded by

dollsicecream

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 14

Machine Learning: Lecture 4

Artificial Neural Networks

(Based on Chapter 4 of Mitchell T..,
Machine Learning, 1997)

1
What is an Artificial Neural
Network?
 It is a formalism for representing functions
inspired from biological systems and composed of
parallel computing units which each compute a
simple function.
 Some useful computations taking place in
Feedforward Multilayer Neural Networks are:
 Summation

 Multiplication

 Threshold (e.g., 1/(1+e

-x ) [the sigmoidal
threshold function]. Other functions are also
possible
2
Biological Motivation

• Biological Learning Systems are built of very

complex webs of interconnected neurons.
• Information-Processing abilities of biological
neural systems must follow from highly parallel
processes operating on representations that are
distributed over many neurons
•ANNs attempt to capture this mode of computation
3
Multilayer Neural Network
Representation
Examples: Output units

weights

Hidden Units

Heteroassociation Input Units Autoassociation

4
How is a function computed by a
Multilayer Neural Network?
• hj=g(w
i
ji.xi) Typically, y1=1 for positive example
• y1=g(w
j
kj.hj)
and y1=0 for negative example
where g(x)= 1/(1+e-x)
y1 k
wkj’s
g (sigmoid):
1 h1 h2 h3 j
wji’s
1/2
0 i
0 x1 x2 x3 x4 x5 x6

5
Learning in Multilayer Neural
Networks
 Learning consists of searching through the space
of all possible matrices of weight values for a
combination of weights that satisfies a database of
positive and negative examples (multi-class as
well as regression problems are possible).
 Note that a Neural Network model with a set of
adjustable weights defines a restricted hypothesis
space corresponding to a family of functions. The
size of this hypothesis space can be increased or
decreased by increasing or decreasing the number
of hidden units present in the network.
6
Appropriate Problems for Neural
Network Learning
 Instances are represented by many attribute-value pairs
(e.g., the pixels of a picture. ALVINN [Mitchell, p. 84]).
 The target function output may be discrete-valued, real-
valued, or a vector of several real- or discrete-valued
attributes.
 The training examples may contain errors.
 Long training times are acceptable.
 Fast evaluation of the learned target function may be
required.
 The ability for humans to understand the learned target
function is not important.
7
History of Neural Networks
 1943: McCulloch and Pitts proposed a model of a neuron -->
Perceptron (read [Mitchell, section 4.4 ])
 1960s: Widrow and Hoff explored Perceptron networks
(which they called “Adelines”) and the delta rule.
 1962: Rosenblatt proved the convergence of the perceptron
training rule.
 1969: Minsky and Papert showed that the Perceptron cannot
deal with nonlinearly-separable data sets---even those that
represent simple function such as X-OR.
 1970-1985: Very little research on Neural Nets
 1986: Invention of Backpropagation [Rumelhart and
McClelland, but also Parker and earlier on: Werbos] which
can learn from nonlinearly-separable data sets.
 Since 1985: A lot of research in Neural Nets!
8
Backpropagation: Purpose and
Implementation
 Purpose: To compute the weights of a
feedforward multilayer neural network
adaptatively, given a set of labeled training
examples.
 Method: By minimizing the following cost
function (the sum of square error):
E= 1/2 n=1 k=1[yk-fk(x )]
N K n n 2

where N is the total number of training examples and K, the

total number of output units (useful for multiclass problems)
and fk is the function implemented by the neural net
9
Backpropagation: Overview
 Backpropagation works by applying the gradient
descent rule to a feedforward network.
 The algorithm is composed of two parts that get
repeated over and over until a pre-set maximal
number of epochs, EPmax.
 Part I, the feedforward pass: the activation values
of the hidden and then output units are computed.
 Part II, the backpropagation pass: the weights of the
network are updated--starting with the hidden to output
weights and followed by the input to hidden weights--
with respect to the sum of squares error and through a
series of weight update rules called the Delta Rule.
10
Backpropagation: The Delta Rule I
 For the hidden to output connections
(easy case)
 wkj = - E/wkj
=  n=1[yk - fk(x )] g’(hk) Vj
N n n n n

=  n=1k Vj
N n n

with •  corresponding to the learning rate

(an extra parameter of the neural net)
• hnk = Mj=0 wkj Vjn
n M is the number of hidden units
•Vj = g(i=0 wjixi) and and d the number of input units
d n
n n
•kn = g’(hk)(ykn - fk(x ))
11
Backpropagation: The Delta Rule II
 For the input to hidden connections
(hard case: no pre-fixed values for the hidden units)
 wji = - E/wji
= - n=1 E/Vj Vj/wji (Chain Rule)
N n n

=  k,n[ykn - fk(xn )] g’(hkn) wkj g’(hnj)xni

n
=  kwkjg’(hjn)xni
=  n=1j xi with
N n n

hj = i=0 wjixni
n d
•
n
• j = g’(hj ) k=1
n n K
wkj k
• and all the other quantities already defined
12
Backpropagation: The Algorithm
1. Initialize the weights to small random values; create a random pool of
all the training patterns; set EP, the number of epochs of training to 0.
2. Pick a training pattern  from the remaining pool of patterns and
propagate it forward through the network.

3. Compute the deltas, k for the output layer.

4. Compute the deltas, j for the hidden layer by propagating the error
backward.
5. Update all the connections such that
New Old New Old
wji = wji + wji and wkj = wkj + wkj
6. If any pattern remains in the pool, then go back to Step 2. If all the
training patterns in the pool have been used, then set EP = EP+1, and
if EP  EPMax, then create a random pool of patterns and go to Step 2.
If EP = EPMax, then stop.
13
Backpropagation: The Momentum
 To this point, Backpropagation has the disadvantage
of being too slow if  is small and it can oscillate
too widely if  is large.
 To solve this problem, we can add a momentum to
give each connection some inertia, forcing it to
change in the direction of the downhill “force”.
 New Delta Rule:
wpq(t+1) = - E/wpq +  wpq(t)
where p and q are any input and hidden, or, hidden and
outpu units; t is a time step or epoch; and  is the
momentum parameter which regulates the amount of
inertia of the weights.
14

Vlog Rubrics
100% (2)
Vlog Rubrics
1 page
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
Data Mining-Backpropagation
100% (1)
Data Mining-Backpropagation
5 pages
Chapter3 - BP
No ratings yet
Chapter3 - BP
12 pages
L04 Slides - mlp1
No ratings yet
L04 Slides - mlp1
22 pages
Back Propagation Technique
No ratings yet
Back Propagation Technique
24 pages
Back Propagation
No ratings yet
Back Propagation
56 pages
IBPS Previous Year Papers - Previous Papers - Question Paper
No ratings yet
IBPS Previous Year Papers - Previous Papers - Question Paper
4 pages
Semi-Detailed Lesson Plan in Math
No ratings yet
Semi-Detailed Lesson Plan in Math
3 pages
Week 2-Day 2-M6NS-1b-92.2
No ratings yet
Week 2-Day 2-M6NS-1b-92.2
7 pages
Backpropagation
No ratings yet
Backpropagation
7 pages
14 Backprop
No ratings yet
14 Backprop
34 pages
English 9 Q1 Module 1
No ratings yet
English 9 Q1 Module 1
27 pages
Backpropagation Learning in Neural Networks
No ratings yet
Backpropagation Learning in Neural Networks
27 pages
Machine Learning: Lecture 4: Artificial Neural Networks (Based On Chapter 4 of Mitchell T.., Machine Learning, 1997)
No ratings yet
Machine Learning: Lecture 4: Artificial Neural Networks (Based On Chapter 4 of Mitchell T.., Machine Learning, 1997)
14 pages
AIML-Module-3-part 2
No ratings yet
AIML-Module-3-part 2
122 pages
Ai 32
No ratings yet
Ai 32
4 pages
Multi Layer Perceptron Haykin
No ratings yet
Multi Layer Perceptron Haykin
50 pages
Lecture 17-Classification by Backpropagation-M
No ratings yet
Lecture 17-Classification by Backpropagation-M
25 pages
ML Exp 8
No ratings yet
ML Exp 8
2 pages
Mid-Year Inset 2023-2024
100% (1)
Mid-Year Inset 2023-2024
11 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
78 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
15 pages
Data Mining, Advance Methods
No ratings yet
Data Mining, Advance Methods
83 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
Neural Networks Backpropagation Algorithm: COMP4302/COMP5322, Lecture 4, 5
No ratings yet
Neural Networks Backpropagation Algorithm: COMP4302/COMP5322, Lecture 4, 5
11 pages
Ann4-3s.pdf 7oct PDF
No ratings yet
Ann4-3s.pdf 7oct PDF
21 pages
Introduction To Artificial Neural Networks in Control: Andrew Paice 2009
No ratings yet
Introduction To Artificial Neural Networks in Control: Andrew Paice 2009
18 pages
CL Back Propogation
No ratings yet
CL Back Propogation
11 pages
Week3 Backpropagation
No ratings yet
Week3 Backpropagation
32 pages
ANN Research
No ratings yet
ANN Research
18 pages
Classification by Back Propagation
No ratings yet
Classification by Back Propagation
20 pages
EELU ANN ITF309 Lecture 07 Spring 2024
No ratings yet
EELU ANN ITF309 Lecture 07 Spring 2024
50 pages
04 03 2025 Tarihli Dersten
No ratings yet
04 03 2025 Tarihli Dersten
35 pages
Neural
No ratings yet
Neural
53 pages
Supervised Learning Network
No ratings yet
Supervised Learning Network
33 pages
Chapter 05
No ratings yet
Chapter 05
25 pages
CI-6-8 Backpropagation (COMPLETE) Updated
No ratings yet
CI-6-8 Backpropagation (COMPLETE) Updated
76 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Lect 15 MLP Introduction Backprop
No ratings yet
Lect 15 MLP Introduction Backprop
24 pages
Back Propagation Algorithm
No ratings yet
Back Propagation Algorithm
13 pages
Lecture 13.3 Classification ANN
No ratings yet
Lecture 13.3 Classification ANN
64 pages
Back Propagation Neural Network
No ratings yet
Back Propagation Neural Network
5 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
Lect 15 MLP Introduction Backprop
No ratings yet
Lect 15 MLP Introduction Backprop
24 pages
UNIT 3 - Backpropagation Algorithm
No ratings yet
UNIT 3 - Backpropagation Algorithm
38 pages
PNAL6 MLPTraining
No ratings yet
PNAL6 MLPTraining
40 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
26 pages
Machine Learning: Feed Forward Neural Networks Backpropagation Algorithm Cnns and Rnns
No ratings yet
Machine Learning: Feed Forward Neural Networks Backpropagation Algorithm Cnns and Rnns
127 pages
Mind - How To Build A Neural Network (Part One)
No ratings yet
Mind - How To Build A Neural Network (Part One)
9 pages
Machine Learning: Chapter 4. Artificial Neural Networks
No ratings yet
Machine Learning: Chapter 4. Artificial Neural Networks
34 pages
Chapter 4 2025
No ratings yet
Chapter 4 2025
19 pages
Classification Advanced
No ratings yet
Classification Advanced
51 pages
Artificial Neural Networks - Lect - 3
No ratings yet
Artificial Neural Networks - Lect - 3
16 pages
ML Unit-2
No ratings yet
ML Unit-2
141 pages
Back Propagation
No ratings yet
Back Propagation
20 pages
Types of MAC Protocols
No ratings yet
Types of MAC Protocols
32 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
ML Unit - 2
No ratings yet
ML Unit - 2
70 pages
Unit II Supervised II
No ratings yet
Unit II Supervised II
16 pages
3ML.05.NeuralNetworks DeepLearning
No ratings yet
3ML.05.NeuralNetworks DeepLearning
67 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
2012-1158. Backpropagation NN
No ratings yet
2012-1158. Backpropagation NN
56 pages
55 Limba Engleza 10 Lotul 55 Reevaluare Compressed
No ratings yet
55 Limba Engleza 10 Lotul 55 Reevaluare Compressed
152 pages
Operate Database Application Basic
100% (4)
Operate Database Application Basic
32 pages
SCIENCE, TECHNOLOGY AND SOCIETY (2) Syllabus
No ratings yet
SCIENCE, TECHNOLOGY AND SOCIETY (2) Syllabus
12 pages
LRP Monitoring Tool
No ratings yet
LRP Monitoring Tool
2 pages
Yearly Scheme of Work Science Y6 2025-2026
No ratings yet
Yearly Scheme of Work Science Y6 2025-2026
16 pages
August 22, 2020 - Parents'/ Guardians' Orientation
No ratings yet
August 22, 2020 - Parents'/ Guardians' Orientation
2 pages
Machine Learning Life Cycle - Chap2
No ratings yet
Machine Learning Life Cycle - Chap2
12 pages
Lesson Guide: Ladylike
No ratings yet
Lesson Guide: Ladylike
7 pages
Girma Gemeda
No ratings yet
Girma Gemeda
77 pages
Principles For Materials Development
No ratings yet
Principles For Materials Development
22 pages
Civics Lesson 2 Why Do People Form Governments
No ratings yet
Civics Lesson 2 Why Do People Form Governments
3 pages
Sample New Position Performance Evaluation v81
No ratings yet
Sample New Position Performance Evaluation v81
3 pages
Biomolecules Lesson Plan 7es
No ratings yet
Biomolecules Lesson Plan 7es
6 pages
Courseoutliness 8
No ratings yet
Courseoutliness 8
3 pages
DLL Gr8 Edited
No ratings yet
DLL Gr8 Edited
61 pages
Cot Math 6 Volume
No ratings yet
Cot Math 6 Volume
5 pages
Layc Career Academy Application
No ratings yet
Layc Career Academy Application
98 pages
Deped - K To 12 Curriculum 2012
No ratings yet
Deped - K To 12 Curriculum 2012
33 pages
Professional1 LOR Workbook
No ratings yet
Professional1 LOR Workbook
10 pages
Module4 DS PPT
No ratings yet
Module4 DS PPT
49 pages
Syllabus en 103
No ratings yet
Syllabus en 103
5 pages
107BBA II General Psychology PDF
No ratings yet
107BBA II General Psychology PDF
2 pages
Memo For NAT 6 and ELLNA 3
No ratings yet
Memo For NAT 6 and ELLNA 3
2 pages
ILP-english Week2 Modules 3&5
No ratings yet
ILP-english Week2 Modules 3&5
1 page
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
From Everand
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
Fouad Sabry
No ratings yet
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet

Machine Learning: Lecture 4: Artificial Neural Networks (Based On Chapter 4 of Mitchell T.., Machine Learning, 1997)

Uploaded by

Machine Learning: Lecture 4: Artificial Neural Networks (Based On Chapter 4 of Mitchell T.., Machine Learning, 1997)

Uploaded by

Machine Learning: Lecture 4

Artificial Neural Networks

 Threshold (e.g., 1/(1+e

• Biological Learning Systems are built of very

Heteroassociation Input Units Autoassociation

where N is the total number of training examples and K, the

with •  corresponding to the learning rate

=  k,n[ykn - fk(xn )] g’(hkn) wkj g’(hnj)xni

You might also like