0% found this document useful (0 votes)

75 views4 pages

Exercises On Backpropagation

Uploaded by

yuyang zhang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views4 pages

Exercises On Backpropagation

Uploaded by

yuyang zhang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Exercises on

Backpropagation

Laurenz Wiskott
Institut fur Neuroinformatik
Ruhr-Universitat Bochum, Germany, EU

30 January 2017

Contents

1 Supervised learning 2

1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.2 Error function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.2.1 Exercise: Error functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.3 Gradient descent . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.3.1 Exercise: Gradients . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.4 Online learning rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1.5 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1.5.1 Nonlinear regression in x . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1.5.2 Linear regression in x ? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1.5.3 Exercise: Learning rule for a nonlinear unit . . . . . . . . . . . . . . . . . . . . . . . . 3

1.5.4 Exercise: Closed form solution of the linear regression problem . . . . . . . . . . . . . 3

2 Supervised learning in multilayer networks 4

2.1 Multilayer networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

2017 Laurenz Wiskott (homepage https://www.ini.rub.de/PEOPLE/wiskott/). This work (except for all figures from
other sources, if present) is licensed under the Creative Commons Attribution-ShareAlike 4.0 International License. To view
a copy of this license, visit http://creativecommons.org/licenses/by-sa/4.0/. Figures from other sources have their own
copyright, which is generally indicated. Do not distribute parts of these lecture notes showing figures with non-free copyrights
(here usually figures I have the rights to publish but you dont, like my own published figures).
Several of my exercises (not necessarily on this topic) were inspired by papers and textbooks by other authors. Unfortunately,
I did not document that well, because initially I did not intend to make the exercises publicly available, and now I cannot trace
it back anymore. So I cannot give as much credit as I would like to. The concrete versions of the exercises are certainly my
own work, though.
These exercises complement my corresponding lecture notes available at https://www.ini.rub.de/PEOPLE/wiskott/

Teaching/Material/, where you can also find other teaching material such as programming exercises. The table of contents of
the lecture notes is reproduced here to give an orientation when the exercises can be reasonably solved. For best learning effect
I recommend to first seriously try to solve the exercises yourself before looking into the solutions.

1
2.1.1 Exercise: Chain rule in a three-layer network . . . . . . . . . . . . . . . . . . . . . . . 4

2.2 Error backpropagation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

3 Sample applications ( slides) 4

1 Supervised learning

1.1 Introduction

1.2 Error function

1.2.1 Exercise: Error functions

Let y be the scalar output of a network for a training pattern indexed with and s the required output
value. The error of a network over all M training patterns is often defined as
1 X1
E2 := (y s )2 . (1)
M 2

Why is this error measure so popular? Discuss it in comparison to

1 X
E|| := |y s | , (2)
M
1 X (y s )2
and E := . (3)
M 2 + (y s )2

What are the advantages and disadvantages of these three measures? Calculate the derivative with respect
to y 1 . What is the role of parameter > 0?

1.3 Gradient descent

1.3.1 Exercise: Gradients

For each of the following functions in x and y

1. draw the function with contour lines,

2. illustrate the minima and the gradient of the function, and

3. decide if the function could serve as an error function.

(a) E := x + y . (1)

(b) E := x2 + 2y 2 . (2)

(c) E := cos(x) + sin(y) . (3)

2
1.4 Online learning rule

1.5 Examples

1.5.1 Nonlinear regression in x

1.5.2 Linear regression in x ?

1.5.3 Exercise: Learning rule for a nonlinear unit

Given the output of a nonlinear unit

N
!
X
y(x) := wi xi , (1)
i=0
with (z) := tanh(z) . (2)

Let the training set be (x , s ) for = 1, ..., M , where x is the input vector and s the desired output, and
the error function be
M
1 X1
E := (y(x ) s )2 . (3)
M =1 |2 {z }
=:E

1. Try to get an intuition for E and describe how it differs from the linear case. Illustrate your statements
with a graph.

2. Derive an incremental learning rule from E that uses a gradient descent method and that is applied
separately to each training example.

1.5.4 Exercise: Closed form solution of the linear regression problem

Given a linear unit with output value

N
X
y(x) := wi xi . (1)
i=1

The unit shall learn training data (x , s ) for = 1, ..., M , with x indicating input vectors and s the desired
output values. The error function is given by
M
1X
Fw := (y(x ) s )2 . (2)
2 =1

1. Up to which M does there generally exist an exact solution w+ , such that

X
s = wi+ xi (3)
i

for all ?
2. Are there cases in which no exact solution exists even though M is small enough?
3. Derive a closed form expression for the weight vector w that minimizes the error function under
general conditions (important for cases in which no exact solution exists).

Hint: Write everything compactly with vectors and matrices.

3
2 Supervised learning in multilayer networks

2.1 Multilayer networks

1. Make a sketch of the network and mark connections and units with the variables used above.
2. Calculate the derivative of a with respect to wk .

3. Calculate the derivative of a with respect to vkj .

4. Calculate the derivative of a with respect to uji .
5. Let the target value be s and the error of the network be defined as
1
E := (s a)2 . (4)
2
Calculate the derivative of the error with respect to uji .
6. Now assume there are M input patterns xi and required target values s , both indexed with a super-
script . Let a indicate the output of the network for input pattern xi and the error be the average
over the individual errors, i.e.
1 X1
E := (s a )2 . (5)
M 2

Calculate the derivative of the averaged error with respect to uji .

2.2 Error backpropagation

3 Sample applications ( slides)

Clase 3 - Redes Neuronales - Entrenamiento y Aplicaciones
No ratings yet
Clase 3 - Redes Neuronales - Entrenamiento y Aplicaciones
9 pages
Quadratic Equation Short Notes
No ratings yet
Quadratic Equation Short Notes
3 pages
MAC 1105 College Algebra
No ratings yet
MAC 1105 College Algebra
4 pages
Backpropagation LectureNotesPublic
No ratings yet
Backpropagation LectureNotesPublic
13 pages
Lecture20 Backprop
No ratings yet
Lecture20 Backprop
77 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
ML807 Distributed and Federated Learning Slides 2
No ratings yet
ML807 Distributed and Federated Learning Slides 2
211 pages
L04 Slides - mlp1
No ratings yet
L04 Slides - mlp1
22 pages
Neural Network - Optimization DRAFT 3.11
No ratings yet
Neural Network - Optimization DRAFT 3.11
66 pages
Artificial Neural Networks and Deep Learning
No ratings yet
Artificial Neural Networks and Deep Learning
55 pages
(IJCST-V6I4P17) :P T V Lakshmi
No ratings yet
(IJCST-V6I4P17) :P T V Lakshmi
4 pages
Mod 2.4,2.5,2.6 Architecture Design
No ratings yet
Mod 2.4,2.5,2.6 Architecture Design
20 pages
Neural Networks: Derivation: 1 Model
No ratings yet
Neural Networks: Derivation: 1 Model
9 pages
شبكات عصبية ٢
No ratings yet
شبكات عصبية ٢
6 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
NN 2
No ratings yet
NN 2
12 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
19 pages
SJNanda Neural Network
No ratings yet
SJNanda Neural Network
47 pages
Six Lectures On NN - Montanari
No ratings yet
Six Lectures On NN - Montanari
77 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
Neural Network (Perceptrons)
No ratings yet
Neural Network (Perceptrons)
31 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
Lecture 13.3 Classification ANN
No ratings yet
Lecture 13.3 Classification ANN
64 pages
6 Working Example 01-08-2024
No ratings yet
6 Working Example 01-08-2024
21 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
ANN Notes Updated
0% (1)
ANN Notes Updated
46 pages
Exp - 4 - 5 (Prakash)
No ratings yet
Exp - 4 - 5 (Prakash)
10 pages
Learning Rules For Multilayer Feedforward Neural Networks
No ratings yet
Learning Rules For Multilayer Feedforward Neural Networks
19 pages
Lecture 6 - Multi-Layer Feedforward Neural Networks Using Matlab Part 2
No ratings yet
Lecture 6 - Multi-Layer Feedforward Neural Networks Using Matlab Part 2
3 pages
Back-Propagation Algorithm
No ratings yet
Back-Propagation Algorithm
26 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Week 2
No ratings yet
Week 2
17 pages
Lab 5: 16 April 2012 Exercises On Neural Networks
No ratings yet
Lab 5: 16 April 2012 Exercises On Neural Networks
6 pages
Towards A Mathematical Understanding of Neural Network-Based Machine Learning: What We Know and What We Don't
No ratings yet
Towards A Mathematical Understanding of Neural Network-Based Machine Learning: What We Know and What We Don't
56 pages
EELU ANN ITF309 Lecture 08 Spring 2023-2024-Sensitivity-Back-Propagation
No ratings yet
EELU ANN ITF309 Lecture 08 Spring 2023-2024-Sensitivity-Back-Propagation
39 pages
Unit 2
No ratings yet
Unit 2
36 pages
Neural Networks:: Basics Using MATLAB
No ratings yet
Neural Networks:: Basics Using MATLAB
54 pages
2-Mathematical Optimization and Deep Learning
No ratings yet
2-Mathematical Optimization and Deep Learning
53 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
14 pages
DeepLearning Recap
No ratings yet
DeepLearning Recap
104 pages
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
9 pages
Lecture 02
No ratings yet
Lecture 02
37 pages
EE769 7 Introduction To Neural Networks
No ratings yet
EE769 7 Introduction To Neural Networks
52 pages
3EBX0 Lecture Notes Addendum
No ratings yet
3EBX0 Lecture Notes Addendum
10 pages
Learning Representations by Backpropagating Errors PDF
No ratings yet
Learning Representations by Backpropagating Errors PDF
4 pages
Neural Networks Essay Feranmi Dere
No ratings yet
Neural Networks Essay Feranmi Dere
7 pages
Exp 3
No ratings yet
Exp 3
9 pages
Chapter 3
No ratings yet
Chapter 3
30 pages
Ex1 2017 1
No ratings yet
Ex1 2017 1
2 pages
SJNanda - Neural Network
No ratings yet
SJNanda - Neural Network
43 pages
SJNanda Neural Network
No ratings yet
SJNanda Neural Network
43 pages
855597620
No ratings yet
855597620
44 pages
Kevin Swingler - Lecture 4: Multi-Layer Perceptrons
No ratings yet
Kevin Swingler - Lecture 4: Multi-Layer Perceptrons
20 pages
ML Lec-23
No ratings yet
ML Lec-23
20 pages
Wa0006.
No ratings yet
Wa0006.
70 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)
Exercises On Independent Component Analysis
No ratings yet
Exercises On Independent Component Analysis
5 pages
Solutions To The Exercises On Independent Component Analysis
No ratings yet
Solutions To The Exercises On Independent Component Analysis
12 pages
Lecture Notes On Independent Component Analysis
No ratings yet
Lecture Notes On Independent Component Analysis
12 pages
Solutions To The Exercises On Fisher Discriminant Analysis
No ratings yet
Solutions To The Exercises On Fisher Discriminant Analysis
5 pages
Exercises On Fisher Discriminant Analysis
No ratings yet
Exercises On Fisher Discriminant Analysis
2 pages
Lecture Notes On Clustering
No ratings yet
Lecture Notes On Clustering
10 pages
Exercises On Backpropagation
No ratings yet
Exercises On Backpropagation
4 pages
Solutions To The Exercises On The Bias-Variance Dilemma
No ratings yet
Solutions To The Exercises On The Bias-Variance Dilemma
8 pages
P1
0% (1)
P1
4 pages
8-7-24 - Math-QP
No ratings yet
8-7-24 - Math-QP
4 pages
Intervals (Set Theory)
No ratings yet
Intervals (Set Theory)
12 pages
Matrices and Calculus - 1
No ratings yet
Matrices and Calculus - 1
7 pages
0.1 Point Groups
No ratings yet
0.1 Point Groups
2 pages
A Sequential Quadratic Programming Algorithm With Non-Monotone Line Search
No ratings yet
A Sequential Quadratic Programming Algorithm With Non-Monotone Line Search
24 pages
Rubiks Cube's God Number
No ratings yet
Rubiks Cube's God Number
11 pages
Lab I12
No ratings yet
Lab I12
8 pages
3D Geometry (Part 2) - JEE (Main) - 2024
No ratings yet
3D Geometry (Part 2) - JEE (Main) - 2024
110 pages
Chap1 Ma1010 Notes 2
No ratings yet
Chap1 Ma1010 Notes 2
29 pages
Problems and Results On 3-Chromatic Hypergraphs An
No ratings yet
Problems and Results On 3-Chromatic Hypergraphs An
20 pages
Tut Test 3 2024
No ratings yet
Tut Test 3 2024
3 pages
Maths GR 10, 11 and 12 March Test Framework
No ratings yet
Maths GR 10, 11 and 12 March Test Framework
2 pages
Parabola and Catenary Equations For Conductor Height Calculation
No ratings yet
Parabola and Catenary Equations For Conductor Height Calculation
7 pages
23 Prime Numbers and Composite Numbers
No ratings yet
23 Prime Numbers and Composite Numbers
5 pages
(Keith, Michael) An Intuitive Guide To Lebesgue Measure20201207 - Michael Keith
No ratings yet
(Keith, Michael) An Intuitive Guide To Lebesgue Measure20201207 - Michael Keith
19 pages
The Zhu-Takaoka Algorithm: Advisor: Prof. R. C. T. Lee Speaker: S. Y. Tang
No ratings yet
The Zhu-Takaoka Algorithm: Advisor: Prof. R. C. T. Lee Speaker: S. Y. Tang
25 pages
Difference Equations PDF
No ratings yet
Difference Equations PDF
3 pages
ME1401 Finite Element Analysis
No ratings yet
ME1401 Finite Element Analysis
10 pages
Higher Mathimatics in English
No ratings yet
Higher Mathimatics in English
12 pages
ISC Mathematics Full Portion
No ratings yet
ISC Mathematics Full Portion
5 pages
Linear Algebra - Final
No ratings yet
Linear Algebra - Final
10 pages
Beamer Example: Ethan Alt
No ratings yet
Beamer Example: Ethan Alt
13 pages
MANAGEMENT SCIENCE Lesson Ten
No ratings yet
MANAGEMENT SCIENCE Lesson Ten
9 pages
Class 7 Integers: Choose Correct Answer(s) From The Given Choices
No ratings yet
Class 7 Integers: Choose Correct Answer(s) From The Given Choices
3 pages
2-4 Zeros of Polynomials
No ratings yet
2-4 Zeros of Polynomials
14 pages
Act m1 Algebra For Payhip
No ratings yet
Act m1 Algebra For Payhip
65 pages
Quaternion Conrad
No ratings yet
Quaternion Conrad
19 pages

Exercises On Backpropagation

Uploaded by

Exercises On Backpropagation

Uploaded by

Exercises on

1.2 Error function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.2.1 Exercise: Error functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.3 Gradient descent . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.3.1 Exercise: Gradients . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.4 Online learning rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1.5.1 Nonlinear regression in x . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1.5.2 Linear regression in x ? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1.5.3 Exercise: Learning rule for a nonlinear unit . . . . . . . . . . . . . . . . . . . . . . . . 3

1.5.4 Exercise: Closed form solution of the linear regression problem . . . . . . . . . . . . . 3

2 Supervised learning in multilayer networks 4

2.1 Multilayer networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

2.2 Error backpropagation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

3 Sample applications ( slides) 4

1.2 Error function

1.2.1 Exercise: Error functions

Why is this error measure so popular? Discuss it in comparison to

1.3 Gradient descent

1.3.1 Exercise: Gradients

For each of the following functions in x and y

1. draw the function with contour lines,

2. illustrate the minima and the gradient of the function, and

(c) E := cos(x) + sin(y) . (3)

1.5.1 Nonlinear regression in x

1.5.2 Linear regression in x ?

1.5.3 Exercise: Learning rule for a nonlinear unit

Given the output of a nonlinear unit

1.5.4 Exercise: Closed form solution of the linear regression problem

Given a linear unit with output value

1. Up to which M does there generally exist an exact solution w+ , such that

Hint: Write everything compactly with vectors and matrices.

2.1 Multilayer networks

2.1.1 Exercise: Chain rule in a three-layer network

3. Calculate the derivative of a with respect to vkj .

Calculate the derivative of the averaged error with respect to uji .

2.2 Error backpropagation

3 Sample applications ( slides)

You might also like