0% found this document useful (0 votes)

35 views24 pages

Lecture 16-Multilayer Perceptron

DSAI 512 ML theory CH 16

Uploaded by

Knn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views24 pages

Lecture 16-Multilayer Perceptron

DSAI 512 ML theory CH 16

Uploaded by

Knn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Institute for Data Science & Artificial Intelligence

Lecture 16

Multilayer Perceptron
Course: DSAI 512-Machine Learning
1
Instructor: Ercan Atam
List of contents for this lecture

❖ Multiple layers

❖ Universal approximation

❖ The neural network

2
Relevant readings for this lecture

➢ e-Chapter 7 of Yaser S. Abu-Mostafa, Malik Magdon-Ismail, Hsuan-Tien Lin, “Learning from Data",
AMLBook, 2012.

➢ Chapter 6 (6.1-6.2) of Andreas Lindholm, Niklas Wahlstrom, Fredrik Lindsten, Thomas B. Schon,
“Machine Learning: A First Course for Engineers and Scientists”, Cambridge University Press, 2022.

3
The Neural network - biologically inspired

4
Planes don’t flap wings to fly

Engineering success may start with biological inspiration, but then takes a totally different path...

5
XOR: a limitation of the linear model (1)

6
XOR: a limitation of the linear model (2)

𝑓(𝐱)

A perceptron cannot implement

this function!

Why?

7
Decomposing XOR

We can write 𝑓 using the simpler “OR” and “”AND” operations.

AND → multiplication
OR → addition
Negation → bar

Note: A procedure for obtaining this will be provided

soon.

8
Perceptrons for OR and AND

𝑢1 AND(𝑢1 , 𝑢2 )
𝑢1 OR(𝑢1 , 𝑢2 )

𝑢2
𝑢2

9
How did we find that 𝑓 = ℎ1 ℎത 2 + ℎത1 ℎ2 ?

We consider only the regions of 𝑓 which are “+” and use the “disjunctive normal form” (= OR of ANDs):

Note: You can check that the decomposition constructed based on considering only the positive regions of 𝑓
holds as well when the negative regions of 𝑓 are also considered.
10
Representing 𝑓 using OR and AND (1)

Step1 (“OR”):

11
Representing 𝑓 using OR and AND (2)

Step1 (“OR”) Step2 (“ANDs”)

12
Representing 𝑓 using OR and AND (3)

Step2 (“ANDs”) Step3 ("ℎ1 &ℎ2 ")

13
The multilayer perceptron (MLP)

MLP:

o More layers allow us to implement 𝑓.

o These additional layers are called “hidden layers”.

14
A closer look at MLP

input layer hidden layer 1 hidden layer 2 output layer

(layer 0) (layer 1) (layer 2) (layer 3)

Extra two layers compared to perceptron

Not counted as a layer in general since we have inputs here.

15
Universal approximation (1)

Any target function 𝑓 that can be decomposed into linear separators can be implemented by a 3-layer
perceptron.

16
Universal approximation (2)

If 𝑓 is not strictly decomposable into perceptrons, but has a smooth decision boundary, then a 3-layer
perceptron can come arbitrarily close to implementing it.

Pictorial proof:

Target 8 perceptrons 16 perceptrons

17
Approximation versus generalization

❑ The size of the MLP controls the approximation-generalization tradeoff.

❑ More nodes per hidden layer => approximation ↑ and generalization ↓

18
Minimizing 𝐸in for MLPs

❑ Remember that Ein minimization in Perceptron was a hard combinatorial optimization

problem. Now, for MLPs this problem is much harder. Why?

❑ 𝐸in is not smooth (due to “sign” function), so we cannot use the gradient descent.

❑ Remedy: sign(𝑥 ) ≈ tanh(𝑥) → use gradient descent to minimize 𝐸in corresponding to

this replacement.

19
The neural network

20
Zooming into a hidden node

(𝑙)
𝑤𝑖𝑗 : the weight into node 𝑗 in layer 𝑙
from node 𝑖 in the previous layer.

𝑊 (𝑙) 𝑊 (𝑙+1)
𝒙(𝑙−1) 𝒔(𝑙) 𝒙(𝑙) 𝒔(𝑙+1)
𝜃 𝜃 𝜃
layer (l-1) layer (l) layer (l+1)

Note: the constant “1” nodes have no incoming weight,

but they have an outgoing weight.
21
The neural network for regression and logistic regression

❑ Regression: replace 𝜃(𝑠) in the output node with identity transformation (=no transformation).

❑ Logistic regression: replace 𝜃(𝑠) in the output node with logistic regression sigmoid.

22
Summary

23
References
(utilized for preparation of lecture notes or Matlab code)

▪ https://amlbook.com/eChapters/6-Oct2022-readeronly.pdf
▪ https://www.cs.rpi.edu/~magdon/courses/LFD-Slides/SlidesLect20.pdf

ML Unit 5
No ratings yet
ML Unit 5
34 pages
Analysis and Study of Perceptron To Solve Xor Problem
No ratings yet
Analysis and Study of Perceptron To Solve Xor Problem
6 pages
Notes ML 02 Slides RNN ANN
No ratings yet
Notes ML 02 Slides RNN ANN
105 pages
Non-Linear Classifiers
No ratings yet
Non-Linear Classifiers
19 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
Deep Learning Question Paper
100% (1)
Deep Learning Question Paper
3 pages
9 MLP Example 08 08 2024
No ratings yet
9 MLP Example 08 08 2024
50 pages
NN 1
No ratings yet
NN 1
6 pages
Unit 5
No ratings yet
Unit 5
61 pages
22PCOAM16 - Machine Learning - Session 8 Multi Layer Perceptions
No ratings yet
22PCOAM16 - Machine Learning - Session 8 Multi Layer Perceptions
12 pages
MAT6007 - Session6 - Multilayer Perceptrons
No ratings yet
MAT6007 - Session6 - Multilayer Perceptrons
13 pages
DNN - M1 - MLP As Universal Approximator
No ratings yet
DNN - M1 - MLP As Universal Approximator
50 pages
19 Learning
No ratings yet
19 Learning
31 pages
7 Jan2015
No ratings yet
7 Jan2015
50 pages
10 Multilayer Perceptrons
No ratings yet
10 Multilayer Perceptrons
54 pages
1991 Multilayer Perceptrons
No ratings yet
1991 Multilayer Perceptrons
15 pages
Unit en Multilayer Perceptron
No ratings yet
Unit en Multilayer Perceptron
71 pages
NN PDF
No ratings yet
NN PDF
48 pages
RACHIT MITTAL Capstone Project. Notes 2 PDF
No ratings yet
RACHIT MITTAL Capstone Project. Notes 2 PDF
39 pages
Soft Computing Unit 2
No ratings yet
Soft Computing Unit 2
23 pages
4 Neural Network
No ratings yet
4 Neural Network
74 pages
Neural Network: Presented by Lecturer Dept. of Mechatronics Engineering Rajshahi University of Engineering & Technology
No ratings yet
Neural Network: Presented by Lecturer Dept. of Mechatronics Engineering Rajshahi University of Engineering & Technology
25 pages
NN PDF
No ratings yet
NN PDF
23 pages
Week-12 - Introduction To ML-NN-CNN
No ratings yet
Week-12 - Introduction To ML-NN-CNN
45 pages
Module - 2
No ratings yet
Module - 2
33 pages
ML 03
No ratings yet
ML 03
42 pages
Anthony Kuh - Neural Networks and Learning Theory
No ratings yet
Anthony Kuh - Neural Networks and Learning Theory
72 pages
2K22 - B17 - 49 PRIYANSHU NANDAN - Multi Layer Perceptrons Reference
No ratings yet
2K22 - B17 - 49 PRIYANSHU NANDAN - Multi Layer Perceptrons Reference
32 pages
Neural Network: Presented by Lecturer Dept. of Mechatronics Engineering Rajshahi University of Engineering & Technology
No ratings yet
Neural Network: Presented by Lecturer Dept. of Mechatronics Engineering Rajshahi University of Engineering & Technology
25 pages
MLP Chap11
No ratings yet
MLP Chap11
24 pages
2024 MTH058 Lecture02 Backpropagation
No ratings yet
2024 MTH058 Lecture02 Backpropagation
62 pages
Lesson 3 Basics of Neural Networks - Perceptron
No ratings yet
Lesson 3 Basics of Neural Networks - Perceptron
26 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
29 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
82 pages
Lecture #2
No ratings yet
Lecture #2
23 pages
2EL1730 ML Lecture07 Neural Networks
No ratings yet
2EL1730 ML Lecture07 Neural Networks
65 pages
Percptron
No ratings yet
Percptron
25 pages
Graph Theory Report
No ratings yet
Graph Theory Report
9 pages
Percept Ron
No ratings yet
Percept Ron
13 pages
Multi Layer Perceptron 1
No ratings yet
Multi Layer Perceptron 1
54 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
2023 Lecture11 NeuralNetworks
No ratings yet
2023 Lecture11 NeuralNetworks
48 pages
DL - Unit II
No ratings yet
DL - Unit II
78 pages
Multilayer Neural Network
No ratings yet
Multilayer Neural Network
27 pages
08 NN
No ratings yet
08 NN
117 pages
Lecture 2
No ratings yet
Lecture 2
52 pages
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
No ratings yet
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
13 pages
MLP 1122 20240509 ch10 DeepNN
No ratings yet
MLP 1122 20240509 ch10 DeepNN
47 pages
2025 Lecture07 P2 MLP
No ratings yet
2025 Lecture07 P2 MLP
56 pages
Week3 Perceptron Mlprwerwerwer
No ratings yet
Week3 Perceptron Mlprwerwerwer
8 pages
Supervised Learning Neural Networks
No ratings yet
Supervised Learning Neural Networks
34 pages
Unit2ml 230101150634 5590aaef
No ratings yet
Unit2ml 230101150634 5590aaef
202 pages
Unit 4 ML NN, DL, CNN-1
No ratings yet
Unit 4 ML NN, DL, CNN-1
84 pages
P5 Neural Nets
No ratings yet
P5 Neural Nets
114 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
87 pages
Perceptron Detailed
No ratings yet
Perceptron Detailed
20 pages
Neural Network
No ratings yet
Neural Network
82 pages
Neural Network
No ratings yet
Neural Network
97 pages
ML Unit-Ii
No ratings yet
ML Unit-Ii
34 pages
Perceptron - Presentations and Its Typess
No ratings yet
Perceptron - Presentations and Its Typess
17 pages
DL CS05
No ratings yet
DL CS05
22 pages
Unit I Introduction
No ratings yet
Unit I Introduction
55 pages
Models Definition 3. Gans Training 4. Types of Gans 5. Gans Applications
No ratings yet
Models Definition 3. Gans Training 4. Types of Gans 5. Gans Applications
28 pages
NN Assignment PDF
No ratings yet
NN Assignment PDF
3 pages
Keras Cheat Sheet Python For Data Science: Model Architecture Inspect Model
No ratings yet
Keras Cheat Sheet Python For Data Science: Model Architecture Inspect Model
1 page
Is Zc415 (Data Mining BITS-WILP)
No ratings yet
Is Zc415 (Data Mining BITS-WILP)
4 pages
Machine Learning Task List
No ratings yet
Machine Learning Task List
14 pages
Perbandingan Metode Naïve Bayes Dan C4.5 Klasifikasi Status Gizi Bayi Balita
No ratings yet
Perbandingan Metode Naïve Bayes Dan C4.5 Klasifikasi Status Gizi Bayi Balita
11 pages
Food Recognition With ResNet-50
No ratings yet
Food Recognition With ResNet-50
5 pages
Deep Learning
No ratings yet
Deep Learning
48 pages
Mlpyq
No ratings yet
Mlpyq
5 pages
Bim309 Ai Week13
No ratings yet
Bim309 Ai Week13
53 pages
Unit - 2 ML Notes
No ratings yet
Unit - 2 ML Notes
14 pages
Important Questions Unit 2
No ratings yet
Important Questions Unit 2
8 pages
4 Implementing A GPT Model From Scratch To Generate Text - Build A Large Language Model (From Scratch)
No ratings yet
4 Implementing A GPT Model From Scratch To Generate Text - Build A Large Language Model (From Scratch)
52 pages
Lecture 18. Backpropagation
No ratings yet
Lecture 18. Backpropagation
55 pages
5a. Recurrent Neural Networks
No ratings yet
5a. Recurrent Neural Networks
45 pages
Lecture 17-Backpropagation
No ratings yet
Lecture 17-Backpropagation
28 pages
Lecture 18-ANNs and Overfitting
No ratings yet
Lecture 18-ANNs and Overfitting
21 pages
Lecture 20-Dual Quadratic Programming Formulation of SVMs and Kernel Trick
No ratings yet
Lecture 20-Dual Quadratic Programming Formulation of SVMs and Kernel Trick
31 pages
Lecture 19-Support Vector Machines For Maximization of Margin
No ratings yet
Lecture 19-Support Vector Machines For Maximization of Margin
38 pages
Pattern Assignment
No ratings yet
Pattern Assignment
18 pages
Lecture 3
No ratings yet
Lecture 3
15 pages
Attention and Transformers
No ratings yet
Attention and Transformers
103 pages
答案解析
No ratings yet
答案解析
15 pages
BasicNeuralNetwork TrainingAndEvaluation - Ipynb Colaboratory
No ratings yet
BasicNeuralNetwork TrainingAndEvaluation - Ipynb Colaboratory
2 pages
Camm 4e Ch09 PPT
No ratings yet
Camm 4e Ch09 PPT
71 pages
CNN Stanford2015
No ratings yet
CNN Stanford2015
129 pages
Crosstabs: Kriteria Produk Kriteria Keputusan Pemanfaatan Crosstabulation
No ratings yet
Crosstabs: Kriteria Produk Kriteria Keputusan Pemanfaatan Crosstabulation
7 pages
Exp4 11841524
No ratings yet
Exp4 11841524
8 pages
Graded Assessment
No ratings yet
Graded Assessment
6 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
From Everand
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
Fouad Sabry
No ratings yet
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet

Lecture 16-Multilayer Perceptron

Uploaded by

Lecture 16-Multilayer Perceptron

Uploaded by

Institute for Data Science & Artificial Intelligence

❖ The neural network

A perceptron cannot implement

We can write 𝑓 using the simpler “OR” and “”AND” operations.

Note: A procedure for obtaining this will be provided

Step1 (“OR”) Step2 (“ANDs”)

Step2 (“ANDs”) Step3 ("ℎ1 &ℎ2 ")

o More layers allow us to implement 𝑓.

o These additional layers are called “hidden layers”.

input layer hidden layer 1 hidden layer 2 output layer

Extra two layers compared to perceptron

Not counted as a layer in general since we have inputs here.

Target 8 perceptrons 16 perceptrons

❑ The size of the MLP controls the approximation-generalization tradeoff.

❑ More nodes per hidden layer => approximation ↑ and generalization ↓

❑ Remember that Ein minimization in Perceptron was a hard combinatorial optimization

❑ Remedy: sign(𝑥 ) ≈ tanh(𝑥) → use gradient descent to minimize 𝐸in corresponding to

Note: the constant “1” nodes have no incoming weight,

You might also like