0% found this document useful (0 votes)

19 views56 pages

4 DL Deep Neural Nets

The document discusses deep neural networks, focusing on their structure, including two-layer networks and hyperparameters like network depth and width. It emphasizes the complexity of deep networks compared to shallow ones and the importance of hyperparameter optimization in training. Additionally, it explores the representation of functions by neural networks based on chosen hyperparameters.

Uploaded by

mahfuz.karim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views56 pages

4 DL Deep Neural Nets

Uploaded by

mahfuz.karim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

13

Interactive
Figures!

CMPS 497/CMPE 471 Special Topics in Deep Learning 3

Deep neural networks
• Networks with more than one hidden layer
• Intuition becomes more difficult!

CMPS 497/CMPE 471 Special Topics in Deep Learning 4

Deep neural networks
• Two-layer neural network
• Hyperparameters
• Notation change and general case
• Shallow vs. deep networks

CMPS 497/CMPE 471 Special Topics in Deep Learning 5

Two-layer network

CMPS 497/CMPE 471 Special Topics in Deep Learning 6

Figures from http://udlbook.com
Two-layer network as one equation

CMPS 497/CMPE 471 Special Topics in Deep Learning 7

Two-layer network as one equation

Still .. a mathematical equation 

CMPS 497/CMPE 471 Special Topics in Deep Learning 7
8
Remember shallow net with 2 outputs?
• 1 input, 4 hidden units, 2 outputs

CMPS 497/CMPE 471 Special Topics in Deep Learning 9

Figures from http://udlbook.com
Networks as composing functions

Consider the pre-activations at the second hidden units

At this point, it’s a one-layer network with three outputs

CMPS 497/CMPE 471 Special Topics in Deep Learning 10

Figures from http://udlbook.com
Networks as composing functions

Consider the pre-activations at the second hidden units

At this point, it’s a one-layer network with three outputs

CMPS 497/CMPE 471 Special Topics in Deep Learning 11

Figures from http://udlbook.com
CMPS 497/CMPE 471 Special Topics in Deep Learning 12
Figures from http://udlbook.com
CMPS 497/CMPE 471 Special Topics in Deep Learning 13
Figures from http://udlbook.com
CMPS 497/CMPE 471 Special Topics in Deep Learning 14
Figures from http://udlbook.com
CMPS 497/CMPE 471 Special Topics in Deep Learning 15
Figures from http://udlbook.com
Shallow network with 1 output …

Bias +
Weight

Bias +
𝒙𝒙 Weight

Bias +
Weight

CMPS 497/CMPE 471 Special Topics in Deep Learning 16

Shallow network with 1 output …

Bias + Activation 𝒉𝒉𝟏𝟏

Weight (eg, ReLU)

Bias + Activation 𝒉𝒉𝟐𝟐

𝒙𝒙 Weight (eg, ReLU)

Bias + Activation
Weight (eg, ReLU) 𝒉𝒉𝟑𝟑

CMPS 497/CMPE 471 Special Topics in Deep Learning 16

Shallow network with 1 output …

Bias + Activation 𝒉𝒉𝟏𝟏

Weight (eg, ReLU)

𝒉𝒉𝟐𝟐 Bias +
Bias + Activation
𝒙𝒙 Weight (eg, ReLU)
Weighted 𝒚𝒚
Sum

Bias + Activation
Weight (eg, ReLU) 𝒉𝒉𝟑𝟑

CMPS 497/CMPE 471 Special Topics in Deep Learning 16

Shallow network with 1 output …

Bias + Activation 𝒉𝒉𝟏𝟏

Weight (eg, ReLU)

𝒉𝒉𝟐𝟐 Bias +
Bias + Activation Piecewise
𝒙𝒙 Weight (eg, ReLU)
Weighted 𝒚𝒚 linear function
Sum

Bias + Activation
Weight (eg, ReLU) 𝒉𝒉𝟑𝟑

CMPS 497/CMPE 471 Special Topics in Deep Learning 16

Shallow network with 3 outputs …
𝒚𝒚𝟏𝟏

𝒚𝒚𝟐𝟐

𝒚𝒚𝟑𝟑

Bias + Activation
𝒉𝒉𝟏𝟏
Weight (eg, ReLU)

𝒉𝒉𝟐𝟐
Bias + Activation
𝒙𝒙 Weight (eg, ReLU)

Bias + Activation
Weight (eg, ReLU) 𝒉𝒉𝟑𝟑

CMPS 497/CMPE 471 Special Topics in Deep Learning 17

Shallow network with 3 outputs …
𝒚𝒚𝟏𝟏

𝒚𝒚𝟐𝟐

𝒚𝒚𝟑𝟑

𝒉𝒉𝟏𝟏 Bias +
Bias + Activation
Weighted
Weight (eg, ReLU)
Sum
𝒉𝒉𝟐𝟐
Bias +
Bias + Activation
𝒙𝒙 Weight (eg, ReLU)
Weighted
Sum

Bias +
Bias + Activation
Weighted
Weight (eg, ReLU) 𝒉𝒉𝟑𝟑 Sum

CMPS 497/CMPE 471 Special Topics in Deep Learning 17

Shallow network with 3 outputs …
𝒚𝒚𝟏𝟏

𝒚𝒚𝟐𝟐

𝒚𝒚𝟑𝟑

𝒉𝒉𝟏𝟏 Bias +
Bias + Activation
Weight (eg, ReLU)
Weighted 𝒚𝒚𝟏𝟏
Sum
𝒉𝒉𝟐𝟐
Bias +
Bias + Activation
𝒙𝒙 Weight (eg, ReLU)
Weighted 𝒚𝒚𝟐𝟐
Sum

Bias +
Bias + Activation
Weight (eg, ReLU)
Weighted 𝒚𝒚𝟑𝟑
𝒉𝒉𝟑𝟑 Sum

CMPS 497/CMPE 471 Special Topics in Deep Learning 17

Shallow network with 3 outputs …
𝒚𝒚𝟏𝟏

𝒚𝒚𝟐𝟐

𝒚𝒚𝟑𝟑

𝒉𝒉𝟏𝟏 Bias +
Bias + Activation Piecewise
Weight (eg, ReLU)
Weighted 𝒚𝒚𝟏𝟏 linear function
Sum
𝒉𝒉𝟐𝟐
Bias +
Bias + Activation Piecewise
𝒙𝒙 Weight (eg, ReLU)
Weighted 𝒚𝒚𝟐𝟐 linear function
Sum

Bias +
Bias + Activation Piecewise
Weight (eg, ReLU)
Weighted 𝒚𝒚𝟑𝟑 linear function
𝒉𝒉𝟑𝟑 Sum

CMPS 497/CMPE 471 Special Topics in Deep Learning 17

Two-layer network with 1 output …

𝒉𝒉𝟏𝟏 Bias +
Bias + Activation
Weighted
Weight (eg, ReLU)
Sum
𝒉𝒉𝟐𝟐
Bias +
Bias + Activation
𝒙𝒙 Weight (eg, ReLU)
Weighted
Sum

Bias +
Bias + Activation
Weighted
Weight (eg, ReLU) 𝒉𝒉𝟑𝟑 Sum

18
Two-layer network with 1 output …

Bias + Activation 𝒉𝒉𝟏𝟏 Bias +

Activation 𝒉𝒉′𝟏𝟏
Weighted
Weight (eg, ReLU) (eg, ReLU)
Sum
𝒉𝒉𝟐𝟐
Bias + 𝒉𝒉′𝟐𝟐
Bias + Activation Activation
𝒙𝒙 Weight (eg, ReLU)
Weighted
(eg, ReLU)
Sum

Bias +
Bias + Activation Activation
Weighted
Weight (eg, ReLU) 𝒉𝒉𝟑𝟑 Sum (eg, ReLU) 𝒉𝒉′𝟑𝟑

18
Two-layer network with 1 output …

Bias + Activation 𝒉𝒉𝟏𝟏 Bias +

Activation 𝒉𝒉′𝟏𝟏
Weighted
Weight (eg, ReLU) (eg, ReLU)
Sum
𝒉𝒉𝟐𝟐
Bias + 𝒉𝒉′𝟐𝟐 Bias +
Bias + Activation Activation
𝒙𝒙 Weight (eg, ReLU)
Weighted
(eg, ReLU)
Weighted 𝒚𝒚’
Sum Sum

Bias +
Bias + Activation Activation
Weighted
Weight (eg, ReLU) 𝒉𝒉𝟑𝟑 Sum (eg, ReLU) 𝒉𝒉′𝟑𝟑

18
Two-layer network with 1 output …

Piecewise
linear functions

Bias + Activation 𝒉𝒉𝟏𝟏 Bias +

Activation 𝒉𝒉′𝟏𝟏
Weighted
Weight (eg, ReLU) (eg, ReLU)
Sum
𝒉𝒉𝟐𝟐
Bias + 𝒉𝒉′𝟐𝟐 Bias +
Bias + Activation Activation
𝒙𝒙 Weight (eg, ReLU)
Weighted
(eg, ReLU)
Weighted 𝒚𝒚’
Sum Sum
Piecewise
linear function
Bias +
Bias + Activation Activation
Weighted
Weight (eg, ReLU) 𝒉𝒉𝟑𝟑 Sum (eg, ReLU) 𝒉𝒉′𝟑𝟑

18
Two-layer network with 1 output …

2 outputs?
Piecewise
linear functions

Bias + Activation 𝒉𝒉𝟏𝟏 Bias +

18
Two-layer network with 1 output …
3 layers?

2 outputs?
Piecewise
linear functions

Bias + Activation 𝒉𝒉𝟏𝟏 Bias +

18
Two-layer network with 1 output …
3 layers?

2 inputs? 2 outputs?
Piecewise
linear functions

Bias + Activation 𝒉𝒉𝟏𝟏 Bias +

18
Deep neural networks
• Two-layer neural network
• Hyperparameters
• Notation change and general case
• Shallow vs. deep networks

CMPS 497/CMPE 471 Special Topics in Deep Learning 19

Hyperparameters
• 𝐾𝐾 layers = depth of network
• 𝐷𝐷𝑘𝑘 hidden units per layer = width of network

Are these learned in training?

• These are called hyperparameters – chosen before training the

network
• Can try retraining with different hyperparameters – hyperparameter
optimization or hyperparameter search

CMPS 497/CMPE 471 Special Topics in Deep Learning 20

Hyperparameters
• For fixed hyperparameters (e.g., 𝐾𝐾 = 2 layers with 𝐷𝐷𝑘𝑘 = 3 hidden units
in each):
the model describes a family of functions

the parameters determine the particular function

• Hence, when we also consider the hyperparameters:

Neural networks are representing
a family of families of functions relating input to output

CMPS 497/CMPE 471 Special Topics in Deep Learning 21

CMPS 497/CMPE 471 Special Topics in Deep Learning 22
Consider a deep neural network with 5 inputs, 2 outputs,
and 20 hidden layers, each containing 30 hidden units each.
What is the depth of this network? What is the width?

23
How many parameters are in that network (5 inputs, 2
outputs, 20 hidden layers, each of 30 hidden units each)?

24
Deep neural networks
• Two-layer neural network
• Hyperparameters
• Notation change and general case
• Shallow vs. deep networks

CMPS 497/CMPE 471 Special Topics in Deep Learning 25

Notation change #1

CMPS 497/CMPE 471 Special Topics in Deep Learning 26

Notation change #2

CMPS 497/CMPE 471 Special Topics in Deep Learning 27

Notation change #3 Weight
Bias
vector matrix

CMPS 497/CMPE 471 Special Topics in Deep Learning 28

General equations for deep network

CMPS 497/CMPE 471 Special Topics in Deep Learning 29

General equations for deep network

Still .. a mathematical equation 

CMPS 497/CMPE 471 Special Topics in Deep Learning 29
Example

CMPS 497/CMPE 471 Special Topics in Deep Learning 30

Figures from http://udlbook.com
CMPS 497/CMPE 471 Special Topics in Deep Learning 31
For a deep network of 4 inputs, 2 layers of 10 and 8 hidden
units, and 3 outputs, what are the sizes of each weight
matrix Ω and bias vector β?

32
Deep neural networks
• Two-layer neural network
• Hyperparameters
• Notation change and general case
• Shallow vs. deep networks

CMPS 497/CMPE 471 Special Topics in Deep Learning 33

Shallow vs. deep networks
The best results are created by deep networks with many layers.
• 50-1000 layers for most applications
• Best results in
• Computer vision
• Natural language processing
• Graph neural networks All use deep networks. But why?
• Generative models
• Reinforcement learning

CMPS 497/CMPE 471 Special Topics in Deep Learning 34

1. Ability to approximate diff. functions?
Both obey the universal approximation theorem.

Argument: One layer is enough, and for deep networks could arrange
for the other layers to compute the identity function.

CMPS 497/CMPE 471 Special Topics in Deep Learning 35

2. N of linear regions per parameter

5 layers 5 layers
10 hidden units per layer 50 hidden units per layer
471 parameters 10,801 parameters
161,501 linear regions >1040 linear regions

Figures from http://udlbook.com

2. N of linear regions per parameter
For a fixed parameter budget,
deeper networks produce more linear regions than shallower ones

• But there are dependencies between them

• Perhaps similar symmetries in real-world functions? Unknown

CMPS 497/CMPE 471 Special Topics in Deep Learning 37

3. Depth efficiency
• There are some functions that require a shallow network with
exponentially more hidden units than a deep network to achieve an
equivalent approximation.

Depth efficiency of deep networks

• But do the real-world functions we want to approximate have this

property? Unknown.

CMPS 497/CMPE 471 Special Topics in Deep Learning 38

4. Large structured networks
• Think about images as input – might be 1M pixels
• Fully connected networks not practical

• Need different parts of the image to be processed similarly

• no point in independently learning to recognize the same object at every
possible position in the image.
• Solution: process local image regions in parallel --> have weights that
only operate locally, and share across image
• This leads to convolutional networks
• Gradually integrate information from across the image – needs
multiple layers
CMPS 497/CMPE 471 Special Topics in Deep Learning 39
5. Fitting and generalization
• Fitting of deep models seems to be easier up to about 20 layers.
• Fitting with more hidden layers becomes harder.

• Generalization is better in deep networks.

CMPS 497/CMPE 471 Special Topics in Deep Learning 40
Figures from http://udlbook.com
CMPS 497/CMPE 471 Special Topics in Deep Learning 41
With same number of parameters, a deeper network
generally has ……… regions compared to shallow ones.
 less
 equal
 more
Depth efficiency means …….
 Adding more hidden layers in a deep net can achieve
equivalent approximationof shallow ones.
 A shallow network might need exponentially more hidden
units to achieve equivalent approximation of a deep net. 42
Where are we going?
• We have defined families of very flexible networks that map multiple
inputs to multiple outputs
• Now we need to train them
• How to choose loss functions
• How to find minima of the loss function
• How to do this in particular for deep networks
• Then we need to test them

CMPS 497/CMPE 471 Special Topics in Deep Learning 43

Deep Learning Tutorial
No ratings yet
Deep Learning Tutorial
133 pages
Deep Learning Computer Vision
No ratings yet
Deep Learning Computer Vision
302 pages
Deep Learning - IIT Ropar - Unit 4 - Week 1
No ratings yet
Deep Learning - IIT Ropar - Unit 4 - Week 1
8 pages
5 DL Loss Functions
No ratings yet
5 DL Loss Functions
72 pages
Lecture04 NeuralNetwork
No ratings yet
Lecture04 NeuralNetwork
77 pages
1 DL Introduction
No ratings yet
1 DL Introduction
51 pages
9.a-CMPS460-S22-Neural Networks I
No ratings yet
9.a-CMPS460-S22-Neural Networks I
47 pages
1.b-DL-F24-Supervised Learning
No ratings yet
1.b-DL-F24-Supervised Learning
44 pages
NN 05
No ratings yet
NN 05
28 pages
Module I
No ratings yet
Module I
109 pages
Lec3 Learning
No ratings yet
Lec3 Learning
147 pages
Lecture 221004 04
No ratings yet
Lecture 221004 04
29 pages
Lecture 0.4 - Neural Networks
No ratings yet
Lecture 0.4 - Neural Networks
51 pages
Deep Learning Module-02 Search Creators
No ratings yet
Deep Learning Module-02 Search Creators
15 pages
Lecture Notes 02
No ratings yet
Lecture Notes 02
65 pages
Lecture 3: Basic Neural Networks: Multi-Layer Neural Networks
No ratings yet
Lecture 3: Basic Neural Networks: Multi-Layer Neural Networks
55 pages
1725876123-Unit 1 Fundamental of Deep Learning
No ratings yet
1725876123-Unit 1 Fundamental of Deep Learning
51 pages
Deep - Learning
No ratings yet
Deep - Learning
49 pages
6COM1044 Deep Learning 1
No ratings yet
6COM1044 Deep Learning 1
49 pages
Manual - Deep Learning Lab.
No ratings yet
Manual - Deep Learning Lab.
43 pages
Machine Learning For Transportation Research and Applications - Chapter 4
No ratings yet
Machine Learning For Transportation Research and Applications - Chapter 4
15 pages
ML Neural Networks
No ratings yet
ML Neural Networks
71 pages
Lecture 21 and 22
No ratings yet
Lecture 21 and 22
28 pages
Python Unit 5
No ratings yet
Python Unit 5
36 pages
2K21 - Ee - 192 MLP
No ratings yet
2K21 - Ee - 192 MLP
59 pages
CV 2025 Spring 14
No ratings yet
CV 2025 Spring 14
33 pages
Lecture 21 and 22
No ratings yet
Lecture 21 and 22
28 pages
Deep Learning
No ratings yet
Deep Learning
299 pages
A Imprimer 4
No ratings yet
A Imprimer 4
4 pages
FML Unit5
No ratings yet
FML Unit5
21 pages
Lecture 12 - Neural Networks (DONE!!) PDF
No ratings yet
Lecture 12 - Neural Networks (DONE!!) PDF
27 pages
UNit 6 Machine Learning
No ratings yet
UNit 6 Machine Learning
23 pages
CS771: Introduction To Machine Learning Piyush Rai
No ratings yet
CS771: Introduction To Machine Learning Piyush Rai
25 pages
Lecture2 Slides 1
No ratings yet
Lecture2 Slides 1
28 pages
SS 2020
No ratings yet
SS 2020
21 pages
FLNN Question Bank
75% (4)
FLNN Question Bank
23 pages
DL 2
No ratings yet
DL 2
62 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
SS 2020 Solutions
No ratings yet
SS 2020 Solutions
22 pages
Artificial Intelligence Basics
No ratings yet
Artificial Intelligence Basics
13 pages
Lecture 26
No ratings yet
Lecture 26
17 pages
Neural
No ratings yet
Neural
53 pages
Vectorization: Linear Model As A Perceptron
No ratings yet
Vectorization: Linear Model As A Perceptron
5 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Unit 4
No ratings yet
Unit 4
19 pages
Session NN
No ratings yet
Session NN
32 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
Unit I
No ratings yet
Unit I
90 pages
Notes Chapter8
No ratings yet
Notes Chapter8
4 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
3 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
DNN Cluster S2 22 MidSem Makeup
No ratings yet
DNN Cluster S2 22 MidSem Makeup
7 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
Deep Learning - AD3501 - Important Question and 2 Marks With Answers - Unit 1
No ratings yet
Deep Learning - AD3501 - Important Question and 2 Marks With Answers - Unit 1
13 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
CM412 - DL - Model Paper
No ratings yet
CM412 - DL - Model Paper
5 pages
Deep Learning - IIT Ropar - Unit 4 - Week 1
No ratings yet
Deep Learning - IIT Ropar - Unit 4 - Week 1
8 pages
Grade-12 Unit-1 Capstone Project
No ratings yet
Grade-12 Unit-1 Capstone Project
15 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
Ad3002 Health Care Analytics
No ratings yet
Ad3002 Health Care Analytics
76 pages
The ML Test Score: A Rubric For ML Production Readiness and Technical Debt Reduction
No ratings yet
The ML Test Score: A Rubric For ML Production Readiness and Technical Debt Reduction
10 pages
Scaling Monosemanticity - Extracting Interpretable Features From Claude 3 Sonnet
No ratings yet
Scaling Monosemanticity - Extracting Interpretable Features From Claude 3 Sonnet
75 pages
Tipu - 2022 - J. - Phys. - Conf. - Ser. - 2273 - 012016
No ratings yet
Tipu - 2022 - J. - Phys. - Conf. - Ser. - 2273 - 012016
12 pages
Tutorial: Gaussian Process Models For Machine Learning
No ratings yet
Tutorial: Gaussian Process Models For Machine Learning
35 pages
Final
No ratings yet
Final
145 pages
Hhaa 009
No ratings yet
Hhaa 009
51 pages
SDSC4008 08 Performance
No ratings yet
SDSC4008 08 Performance
39 pages
Day 1 Special Bonus
No ratings yet
Day 1 Special Bonus
23 pages
Lecture 1artificial Neural Networks
No ratings yet
Lecture 1artificial Neural Networks
45 pages
Report Document Batch (9) (12 90)
No ratings yet
Report Document Batch (9) (12 90)
80 pages
Final Research Paper
No ratings yet
Final Research Paper
16 pages
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
No ratings yet
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
8 pages
Ell409 Aq
No ratings yet
Ell409 Aq
8 pages
AI Tools For Software Developers Part Two
No ratings yet
AI Tools For Software Developers Part Two
19 pages
Waymo Report
No ratings yet
Waymo Report
30 pages
Churn Prediction
No ratings yet
Churn Prediction
17 pages
PySiRC Supplementary Information
No ratings yet
PySiRC Supplementary Information
8 pages
DPO Vs PPO Comparative Analysis
No ratings yet
DPO Vs PPO Comparative Analysis
15 pages
Hyperparameter Tuningin Machine Learning AComprehensive Review
No ratings yet
Hyperparameter Tuningin Machine Learning AComprehensive Review
9 pages
Deep Learning and Genetic Algorithms For Cosmological Bayesian Inference Speed-Up
No ratings yet
Deep Learning and Genetic Algorithms For Cosmological Bayesian Inference Speed-Up
16 pages
Crisp-Dm: Cross Industry Standard Process For Data Mining
No ratings yet
Crisp-Dm: Cross Industry Standard Process For Data Mining
60 pages
A Practical Deep Learning-Based Acoustic Side
No ratings yet
A Practical Deep Learning-Based Acoustic Side
21 pages
Adadelta: An Adaptive Learning Rate Method Matthew D. Zeiler Google Inc., USA New York University, USA
No ratings yet
Adadelta: An Adaptive Learning Rate Method Matthew D. Zeiler Google Inc., USA New York University, USA
6 pages
Fourier Feature Approximations For Periodic Kernels
No ratings yet
Fourier Feature Approximations For Periodic Kernels
8 pages
A General and Adaptive Robust Loss Function: Jonathan T. Barron Google Research
No ratings yet
A General and Adaptive Robust Loss Function: Jonathan T. Barron Google Research
19 pages
Experiment Tracking With Weights & Biases
No ratings yet
Experiment Tracking With Weights & Biases
5 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

4 DL Deep Neural Nets

Uploaded by

4 DL Deep Neural Nets

Uploaded by

13

CMPS 497/CMPE 471 Special Topics in Deep Learning 3

CMPS 497/CMPE 471 Special Topics in Deep Learning 4

CMPS 497/CMPE 471 Special Topics in Deep Learning 5

CMPS 497/CMPE 471 Special Topics in Deep Learning 6

CMPS 497/CMPE 471 Special Topics in Deep Learning 7

Still .. a mathematical equation 

CMPS 497/CMPE 471 Special Topics in Deep Learning 9

Consider the pre-activations at the second hidden units

CMPS 497/CMPE 471 Special Topics in Deep Learning 10

Consider the pre-activations at the second hidden units

CMPS 497/CMPE 471 Special Topics in Deep Learning 11

CMPS 497/CMPE 471 Special Topics in Deep Learning 16

Bias + Activation 𝒉𝒉𝟏𝟏

Bias + Activation 𝒉𝒉𝟐𝟐

CMPS 497/CMPE 471 Special Topics in Deep Learning 16

Bias + Activation 𝒉𝒉𝟏𝟏

CMPS 497/CMPE 471 Special Topics in Deep Learning 16

Bias + Activation 𝒉𝒉𝟏𝟏

CMPS 497/CMPE 471 Special Topics in Deep Learning 16

CMPS 497/CMPE 471 Special Topics in Deep Learning 17

CMPS 497/CMPE 471 Special Topics in Deep Learning 17

CMPS 497/CMPE 471 Special Topics in Deep Learning 17

CMPS 497/CMPE 471 Special Topics in Deep Learning 17

Bias + Activation 𝒉𝒉𝟏𝟏 Bias +

Bias + Activation 𝒉𝒉𝟏𝟏 Bias +

Bias + Activation 𝒉𝒉𝟏𝟏 Bias +

Bias + Activation 𝒉𝒉𝟏𝟏 Bias +

Bias + Activation 𝒉𝒉𝟏𝟏 Bias +

Bias + Activation 𝒉𝒉𝟏𝟏 Bias +

CMPS 497/CMPE 471 Special Topics in Deep Learning 19

Are these learned in training?

• These are called hyperparameters – chosen before training the

CMPS 497/CMPE 471 Special Topics in Deep Learning 20

the parameters determine the particular function

• Hence, when we also consider the hyperparameters:

CMPS 497/CMPE 471 Special Topics in Deep Learning 21

CMPS 497/CMPE 471 Special Topics in Deep Learning 25

CMPS 497/CMPE 471 Special Topics in Deep Learning 26

CMPS 497/CMPE 471 Special Topics in Deep Learning 27

CMPS 497/CMPE 471 Special Topics in Deep Learning 28

CMPS 497/CMPE 471 Special Topics in Deep Learning 29

Still .. a mathematical equation 

CMPS 497/CMPE 471 Special Topics in Deep Learning 30

CMPS 497/CMPE 471 Special Topics in Deep Learning 33

CMPS 497/CMPE 471 Special Topics in Deep Learning 34

CMPS 497/CMPE 471 Special Topics in Deep Learning 35

Figures from http://udlbook.com

• But there are dependencies between them

CMPS 497/CMPE 471 Special Topics in Deep Learning 37

Depth efficiency of deep networks

• But do the real-world functions we want to approximate have this

CMPS 497/CMPE 471 Special Topics in Deep Learning 38

• Need different parts of the image to be processed similarly

• Generalization is better in deep networks.

CMPS 497/CMPE 471 Special Topics in Deep Learning 43

You might also like