Feed-Forward Neural Networks (Part 2: Learning)

This document discusses learning feed-forward neural networks. It explains that stochastic gradient descent (SGD) and backpropagation can be used to learn neural networks similarly to linear classifiers. While multi-layer neural networks are more complex, larger models with more hidden units tend to be easier to learn as long as the units collectively can solve the task, even if not reaching a perfect solution.

Uploaded by

Rahul Vasanth

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views17 pages

Feed-Forward Neural Networks (Part 2: Learning)

Uploaded by

Rahul Vasanth

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Feed-forward Neural

Networks
(Part 2: learning)
Outline (part 2)
‣ Learning feed-forward neural networks
‣ SGD and back-propagation
Learning neural networks
Simple example
‣ A long chain like neural network

w 1 z1 f 1 w 2 z2 f2 w L zL f L
x
Back-propagation
w 1 z1 f 1 w 2 z2 f2 w L zL f L
x
2 hidden units: training

Initial network (hidden units) Average hinge loss per epoch

(2)

(1)
2 hidden units: training
‣ After ~10 passes through the data

hidden unit activations

(1)

(2)
(2)

(1)
10 hidden units
‣ Randomly initialized weights (zero offset) for the hidden
units
10 hidden units
‣ After ~ 10 epochs the hidden units are arranged in a
manner sufficient for the task (but not otherwise perfect)
Decisions (and a harder task)
‣ 2 hidden units can no longer solve this task
Decisions (and a harder task)
‣ 2 hidden units can no longer solve this task

10 hidden units
Decisions (and a harder task)

10 hidden units 100 hidden units

Decision boundaries
‣ Symmetries introduced in initialization can persist…

100 hidden units 100 hidden units

(zero offset initialization) (random offset initialization)
Size, optimization
‣ Many recent architectures use ReLU units (cheap to
evaluate, sparsity)
‣ Easier to learn as large models…

10 hidden units
Size, optimization
‣ Many recent architectures use ReLU units (cheap to
evaluate, sparsity)
‣ Easier to learn as large models…

100 hidden units

Size, optimization
‣ Many recent architectures use ReLU units (cheap to
evaluate, sparsity)
‣ Easier to learn as large models…

500 hidden units

Summary (part 2)
‣ Neural networks can be learned with SGD similarly to
linear classifiers
‣ The derivatives necessary for SGD can be evaluated
effectively via back-propagation
‣ Multi-layer neural network models are complicated… we
are no longer guaranteed to reach global (only local)
optimum with SGD
‣ Larger models tend to be easier to learn … units only
need to be adjusted so that they are, collectively,
sufficient to solve the task

Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
Teach Reading
100% (2)
Teach Reading
84 pages
K-16 Non-Native Chinese Teachers Training
No ratings yet
K-16 Non-Native Chinese Teachers Training
269 pages
Deep Learning Tutorial: Reference: Hung-Yi Lee
100% (1)
Deep Learning Tutorial: Reference: Hung-Yi Lee
179 pages
Unit 12
No ratings yet
Unit 12
26 pages
Lesson 4 - Qualitative Research in Different Areas of Knowledge
100% (2)
Lesson 4 - Qualitative Research in Different Areas of Knowledge
18 pages
Module 3
No ratings yet
Module 3
83 pages
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
Notes ML 02 Slides RNN ANN
No ratings yet
Notes ML 02 Slides RNN ANN
105 pages
Neural Networks: 10-601B Introduction To Machine Learning
No ratings yet
Neural Networks: 10-601B Introduction To Machine Learning
78 pages
Neural Networks / Deep Learning
No ratings yet
Neural Networks / Deep Learning
9 pages
Feed-Forward Neural Networks (Part 2: Learning)
No ratings yet
Feed-Forward Neural Networks (Part 2: Learning)
17 pages
Chapter 5 Artificial Neural Networks
No ratings yet
Chapter 5 Artificial Neural Networks
50 pages
ANN Doc
No ratings yet
ANN Doc
2 pages
Child, Adolescent, and Adult Learning
No ratings yet
Child, Adolescent, and Adult Learning
32 pages
Case 2.2 (Danish International (A) )
67% (3)
Case 2.2 (Danish International (A) )
2 pages
Chap 5 & 6 MCQs
100% (1)
Chap 5 & 6 MCQs
6 pages
OC 12 The Learner Distinguishes The Types of Speech
No ratings yet
OC 12 The Learner Distinguishes The Types of Speech
4 pages
Feed-Forward Neural Networks (Part 1)
No ratings yet
Feed-Forward Neural Networks (Part 1)
33 pages
6th Central Pay Commission Salary Calculator
100% (436)
6th Central Pay Commission Salary Calculator
15 pages
NNs PDF
No ratings yet
NNs PDF
16 pages
Exp6 - Artificial Neural Networks
No ratings yet
Exp6 - Artificial Neural Networks
16 pages
Neural Networks:: Basics Using MATLAB
No ratings yet
Neural Networks:: Basics Using MATLAB
54 pages
EPS-DL-Handout4 - Steps To Build ANN From Scratch
No ratings yet
EPS-DL-Handout4 - Steps To Build ANN From Scratch
14 pages
Unit 1 DL
No ratings yet
Unit 1 DL
18 pages
5 1 ArtificialNeuralNetworks 4up
No ratings yet
5 1 ArtificialNeuralNetworks 4up
12 pages
Neural Nets
No ratings yet
Neural Nets
33 pages
Pattern Classification Slide
No ratings yet
Pattern Classification Slide
45 pages
Unit 3
No ratings yet
Unit 3
12 pages
Neural Networks - Annotated
No ratings yet
Neural Networks - Annotated
21 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
15 pages
Neural Networks - Annotated
No ratings yet
Neural Networks - Annotated
21 pages
ANN - Back Propagation
No ratings yet
ANN - Back Propagation
22 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
Deep Learningchap2
No ratings yet
Deep Learningchap2
20 pages
Lec 6
No ratings yet
Lec 6
18 pages
Soft Computing 2
No ratings yet
Soft Computing 2
33 pages
National Occupational Skills Standard (Noss) For
No ratings yet
National Occupational Skills Standard (Noss) For
15 pages
Lecture 8
No ratings yet
Lecture 8
65 pages
MLP 1122 20240509 ch10 DeepNN
No ratings yet
MLP 1122 20240509 ch10 DeepNN
47 pages
Unit 2 v1.
No ratings yet
Unit 2 v1.
41 pages
04 - Machine Learning For Embedded and Edge AI
No ratings yet
04 - Machine Learning For Embedded and Edge AI
58 pages
Week 03-04 - Deep Feedforward Networks - Intro
No ratings yet
Week 03-04 - Deep Feedforward Networks - Intro
141 pages
A Survey of Randomized Algorithms For Training Neural Networks
No ratings yet
A Survey of Randomized Algorithms For Training Neural Networks
10 pages
Module 2
No ratings yet
Module 2
44 pages
ML807 Distributed and Federated Learning Slides 2
No ratings yet
ML807 Distributed and Federated Learning Slides 2
211 pages
ML Unit-2
No ratings yet
ML Unit-2
141 pages
04 - Neural Networks PDF
No ratings yet
04 - Neural Networks PDF
46 pages
Artificial Neural Networks (Anns) : Intro
No ratings yet
Artificial Neural Networks (Anns) : Intro
15 pages
Courseoutline POE Jan2019
No ratings yet
Courseoutline POE Jan2019
2 pages
Module 2 DL Snotes P1
No ratings yet
Module 2 DL Snotes P1
16 pages
Neural Networks
No ratings yet
Neural Networks
33 pages
DL 2
No ratings yet
DL 2
62 pages
13 Ann
No ratings yet
13 Ann
39 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Introduction To Neural Networks
No ratings yet
Introduction To Neural Networks
51 pages
Unit 03 - Neural Networks - MD
No ratings yet
Unit 03 - Neural Networks - MD
24 pages
Information Sciences: Le Zhang, P.N. Suganthan
No ratings yet
Information Sciences: Le Zhang, P.N. Suganthan
3 pages
Machine Learning
No ratings yet
Machine Learning
83 pages
15 Neural Network Updated
No ratings yet
15 Neural Network Updated
85 pages
Mind - How To Build A Neural Network (Part One)
No ratings yet
Mind - How To Build A Neural Network (Part One)
9 pages
Analytical Skills: Analytical Skills Make You A Better Problem Solver
No ratings yet
Analytical Skills: Analytical Skills Make You A Better Problem Solver
6 pages
ML - UNIT-1 &2 Notes
No ratings yet
ML - UNIT-1 &2 Notes
84 pages
Refined Chapter 5 UceQEJ
No ratings yet
Refined Chapter 5 UceQEJ
79 pages
Int254 Unit 3
No ratings yet
Int254 Unit 3
29 pages
Moral Minefield Level 4
No ratings yet
Moral Minefield Level 4
3 pages
Lecture W15ab
No ratings yet
Lecture W15ab
44 pages
SAs 18 EDU 536
No ratings yet
SAs 18 EDU 536
8 pages
Unit 1
No ratings yet
Unit 1
29 pages
Slot Book - Jul-Nov 2021 - Updated 21.04.2021
No ratings yet
Slot Book - Jul-Nov 2021 - Updated 21.04.2021
8 pages
Adhd Makale
No ratings yet
Adhd Makale
11 pages
Saul Kripke - Naming and Necessity Notes: Page 1 of 73
100% (3)
Saul Kripke - Naming and Necessity Notes: Page 1 of 73
73 pages
Mps in Elementary Edukasyon Sa Pagpapakatao: First Quarter
No ratings yet
Mps in Elementary Edukasyon Sa Pagpapakatao: First Quarter
4 pages
Malaysian Education Blueprint (2013-2025) Commitment of Government To Equip Our Students Holistically
No ratings yet
Malaysian Education Blueprint (2013-2025) Commitment of Government To Equip Our Students Holistically
24 pages
(Texts in Applied Mathematics 12) J. Stoer, R. Bulirsch (Auth.) - Introduction To Numerical Analysis-Springer New York (1993) - 107-121
No ratings yet
(Texts in Applied Mathematics 12) J. Stoer, R. Bulirsch (Auth.) - Introduction To Numerical Analysis-Springer New York (1993) - 107-121
15 pages
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture2 Compressed
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture2 Compressed
21 pages
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture1 Compressed
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture1 Compressed
27 pages
Guru Nanak Dev University Ann - Ug-1 Date-Sheet: Teaching of Social Studies - (8649)
No ratings yet
Guru Nanak Dev University Ann - Ug-1 Date-Sheet: Teaching of Social Studies - (8649)
1 page
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture11 Compressed PDF
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture11 Compressed PDF
27 pages
Modeling With Machine Learning: RNN (Part 1)
No ratings yet
Modeling With Machine Learning: RNN (Part 1)
24 pages
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture6 Compressed
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture6 Compressed
22 pages
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture4 Compressed
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture4 Compressed
22 pages
Giordano Scappucci: Quantum Materials
No ratings yet
Giordano Scappucci: Quantum Materials
18 pages
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture3 Compressed
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture3 Compressed
15 pages
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture12 Compressed PDF
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture12 Compressed PDF
13 pages
6.453 Quantum Optical Communication Jeffrey H. Shapiro: 6.453 - Lecture 20
No ratings yet
6.453 Quantum Optical Communication Jeffrey H. Shapiro: 6.453 - Lecture 20
6 pages
QTM2x 2018 27 Majorana Experiments-Slides Compressed
No ratings yet
QTM2x 2018 27 Majorana Experiments-Slides Compressed
8 pages
EE5160 - Error Control Coding Tutorial - Week 02: February 15, 2021
No ratings yet
EE5160 - Error Control Coding Tutorial - Week 02: February 15, 2021
3 pages
CBG Cavity v1
No ratings yet
CBG Cavity v1
10 pages
Introduction To Tools - Python: 1 Assignment
No ratings yet
Introduction To Tools - Python: 1 Assignment
7 pages
A Toastmaster Wears Many Hats - Meeting Roles 2
No ratings yet
A Toastmaster Wears Many Hats - Meeting Roles 2
20 pages
6.453 Quantum Optical Communication Jeffrey H. Shapiro: 6.453 - Lecture 22
No ratings yet
6.453 Quantum Optical Communication Jeffrey H. Shapiro: 6.453 - Lecture 22
6 pages
Quiz I From The Origins of Quantum Theory and Wave Aspects of Matter To The Postulates of Quantum Mechanics and The Schrodinger Equation
No ratings yet
Quiz I From The Origins of Quantum Theory and Wave Aspects of Matter To The Postulates of Quantum Mechanics and The Schrodinger Equation
2 pages
Department of Education Region VIII (Eastern Visayas) Division of Leyte School ID.: 344770
No ratings yet
Department of Education Region VIII (Eastern Visayas) Division of Leyte School ID.: 344770
9 pages
Assessment of Learning
No ratings yet
Assessment of Learning
90 pages
EE 3313: Device Modelling Tutorial 7: N Ox - 4 2 B
No ratings yet
EE 3313: Device Modelling Tutorial 7: N Ox - 4 2 B
1 page
Digital Leadership in Higher Education Prof Abd Karim Alias
No ratings yet
Digital Leadership in Higher Education Prof Abd Karim Alias
75 pages
Quiz I From The Origins of Quantum Theory To The Schrodinger Equation in One Dimension
No ratings yet
Quiz I From The Origins of Quantum Theory To The Schrodinger Equation in One Dimension
2 pages
A Study On Leadership Preference
No ratings yet
A Study On Leadership Preference
3 pages
Principles of Language Awareness
No ratings yet
Principles of Language Awareness
4 pages
Course Slotting - Jan-May 2021.
No ratings yet
Course Slotting - Jan-May 2021.
1 page
Leadership Academy Presentation
No ratings yet
Leadership Academy Presentation
20 pages
Excellence Integrity Caring: EDSE 400 Class Syllabus Introduction To Secondary Teaching (6 Units)
No ratings yet
Excellence Integrity Caring: EDSE 400 Class Syllabus Introduction To Secondary Teaching (6 Units)
5 pages
Portfolio Reflection Sheet Math
No ratings yet
Portfolio Reflection Sheet Math
1 page
Architecture Handbook
No ratings yet
Architecture Handbook
19 pages
The 2 Sigma Problem: The Search For Methods of Group Instruction As Effective As One-to-One Tutoring
No ratings yet
The 2 Sigma Problem: The Search For Methods of Group Instruction As Effective As One-to-One Tutoring
13 pages
DLL - Mathematics 1 - Q3 - W6
No ratings yet
DLL - Mathematics 1 - Q3 - W6
6 pages
N20283009 - Ravi Kumar - MRCB Assignment 2
No ratings yet
N20283009 - Ravi Kumar - MRCB Assignment 2
2 pages
Unit Assessmenet Plan Matrix - With Work Done 2
No ratings yet
Unit Assessmenet Plan Matrix - With Work Done 2
2 pages
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
FreeBSD Mastery: Advanced ZFS: IT Mastery, #9
From Everand
FreeBSD Mastery: Advanced ZFS: IT Mastery, #9
Michael W. Lucas
No ratings yet
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet