0% found this document useful (0 votes)

77 views20 pages

Intro4 ANN Deep CNN PDF

Deep learning is a family of techniques for learning compositional vector representations of complex data using neural networks. Deep neural networks learn hierarchical representations of data by building higher-level features from lower-level ones. Convolutional neural networks apply this idea to visual data by incorporating spatial structure through local connectivity and parameter sharing. Modern deep learning techniques like residual networks enable very deep networks to be trained effectively.

Uploaded by

pranab sarker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views20 pages

Intro4 ANN Deep CNN PDF

Uploaded by

pranab sarker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

What is deep learning?

A family of techniques for learning compositional vector representations

of complex data.

CS221 / Spring 2020 / Finn & Anari 9

Review: linear predictors
w
x1

x2 f✓ (x)

Output:

f✓ (x) = w · x

Parameters: ✓ = w

CS221 / Spring 2020 / Finn & Anari 11

Review: neural networks
V h1
x1 w
x2 f✓ (x)

x3
h2
Intermediate hidden units:
z 1
hj (x) = (vj · x) (z) = (1 + e )
Output:
f✓ (x) = w · h(x)
Parameters: ✓ = (V, w)

CS221 / Spring 2020 / Finn & Anari 12

Deep neural networks
1-layer neural network: x
w>
score =

2-layer neural network: x

> V
w
score = ( )

3-layer neural network: x

U V
>
w
score = ( ( ))

CS221 / Spring 2020 / Finn & Anari

... 13
Depth
x
h h0 h00 h000
f✓ (x)

Intuitions:
• Hierarchical feature representations
• Can simulate a bounded computation logic circuit (original moti-
vation from McCulloch/Pitts, 1943)
• Learn this computation (and potentially more because networks
are real-valued)
• Formal theory/understanding is still incomplete
• Some hypotheses emerging: double descent, lottery ticket hypoth-
esis

CS221 / Spring 2020 / Finn & Anari 14

[figure from Honglak Lee]

What’s learned?

CS221 / Spring 2020 / Finn & Anari 15

Review: optimization
Regression:
Loss(x, y, ✓) = (f✓ (x) y)2
Key idea: minimize training loss
1 X
TrainLoss(✓) = Loss(x, y, ✓)
|Dtrain |
(x,y)2Dtrain

min TrainLoss(✓)
✓2Rd

Algorithm: stochastic gradient descent

For t = 1, . . . , T :
For (x, y) 2 Dtrain :
✓ ✓ ⌘t r✓ Loss(x, y, ✓)

CS221 / Spring 2020 / Finn & Anari 16

Training

• Non-convex optimization

• No theoretical guarantees that it works

• Before 2000s, empirically very difficult to get working

CS221 / Spring 2020 / Finn & Anari 17

What’s di↵erent today
Computation (time/memory) Information (data)

CS221 / Spring 2020 / Finn & Anari 18

How to make it work

• More hidden units (over-parameterization)

• Adaptive step sizes (AdaGrad, Adam)
• Dropout to guard against overfitting
• Careful initialization (pre-training)
• Batch normalization

Model and optimization are tightly coupled

CS221 / Spring 2020 / Finn & Anari 19
Summary
• Deep networks learn hierarchical representations of data

• Train via SGD, use backpropagation to compute gradients

• Non-convex optimization, but works empirically given enough com-

pute and data

CS221 / Spring 2020 / Finn & Anari 20

Motivation
x
W

• Observation: images are not arbitrary vectors

• Goal: leverage spatial structure of images (translation equivari-
ance)

CS221 / Spring 2020 / Finn & Anari 22

Idea: Convolutions

CS221 / Spring 2020 / Finn & Anari 23

[figure from Andrej Karpathy]

Prior knowledge

• Local connectivity: each hidden unit operates on a local image

patch (3 instead of 7 connections per hidden unit)

• Parameter sharing: processing of each image patch is same (3

parameters instead of 3 · 5)

• Intuition: try to match a pattern in image

CS221 / Spring 2020 / Finn & Anari 24

Convolutional layers

• Instead of vector to vector, we do volume to volume

[Andrej Karpathy’s demo]

CS221 / Spring 2020 / Finn & Anari 25

[figure from Andrej Karpathy]

Max-pooling

• Intuition: test if there exists a pattern in neighborhood

• Reduce computation, prevent overfitting

CS221 / Spring 2020 / Finn & Anari 26

Example of function evaluation

[Andrej Karpathy’s demo]

CS221 / Spring 2020 / Finn & Anari 27

[Krizhevsky et al., 2012]

AlexNet

• Non-linearity: use RelU (max(z, 0)) instead of logistic

• Data augmentation: translate, horizontal reflection, vary intensity,
dropout (guard against overfitting)
• Computation: parallelize across two GPUs (6 days)
• Results on ImageNet: 16.4% error (next best was 25.8%)

CS221 / Spring 2020 / Finn & Anari 28

[He et al. 2015]

Residual networks
x 7! (W x) + x

• Key idea: make it easy to learn the iden-

tity (good inductive bias)
• Enables training 152 layer networks
• Results on ImageNet: 3.6% error

CS221 / Spring 2020 / Finn & Anari 29

Summary
• Key idea 1: locality of connections, capture spatial structure

• Key idea 2: Filters share parameters, capture translational equiv-

ariance

• Depth matters

• Applications to images, text, Go, drug design, etc.

CS221 / Spring 2020 / Finn & Anari 30

AgriculturalEngineering - Icomm PDF
100% (1)
AgriculturalEngineering - Icomm PDF
223 pages
Significance of Trees in Landscaping
No ratings yet
Significance of Trees in Landscaping
11 pages
Csps 1
100% (1)
Csps 1
62 pages
Tax 2 Reviewer
100% (1)
Tax 2 Reviewer
164 pages
What Is Artificial Intelligence?: John Mccarthy, Stanford University
No ratings yet
What Is Artificial Intelligence?: John Mccarthy, Stanford University
35 pages
CNN PPT Unit Iv
No ratings yet
CNN PPT Unit Iv
134 pages
Deep Learning Unit-III
No ratings yet
Deep Learning Unit-III
9 pages
Professor - S Bcs Math - Mental-Ability PDF
No ratings yet
Professor - S Bcs Math - Mental-Ability PDF
19 pages
L10 - Intro - To - Deep - Learning
No ratings yet
L10 - Intro - To - Deep - Learning
75 pages
ANN Notes
No ratings yet
ANN Notes
54 pages
Data Science
No ratings yet
Data Science
74 pages
Scikit Learn Docs
100% (1)
Scikit Learn Docs
2,201 pages
Btech CSE
No ratings yet
Btech CSE
17 pages
Khairuls-Basic-Math-Mental Ability PDF
100% (1)
Khairuls-Basic-Math-Mental Ability PDF
75 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Machine Learning: Neural Networks
No ratings yet
Machine Learning: Neural Networks
22 pages
CNN RNN Assignment Set 4
0% (1)
CNN RNN Assignment Set 4
2 pages
RAG With Math
No ratings yet
RAG With Math
7 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
15 pages
Ann Chapter 2
No ratings yet
Ann Chapter 2
240 pages
Example of 2D Convolution
No ratings yet
Example of 2D Convolution
5 pages
The Global Employer
100% (1)
The Global Employer
198 pages
DGCA RPAS Guidance Manual
No ratings yet
DGCA RPAS Guidance Manual
66 pages
Lecture Notes SC
No ratings yet
Lecture Notes SC
21 pages
Lecture 10 Tensor and Tensor Algebra 2 PDF
No ratings yet
Lecture 10 Tensor and Tensor Algebra 2 PDF
14 pages
DL All Units Materials
No ratings yet
DL All Units Materials
138 pages
Instant Ebooks Textbook Deep Generative Modeling Jakub M. Tomczak Download All Chapters
No ratings yet
Instant Ebooks Textbook Deep Generative Modeling Jakub M. Tomczak Download All Chapters
49 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
Seismic Performance Evaluation of Concrete Gravity Dams
No ratings yet
Seismic Performance Evaluation of Concrete Gravity Dams
14 pages
Practice Final sp22
No ratings yet
Practice Final sp22
10 pages
Regularization: Swetha V, Research Scholar
No ratings yet
Regularization: Swetha V, Research Scholar
32 pages
Three Canonical Learning Problems
No ratings yet
Three Canonical Learning Problems
13 pages
Backpropagation Learning in Neural Networks
No ratings yet
Backpropagation Learning in Neural Networks
27 pages
GNN Review
No ratings yet
GNN Review
26 pages
Introduction To Neural Networks Using Matlab 6 0 S N Sivanandam Sumathi Deepa
0% (1)
Introduction To Neural Networks Using Matlab 6 0 S N Sivanandam Sumathi Deepa
4 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
25 pages
Radial Basis Function
No ratings yet
Radial Basis Function
35 pages
ANN Supervised Learning (Compatibility Mode)
No ratings yet
ANN Supervised Learning (Compatibility Mode)
73 pages
How To Do Deep Learning With SAS: Title
No ratings yet
How To Do Deep Learning With SAS: Title
16 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Lesson 4 Gradient Descent
No ratings yet
Lesson 4 Gradient Descent
13 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
RBM, DBN, and DBM
No ratings yet
RBM, DBN, and DBM
79 pages
2.neural Network
No ratings yet
2.neural Network
19 pages
Neural
No ratings yet
Neural
35 pages
A Practical Guide To Graph Neural Networks
No ratings yet
A Practical Guide To Graph Neural Networks
28 pages
Answers For End-Sem Exam Part - 2 (Deep Learning)
No ratings yet
Answers For End-Sem Exam Part - 2 (Deep Learning)
20 pages
Overview Stanford PDF
No ratings yet
Overview Stanford PDF
113 pages
Artificial Intelligence in Mechanical Engineering: A Case Study On Vibration Analysis of Cracked Cantilever Beam
No ratings yet
Artificial Intelligence in Mechanical Engineering: A Case Study On Vibration Analysis of Cracked Cantilever Beam
4 pages
Notes On Backpropagation
No ratings yet
Notes On Backpropagation
14 pages
Computer Education For Nepali School Students - QBASIC CLASS IX
No ratings yet
Computer Education For Nepali School Students - QBASIC CLASS IX
10 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
7 pages
Ann Book
No ratings yet
Ann Book
16 pages
Face Recognition With GNU Octave/MATLAB: Philipp Wagner
No ratings yet
Face Recognition With GNU Octave/MATLAB: Philipp Wagner
14 pages
Why and How Do I Get Into Machine Learning Development?
No ratings yet
Why and How Do I Get Into Machine Learning Development?
3 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
Gradient Descent
No ratings yet
Gradient Descent
15 pages
Types of Neural Networks
No ratings yet
Types of Neural Networks
7 pages
Lec 4 PDF
No ratings yet
Lec 4 PDF
66 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
The Backpropagation Algorithm
No ratings yet
The Backpropagation Algorithm
4 pages
Perceptons Neural Networks
No ratings yet
Perceptons Neural Networks
33 pages
Back-Propagation Is Very Simple. Who Made It Complicated
No ratings yet
Back-Propagation Is Very Simple. Who Made It Complicated
26 pages
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
No ratings yet
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
3 pages
Onkyo TX-NR709 Manual
No ratings yet
Onkyo TX-NR709 Manual
96 pages
Deep Learning
No ratings yet
Deep Learning
2 pages
Lec 2 PDF
No ratings yet
Lec 2 PDF
36 pages
Advanced Digital Image Processing: Lecture - 3 Basic Relationships Between Pixels
No ratings yet
Advanced Digital Image Processing: Lecture - 3 Basic Relationships Between Pixels
46 pages
Deep
No ratings yet
Deep
73 pages
Sap Education: Sample Questions: C - Tadm51 - 74
No ratings yet
Sap Education: Sample Questions: C - Tadm51 - 74
5 pages
Presenter
No ratings yet
Presenter
2 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages
Mos Drywall
No ratings yet
Mos Drywall
4 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
Lec 1 PDF
No ratings yet
Lec 1 PDF
31 pages
Lectures on the Coupling Method
From Everand
Lectures on the Coupling Method
Torgny Lindvall
No ratings yet
AlertConfiguration STEPS&Troubleshoot
No ratings yet
AlertConfiguration STEPS&Troubleshoot
13 pages
Exploring New Insights and Stratergies in Nursing Education
No ratings yet
Exploring New Insights and Stratergies in Nursing Education
49 pages
Digital Logic Design 6 Counters
No ratings yet
Digital Logic Design 6 Counters
37 pages
Tle 6281
No ratings yet
Tle 6281
15 pages
LSTM
No ratings yet
LSTM
42 pages
XCP 2.1 Sample Application - WorkSpace 1.0.0
No ratings yet
XCP 2.1 Sample Application - WorkSpace 1.0.0
16 pages
Theems and Imagery
No ratings yet
Theems and Imagery
13 pages
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
From Everand
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Sebastian Thelen
5/5 (1)
Manual Aspirador Makita DCL180Z A Batería 18V Litio
No ratings yet
Manual Aspirador Makita DCL180Z A Batería 18V Litio
44 pages
PHARMACY RX Business Plan
No ratings yet
PHARMACY RX Business Plan
17 pages
Retinal Disease
No ratings yet
Retinal Disease
16 pages
Seminar 4 Verb
No ratings yet
Seminar 4 Verb
11 pages
Full
No ratings yet
Full
131 pages
Principles of High Quality Assessment: Clarity of Learning Outcomes and Appropriateness of Assessment Methods
No ratings yet
Principles of High Quality Assessment: Clarity of Learning Outcomes and Appropriateness of Assessment Methods
17 pages
Exemple Dintroduction de Dissertation de Philosophie Sur Le Bonheur
100% (1)
Exemple Dintroduction de Dissertation de Philosophie Sur Le Bonheur
6 pages
Income Tax Reviewer and Case Digests PAGE-2 - : Ma. Angela Leonor C. Aguinaldo Ateneo Law 2010
No ratings yet
Income Tax Reviewer and Case Digests PAGE-2 - : Ma. Angela Leonor C. Aguinaldo Ateneo Law 2010
1 page
Fundamentals of GST - 1
No ratings yet
Fundamentals of GST - 1
39 pages
ASSIGNMENT Marketing
No ratings yet
ASSIGNMENT Marketing
5 pages
Irregular Verbs Exercises
No ratings yet
Irregular Verbs Exercises
2 pages
Front Matter (M Bulletin (Museum of Fine Arts, Boston), Vol. 79) (1981)
No ratings yet
Front Matter (M Bulletin (Museum of Fine Arts, Boston), Vol. 79) (1981)
3 pages
Jsa - Certified Associate Javascript Programmer: Exam Objectives
No ratings yet
Jsa - Certified Associate Javascript Programmer: Exam Objectives
5 pages
Flash Point by Tag Closed Cup Tester: Standard Test Method For
No ratings yet
Flash Point by Tag Closed Cup Tester: Standard Test Method For
12 pages

Intro4 ANN Deep CNN PDF

Uploaded by

Intro4 ANN Deep CNN PDF

Uploaded by

What is deep learning?

A family of techniques for learning compositional vector representations

CS221 / Spring 2020 / Finn & Anari 9

CS221 / Spring 2020 / Finn & Anari 11

CS221 / Spring 2020 / Finn & Anari 12

2-layer neural network: x

3-layer neural network: x

CS221 / Spring 2020 / Finn & Anari

CS221 / Spring 2020 / Finn & Anari 14

CS221 / Spring 2020 / Finn & Anari 15

Algorithm: stochastic gradient descent

CS221 / Spring 2020 / Finn & Anari 16

• No theoretical guarantees that it works

• Before 2000s, empirically very difficult to get working

CS221 / Spring 2020 / Finn & Anari 17

CS221 / Spring 2020 / Finn & Anari 18

• More hidden units (over-parameterization)

Model and optimization are tightly coupled

• Train via SGD, use backpropagation to compute gradients

• Non-convex optimization, but works empirically given enough com-

CS221 / Spring 2020 / Finn & Anari 20

• Observation: images are not arbitrary vectors

CS221 / Spring 2020 / Finn & Anari 22

CS221 / Spring 2020 / Finn & Anari 23

• Local connectivity: each hidden unit operates on a local image

• Parameter sharing: processing of each image patch is same (3

• Intuition: try to match a pattern in image

CS221 / Spring 2020 / Finn & Anari 24

• Instead of vector to vector, we do volume to volume

CS221 / Spring 2020 / Finn & Anari 25

• Intuition: test if there exists a pattern in neighborhood

• Reduce computation, prevent overfitting

CS221 / Spring 2020 / Finn & Anari 26

[Andrej Karpathy’s demo]

CS221 / Spring 2020 / Finn & Anari 27

• Non-linearity: use RelU (max(z, 0)) instead of logistic

CS221 / Spring 2020 / Finn & Anari 28

• Key idea: make it easy to learn the iden-

CS221 / Spring 2020 / Finn & Anari 29

• Key idea 2: Filters share parameters, capture translational equiv-

• Applications to images, text, Go, drug design, etc.

CS221 / Spring 2020 / Finn & Anari 30

You might also like