0% found this document useful (0 votes)

7 views20 pages

Winter1516 Lecture51

Uploaded by

rsethi3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views20 pages

Winter1516 Lecture51

Uploaded by

rsethi3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Lecture 5:

Training Neural Networks,

Part I

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 1 20 Jan 2016
Administrative
A1 is due today (midnight)
I’m holding make up office hours on today: 5pm @ Gates 259

A2 will be released ~tomorrow. It’s meaty, but educational!

Also:
- We are shuffling the course schedule around a bit
- the grading scheme is subject to few % changes

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 2 20 Jan 2016
Things you should know for your Project Proposal

“ConvNets need a lot

of data to train”

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 3 20 Jan 2016
Things you should know for your Project Proposal

“ConvNets need a lot

of data to train”
finetuning! we rarely ever
train ConvNets from scratch.

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 4 20 Jan 2016
1. Train on ImageNet 2. Finetune network on
your own data

ImageNet data your

data

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 5 20 Jan 2016
Transfer Learning with CNNs
1. Train on 2. If small dataset: fix 3. If you have medium sized
ImageNet all weights (treat CNN dataset, “finetune” instead:
as fixed feature use the old weights as
extractor), retrain only initialization, train the full
the classifier network or only some of the
higher layers
i.e. swap the Softmax
layer at the end retrain bigger portion of the
network, or even all of it.

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 6 20 Jan 2016
E.g. Caffe Model Zoo: Lots of pretrained ConvNets
https://github.com/BVLC/caffe/wiki/Model-Zoo

...
Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 7 20 Jan 2016
Things you should know for your Project Proposal

“We have infinite

compute available
because Terminal.”

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 8 20 Jan 2016
Things you should know for your Project Proposal

“We have infinite

compute available
because Terminal.”
You have finite compute.
Don’t be overly ambitious.
Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 9 20 Jan 2016
Where we are now...

Mini-batch SGD
Loop:
1. Sample a batch of data
2. Forward prop it through the graph, get loss
3. Backprop to calculate the gradients
4. Update the parameters using the gradient

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 10 20 Jan 2016
Where we are now...

(image credits
to Alec Radford)

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 11 20 Jan 2016
Neural Turing Machine

input tape

loss

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 12 20 Jan 2016
activations

“local gradient”

gradients

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 13 20 Jan 2016
Implementation: forward/backward API
Graph (or Net) object. (Rough psuedo code)

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 14 20 Jan 2016
Implementation: forward/backward API

x
z
*
y

(x,y,z are scalars)

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 15 20 Jan 2016
Example: Torch Layers

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 16 20 Jan 2016
Neural Network: without the brain stuff

(Before) Linear score function:

(Now) 2-layer Neural Network

or 3-layer Neural Network

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 17 20 Jan 2016
Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 18 20 Jan 2016
Neural Networks: Architectures

“3-layer Neural Net”, or

“2-layer Neural Net”, or “2-hidden-layer Neural Net”
“1-hidden-layer Neural Net” “Fully-connected” layers

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 19 20 Jan 2016
Training Neural Networks
A bit of history...

Fei-Fei Li & Andrej Karpathy & Justin Johnson Lecture 5 - 20 20 Jan 2016

CNN2
No ratings yet
CNN2
109 pages
2021 Pho1 15 Neural Networks Part1
No ratings yet
2021 Pho1 15 Neural Networks Part1
77 pages
Deep Learning Lab Course 2017 (Deep Learning Practical)
No ratings yet
Deep Learning Lab Course 2017 (Deep Learning Practical)
49 pages
NN Bnu2
No ratings yet
NN Bnu2
47 pages
CS 236 Section 3
No ratings yet
CS 236 Section 3
59 pages
Lecture 1
No ratings yet
Lecture 1
21 pages
Thesis Topic On Neural Network
100% (3)
Thesis Topic On Neural Network
4 pages
Lecture 2 - Neural Network v1.0
No ratings yet
Lecture 2 - Neural Network v1.0
64 pages
Training Neural Networks
No ratings yet
Training Neural Networks
109 pages
Winter1516 Lecture52
No ratings yet
Winter1516 Lecture52
20 pages
Winter1516 Lecture55
No ratings yet
Winter1516 Lecture55
22 pages
Winter1516 Lecture53
No ratings yet
Winter1516 Lecture53
20 pages
04introduction To Neural Networks
No ratings yet
04introduction To Neural Networks
62 pages
Basics of DL: Prof. Leal-Taixé and Prof. Niessner 1
No ratings yet
Basics of DL: Prof. Leal-Taixé and Prof. Niessner 1
76 pages
AIML Unit-5
No ratings yet
AIML Unit-5
26 pages
Lecture 4
No ratings yet
Lecture 4
146 pages
A Recipe For Training Neural Networks
No ratings yet
A Recipe For Training Neural Networks
18 pages
Neural Networks and Pre-Neural Network Algorithms: A Comprehensive Technical Guide
No ratings yet
Neural Networks and Pre-Neural Network Algorithms: A Comprehensive Technical Guide
12 pages
How Do We Train' Neural Networks - by Vitaly Bushaev - Towards Data Science
No ratings yet
How Do We Train' Neural Networks - by Vitaly Bushaev - Towards Data Science
7 pages
Lect 2 Common Architectural Principles of Deep Networks
No ratings yet
Lect 2 Common Architectural Principles of Deep Networks
20 pages
L7 Lecture Image - classification.DNN v4
No ratings yet
L7 Lecture Image - classification.DNN v4
61 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
43 pages
Deep Learning HA (Blog) - 1
No ratings yet
Deep Learning HA (Blog) - 1
9 pages
Deep Learning - Part II-1
No ratings yet
Deep Learning - Part II-1
23 pages
Unit3 DLT Material Important Notes
No ratings yet
Unit3 DLT Material Important Notes
33 pages
Winter1516 Lecture54
No ratings yet
Winter1516 Lecture54
20 pages
Convnets
No ratings yet
Convnets
41 pages
cs231n 2018 Midterm Review-2 PDF
No ratings yet
cs231n 2018 Midterm Review-2 PDF
86 pages
Deep Learning - Question Bank
No ratings yet
Deep Learning - Question Bank
6 pages
Section - C: Unit 1
No ratings yet
Section - C: Unit 1
12 pages
Lec14 CNNRNNModels
No ratings yet
Lec14 CNNRNNModels
64 pages
CCS355 NNDL Unit1
No ratings yet
CCS355 NNDL Unit1
30 pages
DAI School TG 7
No ratings yet
DAI School TG 7
5 pages
Introduction To Deep Neural Networks - DataCamp
No ratings yet
Introduction To Deep Neural Networks - DataCamp
10 pages
A Imprimer 4
No ratings yet
A Imprimer 4
4 pages
Lect 12 - Deep Feed Forward NN - Review
No ratings yet
Lect 12 - Deep Feed Forward NN - Review
93 pages
NN Fundamentals CS
No ratings yet
NN Fundamentals CS
36 pages
Lec 07 8
No ratings yet
Lec 07 8
40 pages
Chapter 3
No ratings yet
Chapter 3
24 pages
Administrative
No ratings yet
Administrative
38 pages
Question Bank Advanced CO1, CO2
No ratings yet
Question Bank Advanced CO1, CO2
4 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
High Level MCQ Soft Computing
No ratings yet
High Level MCQ Soft Computing
5 pages
Artificial Intelligence Basics
No ratings yet
Artificial Intelligence Basics
13 pages
Seminar
No ratings yet
Seminar
13 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Neuralnetworks 1
No ratings yet
Neuralnetworks 1
65 pages
01-NDL Theory and Lab Syllabus
No ratings yet
01-NDL Theory and Lab Syllabus
4 pages
IF4071 Ass1 - Practive Questions of Deep Learning IF4071 Ass1 - Practive Questions of Deep Learning
No ratings yet
IF4071 Ass1 - Practive Questions of Deep Learning IF4071 Ass1 - Practive Questions of Deep Learning
8 pages
WS 2021
No ratings yet
WS 2021
16 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
Lecture 5
No ratings yet
Lecture 5
114 pages
DNN Ho
No ratings yet
DNN Ho
8 pages
Deep Neural Network AIML Handout v1.0-1
No ratings yet
Deep Neural Network AIML Handout v1.0-1
8 pages
Syllabus DL MBU
No ratings yet
Syllabus DL MBU
4 pages
20IT7301 - Deep Learning Syllabus
No ratings yet
20IT7301 - Deep Learning Syllabus
3 pages
Artificial Neural Networks - MiniProject
100% (1)
Artificial Neural Networks - MiniProject
16 pages
UNIT 4 - Perceptron and DL
No ratings yet
UNIT 4 - Perceptron and DL
39 pages
Convolutional Neural Networks For Visual Recognition
No ratings yet
Convolutional Neural Networks For Visual Recognition
45 pages
CCS355 Neural Networks and Deep Learning
No ratings yet
CCS355 Neural Networks and Deep Learning
2 pages
MCQs Dumps 2
No ratings yet
MCQs Dumps 2
15 pages
DL LWB
No ratings yet
DL LWB
116 pages
Unit III
No ratings yet
Unit III
37 pages
CS5228 Project 2 Twitter Sentiment Analysis Group No.: 29: 1 Problem Statement
No ratings yet
CS5228 Project 2 Twitter Sentiment Analysis Group No.: 29: 1 Problem Statement
15 pages
18 GoogleNet 05 09 2024
No ratings yet
18 GoogleNet 05 09 2024
40 pages
Image Classification Using CNN
No ratings yet
Image Classification Using CNN
15 pages
Unit III Deep Learning Chapter Notes
No ratings yet
Unit III Deep Learning Chapter Notes
23 pages
Session2 2024 - 2025 - Natural Language Processing
No ratings yet
Session2 2024 - 2025 - Natural Language Processing
30 pages
QB DL
No ratings yet
QB DL
2 pages
ML IA-2 Question Bank - 1
No ratings yet
ML IA-2 Question Bank - 1
24 pages
Roadmap To GenAi
No ratings yet
Roadmap To GenAi
2 pages
Goog Le Net
No ratings yet
Goog Le Net
30 pages
Deep Neural Networks: Amity Centre For Artificial Intelligence, Amity University, Noida, India
No ratings yet
Deep Neural Networks: Amity Centre For Artificial Intelligence, Amity University, Noida, India
62 pages
Generative Adversarial Network-Based Phishing URL Detection With Variational Autoencoder and Transformer
No ratings yet
Generative Adversarial Network-Based Phishing URL Detection With Variational Autoencoder and Transformer
8 pages
Assignment-1 ML
No ratings yet
Assignment-1 ML
1 page
Transformers Explained "Attention Is All You Need."
No ratings yet
Transformers Explained "Attention Is All You Need."
28 pages
ML Unit Iv
No ratings yet
ML Unit Iv
17 pages
Understanding GRU Networks
No ratings yet
Understanding GRU Networks
8 pages
Survey On DeepLearning Medical Image Analysis MIT2017-1
No ratings yet
Survey On DeepLearning Medical Image Analysis MIT2017-1
15 pages
Assignment 1
No ratings yet
Assignment 1
17 pages
Transformer - Ipynb - Colab
No ratings yet
Transformer - Ipynb - Colab
5 pages
Le y Yang - Tiny ImageNet Visual Recognition Challenge
No ratings yet
Le y Yang - Tiny ImageNet Visual Recognition Challenge
6 pages
Backpropagation Algorithm
No ratings yet
Backpropagation Algorithm
3 pages
Module 3 Quiz
No ratings yet
Module 3 Quiz
1 page
Rr720507 Neural Networks
No ratings yet
Rr720507 Neural Networks
5 pages
Back Propagation Example
No ratings yet
Back Propagation Example
3 pages