Deep Learning Basics Lecture 11 Practical Methodology

1. The document discusses practical methodology for designing deep learning systems. 2. It recommends determining goals, establishing an end-to-end pipeline, identifying bottlenecks, and making incremental improvements. 3. Choosing appropriate networks, optimization algorithms, and hyperparameters is also covered, with suggestions like using SGD with momentum and learning rate decay.

Uploaded by

baris

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views

Deep Learning Basics Lecture 11 Practical Methodology

Uploaded by

baris

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Deep Learning Basics

Lecture 11: Practical Methodology

Princeton University COS 495
Instructor: Yingyu Liang
Designing process
Practical methodology
• Important to know a variety of techniques and understand their pros
and cons

• In practice, “can do much better with a correct application of a

commonplace algorithm than by sloppily applying an obscure
algorithm”
Practical designing process
1. Determine your goals: input and output; evaluation metrics
2. Establish an end-to-end pipeline
3. Determine bottlenecks in performance
4. Repeatedly make incremental changes based on findings

From Andrew Ng’s lecture and the book deep Learning

Practical designing process
1. Determine your goals: input and output; evaluation metrics
• What is the input of the system?
• What is the output of the system?
• What can be regarded as a good system? Accuracy? Speed? Memory? …
2. Establish an end-to-end pipeline
3. Determine bottlenecks in performance
4. Repeatedly make incremental changes based on findings
Practical designing process
1. Determine your goals: input and output; evaluation metrics
2. Establish an end-to-end pipeline
• Design the system as soon as possible, no need to be perfect
• Can be based on existing systems for similar goals
3. Determine bottlenecks in performance
4. Repeatedly make incremental changes based on findings
Practical designing process
1. Determine your goals: input and output; evaluation metrics
2. Establish an end-to-end pipeline
3. Determine bottlenecks in performance
• Divide the system into components
• Diagnose which component performing worse than expected
• Overfitting? Underfitting? Bugs in the software? Bad/too small dataset? …
4. Repeatedly make incremental changes based on findings
Practical designing process
1. Determine your goals: input and output; evaluation metrics
2. Establish an end-to-end pipeline
3. Determine bottlenecks in performance
4. Repeatedly make incremental changes based on findings
• Do not make big changes (unless the system just too bad)
• Replace system component? Change optimization algorithm? Adjust
hyperparameters? Get more/new data?
To begin with
Deep learning?
• First question: do you really need deep learning systems?
• Maybe simple models like logistic regression/SVM suffice for your
goals (i.e., shallow models)

• Choose deep learning if

• The task fall into the areas that deep learning is known to perform well
• The task is complicated enough that deep models have a better chance to win
Which networks to choose?
• Based on the input and the goal

• Vector input, supervised learning: feedforward networks

• If know input topological structure, use convolution
• Activation function: typically ReLU
Which networks to choose?
• Based on the input and the goal

• Vector input, unsupervised: generative model; autoencoder; energy

based model
• Highly depend on your goal
Which networks to choose?
• Based on the input and the goal

• Sequential input: Recurrent network

• LSTM (long-short term memory network)
• GRU (Gated Recurrent Unit)
• Memory network
• Attention-based variants
Which optimization algorithm?
• SGD with momentum and a decaying learning rate

• Momentum: 0.5 at the beginning and 0.9 at the end

• Learning rate decaying schemes

• linearly until reaching a fixed minimum learning rate
• decaying exponentially
• decreasing the learning rate by a factor of 2-10 each time validation error
plateaus
What regularizations?
• 𝑙2 regularization
• Early stopping
• Dropout
• Batch Normalization: can replace dropout

• Data augmentation if the transformations known/easy to implement

Reusing models
• If your task is similar to another task studied: copy the
model/optimization algorithm/hyperparameters, improve them

• Even can copy the trained models and then fine-tune it

Whether to use unsupervised pretraining?
• NLP: yes, use word embeddings almost all the time

• Computer vision: not quite; unsupervised now only good for semi-
supervised learning (a few labeled data, a lot of unlabeled data)
Tuning hyperparameters
Why?
• Performance: training/test errors; reconstruction; generative ability…

• Resources: training time; test time; memory…

Two types of approaches
• Manually tune: need to understand the hyperparameters and their
effects on the goals

• Automatically tune: need resources

Manually tune
• Need to know: the relationship between hyperparameters and
training/test errors and computational resources (memory and
runtime)

• Example: increase number of hidden units in each layer will

• Increase the model capacity
• Increase the generalization error (= test error – training error)
• Increase memory and runtime
Automatically tune
• Grid search
• Random search
• Model-based optimization (another level of optimization)
• Variables: hyperparameters
• Objective: validation errors
Debugging strategies
Difficulties
• Do not know a prior what performance/behavior to expect
• Components of the model can adapt for each other
• One components fails but the other components adapt to cover the failure
Debugging
• Try a small dataset
• Faster, save time
• Inspect components
• Monitor histograms of activations and gradients
• Compare symbolic derivatives to numerical derivatives
• Compare training/validation/test errors
• Overfitting or underfitting?
• Focus on worst mistake
• On which data points it perform worst? Why?

Deep Learning-Question Bank-Module-Wise
67% (3)
Deep Learning-Question Bank-Module-Wise
5 pages
Psychology For Nurses and The Caring Professions
100% (1)
Psychology For Nurses and The Caring Professions
288 pages
Mazda6 Brosura
No ratings yet
Mazda6 Brosura
48 pages
ECE604 f20 hw3
0% (1)
ECE604 f20 hw3
3 pages
300 Spartan Medication Ebooklet
No ratings yet
300 Spartan Medication Ebooklet
201 pages
UAW Ford Contract Summary
No ratings yet
UAW Ford Contract Summary
24 pages
Instrukcja EasyMIG210 210s 215 225 EN-1
No ratings yet
Instrukcja EasyMIG210 210s 215 225 EN-1
28 pages
ME8793 Process Planning and Cost EStimation UNIT 5 Notes
No ratings yet
ME8793 Process Planning and Cost EStimation UNIT 5 Notes
26 pages
Design and Analysis of High Efficiency DC - DC Boost Converter For EV Charging Applications
No ratings yet
Design and Analysis of High Efficiency DC - DC Boost Converter For EV Charging Applications
9 pages
Document
No ratings yet
Document
29 pages
2.calculating Marginal Revenue From A Linear Deman...
No ratings yet
2.calculating Marginal Revenue From A Linear Deman...
5 pages
Lab Course File EC 601 DSP
No ratings yet
Lab Course File EC 601 DSP
17 pages
March 2023: Monthly Current Affairs
No ratings yet
March 2023: Monthly Current Affairs
149 pages
Arduino Obstacle Avoiding Robot Code
No ratings yet
Arduino Obstacle Avoiding Robot Code
4 pages
Brain-Inspired Computing System
100% (1)
Brain-Inspired Computing System
10 pages
Aadeis-Aafdei Newsletter March 2022 Vol-Vii No.2
No ratings yet
Aadeis-Aafdei Newsletter March 2022 Vol-Vii No.2
64 pages
CH 08
No ratings yet
CH 08
82 pages
Reaction Time Lab Report: Name: Archisman Nath Teacher: Mr. David Hill Class: SNC2D0
No ratings yet
Reaction Time Lab Report: Name: Archisman Nath Teacher: Mr. David Hill Class: SNC2D0
19 pages
IBOS - Shipping Business Amended Slyllabus
No ratings yet
IBOS - Shipping Business Amended Slyllabus
25 pages
Neet2020 Apst Merit
No ratings yet
Neet2020 Apst Merit
125 pages
BREB Supplementary 4th Edition
No ratings yet
BREB Supplementary 4th Edition
140 pages
Impacts of Nursing On Politics and Health Policy
No ratings yet
Impacts of Nursing On Politics and Health Policy
1 page
Syllogism Question PDF 1
No ratings yet
Syllogism Question PDF 1
5 pages
Control Systems - Introduction - Tutorialspoint 1
No ratings yet
Control Systems - Introduction - Tutorialspoint 1
3 pages
Investors Are Getting Ready For Jump in Market Volatility: For Personal, Non-Commercial Use Only
No ratings yet
Investors Are Getting Ready For Jump in Market Volatility: For Personal, Non-Commercial Use Only
28 pages
A Novel Deep Learning Framework Approach For Sugarcane Disease Detection
No ratings yet
A Novel Deep Learning Framework Approach For Sugarcane Disease Detection
20 pages
Chapter 2: Literature Review
No ratings yet
Chapter 2: Literature Review
38 pages
Information Memorandum - The Dunes Cotton Tree
No ratings yet
Information Memorandum - The Dunes Cotton Tree
17 pages
Central Action Plan 2021-22
No ratings yet
Central Action Plan 2021-22
73 pages
စာချုပ်စာတမ်းမှတ်ပုံတင်လက်စွဲဥပဒေ
No ratings yet
စာချုပ်စာတမ်းမှတ်ပုံတင်လက်စွဲဥပဒေ
328 pages
Visakha Vidyalaya Colombo 05 Grade 13 Physics 2018 3rd Term Test Paper 6375dec5cefbf
No ratings yet
Visakha Vidyalaya Colombo 05 Grade 13 Physics 2018 3rd Term Test Paper 6375dec5cefbf
50 pages
Power System Lab
No ratings yet
Power System Lab
24 pages
Texas Racing Commission Notice
No ratings yet
Texas Racing Commission Notice
1 page
Project AWARE - Hackathon Presentation
100% (1)
Project AWARE - Hackathon Presentation
11 pages
Samsung cl21m21mq Chassis Ks7a N r2 Gold Rush PDF
No ratings yet
Samsung cl21m21mq Chassis Ks7a N r2 Gold Rush PDF
63 pages
Ym3012 199204
No ratings yet
Ym3012 199204
8 pages
FY24 Preliminary Fiscal Plan
No ratings yet
FY24 Preliminary Fiscal Plan
539 pages
04.02 More Decisions PDF
No ratings yet
04.02 More Decisions PDF
6 pages
Genomic Dna by Ligation SQK lsk110 GDE - 9108 - v110 - Revx - 10nov2020 Minion
No ratings yet
Genomic Dna by Ligation SQK lsk110 GDE - 9108 - v110 - Revx - 10nov2020 Minion
26 pages
Statistics Case Study SEM-1
No ratings yet
Statistics Case Study SEM-1
8 pages
Guidance Note 3B Version 1.0
100% (1)
Guidance Note 3B Version 1.0
26 pages
Numerical Modeling of Ground-Penetrating Radar in 2-D Using MATLAB
No ratings yet
Numerical Modeling of Ground-Penetrating Radar in 2-D Using MATLAB
12 pages
Recon Automation Using OSINT
100% (1)
Recon Automation Using OSINT
5 pages
Potential Pathways For Decarbonizing China's Inland Waterway Shipping
No ratings yet
Potential Pathways For Decarbonizing China's Inland Waterway Shipping
4 pages
Win Mail
No ratings yet
Win Mail
655 pages
Davrados 2023 Obligations II Outline - LZ
No ratings yet
Davrados 2023 Obligations II Outline - LZ
68 pages
Post Lockdown Handbook
No ratings yet
Post Lockdown Handbook
59 pages
Bank of India 9
No ratings yet
Bank of India 9
60 pages
Axion Technical Services PVT LTD: Vehicle Inspection Report
100% (1)
Axion Technical Services PVT LTD: Vehicle Inspection Report
4 pages
Carp Hatchery Management - Sudhan - 1535281360
No ratings yet
Carp Hatchery Management - Sudhan - 1535281360
47 pages
Coaching Classroom Instruction
No ratings yet
Coaching Classroom Instruction
2 pages
100 Câu T Đ NG Nghĩa - T Trái Nghĩa
No ratings yet
100 Câu T Đ NG Nghĩa - T Trái Nghĩa
15 pages
Report 3
No ratings yet
Report 3
6 pages
Flistmbbs GSF
No ratings yet
Flistmbbs GSF
17 pages
August 1st Week Top60 (Eng) by AC
No ratings yet
August 1st Week Top60 (Eng) by AC
38 pages
The Hindu Explainer Compilation 2022-23
No ratings yet
The Hindu Explainer Compilation 2022-23
169 pages
8th Quarterly Progress Report of JEEViKA
No ratings yet
8th Quarterly Progress Report of JEEViKA
19 pages
Untitled
No ratings yet
Untitled
18 pages
MGT 7-Annual Return
No ratings yet
MGT 7-Annual Return
14 pages
Week 2 - Select and Train A Model
No ratings yet
Week 2 - Select and Train A Model
29 pages
DGM MID SEM
No ratings yet
DGM MID SEM
39 pages
Unit-2 Improving-Deep-Neural-Networks
No ratings yet
Unit-2 Improving-Deep-Neural-Networks
18 pages
OSRAM SFH 309 Datasheet
No ratings yet
OSRAM SFH 309 Datasheet
16 pages
Deep Learning Basics Lecture 2 Backpropagation
No ratings yet
Deep Learning Basics Lecture 2 Backpropagation
31 pages
Deep Learning Basics Lecture 1 Feedforward
No ratings yet
Deep Learning Basics Lecture 1 Feedforward
31 pages
Deep Learning Basics Lecture 3 Regularization I
No ratings yet
Deep Learning Basics Lecture 3 Regularization I
32 pages
Deep Learning Basics Lecture 6 Convolutional NN
No ratings yet
Deep Learning Basics Lecture 6 Convolutional NN
36 pages
Deep Learning Basics Lecture 4 Regularization II
No ratings yet
Deep Learning Basics Lecture 4 Regularization II
27 pages
Deep Learning Basics Lecture 8 Autoencoder & DBM
No ratings yet
Deep Learning Basics Lecture 8 Autoencoder & DBM
28 pages
ECE604 f20 hw1
No ratings yet
ECE604 f20 hw1
1 page
PYu-RC Group 51 RoHS L 12
No ratings yet
PYu-RC Group 51 RoHS L 12
10 pages
Lectures On Electromagnetic Theory - Weng Cho Chew
No ratings yet
Lectures On Electromagnetic Theory - Weng Cho Chew
591 pages
SFH 203 - en
No ratings yet
SFH 203 - en
15 pages
SFH 235 Fa - en
No ratings yet
SFH 235 Fa - en
15 pages
Quiz Q1
No ratings yet
Quiz Q1
3 pages
NA Ch4 Student
No ratings yet
NA Ch4 Student
116 pages
YW-Eshel
No ratings yet
YW-Eshel
8 pages
Signals and Systems Using MATLAB: Luis F. Chaparro
No ratings yet
Signals and Systems Using MATLAB: Luis F. Chaparro
18 pages
Lec2 Linear Regression With One Variable
No ratings yet
Lec2 Linear Regression With One Variable
48 pages
EEN08 Step by Step Buhat
No ratings yet
EEN08 Step by Step Buhat
10 pages
SDSM
No ratings yet
SDSM
13 pages
TTS Chapter8
No ratings yet
TTS Chapter8
17 pages
problem solving module 1
No ratings yet
problem solving module 1
4 pages
Applications of Stack: 20CS2013 L-Data Structures and Algorithms Lab
No ratings yet
Applications of Stack: 20CS2013 L-Data Structures and Algorithms Lab
5 pages
DAA - Non-Deterministic Algorithms
No ratings yet
DAA - Non-Deterministic Algorithms
13 pages
Complete Download (Ebook) Python Programming and Numerical Methods: A Guide for Engineers and Scientist by Qingkai Kong, Timmy Siauw, Alexandre Bayen ISBN 9780128195499, 0128195495 PDF All Chapters
100% (7)
Complete Download (Ebook) Python Programming and Numerical Methods: A Guide for Engineers and Scientist by Qingkai Kong, Timmy Siauw, Alexandre Bayen ISBN 9780128195499, 0128195495 PDF All Chapters
67 pages
Introduction To Numerical Methods With Examples in Javascript
No ratings yet
Introduction To Numerical Methods With Examples in Javascript
55 pages
Sampling and Reconstruction
No ratings yet
Sampling and Reconstruction
6 pages
4.2 - 1b - Gradient Descent - Wikipedia - Workedout
No ratings yet
4.2 - 1b - Gradient Descent - Wikipedia - Workedout
5 pages
Artificial Intelligence: Pathfinding
No ratings yet
Artificial Intelligence: Pathfinding
37 pages
Syndrome Decoding
No ratings yet
Syndrome Decoding
4 pages
21MA44T NM Syllabus
No ratings yet
21MA44T NM Syllabus
2 pages
DL Notes
No ratings yet
DL Notes
35 pages
Linear Block Codes
No ratings yet
Linear Block Codes
13 pages
Numerical Analysis Final Exam
No ratings yet
Numerical Analysis Final Exam
2 pages
Gujarat Technological University: Instructions
No ratings yet
Gujarat Technological University: Instructions
1 page
Matlab Simulink Lab Exercises Designed For Teaching Digital Signal Processing Applications
No ratings yet
Matlab Simulink Lab Exercises Designed For Teaching Digital Signal Processing Applications
14 pages
Exam 2 S 13 Key
No ratings yet
Exam 2 S 13 Key
11 pages
CS 610-103 - Data Structure & Algorithm
No ratings yet
CS 610-103 - Data Structure & Algorithm
5 pages
MCQ On Linear Programming Problem
88% (8)
MCQ On Linear Programming Problem
7 pages
Video 18
No ratings yet
Video 18
17 pages
Amit Kumar Ranjan Dsa Lab Exam
No ratings yet
Amit Kumar Ranjan Dsa Lab Exam
6 pages
(2021) A Heuristic Approach For Two Dimensional Rectangular Cutting
No ratings yet
(2021) A Heuristic Approach For Two Dimensional Rectangular Cutting
15 pages