0% found this document useful (0 votes)

2 views

Lecture 1

Notes

Uploaded by

bhagyavantrajapur88

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Lecture 1

Notes

Uploaded by

bhagyavantrajapur88

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Lecture 1: Introduction to Deep Learning

CSE599W: Spring 2018

Lecturers
ML Applications need more than algorithms

Learning Systems: this course

What’s this course
● Not about Learning aspect of Deep Learning (except for the first two)

● System aspect of deep learning: faster training, efficient serving, lower

memory consumption.
Logistics
● Location/Date: Tue/Thu 11:30 am - 12:50pm MUE 153

● Join slack: https://uw-cse.slack.com dlsys channel

● We may use other time and locations for invited speakers.

● Compute Resources: AWS Education, instruction sent via email.

● Office hour by appointment

Homeworks and Projects
● Two code assignments

● Group project
○ Two to three person team
○ Poster presentation and write-up
A Crash Course on Deep Learning
Elements of Machine Learning

Model

Objective

Training
What’s Special About Deep Learning

layer1 layer2 predictor

exractor extractor

Compositional
Model

End to End Training

Ingredients in Deep Learning
● Model and architecture

● Objective function, training techniques

○ Which feedback should we use to guide the algorithm?
○ Supervised, RL, adversarial training.

● Regularization, initialization (coupled with modeling)

○ Dropout, Xavier

● Get enough amount of data

Major Architectures

Image Modeling
Convolutional Nets

Language/Speech
Recurrent Nets
Image Modeling and Convolutional Nets
Breakthrough of Image Classification
Evolution of ConvNets
• LeNet (LeCun, 1998)
– Basic structures: convolution, max-pooling, softmax
• Alexnet (Krizhevsky et.al 2012)
– ReLU, Dropout
• GoogLeNet (Szegedy et.al. 2014)
– Multi-independent pass way (Sparse weight matrix)
• Inception BN (Ioffe et.al 2015)
– Batch normalization
• Residual net (He et.al 2015)
– Residual pass way
Fully Connected Layer

Output

Input
Convolution = Spatial Locality + Sharing

Spatial Locality

Without Sharing

With Sharing
Convolution with Multiple Channels

Source: http://cs231n.github.io/convolutional-networks/
Pooling Layer
Can be replaced by strided convolution

Source: http://cs231n.github.io/convolutional-networks/
LeNet (LeCun 1998)

• Convolution
• Pooling
• Flatten
• Fully connected
• Softmax output
AlexNet (Krizhevsky et.al 2012)
Challenges: From LeNet to AlexNet

● Need much more data: ImageNet

● A lot more computation burdens: GPU

● Overfitting prevention
○ Dropout regularization

● Stable initialization and training

○ Explosive/vanishing gradient problems
○ Requires careful tuning of initialization and data normalization
ReLU Unit

• ReLU

• Why ReLU?
– Cheap to compute
– It is roughly linear..
Dropout Regularization
● Randomly zero out neurons with
probability 0.5

● During prediction, use expectation

value (keep all neurons but scale
output by 0.5)

Dropout Mask
Dropout Regularization
● Randomly zero out neurons with
probability 0.5

● During prediction, use expectation

value (keep all neurons but scale
output by 0.5)

Dropout Mask
GoogleNet: Multiple Pathways, Less Parameters
Vanishing and Explosive Value Problem
● Imagine each layer multiplies
Its input by same weight matrix
○ W > 1: exponential explosion
○ W < 1: exponential vanishing

● In ConvNets, the weight are not tied, but

their magnitude matters
○ Deep nets training was initialization sensitive
Batch Normalization: Stabilize the Magnitude

• Subtract mean
• Divide by standard deviation
• Output is invariant to input scale!
– Scale input by a constant
– Output of BN remains the same

• Impact
– Easy to tune learning rate
– Less sensitive initialization
(Ioffe et.al 2015)
The Scale Normalization (Assumes zero mean)

Scale
Normalization

Invariance to
Magnitude!
Residual Net (He et.al 2015)

● Instead of doing transformation

add transformation result to input

● Partly solve vanishing/explosive

value problem
Evolution of ConvNets
• LeNet (LeCun, 1998)
– Basic structures: convolution, max-pooling, softmax
• Alexnet (Krizhevsky et.al 2012)
– ReLU, Dropout
• GoogLeNet (Szegedy et.al. 2014)
– Multi-independent pass way (Sparse weight matrix)
• Inception BN (Ioffe et.al 2015)
– Batch normalization
• Residual net (He et.al 2015)
– Residual pass way
More Resources
● Deep learning book (Goodfellow et. al)

● Stanford CS231n: Convolutional Neural Networks for Visual Recognition

● http://dlsys.cs.washington.edu/materials
Lab1 on Thursday
● Walk through how to implement a simple model for digit recognition
using MXNet Gluon
● Focus is on data I/O, model definition and typical training loop
● Familiarize with typical framework APIs for vision tasks

● Before class: sign up for AWS educate credits

● https://aws.amazon.com/education/awseducate/apply/
● Create AWS Educate Starter Account to avoid getting charged
● Will email out instructions, but very simple to DIY, so do it today!

Introduction To Information Systems 6th Edition Rainer Prince Solution Manual
100% (42)
Introduction To Information Systems 6th Edition Rainer Prince Solution Manual
18 pages
Unit 2 - Week 1
No ratings yet
Unit 2 - Week 1
5 pages
Vienna Simulators LTE-A Downlink System Level Simulator Documentation, v1.8r1375
No ratings yet
Vienna Simulators LTE-A Downlink System Level Simulator Documentation, v1.8r1375
34 pages
L10 - Intro - To - Deep - Learning
No ratings yet
L10 - Intro - To - Deep - Learning
75 pages
Unit III
No ratings yet
Unit III
58 pages
Lecture 9 Training Deep Networks
No ratings yet
Lecture 9 Training Deep Networks
20 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Modern CNN Architectures
No ratings yet
Modern CNN Architectures
32 pages
138 B Pretrained Networks Classification Complete
No ratings yet
138 B Pretrained Networks Classification Complete
47 pages
L3 - UUCLxDeepMind DL2020
No ratings yet
L3 - UUCLxDeepMind DL2020
110 pages
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
No ratings yet
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
167 pages
Basics of DL: Prof. Leal-Taixé and Prof. Niessner 1
No ratings yet
Basics of DL: Prof. Leal-Taixé and Prof. Niessner 1
76 pages
CNN Architectures 01
No ratings yet
CNN Architectures 01
66 pages
Pretrained Networks
No ratings yet
Pretrained Networks
42 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
No ratings yet
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
33 pages
CS436_CS5310_EE513_L05_CNN2
No ratings yet
CS436_CS5310_EE513_L05_CNN2
27 pages
Ai 4 All
No ratings yet
Ai 4 All
18 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
No ratings yet
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
6 pages
Cnn
No ratings yet
Cnn
56 pages
CV - Deep Convolutional Neural Networks
No ratings yet
CV - Deep Convolutional Neural Networks
55 pages
001 Intro
No ratings yet
001 Intro
66 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Cs437 Cs5317 Ee414 Ee513 l10 Cnncasestudies
No ratings yet
Cs437 Cs5317 Ee414 Ee513 l10 Cnncasestudies
55 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
10.CNN-2
No ratings yet
10.CNN-2
97 pages
An Overview of Convolutional Neural Network Architectures For Deep Learning
No ratings yet
An Overview of Convolutional Neural Network Architectures For Deep Learning
22 pages
Res Net 4
No ratings yet
Res Net 4
23 pages
GoogleNET and ResNet v4 With Nin and Bias
No ratings yet
GoogleNET and ResNet v4 With Nin and Bias
82 pages
CNN
No ratings yet
CNN
31 pages
Intro CNN PDF
No ratings yet
Intro CNN PDF
31 pages
CII4Q3 - Computer Vision-EAR - Week-11-Intro To Deep Learning v1.0
No ratings yet
CII4Q3 - Computer Vision-EAR - Week-11-Intro To Deep Learning v1.0
50 pages
Lec_2
No ratings yet
Lec_2
42 pages
Hot Chips Overview
No ratings yet
Hot Chips Overview
47 pages
6 Apr - 6 - DL
No ratings yet
6 Apr - 6 - DL
69 pages
Convolutional Neural Networks Notes
No ratings yet
Convolutional Neural Networks Notes
29 pages
5b Dana
No ratings yet
5b Dana
67 pages
4th Unit Aktu Machine Learning
No ratings yet
4th Unit Aktu Machine Learning
9 pages
Trustworthy - Final Essay
No ratings yet
Trustworthy - Final Essay
21 pages
Chapter21 4e
No ratings yet
Chapter21 4e
35 pages
23 DeepLearning PDF
No ratings yet
23 DeepLearning PDF
74 pages
Lec6 RNN Attention Search
No ratings yet
Lec6 RNN Attention Search
62 pages
Aidl 2023s DL 08 CNN Architectures
No ratings yet
Aidl 2023s DL 08 CNN Architectures
51 pages
Deep Nets
No ratings yet
Deep Nets
25 pages
Object Classification Using CNN
No ratings yet
Object Classification Using CNN
9 pages
StatPred Deep Learning Winter 2020 Handout
No ratings yet
StatPred Deep Learning Winter 2020 Handout
17 pages
Lecture 26
No ratings yet
Lecture 26
17 pages
SS_2020
No ratings yet
SS_2020
21 pages
Introduction to Deep Learning 17th January 2025 (2)
No ratings yet
Introduction to Deep Learning 17th January 2025 (2)
60 pages
MLT CNN Architectures
No ratings yet
MLT CNN Architectures
104 pages
LBDL A5 Booklet
No ratings yet
LBDL A5 Booklet
82 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
130 pages
Deepnet Lourentzou
No ratings yet
Deepnet Lourentzou
49 pages
DL Intro
No ratings yet
DL Intro
64 pages
01 Intro
No ratings yet
01 Intro
49 pages
Deep Learning With Keras - Quick Guide
No ratings yet
Deep Learning With Keras - Quick Guide
22 pages
IoT - Lecture 11
No ratings yet
IoT - Lecture 11
58 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Artificial Intelligence Interview Questions
From Everand
Artificial Intelligence Interview Questions
Tech Interviews
5/5 (2)
Enhancing Deep Learning Performance Using Displaced Rectifier Linear Unit
From Everand
Enhancing Deep Learning Performance Using Displaced Rectifier Linear Unit
David Macêdo
No ratings yet
Model Bank R13: AML Setup Guide
No ratings yet
Model Bank R13: AML Setup Guide
25 pages
2.application Layer
No ratings yet
2.application Layer
59 pages
2018 Executive View Evidian Identity Access Management
No ratings yet
2018 Executive View Evidian Identity Access Management
7 pages
TIA TR-42 Activity: Ronald Tellas, Technology and Applications Manager, Enterprise Networking
No ratings yet
TIA TR-42 Activity: Ronald Tellas, Technology and Applications Manager, Enterprise Networking
8 pages
Chapter 4 Basic Probability
100% (1)
Chapter 4 Basic Probability
33 pages
PDF 169 LP Cairan Dan Elektrolit Sdki - Compress
No ratings yet
PDF 169 LP Cairan Dan Elektrolit Sdki - Compress
8 pages
nizam cv
No ratings yet
nizam cv
2 pages
PDF Contoh Soal Transformasi Laplace Dan Jawabandocx
No ratings yet
PDF Contoh Soal Transformasi Laplace Dan Jawabandocx
26 pages
Firmware Update S7-1500 CPU
No ratings yet
Firmware Update S7-1500 CPU
2 pages
123819FMM Industry4.0 - Cat21 - Full-Copy1
No ratings yet
123819FMM Industry4.0 - Cat21 - Full-Copy1
75 pages
Task 1
No ratings yet
Task 1
7 pages
ADAPT
No ratings yet
ADAPT
33 pages
Best Formulas For Clocks and Calendars
No ratings yet
Best Formulas For Clocks and Calendars
56 pages
Round Robin and Priority Schedule
No ratings yet
Round Robin and Priority Schedule
9 pages
MERN Stack Interview Questions (2024)
100% (1)
MERN Stack Interview Questions (2024)
24 pages
MFL71793834 01 S 210617+RS-232C
No ratings yet
MFL71793834 01 S 210617+RS-232C
69 pages
Diagnostic Transformation Digitale ALFYNS Group
No ratings yet
Diagnostic Transformation Digitale ALFYNS Group
10 pages
PDF-XChange - Info Review Updated For Editor Build 6.0.317.1
No ratings yet
PDF-XChange - Info Review Updated For Editor Build 6.0.317.1
1 page
Application of IT
No ratings yet
Application of IT
3 pages
Color Based Sorting 1
No ratings yet
Color Based Sorting 1
43 pages
List of Programming Languages
No ratings yet
List of Programming Languages
11 pages
Impact of Incorrect and New Requirements On Waterfall Software Project Outcomes
No ratings yet
Impact of Incorrect and New Requirements On Waterfall Software Project Outcomes
22 pages
Operational Manual: Spectrophotometer Model: SP-880
No ratings yet
Operational Manual: Spectrophotometer Model: SP-880
21 pages
NationalInstruments 9234 Datasheet
No ratings yet
NationalInstruments 9234 Datasheet
12 pages
To-Do List Application
No ratings yet
To-Do List Application
49 pages
Starz University: College of Science and Technology
No ratings yet
Starz University: College of Science and Technology
11 pages
Shelf Life Expiration Date (SLED/BBD) Entered During Goods Movement Posting Is Not Updated To Batch Master - SAP ERP & SAP S/4 HANA
No ratings yet
Shelf Life Expiration Date (SLED/BBD) Entered During Goods Movement Posting Is Not Updated To Batch Master - SAP ERP & SAP S/4 HANA
2 pages