0% found this document useful (0 votes)

16 views29 pages

Deep Learning: Hung-yi Lee 李宏毅

Uploaded by

amjidaliafridi2001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views29 pages

Deep Learning: Hung-yi Lee 李宏毅

Uploaded by

amjidaliafridi2001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 29

Deep Learning

Hung-yi Lee
李宏毅
Deep learning
attracts lots of attention.
• I believe you have seen lots of exciting results
before.

Deep learning trends at Google. Source: SIGMOD 2016/Jeff Dean

Ups and downs of Deep Learning
• 1958: Perceptron (linear model)
• 1969: Perceptron has limitation
• 1980s: Multi-layer perceptron
• Do not have significant difference from DNN today
• 1986: Backpropagation
• Usually more than 3 hidden layers is not helpful
• 1989: 1 hidden layer is “good enough”, why deep?
• 2006: RBM initialization
• 2009: GPU
• 2011: Start to be popular in speech recognition
• 2012: win ILSVRC image competition
• 2015.2: Image recognition surpassing human-level performance
• 2016.3: Alpha GO beats Lee Sedol
• 2016.10: Speech recognition system as good as humans
Three Steps for Deep Learning
Step 1:
Step 2: Step 3: pick
define a set
goodness of the best
of Neural
function
Network function function

Deep Learning is so simple ……

Neural Network

  z 

  z    z 

  z 
“Neuron”
Neural Network
Different connection leads to different network
structures
Network parameter : all the weights and biases in the “neurons”
Fully Connect Feedforward
Network
1 4 0.98
1
-2
1
-1 -2 0.12
-1
1
0
Sigmoid Function  z 
1
 z   z
1 e z
Fully Connect Feedforward
Network
1 4 0.98 2 0.86 3 0.62
1
-2 -1 -1
1 0 -2
-1 -2 0.12 -2 0.11 -1 0.83
-1
1 -1 4
0 0 2
Fully Connect Feedforward
Network
1 0.73 2 0.72 3 0.51
0
-2 -1 -1
1 0 -2
-1 0.5 -2 0.12 -1 0.85
0
1 -1 4
0 0 2
This is a function.
Input vector, output vector
𝑓
([ ]) [
1
−1
=
0 .62
0.83 ] ([ ]) [
𝑓
0
0
=
0 .51
0.85 ]
Given network structure, define a function set
Fully Connect Feedforward
Network
neuron
Input Layer 1 Layer 2 Layer L Output
x1 …… y1
x2 …… y2

……
……

……

……
xN …… yM
Input Output
Layer Hidden Layers Layer
Deep = Many hidden layers
22 layers

http://
cs231n.stanford.edu/ 19 layers
slides/
winter1516_lecture8.pdf

8 layers
6.7%
7.3%
16.4%

AlexNet (2012) VGG (2014) GoogleNet (2014)

Deep = Many hidden layers

152 layers 101 layers

Special
structure

Ref:
https://www.youtube.com/watch?
3.57%
v=dxB6299gpvI

7.3% 6.7%
16.4%
AlexNet VGG GoogleNet Residual Net Taipei
(2012) (2014) (2014) (2015) 101
Matrix Operation
1 4 0.98
y1
1
-2
1
-1 -2 0.12
-1 y2
1
0

𝜎[
1
−1
−2
1 ] [ ] +¿ [ ] ¿ [
(
1
−1
1
)
0
0 .98
0.12 ]
[ ]
4
−2
Neural Network
x1 …… y1
x 2 W1 W2 ……
WL y2
b1 b2 bL

……
……

……

……
xN x a1 ……
a2 y yM

𝜎W1 x(+ b)
1

𝜎W2 a1(+ b)
2

𝜎 L-1 + )
WL a( bL
Neural Network
x1 …… y1
x 2 W1 W2 ……
WL y2
b1 b2 bL

……
……

……

……
xN x a1 ……
a2 y yM

Using parallel computing techniques

y ¿ 𝑓 x( )
to speed up matrix operation

¿WL … 𝜎
W2 𝜎
𝜎 W1 (
x(+ (
b)
1 … + bL
+ b2 )
)
Output Layer
as Multi-Class Classifier
Feature extractor replacing
feature engineering
x1
…… y1
x2
…… y2

Softmax
x

……
……

……
xK
…… yM
Input Output = Multi-class
Layer Hidden Layers Layer Classifier
Example Application

Input Output

x1 y1
0.1 is 1

x2 y2
0.7 is 2
The image
is “2”

……
……
……

x256 y10
0.2 is 0
16 x 16 = 256
Ink → 1 Each dimension represents
No ink → 0 the confidence of a digit.
Example Application
• Handwriting Digit Recognition

x1 y1 is 1
x2
y2 is 2
Neural
Machine “2
……

Network

……
……
”
x256 y10 is 0
What is needed is a
function ……
Input: output:
256-dim vector 10-dim vector
Example Application
Input Layer 1 Layer 2 Layer L Output
x1 …… y1 is 1
x2 ……
A function set containing the y2 is 2
candidates for “2

……
……

……

……
……
Handwriting Digit Recognition ”
xN …… y10 is 0
Input Output
Layer Hidden Layers Layer

You need to decide the network structure to

let a good function in your function set.
FAQ

• Q: How many layers? How many neurons for each

layer?
Trial and Error + Intuition
• Q: Can the structure be automatically determined?
• E.g. Evolutionary Artificial Neural Networks
• Q: Can we design the network structure?
Convolutional Neural Network (CNN)
Three Steps for Deep Learning
Step 1:
Step 2: Step 3: pick
define a set
goodness of the best
of Neural
function
Network function function

Deep Learning is so simple ……

Loss for an Example
target
“1
”
x1 …… y1 ^
𝑦1 1
x2 …… ^
𝑦2

Softmax
Given a set of y2 0
parameters
……

……
……

……
……
Cross

……
Entropy
x256 …… y10 ^
𝑦 10 0

10 𝑦 ^
𝑦
𝑙 ( 𝑦 , ^𝑦 )=− ∑ ^
𝑦 𝑖 𝑙𝑛 𝑦 𝑖
𝑖=1
Total Loss:
Total Loss 𝑁
𝐿= ∑ 𝑙 𝑛
For all training data … 𝑛=1

x1 NN y1 ^
𝑦
1
1
𝑙
Find a function in
x2 NN y2 ^
𝑦
2

𝑙
2
function set that
minimizes total loss L
x3 NN y3 3
^
𝑦
3

𝑙
……
……

……
……

Deep Learning is so simple ……

Gradient Descent
𝜃

[]
Compute
𝑤1 0.2 0.15 𝜕𝐿
−𝜇 𝜕 𝐿/ 𝜕 𝑤1 𝜕 𝑤1
Compute 𝜕𝐿
𝑤2 -0.1 0.05 𝜕 𝑤2
−𝜇 𝜕 𝐿/ 𝜕 𝑤2 𝛻 𝐿=¿
⋮
……

𝜕𝐿
Compute 𝜕 𝑏1
𝑏1 0.3 0.2 ⋮
− 𝜇 𝜕 𝐿/ 𝜕 𝑏1
gradient
……
Gradient Descent
𝜃 Compute Compute
𝑤1 0.2 0.15 0.09
−𝜇 𝜕 𝐿/ 𝜕 𝑤1 −𝜇 𝜕 𝐿/ 𝜕 𝑤1
……
Compute Compute
𝑤2 -0.1 0.05 0.15
−𝜇 𝜕 𝐿/ 𝜕 𝑤2 −𝜇 𝜕 𝐿/ 𝜕 𝑤2
……
……

Compute Compute
𝑏1 0.3 0.2 0.10
− 𝜇 𝜕 𝐿/ 𝜕 𝑏1 − 𝜇 𝜕 𝐿/ 𝜕 𝑏1
……
……
Gradient Descent
This is the “learning” of machines in deep
learning ……
Even alpha go using this approach.
People image …… Actually …..

I hope you are not too disappointed :p

Backpropagation
• Backpropagation: an efficient way to compute in neural
network

libdnn
台大周伯威
同學開發
Ref: http://speech.ee.ntu.edu.tw/~tlkagk/courses/MLDS_2015_2/Lecture/DNN
%20backprop.ecm.mp4/index.html
Three Steps for Deep Learning
Step 1:
Step 2: Step 3: pick
define a set
goodness of the best
of Neural
function
Network function function

Deep Learning is so simple ……

Acknowledgment
• 感謝 Victor Chen 發現投影片上的打字錯誤

Deep Learning Computer Vision
No ratings yet
Deep Learning Computer Vision
302 pages
Deep Learning Tutorial
No ratings yet
Deep Learning Tutorial
133 pages
Unit II
No ratings yet
Unit II
56 pages
Deep Learning Basics Lecture 1 Feedforward
No ratings yet
Deep Learning Basics Lecture 1 Feedforward
31 pages
2.game AI 1
No ratings yet
2.game AI 1
268 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
Chapter 2 - 3 Deep Neural Network
No ratings yet
Chapter 2 - 3 Deep Neural Network
23 pages
Deep Learning Turorial PDF
No ratings yet
Deep Learning Turorial PDF
301 pages
What Is Gradient Based Learning in Deep Learning
100% (1)
What Is Gradient Based Learning in Deep Learning
12 pages
4 - DL (v2)
No ratings yet
4 - DL (v2)
32 pages
Excel Building Weight Calculator
0% (1)
Excel Building Weight Calculator
2 pages
Poster ASME 2022 EN 013
No ratings yet
Poster ASME 2022 EN 013
1 page
1725876123-Unit 1 Fundamental of Deep Learning
No ratings yet
1725876123-Unit 1 Fundamental of Deep Learning
51 pages
IEEE Paper Format Template
No ratings yet
IEEE Paper Format Template
2 pages
Precalculus Concepts Through Functions A Unit Circle Approach To Trigonometry 3rd Edition Sullivan Test Bank
No ratings yet
Precalculus Concepts Through Functions A Unit Circle Approach To Trigonometry 3rd Edition Sullivan Test Bank
105 pages
L7 Lecture Image - classification.DNN v4
No ratings yet
L7 Lecture Image - classification.DNN v4
61 pages
CS 611 Slides 5
No ratings yet
CS 611 Slides 5
28 pages
01 - Introduction To Deep Learning
No ratings yet
01 - Introduction To Deep Learning
56 pages
Unit-5 AI ETC
No ratings yet
Unit-5 AI ETC
64 pages
Deep - Learning
No ratings yet
Deep - Learning
49 pages
Deep Learning
No ratings yet
Deep Learning
299 pages
Deep Learning
100% (2)
Deep Learning
49 pages
Deep Learning Tutorial: Reference: Hung-Yi Lee
100% (1)
Deep Learning Tutorial: Reference: Hung-Yi Lee
179 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
Deep Learning Tutorial Complete (v3)
No ratings yet
Deep Learning Tutorial Complete (v3)
109 pages
A Imprimer 4
No ratings yet
A Imprimer 4
4 pages
Chapter 11 Neural Nets (Python)
No ratings yet
Chapter 11 Neural Nets (Python)
43 pages
Lecture 12 - Neural Networks (DONE!!) PDF
No ratings yet
Lecture 12 - Neural Networks (DONE!!) PDF
27 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
Neuralnetworks 1
No ratings yet
Neuralnetworks 1
65 pages
Deepnet Lourentzou
No ratings yet
Deepnet Lourentzou
49 pages
ML06 Neural-Network 2024-2025
No ratings yet
ML06 Neural-Network 2024-2025
78 pages
Deep Learning
No ratings yet
Deep Learning
38 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
Rebel-9 Manual v1.0
No ratings yet
Rebel-9 Manual v1.0
121 pages
Lect 12 - Deep Feed Forward NN - Review
No ratings yet
Lect 12 - Deep Feed Forward NN - Review
93 pages
Kulfoldi Kutatasi Jelentesek Gyujtemenye
No ratings yet
Kulfoldi Kutatasi Jelentesek Gyujtemenye
92 pages
Marilyn Vos Savant
No ratings yet
Marilyn Vos Savant
18 pages
Deep Learning
No ratings yet
Deep Learning
49 pages
DL Unit 3 Notes
No ratings yet
DL Unit 3 Notes
16 pages
Chapter 5 Final
No ratings yet
Chapter 5 Final
80 pages
The Deep Learning Revolution: Introductory Overview Lecture
No ratings yet
The Deep Learning Revolution: Introductory Overview Lecture
35 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2016
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2016
14 pages
Draft: Chapter 3 Introduction To Shells and Scripting
No ratings yet
Draft: Chapter 3 Introduction To Shells and Scripting
12 pages
Introduction Deep Eng
No ratings yet
Introduction Deep Eng
50 pages
DL 02 Deep Forward Networks
No ratings yet
DL 02 Deep Forward Networks
47 pages
Unit I
No ratings yet
Unit I
90 pages
An Extension of The Finite Hankel Transforms
No ratings yet
An Extension of The Finite Hankel Transforms
21 pages
AI Chapter 4
No ratings yet
AI Chapter 4
63 pages
ST M Hdstat RNN Deep Learning
No ratings yet
ST M Hdstat RNN Deep Learning
17 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
ZOOM Software Measurement and Graph Types
No ratings yet
ZOOM Software Measurement and Graph Types
6 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
CINPD Unit 5
No ratings yet
CINPD Unit 5
16 pages
ThinkServer TD350 - Product Guide
No ratings yet
ThinkServer TD350 - Product Guide
27 pages
Unit 2.1
No ratings yet
Unit 2.1
37 pages
Unit 03 - Neural Networks - MD
No ratings yet
Unit 03 - Neural Networks - MD
24 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
Module 2
No ratings yet
Module 2
44 pages
University of Cambridge International Examinations General Certificate of Education Ordinary Level
No ratings yet
University of Cambridge International Examinations General Certificate of Education Ordinary Level
24 pages
GE 10 Lab Ex 4
No ratings yet
GE 10 Lab Ex 4
8 pages
Assignment B 3 Customer Churn Modeling
No ratings yet
Assignment B 3 Customer Churn Modeling
7 pages
L10 - Intro - To - Deep - Learning
No ratings yet
L10 - Intro - To - Deep - Learning
75 pages
Unit Ii DNN
No ratings yet
Unit Ii DNN
24 pages
Taking The Control System For Granted - Ensuring The Integrity of Sub-Sil Instrumented Functions
No ratings yet
Taking The Control System For Granted - Ensuring The Integrity of Sub-Sil Instrumented Functions
5 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
(L2) - (JLD 4.0) - Solutions - 30th April
No ratings yet
(L2) - (JLD 4.0) - Solutions - 30th April
34 pages
JST PH Connectors - Datasheet
No ratings yet
JST PH Connectors - Datasheet
2 pages
A Guide To Radiocarbon Units and Calculations
No ratings yet
A Guide To Radiocarbon Units and Calculations
19 pages
Automatic Localization of Casting Defects With Convolutional Neural Networks
No ratings yet
Automatic Localization of Casting Defects With Convolutional Neural Networks
11 pages
Assignment
No ratings yet
Assignment
6 pages
Cambridge International AS & A Level: Computer Science 9618/32
No ratings yet
Cambridge International AS & A Level: Computer Science 9618/32
12 pages
Sheet Five Conduction MEP 212s
No ratings yet
Sheet Five Conduction MEP 212s
4 pages
Productfiche Manitou Heftruck Me 425
No ratings yet
Productfiche Manitou Heftruck Me 425
6 pages
IPS SW Upgrade Document Rev 15
No ratings yet
IPS SW Upgrade Document Rev 15
17 pages
Silva Et-Al 2013
No ratings yet
Silva Et-Al 2013
8 pages
BERGHOUT Et Al, 2020 - Aircraft Engines Remaining Useful Life Prediction With An Adaptive Denoising Online Sequential Extreme Learning Machine
No ratings yet
BERGHOUT Et Al, 2020 - Aircraft Engines Remaining Useful Life Prediction With An Adaptive Denoising Online Sequential Extreme Learning Machine
10 pages
SDM Lab Report
No ratings yet
SDM Lab Report
35 pages
Glycol Dehydrator Design Manual
No ratings yet
Glycol Dehydrator Design Manual
36 pages
03-Citation and Referencing Guidelines
No ratings yet
03-Citation and Referencing Guidelines
6 pages
2014 The Rietveld Method
No ratings yet
2014 The Rietveld Method
7 pages
Level Up Your Roblox Studio Skills_ An Intermediate Creator’s Guide: Roblox Studio, #2
From Everand
Level Up Your Roblox Studio Skills_ An Intermediate Creator’s Guide: Roblox Studio, #2
Steven Mcananey
No ratings yet
Autodesk 3ds Max 2023: A Comprehensive Guide, 23rd Edition
From Everand
Autodesk 3ds Max 2023: A Comprehensive Guide, 23rd Edition
Prof. Sham Tickoo
No ratings yet
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
From Everand
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
Vladimir Kiselev
No ratings yet
Pixologic ZBrush 2022: A Comprehensive Guide, 8th Edition
From Everand
Pixologic ZBrush 2022: A Comprehensive Guide, 8th Edition
Prof. Sham Tickoo
No ratings yet
Virtual Boy Architecture: Architecture of Consoles: A Practical Analysis, #17
From Everand
Virtual Boy Architecture: Architecture of Consoles: A Practical Analysis, #17
Rodrigo Copetti
No ratings yet
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet

Deep Learning: Hung-yi Lee 李宏毅

Uploaded by

Deep Learning: Hung-yi Lee 李宏毅

Uploaded by

Deep Learning

Deep learning trends at Google. Source: SIGMOD 2016/Jeff Dean

Deep Learning is so simple ……

AlexNet (2012) VGG (2014) GoogleNet (2014)

152 layers 101 layers

Using parallel computing techniques

You need to decide the network structure to

• Q: How many layers? How many neurons for each

Deep Learning is so simple ……

Find the network

Deep Learning is so simple ……

I hope you are not too disappointed :p

Deep Learning is so simple ……

You might also like