Deep Learning Models
Deep Learning Models
2012-05-03
Byoung-Hee Kim
Biointelligence Lab, CSE,
Seoul National University
NOTE: most slides are from talks of Geoffrey Hinton, Andrew Ng, and Yoshua Bengio.
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 2
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 3
Input output target
Two!
Sketch of a typical
perceptron from the 1960s
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 10
Second generation neural networks (~1985)
hidden
layers
input vector
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 11
But, finding any model with deep architecture was not successful till 2006
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 12
http://www.iro.umontreal.ca/~pift6266/H10/notes/deepintro.html
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 13
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 14
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 15
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 16
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 17
Agenda
Computer Perception
Unsupervised feature learning
Various deep learning models
Application cases of deep learning models
Written digit recognition/generation (MNIST dataset)
Image classification
Audio recognition
Language modeling
Motion generation
References
Appendix
pixel 1
Learning
algorithm
pixel 2
Input
Motorbikes
Input space Non-Motorbikes
pixel 2
pixel 1
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 20
Feature Learning
handle
Feature Learning
wheel Extractor algorithm
Input
Motorbikes
Input space Non-Motorbikes Feature space
handle
pixel 2
pixel 1 wheel
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 21
How is computer perception done?
Object
detection
Audio
classification
Low-level Speaker
Audio
audio features identification
Helicopter
control
Low-level state
Helicopter
features Action
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 22
Learning representations
Feature Learning
Sensor
Representation algorithm
HoG RIFT
Textons
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ GLOH 24
Audio features
Spectrogram MFCC
Auditory cortex
learns to see.
Auditory
Cortex
Unlabeled images
Learning
algorithm
Feature representation
1
These have a state
of 1 or 0.
p(si 1)
The probability of
turning on is 0
0
determined by the
weighted input bi s j w ji
from other units j
(plus a bias)
1
p( si 1)
1 exp(bi s j w ji )
j
(C) 2012, SNU Biointelligence Lab, http://bi.snu.ac.kr/ 42
Binary
Stochastic
Neuron
Gabor functions.
[Images from DeAngelis, Ohzawa & Freeman, 1995]
Sparse coding illustration
Natural Images Learned bases (f1 , , f64): Edges
50
100
150
200 50
250
100
300
150
350
200
400
250 50
450
300 100
500
50 100 150 200 250 300 350 400 450
150 500
350
200
400
250
450
300
500
50 100 150 200
350 250 300 350 400 450 500
400
450
500
50 100 150 200 250 300 350 400 450 500
Test example
Cars Motorcycles
Testing:
What is this?
Semi-supervised learning
Testing:
What is this?
Car Motorcycle
Self-taught learning
Testing:
What is this?
Car Motorcycle
Self-taught learning
Sparse codin
g, LCC, etc. f1, f2, , fk
Car Motorcycle
Convolutional DBN for Images
Convolutional DBN on face images
object models
object parts
(combination
of edges)
edges
pixels
Learning of object parts
Input images
Samples from
feedforward
Inference
(control)
Samples from
Full posterior
inference
An application to modeling motion capture data
(Taylor, Roweis & Hinton, 2007)
Review
Y. Bengio, Learning deep architectures for AI,
Foundations and Trends in Machine Learning,
2(1):1-127, 2009.
I. Arel, D.C. Rose, and T.P. Karnowski, Deep
machine learning A new frontier in Artificial
Intelligence Research, Computational
Intelligence Magazine, 14:12-18, 2010.