Convolutional Neural Networks For Visual Recognition
Convolutional Neural Networks For Visual Recognition
Convolutional Neural Networks For Visual Recognition
boris [email protected]
1
Acknowledgments
2
Agenda
1. Course overview
2. Introduction to Deep Learning
– Classical Computer Vision vs. Deep learning
3. Introduction to Convolutional Networks
– Basic CNN Architecture
– Large Scale Image Classifications
– How deep should be Conv Nets?
– Detection and Other Visual Apps
3
Course overview
1. Introduction
– Intro to Deep Learning
– Caffe: Getting started
– CNN: network topology, layers definition
2. CNN Training
– Backward propagation
– Optimization for Deep Learning: SGD : monentum, rate
adaptation, Adagrad, SGD with Line Search, CGD
– “Regularization” (Dropout , Maxout)
4
Course overview
5
Introduction to Deep Learning
6
Buzz…
7
Deep Learning – from Research to
Technology
9
Classical Computer Vision Pipeline.
CV experts
1. Select / develop features: SURF, HoG, SIFT, RIFT,
…
2. Add on top of this Machine Learning for multi-class
recognition and train classifier
Feature Detection,
Extraction: Classification
SIFT, HoG... Recognition
10
Deep Learning –based Vision Pipeline.
Deep Learning:
Build features automatically based on training data
Combine feature extraction and classification
DL experts: define NN topology and train NN
Detection,
Deep NN... Deep NN...
Classification
Recognition
CV ML
Deep AdaBoost
features
NN... …
HoG, SIFT
12
Deep Learning Basics
Deep Learning – is a set of machine learning
algorithms based on multi-layer networks
CAT DOG
OUTPUTS
HIDDEN
NODES
INPUTS
13
Deep Learning Basics
Deep Learning – is a set of machine learning
algorithms based on multi-layer networks
CAT DOG
Training
14
1
Deep Learning Basics
Deep Learning – is a set of machine learning
algorithms based on multi-layer networks
CAT DOG
15
1
Deep Learning Basics
Deep Learning – is a set of machine learning
algorithms based on multi-layer networks
CAT DOG
16
Deep Learning Taxonomy
Supervised:
–Convolutional NN ( LeCun)
–Recurrent Neural nets (Schmidhuber )
Unsupervised
–Deep Belief Nets / Stacked RBMs (Hinton)
–Stacked denoising autoencoders (Bengio)
–Sparse AutoEncoders ( LeCun, A. Ng, )
17
Convolutional Networks
18
Convolutional NN
19
What is Convolutional NN ?
CNN - multi-layer NN architecture
– Convolutional + Non-Linear Layer
– Sub-sampling Layer
– Convolutional +Non-L inear Layer
– Fully connected layers
Supervised
Classi-
Feature Extraction
fication
20
What is Convolutional NN ?
2x2
22
ILSVRC: Classification
23
Imagenet Classifications 2012
24
ILSVRC 2012: top rankers
http://www.image-net.org/challenges/LSVRC/2012/results.html
25
Imagenet 2013: top rankers
http://www.image-net.org/challenges/LSVRC/2013/results.php
26
Imagenet Classifications 2013
27
Conv Net Topology
5 convolutional layers
3 fully connected layers + soft-max
650K neurons , 60 Mln weights
28
Why ConvNet should be Deep?
30
Why ConvNet should be Deep?
31
Why ConvNet should be Deep?
32
Why ConvNet should be Deep?
33
Conv Nets:
beyond Visual Classification
34
CNN applications
Farabet, 2013
38
Conv NN: Action Detection
BUZZ
41
A lot of buzz about Deep Learning
42
A lot of buzz about Deep Learning