0% found this document useful (0 votes)

186 views45 pages

Convolutional Neural Networks For Visual Recognition

This document provides an overview and agenda for a course on convolutional neural networks for visual recognition. It introduces convolutional networks and their applications in computer vision tasks like image classification, object detection, and scene parsing. It discusses how convolutional networks have achieved state-of-the-art results in large-scale visual recognition challenges by learning hierarchical representations from data. The document also examines why deep convolutional networks are effective and outlines the course content which will cover network architectures, training, and applications in localization and detection.

Uploaded by

Keren Evangeline. I

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

186 views45 pages

Convolutional Neural Networks For Visual Recognition

Uploaded by

Keren Evangeline. I

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 45

Introduction:

Convolutional Neural Networks

for Visual Recognition

boris [email protected]
1
Acknowledgments

This presentation is heavily based on:

– http://cs.nyu.edu/~fergus/pmwiki/pmwiki.php
– http://deeplearning.net/reading-list/tutorials/
– http://deeplearning.net/tutorial/lenet.html
– http://ufldl.stanford.edu/wiki/index.php/UFLDL_Tutorial

… and many other

2
Agenda

1. Course overview
2. Introduction to Deep Learning
– Classical Computer Vision vs. Deep learning
3. Introduction to Convolutional Networks
– Basic CNN Architecture
– Large Scale Image Classifications
– How deep should be Conv Nets?
– Detection and Other Visual Apps

3
Course overview

1. Introduction
– Intro to Deep Learning
– Caffe: Getting started
– CNN: network topology, layers definition
2. CNN Training
– Backward propagation
– Optimization for Deep Learning: SGD : monentum, rate
adaptation, Adagrad, SGD with Line Search, CGD
– “Regularization” (Dropout , Maxout)

4
Course overview

3. Localization and Detection

– Overfeat
– R-CNN (Regions with CNN)
4. CPU / GPU performance optimization
– CUDA
– Vtune, OpenMP, and Intel MKL (Math Kernel Library)

5
Introduction to Deep Learning

6
Buzz…

7
Deep Learning – from Research to
Technology

Deep Learning - breakthrough in

visual and speech recognition 8
Classical Computer Vision Pipeline

9
Classical Computer Vision Pipeline.

CV experts
1. Select / develop features: SURF, HoG, SIFT, RIFT,
…
2. Add on top of this Machine Learning for multi-class
recognition and train classifier
Feature Detection,
Extraction: Classification
SIFT, HoG... Recognition

Classical CV feature definition is domain-

specific and time-consuming

10
Deep Learning –based Vision Pipeline.

Deep Learning:
 Build features automatically based on training data
 Combine feature extraction and classification
DL experts: define NN topology and train NN

Detection,
Deep NN... Deep NN...
Classification
Recognition

Deep Learning promise:

train good feature automatically,
same method for different domain
11
Computer Vision +Deep Learning +
Machine Learning
We want to combine Deep Learning + CV + ML
 Combine pre-defined features with learned features;
 Use best ML methods for multi-class recognition
CV+DL+ML experts needed to build the best-in-class

CV ML
Deep AdaBoost
features
NN... …
HoG, SIFT

Combine best of Computer Vision

Deep Learning and Machine Learning

12
Deep Learning Basics
Deep Learning – is a set of machine learning
algorithms based on multi-layer networks
CAT DOG

OUTPUTS

HIDDEN
NODES

INPUTS
13
Deep Learning Basics
Deep Learning – is a set of machine learning
algorithms based on multi-layer networks
CAT DOG

Training

1
Deep Learning Basics
Deep Learning – is a set of machine learning
algorithms based on multi-layer networks
CAT DOG

16
Deep Learning Taxonomy

Supervised:
–Convolutional NN ( LeCun)
–Recurrent Neural nets (Schmidhuber )

Unsupervised
–Deep Belief Nets / Stacked RBMs (Hinton)
–Stacked denoising autoencoders (Bengio)
–Sparse AutoEncoders ( LeCun, A. Ng, )

17
Convolutional Networks

18
Convolutional NN

Convolutional Neural Networks is extension of

traditional Multi-layer Perceptron, based on 3 ideas:
1. Local receive fields
2. Shared weights
3. Spatial / temporal sub-sampling
See LeCun paper (1998) on text recognition:
http://yann.lecun.com/exdb/publis/pdf/lecun-01a.pdf

19
What is Convolutional NN ?
CNN - multi-layer NN architecture
– Convolutional + Non-Linear Layer
– Sub-sampling Layer
– Convolutional +Non-L inear Layer
– Fully connected layers
 Supervised

Classi-
Feature Extraction
fication

20
What is Convolutional NN ?

2x2

Convolution + NL Sub-sampling Convolution + NL

21
CNN success story: ILSVRC 2012

Imagenet data base: 14 mln labeled images, 20K categories

22
ILSVRC: Classification

23
Imagenet Classifications 2012

24
ILSVRC 2012: top rankers

http://www.image-net.org/challenges/LSVRC/2012/results.html

N Error-5 Algorithm Team Authors

1 0.153 Deep Conv. Neural Univ. of Krizhevsky et al
Network Toronto
2 0.262 Features + Fisher ISI Gunji et al
Vectors + Linear
classifier
3 0.270 Features + FV + SVM OXFORD_VG Simonyan et al
G
4 0.271 SIFT + FV + PQ + SVM XRCE/INRIA Perronin et al
5 0.300 Color desc. + SVM Univ. of van de Sande et
Amsterdam al

25
Imagenet 2013: top rankers

http://www.image-net.org/challenges/LSVRC/2013/results.php

N Error-5 Algorithm Team Authors

1 0.117 Deep Convolutional Clarifi Zeiler
Neural Network
2 0.129 Deep Convolutional Nat.Univ Min LIN
Neural Networks Singapore
3 0.135 Deep Convolutional NYU Zeiler
Neural Networks Fergus
4 0.135 Deep Convolutional Andrew Howard
Neural Networks
5 0.137 Deep Convolutional Overfeat Pierre Sermanet
Neural Networks NYU et al

26
Imagenet Classifications 2013

27
Conv Net Topology

 5 convolutional layers
 3 fully connected layers + soft-max
 650K neurons , 60 Mln weights

28
Why ConvNet should be Deep?

Rob Fergus, NIPS 2013 29

Why ConvNet should be Deep?

30
Why ConvNet should be Deep?

31
Why ConvNet should be Deep?

32
Why ConvNet should be Deep?

33
Conv Nets:
beyond Visual Classification

34
CNN applications

CNN is a big Plenty low hanging fruits

hammer

You need just a right nail! 35

Conv NN: Detection

Sermanet, CVPR 2014

36
Conv NN: Scene parsing

Farabet, PAMI 2013

37
CNN: indoor semantic labeling RGBD

Farabet, 2013
38
Conv NN: Action Detection

Taylor, ECCV 2010

39
Conv NN: Image Processing

Eigen , ICCV 2010

40
BACKUP

BUZZ

41
A lot of buzz about Deep Learning

 July 2012 - Started DL lab

 Nov 2012- Big improvement in Speech, OCR:
– Speech – reduce Error Rate by 25%
– OCR – reduce Error rate by 30%
 2013 launched 5 DL based products
– Voice search
– Photo Wonder
– Visual search

42
A lot of buzz about Deep Learning

Microsoft On Deep Learning for Speech goto 3:00-5:10

43
A lot of buzz about Deep Learning

Why Google invest in Deep Learning

44
A lot of buzz about Deep Learning

NYU “Deep Learning” Professor LeCun Will Head

Facebook’s New Artificial Intelligence Lab, Dec 10,
2013
45

Machine Learning Systems
No ratings yet
Machine Learning Systems
1,748 pages
Video Diffusion Tutorial Prof Mike Shou NUS 2023 Dec 15
No ratings yet
Video Diffusion Tutorial Prof Mike Shou NUS 2023 Dec 15
274 pages
Npu AI
No ratings yet
Npu AI
54 pages
CNN Course V1.3
No ratings yet
CNN Course V1.3
19 pages
Deep Learning
No ratings yet
Deep Learning
80 pages
Machine Learning Algorithms, Real World Applications and Research
No ratings yet
Machine Learning Algorithms, Real World Applications and Research
21 pages
Week 1 Introduction To ML
100% (1)
Week 1 Introduction To ML
42 pages
Intro To Deep Learning
100% (1)
Intro To Deep Learning
35 pages
ATV - CVPR'23 Tutorial
No ratings yet
ATV - CVPR'23 Tutorial
152 pages
2023-05 On Device AI - Double-Edged Sword
No ratings yet
2023-05 On Device AI - Double-Edged Sword
9 pages
Lectures On Machine Learning
100% (1)
Lectures On Machine Learning
69 pages
CNN Short
No ratings yet
CNN Short
61 pages
Keras Succinctly
No ratings yet
Keras Succinctly
107 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
Neural Network: Submitted By-Aarushi Sharma 4-CSE-A 1729010003
No ratings yet
Neural Network: Submitted By-Aarushi Sharma 4-CSE-A 1729010003
18 pages
Practical Guide To Keras
No ratings yet
Practical Guide To Keras
28 pages
The Data Science Framework: Juan J. Cuadrado-Gallego Yuri Demchenko
No ratings yet
The Data Science Framework: Juan J. Cuadrado-Gallego Yuri Demchenko
202 pages
Introduction To DBMS
No ratings yet
Introduction To DBMS
27 pages
Database Management Systems Lec 1a
No ratings yet
Database Management Systems Lec 1a
29 pages
Answer Key Cbse Sample Paper Class 8 Mathematics PDF
No ratings yet
Answer Key Cbse Sample Paper Class 8 Mathematics PDF
4 pages
Residue Number Systems (RNS)
No ratings yet
Residue Number Systems (RNS)
19 pages
Chatbots With Personality Using Deep Learning
No ratings yet
Chatbots With Personality Using Deep Learning
47 pages
Chapter Fundamental Concepts of Database Management
No ratings yet
Chapter Fundamental Concepts of Database Management
36 pages
Rm&i CH-5
No ratings yet
Rm&i CH-5
22 pages
Federated Learning Overview, Strategies, Applications, Tools and
No ratings yet
Federated Learning Overview, Strategies, Applications, Tools and
24 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
32 pages
Sign Language Recognition Using Deep Learning
No ratings yet
Sign Language Recognition Using Deep Learning
6 pages
Lecture 5
No ratings yet
Lecture 5
114 pages
Machine Learning Notes AndrewNg
No ratings yet
Machine Learning Notes AndrewNg
141 pages
Machine Learning
No ratings yet
Machine Learning
20 pages
Rameez - Ducted Fans Vs Propellers
No ratings yet
Rameez - Ducted Fans Vs Propellers
2 pages
OpenSTEP Developers Tutorial 4.0 Mach 1996
No ratings yet
OpenSTEP Developers Tutorial 4.0 Mach 1996
240 pages
Understanding and Coding Neural Networks From Scratch in Python and R
100% (1)
Understanding and Coding Neural Networks From Scratch in Python and R
15 pages
Introduction To Learning: Frederic Precioso 24/01/2019
No ratings yet
Introduction To Learning: Frederic Precioso 24/01/2019
179 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Cognite Computing
100% (1)
Cognite Computing
14 pages
Performance Metrics (Classification) : Enrique J. de La Hoz D
100% (1)
Performance Metrics (Classification) : Enrique J. de La Hoz D
30 pages
Neural Network and Fuzzy Logic
No ratings yet
Neural Network and Fuzzy Logic
46 pages
Machine Learning and Neural Networks: Riccardo Rizzo
100% (1)
Machine Learning and Neural Networks: Riccardo Rizzo
113 pages
3 Art Therapy Techniques To Deal With Anxiety PDF
No ratings yet
3 Art Therapy Techniques To Deal With Anxiety PDF
3 pages
Renewable Energy
No ratings yet
Renewable Energy
38 pages
Keras - TF2 - Book
No ratings yet
Keras - TF2 - Book
364 pages
Accelerating Microservices Design and Development Codex2533
No ratings yet
Accelerating Microservices Design and Development Codex2533
11 pages
Deep Learning Approaches For Network Int
No ratings yet
Deep Learning Approaches For Network Int
116 pages
Types of Neural Networks
No ratings yet
Types of Neural Networks
7 pages
IOT Mod-4
No ratings yet
IOT Mod-4
42 pages
Image Classification Using Pre-Trained Convolutional Neural Network in COLAB
No ratings yet
Image Classification Using Pre-Trained Convolutional Neural Network in COLAB
6 pages
Gradient Descent Algorithms and Variations - PyImageSearch
No ratings yet
Gradient Descent Algorithms and Variations - PyImageSearch
21 pages
Linking Information Systems To The Business Plan
No ratings yet
Linking Information Systems To The Business Plan
10 pages
General Framework For Object Detection
No ratings yet
General Framework For Object Detection
9 pages
Graph Neural Network The Next Frontier in Deep Learning
No ratings yet
Graph Neural Network The Next Frontier in Deep Learning
1 page
Data Visualization For Industry 4
No ratings yet
Data Visualization For Industry 4
3 pages
2025 GKS-G Final Round Successful Candidates
No ratings yet
2025 GKS-G Final Round Successful Candidates
58 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page
Isotope Practice Questions
No ratings yet
Isotope Practice Questions
5 pages
An Introduction To Vision-Language Modeling: Aishwarya Agrawal Kate Saenko Asli Celikyilmaz Vikas Chandra
No ratings yet
An Introduction To Vision-Language Modeling: Aishwarya Agrawal Kate Saenko Asli Celikyilmaz Vikas Chandra
76 pages
Computer Education For Nepali School Students - QBASIC CLASS IX
No ratings yet
Computer Education For Nepali School Students - QBASIC CLASS IX
10 pages
Scalable-ML-3 4 1
No ratings yet
Scalable-ML-3 4 1
147 pages
Introduction To Neural Networks
No ratings yet
Introduction To Neural Networks
51 pages
A Survey of Evolution of Image Captioning PDF
No ratings yet
A Survey of Evolution of Image Captioning PDF
18 pages
19 Data Science and Machine Learning Tools For People Who Don't Know Programming
No ratings yet
19 Data Science and Machine Learning Tools For People Who Don't Know Programming
8 pages
Lecture 01 (Introduction To Pattern Recognition)
No ratings yet
Lecture 01 (Introduction To Pattern Recognition)
26 pages
WWW - Studyguide.pk: Different Observation Types and Inter-Observer Reliability
No ratings yet
WWW - Studyguide.pk: Different Observation Types and Inter-Observer Reliability
2 pages
6869173cb375f164b6288668 - ## - Periodic Table Jwala Notes
No ratings yet
6869173cb375f164b6288668 - ## - Periodic Table Jwala Notes
210 pages
1019 1024 1
No ratings yet
1019 1024 1
6 pages
STATS Stem and Leaf Plots
No ratings yet
STATS Stem and Leaf Plots
5 pages
Percakapan Bhs Inggris Talking About Friend
No ratings yet
Percakapan Bhs Inggris Talking About Friend
4 pages
Icici BNK Imp New ALL OVER File
No ratings yet
Icici BNK Imp New ALL OVER File
18 pages
Grouping
No ratings yet
Grouping
5 pages
Jonathan 170725114020
No ratings yet
Jonathan 170725114020
33 pages
Algorithm For Segmentation
No ratings yet
Algorithm For Segmentation
28 pages
Unit 2 - Lesson 1.1 - Vocab & Reading - Pages 14 & 15
No ratings yet
Unit 2 - Lesson 1.1 - Vocab & Reading - Pages 14 & 15
78 pages
Simple Libraries in Python
No ratings yet
Simple Libraries in Python
12 pages
Ultrasound in Obstet Gyne - 2004 - SCHW Rzler - Sex Specific Antenatal Reference Growth Charts For Uncomplicated
No ratings yet
Ultrasound in Obstet Gyne - 2004 - SCHW Rzler - Sex Specific Antenatal Reference Growth Charts For Uncomplicated
7 pages
IO Wheel Balancer WB220L - CE - 1.1 - ENG - Set910710984
No ratings yet
IO Wheel Balancer WB220L - CE - 1.1 - ENG - Set910710984
18 pages
Ministry Magazine - A Theological Approach To Pastoral Leadership Today
No ratings yet
Ministry Magazine - A Theological Approach To Pastoral Leadership Today
11 pages
Team M.A.V.S Food Truck Business Plan Draft1 1
No ratings yet
Team M.A.V.S Food Truck Business Plan Draft1 1
43 pages
A Comparison of Scale: Macro, Micro, Nano: Primary Knowledge Participant Guide
No ratings yet
A Comparison of Scale: Macro, Micro, Nano: Primary Knowledge Participant Guide
23 pages
Professional Socialization of Sisc+
No ratings yet
Professional Socialization of Sisc+
24 pages
Petronas Twin Towers
No ratings yet
Petronas Twin Towers
6 pages
BreakHis Data Paper
No ratings yet
BreakHis Data Paper
16 pages
Training of The American Actor 1St Edition Edition Arthur Bartow Download
No ratings yet
Training of The American Actor 1St Edition Edition Arthur Bartow Download
48 pages
Aon Pre-Hire Onboarding
No ratings yet
Aon Pre-Hire Onboarding
19 pages
HHW Xi 24-25
No ratings yet
HHW Xi 24-25
33 pages
VGGIN-Net Deep Transfer Network For Imbalanced Breast Cancer Dataset
No ratings yet
VGGIN-Net Deep Transfer Network For Imbalanced Breast Cancer Dataset
12 pages
Dir-2025 243182
No ratings yet
Dir-2025 243182
16 pages
BC Recurrence Prediction ML
No ratings yet
BC Recurrence Prediction ML
7 pages
Maxillary Incisor Based Objectives in Present Day o 2022 Seminars in Orthodo
No ratings yet
Maxillary Incisor Based Objectives in Present Day o 2022 Seminars in Orthodo
13 pages
Hauwam Muhammed - Updated CV
No ratings yet
Hauwam Muhammed - Updated CV
4 pages
Going To Exercise
No ratings yet
Going To Exercise
2 pages
Coxph Randomsurvforest
No ratings yet
Coxph Randomsurvforest
9 pages
Batch Size To Improve Result
No ratings yet
Batch Size To Improve Result
4 pages
Group2 Buntal Hats
No ratings yet
Group2 Buntal Hats
8 pages
NSTP 2 Worksheet 3 - Matias
No ratings yet
NSTP 2 Worksheet 3 - Matias
4 pages
Agricultural Projects, Seminars, Papers, Assignments and Essays
No ratings yet
Agricultural Projects, Seminars, Papers, Assignments and Essays
2 pages
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet

Convolutional Neural Networks For Visual Recognition

Uploaded by

Convolutional Neural Networks For Visual Recognition

Uploaded by

Introduction:

Convolutional Neural Networks

This presentation is heavily based on:

… and many other

3. Localization and Detection

Deep Learning - breakthrough in

Classical CV feature definition is domain-

Deep Learning promise:

Combine best of Computer Vision

Convolutional Neural Networks is extension of

Convolution + NL Sub-sampling Convolution + NL

Imagenet data base: 14 mln labeled images, 20K categories

N Error-5 Algorithm Team Authors

N Error-5 Algorithm Team Authors

Rob Fergus, NIPS 2013 29

CNN is a big Plenty low hanging fruits

You need just a right nail! 35

Sermanet, CVPR 2014

Farabet, PAMI 2013

Taylor, ECCV 2010

Eigen , ICCV 2010

 July 2012 - Started DL lab

Microsoft On Deep Learning for Speech goto 3:00-5:10

Why Google invest in Deep Learning

NYU “Deep Learning” Professor LeCun Will Head

You might also like