Unit 3-Non CNN approaches to object recognition

The document discusses the differences between object detection and image classification, highlighting that object detection provides bounding boxes and class labels for identified entities in an image. It covers traditional non-CNN approaches to object detection, such as Haar features and cascading classifiers, as well as the Viola-Jones algorithm. Applications of object detection include facial recognition, autonomous vehicles, and features like smile detection in smartphones.

Uploaded by

mailtoyashi04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Unit 3-Non CNN approaches to object recognition

Uploaded by

mailtoyashi04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

Object Detection and

Instance
Segmentation
Topics
• Difference between object detection and image classification
• Traditional, non CNN approaches to object detection
• R-CNN
• Regions with CNN features
• Fast R-CNN
• fast region-based CNN
• Faster R-CNN
Image classification vs object detection
How confident is
NEED FOR OBJECT DETECTION the model that
the identified
entity is the one
that is claimed?

we are very
confident that there
is an
entity, say a dog,
in the image, but its
scale and position
in the image is not
as prominent as
that of its owner, a
Person entity?
Image classification vs object detection
• Image classification
• tell you that there is at least an image, but not exactly how many of them
there are
• do not tell you where the identified entity in the image is.
• object detection
• tells you the placement of an entity in the image
• gives you bounding boxes and class labels (along with the probability of
detection) of all the entities identified in an image.
Differences between object detection
and image classification
Scenario 1
• Assume You are watching the movie 101 https://www.youtub
e.com/watch?v=nT-
Dalmatians, pCZyKmcw
• To know how many Dalmatians you can
actually count in a given movie scene from
that movie.
• Image Classification could, at best, tell you
that there is at least one dog or one
• Dalmatian but not exactly how many of them
there are.
Scenario 2
• Want to extract the image of the dog from there to search on the
web for its breed or similar dogs like it
• Problem here is that searching the whole image might not work,
and without identifying individual objects from the image, you
have to do the cut extract-search job manually for this task
Object detection
• Need a technique that not only identifies the entities in an image but also tells you
their placement in the image.
• Object detection gives you bounding boxes and class labels (along with the
probability of detection) of all the entities identified in an image.
Applications
Facial Recognition feature that you have in Facebook, Google Photos
• Face Recognition: Google Photos uses facial recognition technology to identify and group photos of the
same person across different albums and time periods.

Autonomous vehicle
It helps them detect other vehicles, pedestrians, cyclists, traffic signs, and obstacles on the road, enabling
safe navigation
Application in phone
To find out how many of the guests present at your party were actually enjoying it, you can even run an
object detection for Smiling Faces or a Smile Detector
Smile Shutter feature @phone
• To automatically click the image when most of the faces in the scene are detected as smiling
Object detection

• object detection may be considered a combination of two

tasks
• Getting the right bounding boxes (or as many of them to
filter later)
• Classifying the object in that bounding box (while
returning the classification effectiveness for filtering)
Object detection
• Region Proposals (regions that we send as proposals for classifying
objects), need to have some mechanism for finding the best values
for the following parameters:
• Starting (or center) coordinates to extract/draw the candidate
bounding box
• Length of the candidate bounding box
• Width of the candidate bounding box
• Stride across each axis (distance from one starting location to
another in the x-horizontal axis and y-vertical axis)
Object detection
• each object will have a
different scale, so we know
that one fixed value for L and
W for these boxes will not
work.
• extract N number of candidate
boxes per starting coordinate
in the image, where N
encompasses most of the
sizes/scales that may fit
classification problem.
candidate boxes generation
• L represents the length of the image, and w represents
the width of the image.
• Consider all combinations of length and width within the
image dimensions, l×w candidate boxes for each
starting coordinate.
• For example, if our image is 100×100 pixels,
considering all possible combinations of length and
width within this range would lead to 100×100=10,000
candidate boxes per starting coordinate.
• Computationally expensive and impractical.
Object detection
• how many starting coordinates we need to visit in our image from where we will
extract these N boxes each, or the Stride
• big stride will lead us to extract sub-images in themselves
• short stride (say, 1 pixel in each direction) may mean a lot of candidate boxes

• Big stride = fewer starting points = fewer boxes to look at.

• Small stride = more starting points = more boxes to examine.

• choice of stride, or the step size of the sliding window as it

moves across the image, is crucial in balancing computational
efficiency with the accuracy of object detection
NON CNN approaches to object
detection
• Libraries such as OpenCV and others include software bundles for
Smartphones, Robotic projects, and many others, to provide detection
capabilities of specific objects (face, smile, and so on)

• Traditional approaches:
• Haar features
• cascading classifiers
• Viola-Jones algorithm
Introduction
 Before CNN
 OpenCV libraries used for object detection - Smartphones, Robotic projects, and many others
 innovative ideas drawing inspirations from different fields of science and mathematics
 Haar features
 cascading classifiers
 Viola-Jones algorithm
 Haar Features (Haar wavelet – derived from maths)
 Haar classifier, or a Haar cascade classifier, is a machine learning object detection program that identifies
objects in an image and video
 These features on the image makes it easy to find out the edges or the lines in the image, or to pick areas
where there is a sudden change in the intensities of the pixels.
 Haar or Haar-like features are formations of rectangles with varying pixel density.
 sum up the pixel intensity in the adjacent rectangular regions at specific locations in the detection region.
Haar Features

 Based on the difference between the sums of pixel intensities

across regions, they categorize the different subsections of the
image - Two rectangle features, Three rectangle features,
Four rectangle features
 Works better for monochrome image
Haar Features
• These categories can be grouped into
three major groups
Two rectangle features
• Three rectangle features
• Four rectangle features

Two rectangle features

• responsible for finding out the edges in a horizontal or in a vertical direction
Three rectangle features
• responsible for finding out if there is a lighter region surrounded by darker regions on either side or
vice-versa.
Four rectangle features
• responsible for finding out change of pixel intensities across diagonals
Haar Features
How a haar feature traverses on an
image from its left towards its right.
 Haar Feature Challenges:
 Haar feature extraction involves calculating the difference between the sums of
pixel intensities in adjacent rectangular regions.
 Even though the vast majority of the sub-regions do not contain the target
object, the classifier still processes them, leading to unnecessary computational
overhead.
 inability to efficiently prioritize the analysis of regions more likely to contain the
object of interest
Cascading Classifiers
 Cascading classifiers
 combines multiple Haar features in a hierarchy to build a classifier.
 Instead of analyzing the entire image with each Haar feature, cascades break
down the detection process into stages, each consisting of a set of features.
 only a small number of pixels among the entire image is related to the
object in concern.
 The Viola-Jones algorithm
 capable of delivering detections with high TPRs (True Positive Rates) and low FPRs
(False Positive Rates)
 The constraints of the algorithm:
 It could work only for detecting, not recognizing
 The faces had to be present in the image as a frontal view. No other view could be detected.

 Heart of the algorithm: Haar (like) Features and Cascading Classifiers

 uses a subset of Haar features to determine general features on a face such as:
 Eyes (determined by a two-rectangle feature (horizontal), with a dark horizontal rectangle
above the eye forming the brow, followed by a lighter rectangle below)
 Nose (three-rectangle feature (vertical), with the nose as the center light rectangle and one
darker rectangle on either side on the nose, forming the temple), and so on
Introduction

 The Viola-Jones algorithm

 These Haar-like features are then used in the cascading classifiers to detection
problem without losing the robustness of detection.
 Drawback
 But still, the training of these cascades for a new object was very time consuming, and they had
a lot of constraints
Edge detection

FlexILS 3
100% (3)
FlexILS 3
73 pages
Scaricare Libri Normal People Gratis Di Sally Rooney
No ratings yet
Scaricare Libri Normal People Gratis Di Sally Rooney
10 pages
14 Heavy Lift Operations
100% (1)
14 Heavy Lift Operations
14 pages
Facial Feature Detection Using Haar
No ratings yet
Facial Feature Detection Using Haar
7 pages
Project Detecto!: A Real-Time Object Detection Model
No ratings yet
Project Detecto!: A Real-Time Object Detection Model
3 pages
SR22804211151
No ratings yet
SR22804211151
8 pages
LAB MANUAL 2D1427 Image Based Recognitio
No ratings yet
LAB MANUAL 2D1427 Image Based Recognitio
25 pages
CVlecture 4
No ratings yet
CVlecture 4
62 pages
Introduction To Computer Vision
No ratings yet
Introduction To Computer Vision
45 pages
Tensor Flow
No ratings yet
Tensor Flow
5 pages
Object Detection and Identification A Project Report: November 2019
No ratings yet
Object Detection and Identification A Project Report: November 2019
45 pages
Object Detection and Identification A Project Report: November 2019
No ratings yet
Object Detection and Identification A Project Report: November 2019
45 pages
Sidra Face Detection Final
No ratings yet
Sidra Face Detection Final
23 pages
Facial Recognition Using Deep Learning
No ratings yet
Facial Recognition Using Deep Learning
6 pages
Image Recognition and Melody Creation
No ratings yet
Image Recognition and Melody Creation
28 pages
CSE4261 Lecture-12
No ratings yet
CSE4261 Lecture-12
24 pages
Meghna Raj Saxena, Akarsh Pathak, Aditya Pratap Singh, Ishika Shukla
No ratings yet
Meghna Raj Saxena, Akarsh Pathak, Aditya Pratap Singh, Ishika Shukla
4 pages
CH 8
No ratings yet
CH 8
21 pages
Facial Recognition Using Deep Learning
No ratings yet
Facial Recognition Using Deep Learning
8 pages
Face Detection in Python Using OpenCV
No ratings yet
Face Detection in Python Using OpenCV
17 pages
Improved Haar Cascade Feature Extraction and Access Control Framework For Rich Internet Applications
No ratings yet
Improved Haar Cascade Feature Extraction and Access Control Framework For Rich Internet Applications
11 pages
1 ObjectDetection
No ratings yet
1 ObjectDetection
46 pages
An Area of Application of Computer Visio1
No ratings yet
An Area of Application of Computer Visio1
17 pages
FD Report
No ratings yet
FD Report
3 pages
RO47002 - Lecture 2A - Case Study Visual Object Detection
No ratings yet
RO47002 - Lecture 2A - Case Study Visual Object Detection
24 pages
Face Detection With Python
0% (1)
Face Detection With Python
20 pages
Invicta-2020 Day12
No ratings yet
Invicta-2020 Day12
61 pages
Opencv: Object Classification - in The Object Classification, We Train A Model On A Dataset of
No ratings yet
Opencv: Object Classification - in The Object Classification, We Train A Model On A Dataset of
9 pages
Object Recognition
No ratings yet
Object Recognition
60 pages
freeman chain code
No ratings yet
freeman chain code
8 pages
Computer vision
No ratings yet
Computer vision
13 pages
Computer Vision Assignment
No ratings yet
Computer Vision Assignment
9 pages
Corner and Interest Point Detection
No ratings yet
Corner and Interest Point Detection
37 pages
Object Detection and Identification A Project Report: November 2019
No ratings yet
Object Detection and Identification A Project Report: November 2019
45 pages
Regions of Interest For Accurate Object Detection
No ratings yet
Regions of Interest For Accurate Object Detection
8 pages
Real-Time Object Detection Using Deep Learning and Open CV
No ratings yet
Real-Time Object Detection Using Deep Learning and Open CV
4 pages
Image Features and Categorization: Computer Vision Jia-Bin Huang, Virginia Tech
No ratings yet
Image Features and Categorization: Computer Vision Jia-Bin Huang, Virginia Tech
70 pages
Image Sorting Using Object Detection and Face Recognition
No ratings yet
Image Sorting Using Object Detection and Face Recognition
6 pages
computer vision unit 3
No ratings yet
computer vision unit 3
19 pages
Part 2
No ratings yet
Part 2
225 pages
DL UNIT-5
No ratings yet
DL UNIT-5
34 pages
A Tutorial On Object Detection Using Opencv
No ratings yet
A Tutorial On Object Detection Using Opencv
9 pages
Classifier
No ratings yet
Classifier
39 pages
Haar - Cascades - 1 Ref
No ratings yet
Haar - Cascades - 1 Ref
5 pages
Object Detection Tutorial
No ratings yet
Object Detection Tutorial
9 pages
A Tutorial On Object Detection Using Opencv
No ratings yet
A Tutorial On Object Detection Using Opencv
9 pages
A Tutorial On Object Detection Using Opencv
No ratings yet
A Tutorial On Object Detection Using Opencv
9 pages
Object Detection and Recognition: CS 534 Spring 2005: A. Elgammal Rutgers University
No ratings yet
Object Detection and Recognition: CS 534 Spring 2005: A. Elgammal Rutgers University
25 pages
Voilajones Paper PDF
No ratings yet
Voilajones Paper PDF
8 pages
Computer Vision Application
No ratings yet
Computer Vision Application
2 pages
Real Time Object Recognition and Classification
No ratings yet
Real Time Object Recognition and Classification
6 pages
Bai09 Descriptors
No ratings yet
Bai09 Descriptors
81 pages
Features & Object Recognition and Classification: Ritu Saha MSC 210915
No ratings yet
Features & Object Recognition and Classification: Ritu Saha MSC 210915
14 pages
Detection v2
No ratings yet
Detection v2
12 pages
computer vision technology
No ratings yet
computer vision technology
29 pages
Chapter 1
No ratings yet
Chapter 1
8 pages
Opencv-Face Detection Using Haar Cascades
No ratings yet
Opencv-Face Detection Using Haar Cascades
4 pages
Lec36 Obj Detn
No ratings yet
Lec36 Obj Detn
60 pages
4.01 08 2022 - FeatureDescriptors
No ratings yet
4.01 08 2022 - FeatureDescriptors
46 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Computer Vision Fundamental Matrix: Please, suggest a subtitle for a book with title 'Computer Vision Fundamental Matrix' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Computer Vision Fundamental Matrix: Please, suggest a subtitle for a book with title 'Computer Vision Fundamental Matrix' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet
Jyotishya Deepika 3 EULA
No ratings yet
Jyotishya Deepika 3 EULA
7 pages
Elder Care
No ratings yet
Elder Care
3 pages
Host Service GB v32
No ratings yet
Host Service GB v32
61 pages
Hanahanah
No ratings yet
Hanahanah
4 pages
Etika Marketing
No ratings yet
Etika Marketing
15 pages
Instant download (Ebook) Art Of Laparoscopic Surgery: Textbook & Atlas (4 Volumes) 2nd Edition by Palanivelu, C ISBN 9789352708451, 9352708458 pdf all chapter
100% (2)
Instant download (Ebook) Art Of Laparoscopic Surgery: Textbook & Atlas (4 Volumes) 2nd Edition by Palanivelu, C ISBN 9789352708451, 9352708458 pdf all chapter
67 pages
rrb gbo 2024 123_229b1006-6c96-4bbd-a90e-b4184d2ba122
No ratings yet
rrb gbo 2024 123_229b1006-6c96-4bbd-a90e-b4184d2ba122
27 pages
I2v Quickstart Guide
No ratings yet
I2v Quickstart Guide
7 pages
Manila+Electric+Co +V +NLRC
No ratings yet
Manila+Electric+Co +V +NLRC
2 pages
Project On RCOM
No ratings yet
Project On RCOM
22 pages
Applied Data Analysis and Modeling
No ratings yet
Applied Data Analysis and Modeling
446 pages
Case Processing Summary
No ratings yet
Case Processing Summary
3 pages
Report On CS TRAINEE DRIVE-2019 Organised at ICSI House, Noida
No ratings yet
Report On CS TRAINEE DRIVE-2019 Organised at ICSI House, Noida
3 pages
Authors Contribution Form
No ratings yet
Authors Contribution Form
2 pages
Yankee Fork and Hoe Company1
No ratings yet
Yankee Fork and Hoe Company1
18 pages
RSRTC
No ratings yet
RSRTC
13 pages
Exam 1 KeyFinance
No ratings yet
Exam 1 KeyFinance
7 pages
Avl2 Im
No ratings yet
Avl2 Im
4 pages
Minilogue Korg Manual
No ratings yet
Minilogue Korg Manual
58 pages
SOD123W
No ratings yet
SOD123W
6 pages
Paradox DG-85
No ratings yet
Paradox DG-85
2 pages
VOPAK-Reatile AQIA 13614921-13289-4 - VSAD - RB - DEIAR - Rev0-Ph - 1 PDF
No ratings yet
VOPAK-Reatile AQIA 13614921-13289-4 - VSAD - RB - DEIAR - Rev0-Ph - 1 PDF
69 pages
Порівняння SUN2000-30KTL-M3 VS Solis 30K-5G
No ratings yet
Порівняння SUN2000-30KTL-M3 VS Solis 30K-5G
8 pages
View Answer & Discuss: WAEC 2019
No ratings yet
View Answer & Discuss: WAEC 2019
10 pages
Instant download Designing Data Spaces: The Ecosystem Approach to Competitive Advantage Boris Otto pdf all chapter
100% (4)
Instant download Designing Data Spaces: The Ecosystem Approach to Competitive Advantage Boris Otto pdf all chapter
26 pages
Ostler v. Anderson, 10th Cir. (2006)
No ratings yet
Ostler v. Anderson, 10th Cir. (2006)
8 pages
Project Liter
No ratings yet
Project Liter
6 pages