0% found this document useful (0 votes)

35 views

Advanced Deep Learning Based Object Detection Methods

This document discusses several advanced deep learning methods for object detection. It begins by describing improvements to non-maximum suppression methods, such as linear soft-NMS and Gaussian soft-NMS. It then discusses learning non-maximum suppression by including the suppression process in the training. The document also covers multi-scale object detection using FPN and single-stage detection using RetinaNet with focal loss. Finally, it summarizes Mask R-CNN for instance segmentation, pose estimation, and its use of RoiAlign for fine spatial information.

Uploaded by

seul alone

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views

Advanced Deep Learning Based Object Detection Methods

Uploaded by

seul alone

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Advanced Deep Learning based Object

Detection Methods
Improving Object Detection With One Line of Code
● Non-Maximum Suppression is a greedy
process.
○ It worked well enough in 2007 but it doesn’t
anymore.
● High scoring detections can be suppressed
just as low scoring detections.
○ Overlap with stronger detection is the only
criteria.
● Should one detection completely suppress
another detection, or simply reduce its
confidence?
Improving Object Detection With One Line of Code
● NMS:

● Linear Soft-NMS:

● Gaussian Soft-NMS:
○ Linear Soft-NMS is not continuous in terms of
overlap and a sudden penalty is applied when a
NMS threshold is reached.
○ Instead we can use a continuous function:
Improving Object Detection With One Line of Code
Improving Object Detection With One Line of Code
Learning Non-Maximum Suppression
● Object detectors are mostly trained
end-to-end, except for the NMS.
○ NMS is still fully hand-crafted, and forces a
trade-off between recall and precision.
● Training loss is not evaluation loss.
○ Training is performed without NMS
○ During evaluation, multiple detections for same
object count as false positives.
● Instead, train the network to include the
suppression process.
○ Only output one bounding box per object.
○ Learn how to handle close objects.
Learning Non-Maximum Suppression
● Additional blocks that: ● New loss:
○ Encode pairwise information. ○ Only one positive candidate per object.
○ For each detection, pool information from all ○ Instead of the current practice to take all
pairings. objects with IoU>50%
○ Update feature vector.
○ Repeat.
Learning Non-Maximum Suppression
Learning Non-Maximum Suppression
Multi-Scale Object Detection

● Multi-scale object detection using image pyramid

○ Predict different scales by applying same model at different image resolutions.
● Classic method.
● But also, in OverFeat.
● Slow. Requires multiple evaluation of the same model.
Multi-Scale Object Detection

● Predict multiple scale of objects using a single feature map.

● Same as Faster R-CNN.
● Fast
● Single model (same in training as in testing).
● Bad features resolution for small objects.
Multi-Scale Object Detection

● Predict different object sizes at different feature scales.

● Same as SSD.
● Good features resolution for small objects
● But features are much weaker than in deeper layers.
Feature Pyramid Network (FPN)

● Single model (same in training as in testing).

● Good features resolution for small objects.
● Strong features in all layers.
● Almost no overhead over SSD (= Fast).
Feature Pyramid Network (FPN)
Feature Pyramid Network (FPN)

● How important is top-down enrichment?

● How important are lateral connections?
● How important are pyramid representations?
Feature Pyramid Network (FPN)

● How important is top-down enrichment?

● How important are lateral connections?
● How important are pyramid representations?
Focal Loss for Dense Object Detection

● Can we train a single stage detector to be as accurate as two stage detectors?

● Contributions:
○ RetinaNet: Single stage object detector based on FPN backbone.
○ New loss.
Focal Loss for Dense Object Detection

● Class unbalance is an important issue for object detection.

● Previous solutions:
○ Random resampling at 1:3 ratio.
○ Hard negative resampling at 1:3 ratio.
● Both solutions means that at each step, we only a few samples actually matters
to the loss function.
● Instead, include all samples but use different weight for each class.
○ Regular cross entropy:
○ Weighted cross entropy:
Focal Loss for Dense Object Detection
● Using weight CE as baseline:
○ Can we do better?
○ Can we use different weight for each sample?
● Focal loss:
● Every sample is weighted according to its error.
○ We want to focus on samples which are
mislabeled.
Focal Loss for Dense Object Detection

● Different parameters for RetinaNet

Focal Loss for Dense Object Detection

● Comparison with online hard negative mining

Focal Loss for Dense Object Detection

● Accuracy/speed trade-offs
Focal Loss for Dense Object Detection

● Benchmark results
Also Read:
Deformable Convolutional Networks
https://arxiv.org/abs/1703.06211
YouTube Videos

● CS231n
○ Lecture 11 - Detection and segmentation https://youtu.be/nDPWywWRIRo
● Deep Learning for Objects and Scenes (CVPR 2017 Workshop)
○ Lecture 1: Learning Deep Representations for Visual Recognition, by Kaiming He
https://youtu.be/jHv37mKAhV4
○ Lecture 2: Deep Learning for Instance-level Object Understanding, by Ross Girshick
https://youtu.be/jHv37mKAhV4?t=39m4s
Looking for brilliant researchers

[email protected] /
[email protected]
Computer Vision Tasks

Source: CS231n Object detection http://cs231n.stanford.edu/slides/2016/winter1516_lecture8.pdf

Mask R-CNN
● Instance segmentation with pose
estimation for people.
● Extends faster R-CNN by adding new
branch for the instance mask task.
● Pose estimation can be added by simply
adding an additional branch.
● SOTA accuracy on detection, segmentation
and pose estimation at 5 FPS on GPU.
● https://arxiv.org/abs/1703.06870
● Girshick won young researcher award.
Mask R-CNN
Mask R-CNN
Mask R-CNN
Mask R-CNN
● RoiPool
○ Quantization breaks pixel-to-pixel alignment
○ Too coarse and not good for fine spatial
information required for mask.
● RoiAlign
○ Bilinearly sample the proposal region and avoid
the quantization.
○ Smoothly normalize features and predictions
into coordinate frame free of scale and aspect
ratio
Mask R-CNN
Mask R-CNN
● Backbone architecture
○ ResNet
○ ResNeXt
○ FPN
● Mask representation
○ FC vs. Convolutional
○ Multinomial vs. Independent Masks: softmax
vs. sigmoid
○ Class-Specific vs. Class-Agnostic Masks:
almost same accuracy
● Multi-task learning
○ Mask task improves object detection accuracy.
○ Keypoint task reduces object detection
accuracy.
Mask R-CNN
● Pose estimation
○ Simply add an additional branch.
○ Model a keypoint’s location as a one-hot mask,
and adopt Mask R-CNN to predict K masks.
○ Experiments are mainly to demonstrate the
generality of the Mask R-CNN framework.
○ RoiAlign improves this task’s accuracy as well.
Looking for brilliant researchers

[email protected]

Deep Neural Networks
No ratings yet
Deep Neural Networks
25 pages
Object and Face Detection Based On Center-Net 1
No ratings yet
Object and Face Detection Based On Center-Net 1
7 pages
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
No ratings yet
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
6 pages
Lecture 7 Deep Learning in Object Detection 2025
No ratings yet
Lecture 7 Deep Learning in Object Detection 2025
43 pages
Focal Loss For Dense Object Detection
No ratings yet
Focal Loss For Dense Object Detection
10 pages
End-to-End Object Detection with Fully Convolutional Network
No ratings yet
End-to-End Object Detection with Fully Convolutional Network
13 pages
Retina Net
No ratings yet
Retina Net
6 pages
Image and Video Analytics Unit 3
No ratings yet
Image and Video Analytics Unit 3
18 pages
Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
Report For Retinanet
No ratings yet
Report For Retinanet
7 pages
Overview_of_object_detection_based_on_deep_learnin
No ratings yet
Overview_of_object_detection_based_on_deep_learnin
7 pages
Wang NAS-FCOS Fast Neural Architecture Search For Object Detection CVPR 2020 Paper
No ratings yet
Wang NAS-FCOS Fast Neural Architecture Search For Object Detection CVPR 2020 Paper
9 pages
Objectdetection
No ratings yet
Objectdetection
7 pages
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
No ratings yet
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
60 pages
1 ObjectDetection
No ratings yet
1 ObjectDetection
46 pages
Od Segment
No ratings yet
Od Segment
53 pages
Final Report - Removed
No ratings yet
Final Report - Removed
43 pages
Object Detection
No ratings yet
Object Detection
57 pages
od_segment_221219_043435
No ratings yet
od_segment_221219_043435
40 pages
L10-Lecture-Detection.Segmentation-v2.5
No ratings yet
L10-Lecture-Detection.Segmentation-v2.5
35 pages
[email protected]
No ratings yet
[email protected]
9 pages
Efficient Detection of Small and Complex Objects for Autonomous Driving Using Deep Learning
No ratings yet
Efficient Detection of Small and Complex Objects for Autonomous Driving Using Deep Learning
5 pages
Fairmot Explained 1
No ratings yet
Fairmot Explained 1
19 pages
Second Progress Report UID - 17BCS2127
No ratings yet
Second Progress Report UID - 17BCS2127
13 pages
Assignment-2:DIP: Mr. Victor Mageto CP10101610245
No ratings yet
Assignment-2:DIP: Mr. Victor Mageto CP10101610245
10 pages
Object Detection using ELAN
No ratings yet
Object Detection using ELAN
6 pages
Focus-And-Detect A Small Object Detection Framework For Aerial Images
No ratings yet
Focus-And-Detect A Small Object Detection Framework For Aerial Images
9 pages
Development of Framework For Detecting Smoking Scenes
No ratings yet
Development of Framework For Detecting Smoking Scenes
5 pages
4. Object Detection and Segmentation
No ratings yet
4. Object Detection and Segmentation
37 pages
Module 6
No ratings yet
Module 6
83 pages
The Ultimate Guide To Object Detection
No ratings yet
The Ultimate Guide To Object Detection
16 pages
Introduction To Object Detection
No ratings yet
Introduction To Object Detection
24 pages
Lesson 07
No ratings yet
Lesson 07
59 pages
Cviii 2024 Ws
No ratings yet
Cviii 2024 Ws
45 pages
mv_cs4243_2024_amir_6_p2 (1)
No ratings yet
mv_cs4243_2024_amir_6_p2 (1)
95 pages
Vision
No ratings yet
Vision
24 pages
Incremental Training For Image Classification of Unseen Objects
No ratings yet
Incremental Training For Image Classification of Unseen Objects
19 pages
Final Presentation On Object Detection
No ratings yet
Final Presentation On Object Detection
10 pages
Object Detection
No ratings yet
Object Detection
13 pages
2022 - Enhanced Feature Fusion and Multiple Receptive Fields Object Detection
No ratings yet
2022 - Enhanced Feature Fusion and Multiple Receptive Fields Object Detection
12 pages
Object Detection Using TensorFlow
No ratings yet
Object Detection Using TensorFlow
21 pages
Knowledge-Based Systems
No ratings yet
Knowledge-Based Systems
10 pages
2004 10934v1 PDF
No ratings yet
2004 10934v1 PDF
17 pages
fdgdfd
No ratings yet
fdgdfd
15 pages
NN 09
No ratings yet
NN 09
34 pages
Havi Doc Batch 10
No ratings yet
Havi Doc Batch 10
17 pages
Tensor Flow
No ratings yet
Tensor Flow
5 pages
机器学习读书会嘉宾分享-计算机视觉-目标检测
No ratings yet
机器学习读书会嘉宾分享-计算机视觉-目标检测
52 pages
Literature Survey For Robotics
No ratings yet
Literature Survey For Robotics
6 pages
Deep Learning: Dr. Sanjeev Sharma
No ratings yet
Deep Learning: Dr. Sanjeev Sharma
61 pages
Focal Loss For Dense Object Detection
No ratings yet
Focal Loss For Dense Object Detection
9 pages
Object Detect
No ratings yet
Object Detect
12 pages
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
No ratings yet
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
8 pages
dsfd
No ratings yet
dsfd
10 pages
Li 2021 J. Phys.: Conf. Ser. 1827 012085
No ratings yet
Li 2021 J. Phys.: Conf. Ser. 1827 012085
11 pages
MINI PROJECT SYNOPSIS
No ratings yet
MINI PROJECT SYNOPSIS
6 pages
Object Detectionusing Machine Learningand Deep Learning
No ratings yet
Object Detectionusing Machine Learningand Deep Learning
9 pages
Object Detection Report
No ratings yet
Object Detection Report
27 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
Canny Edge Detector: Unveiling the Art of Visual Perception
From Everand
Canny Edge Detector: Unveiling the Art of Visual Perception
Fouad Sabry
No ratings yet
Fast Methods For Deep Learning Based Object Detection
No ratings yet
Fast Methods For Deep Learning Based Object Detection
43 pages
Cornernet: Detecting Objects As Paired Keypoints: Hei Law Jia Deng Princeton University, University of Michigan
No ratings yet
Cornernet: Detecting Objects As Paired Keypoints: Hei Law Jia Deng Princeton University, University of Michigan
24 pages
Caesar COCO-Stuff Thing and CVPR 2018 Paper
No ratings yet
Caesar COCO-Stuff Thing and CVPR 2018 Paper
10 pages
Cross-Dataset Training For Class Increasing Object Detection
No ratings yet
Cross-Dataset Training For Class Increasing Object Detection
10 pages
The openCV Installed With The Jetpack Does Not Have CUDA Supported PDF
No ratings yet
The openCV Installed With The Jetpack Does Not Have CUDA Supported PDF
11 pages
مصحف ورش طبعة الجزائر PDF
No ratings yet
مصحف ورش طبعة الجزائر PDF
707 pages
EI-331 - Design and Analysis of Algorithms - String Matching
No ratings yet
EI-331 - Design and Analysis of Algorithms - String Matching
18 pages
Name of The Course: Data Structures (DS) Assignment - 3: 1602-19-733-080 M. Meghana 19-11-2020
No ratings yet
Name of The Course: Data Structures (DS) Assignment - 3: 1602-19-733-080 M. Meghana 19-11-2020
14 pages
AI Papers
No ratings yet
AI Papers
9 pages
Weka Book Questions
0% (1)
Weka Book Questions
2 pages
BRANCH AND BOUND
No ratings yet
BRANCH AND BOUND
38 pages
CSC3A - ST1 Notes
No ratings yet
CSC3A - ST1 Notes
8 pages
2) Binary Search Tree PDF
No ratings yet
2) Binary Search Tree PDF
21 pages
Problem Solving: Algorithms and Flowcharts
No ratings yet
Problem Solving: Algorithms and Flowcharts
19 pages
Prims and Kruskal - ET - C2 - Roll No - 26
No ratings yet
Prims and Kruskal - ET - C2 - Roll No - 26
8 pages
Questions by Love Babbar:: Topic: Problem: Done (Yes or No)
No ratings yet
Questions by Love Babbar:: Topic: Problem: Done (Yes or No)
14 pages
06 Chapter 4 - Machine Learning
No ratings yet
06 Chapter 4 - Machine Learning
55 pages
Suprabha Islam (AI)
No ratings yet
Suprabha Islam (AI)
2 pages
DS-I - Introduction To Data Structure
No ratings yet
DS-I - Introduction To Data Structure
64 pages
Csc3205-Lexical - Analysis PDF
No ratings yet
Csc3205-Lexical - Analysis PDF
33 pages
(A) Arithmetic and Logical Operations On Digital Images
No ratings yet
(A) Arithmetic and Logical Operations On Digital Images
13 pages
Singly Linked List in Python: Objective
No ratings yet
Singly Linked List in Python: Objective
3 pages
2-3 Tree PDF
No ratings yet
2-3 Tree PDF
8 pages
Zagreb Indices
No ratings yet
Zagreb Indices
43 pages
Theory of Computation (CS F351) : BITS Pilani
No ratings yet
Theory of Computation (CS F351) : BITS Pilani
13 pages
Algorithm Design Techniques
No ratings yet
Algorithm Design Techniques
6 pages
Computer Science
No ratings yet
Computer Science
3 pages
Department of Computer Science Iqra University, Karachi: 1 Is Pseudocode 2 Is Awesomeness
No ratings yet
Department of Computer Science Iqra University, Karachi: 1 Is Pseudocode 2 Is Awesomeness
5 pages
Cuestionarios IA
No ratings yet
Cuestionarios IA
17 pages
Wipro Elite NLTH Coding Placement Questions
No ratings yet
Wipro Elite NLTH Coding Placement Questions
16 pages
Module-5 Clustering Algorithm
No ratings yet
Module-5 Clustering Algorithm
31 pages
Pointers
No ratings yet
Pointers
2 pages
Bks MaaSL 0306 ws00 Xxaann
No ratings yet
Bks MaaSL 0306 ws00 Xxaann
3 pages
Sri Vidya College of Engineering and Technology Course Material (Lecture Notes)
No ratings yet
Sri Vidya College of Engineering and Technology Course Material (Lecture Notes)
35 pages
Isas 2 Semester 2 Full DLC
No ratings yet
Isas 2 Semester 2 Full DLC
22 pages

Advanced Deep Learning Based Object Detection Methods

Uploaded by

Advanced Deep Learning Based Object Detection Methods

Uploaded by

Advanced Deep Learning based Object

● Multi-scale object detection using image pyramid

● Predict multiple scale of objects using a single feature map.

● Predict different object sizes at different feature scales.

● Single model (same in training as in testing).

● How important is top-down enrichment?

● How important is top-down enrichment?

● Can we train a single stage detector to be as accurate as two stage detectors?

● Class unbalance is an important issue for object detection.

● Different parameters for RetinaNet

● Comparison with online hard negative mining

Source: CS231n Object detection http://cs231n.stanford.edu/slides/2016/winter1516_lecture8.pdf

You might also like