0% found this document useful (0 votes)

8 views40 pages

Od Segment 221219 043435

The document discusses deep learning techniques for object detection and segmentation, comparing methods like RCNN, YOLO, and SSD. It explains the concepts of semantic and instance segmentation, highlighting architectures such as U-Net and Mask R-CNN. Additionally, it outlines various applications in fields like robotics, medical imaging, and autonomous vehicles.

Uploaded by

AvoidLeS5 1CEbornisTaKe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views40 pages

Od Segment 221219 043435

Uploaded by

AvoidLeS5 1CEbornisTaKe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Deep learning – Object Detection and Segmentation

Classification vs. Detection

✓ Dog

Dog
Dog
Object Detection
deer

cat
Object Detection as Classification
deer?
CNN cat?
background?
Object Detection as Classification
deer?
CNN cat?
background?
Object Detection as Classification
deer?
CNN cat?
background?
Object Detection as Classification
with Sliding Window
deer?
CNN cat?
background?
Object Detection as Classification
with Box Proposals
Histogram of Oriented Gradients (HOG) - 1986
Example:
Demo code:
HOG\HOG.py
Object Detection

• The RCNN Object Detector (2014)

• The Fast RCNN Object Detector (2015)
• The Faster RCNN Object Detector (2016)
• The YOLO Object Detector (2016)
• The SSD Object Detector (2016)
• Mask-RCNN (2017)
RCNN

https://people.eecs.berkeley.edu/~rbg/papers/r-cnn-cvpr.pdf
Rich feature hierarchies for accurate object detection and semantic
segmentation. Girshick et al. CVPR 2014.
Fast-RCNN

Idea: No need to recompute features for every box independently,

Regress refined bounding box coordinates.
https://arxiv.org/abs/1504.08083
https://github.com/sunshineatnoon/Paper-
Fast R-CNN. Girshick. ICCV 2015. Collection/blob/master/Fast-RCNN.md
Faster-RCNN

Idea: Integrate the Bounding

Box Proposals as part of the
CNN predictions

https://arxiv.org/abs/1506.01497
Ren et al. NIPS 2015.
YOLO- You Only Look Once

Idea: No bounding
box proposals.
Predict a class and a
box for every location
in a grid.

https://arxiv.org/abs/1506.02640 Redmon et al. CVPR 2016.

YOLO- You Only Look Once

Divide the image into 7x7 cells.

Each cell trains a detector.
Demo Code: The detector needs to predict the object’s class distributions.
YOLO\ytest.py The detector has 2 bounding-box predictors to predict
bounding-boxes and confidence scores.

https://arxiv.org/abs/1506.02640 Redmon et al. CVPR 2016.

SSD: Single Shot Detector

Idea: Similar to YOLO, but denser grid map, multiscale grid maps. +
Data augmentation + Hard negative mining + Other design choices i
n the network. Liu et al. ECCV 2016.
Demo Code: Non-Max Suppression: Non-Max Supression (NMS) is a technique used to select
NMS\Nms.py one bounding box for an object if multiple bounding boxes were detected with
varying probability scores by object detection algorithms(example: Faster R-
CNN,YOLO)
(Intersection over Union)
Segmentation
What is the difference?

Left image, every pixel belongs to a particular class (either background or person). Also, all the pixels belonging
to a particular class are represented by the same color (background as black and person as pink). This is an
example of semantic segmentation

Right image has also assigned a particular class to each pixel of the image. However, different objects of the
same class have different colors (Person 1 as red, Person 2 as green, background as black, etc.). This is an
example of instance segmentation
Thresholding

Edge Segmentation
Deep Learning-based methods

Convolutional Encoder-Decoder Architecture

SegNet -2015
Mask R-CNN

1. We take an image as input and pass it to the ConvNet, which returns the feature map for that image
2. Region proposal network (RPN) is applied on these feature maps. This returns the object proposals along with
their objectness score
3. A RoI pooling layer is applied to these proposals to bring down all the proposals to the same size
4. Finally, the proposals are passed to a fully connected layer to classify and output the bounding boxes for
objects. It also returns the mask for each proposal
U-Net – medical image segmentation

U-Net: The U-Net solves problems of general CNN networks used for medical image
segmentation, since it adopts a perfect symmetric structure and skip connection.

Different from common image segmentation, medical images usually contain noise and show
blurred boundaries. Therefore, it is very difficult to detect or recognize objects in medical
images only depending on image low-level features.

Meanwhile, it is also impossible to obtain accurate boundaries depending only on image

semantic features due to the lack of image detail information. Whereas, the U-Net effectively
fuses low-level and high-level image features by combining low-resolution and high-
resolution feature maps through skip connections, which is a perfect solution for medical
image segmentation tasks.

Currently, the U-Net has become the benchmark for most medical image segmentation tasks
and has inspired a lot of meaningful improvements
The low-level information helps to improve accuracy. The high-level information helps to extract complex features.

Demo code:
UNET\runtrain.py
Annotation
https://www.mdpi.com/2071-1050/13/3/1224/pdf
Image segmentation applications
Robotics (Machine Vision)
1. Instance segmentation for robotic grasping
2. Recycling object picking
3. Autonomous navigation and SLAM

https://youtu.be/aZkmeGIWZVw

Medical imaging
1.Medical image segmentation is the process of extracting the desired object
(organ) from a medical image (2D or 3D)
2. X-Ray segmentation
3. CT scan organ segmentation
4. Dental instance segmentation
5. Digital pathology cell segmentation
6. Surgical video annotation

https://youtu.be/wYdI12EN00M
3.Self Driving Cars
Drivable surface semantic segmentation
Car and pedestrian instance segmentation
In-vehicle object detection (stuff left behind by passengers)
Pothole detection and segmentation

and many …

The Hundred-Page Language Models Book - Andriy Burkov
93% (14)
The Hundred-Page Language Models Book - Andriy Burkov
209 pages
Od Segment
No ratings yet
Od Segment
53 pages
MV cs4243 2024 Amir 6 p2
No ratings yet
MV cs4243 2024 Amir 6 p2
95 pages
L10 Lecture Detection - Segmentation v2.5
No ratings yet
L10 Lecture Detection - Segmentation v2.5
35 pages
02 Semantic Segmentation 2024
No ratings yet
02 Semantic Segmentation 2024
53 pages
cs231n 2018 ds06
No ratings yet
cs231n 2018 ds06
38 pages
Vision
No ratings yet
Vision
24 pages
Lecture2.2 UnimodalRepresentations Part1 PDF
No ratings yet
Lecture2.2 UnimodalRepresentations Part1 PDF
92 pages
NN 09
No ratings yet
NN 09
34 pages
Deep Learning Algorithms For Object Detection
No ratings yet
Deep Learning Algorithms For Object Detection
43 pages
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
No ratings yet
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
60 pages
139 Pretrained Networks Object Detection
No ratings yet
139 Pretrained Networks Object Detection
22 pages
机器学习读书会嘉宾分享计算机视觉目标检测
No ratings yet
机器学习读书会嘉宾分享计算机视觉目标检测
52 pages
DL Unit 5
No ratings yet
DL Unit 5
63 pages
Yolo Family
No ratings yet
Yolo Family
40 pages
Presentation (Theoretical Evaluation)
No ratings yet
Presentation (Theoretical Evaluation)
107 pages
Object Detection and Segmentation - Part 2
No ratings yet
Object Detection and Segmentation - Part 2
36 pages
CSE4261 Lecture-12
No ratings yet
CSE4261 Lecture-12
24 pages
CVR FDP
No ratings yet
CVR FDP
37 pages
Advanced Deep Learning Based Object Detection Methods
No ratings yet
Advanced Deep Learning Based Object Detection Methods
36 pages
Week 5 - Fast RCNN
No ratings yet
Week 5 - Fast RCNN
17 pages
Hands On Hacking Become An Expert at Next Gen Penetration Testing and Purple Teaming 1 Edition Matthew Hickey Official Test Bank
No ratings yet
Hands On Hacking Become An Expert at Next Gen Penetration Testing and Purple Teaming 1 Edition Matthew Hickey Official Test Bank
323 pages
1 s2.0 S0031320322007075 Main
No ratings yet
1 s2.0 S0031320322007075 Main
12 pages
Object Detection1
No ratings yet
Object Detection1
29 pages
He 2017
No ratings yet
He 2017
9 pages
Comprehensive In-Depth Notes On Computer Vision Tasks & Vision Transformers
No ratings yet
Comprehensive In-Depth Notes On Computer Vision Tasks & Vision Transformers
5 pages
Contour Proposal Networks For Biomedical Instance Compressed
No ratings yet
Contour Proposal Networks For Biomedical Instance Compressed
17 pages
1 ObjectDetection
No ratings yet
1 ObjectDetection
46 pages
MMDetection Open MMLab Detection Toolbox and Benchmark
No ratings yet
MMDetection Open MMLab Detection Toolbox and Benchmark
13 pages
Li 2021 J. Phys.: Conf. Ser. 1827 012085
No ratings yet
Li 2021 J. Phys.: Conf. Ser. 1827 012085
11 pages
Detecting Small Signs From Large Images
No ratings yet
Detecting Small Signs From Large Images
9 pages
Mask
No ratings yet
Mask
12 pages
He Mask R-CNN Iccv 2017 Paper
No ratings yet
He Mask R-CNN Iccv 2017 Paper
9 pages
2018 - SeGAN - Adversarial Network With Multi-Scale L 1 Loss For Medical
No ratings yet
2018 - SeGAN - Adversarial Network With Multi-Scale L 1 Loss For Medical
10 pages
Real Time Object Detection System
No ratings yet
Real Time Object Detection System
31 pages
He Mask R-CNN ICCV 2017 Paper PDF
No ratings yet
He Mask R-CNN ICCV 2017 Paper PDF
9 pages
Mask R-CNN
No ratings yet
Mask R-CNN
4 pages
Deep Semantic Segmentation New Model of Natural and Medical Images
No ratings yet
Deep Semantic Segmentation New Model of Natural and Medical Images
4 pages
Deep Semantic Segmentation New Model of Natural and Medical Images
No ratings yet
Deep Semantic Segmentation New Model of Natural and Medical Images
4 pages
Object Detection With Deep Learning - A Review Summary
No ratings yet
Object Detection With Deep Learning - A Review Summary
11 pages
10 1109@access 2019 2932731
No ratings yet
10 1109@access 2019 2932731
9 pages
M10 - Introduction To TensorFlow, Deep Learning and Application
No ratings yet
M10 - Introduction To TensorFlow, Deep Learning and Application
25 pages
Eg-Transunet: A Transformer-Based U-Net With Enhanced and Guided Models For Biomedical Image Segmentation
No ratings yet
Eg-Transunet: A Transformer-Based U-Net With Enhanced and Guided Models For Biomedical Image Segmentation
22 pages
Computer VIsion Applications
No ratings yet
Computer VIsion Applications
30 pages
R-CNN (Object Detection) - A Beginners Guide To One of The Most - by Sharif Elfouly - Medium
No ratings yet
R-CNN (Object Detection) - A Beginners Guide To One of The Most - by Sharif Elfouly - Medium
6 pages
Zhang Et Al - 2020 - DENSE-Inception U-Net For Medical Image Segmentation2
No ratings yet
Zhang Et Al - 2020 - DENSE-Inception U-Net For Medical Image Segmentation2
40 pages
Unet
No ratings yet
Unet
8 pages
Mini Project Synopsis
No ratings yet
Mini Project Synopsis
6 pages
A Review of Object Detection Based On Convolutional Neural Network
No ratings yet
A Review of Object Detection Based On Convolutional Neural Network
6 pages
NNDL Unit 5
No ratings yet
NNDL Unit 5
21 pages
Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
1 Image Segmentation Using Deep Learning
No ratings yet
1 Image Segmentation Using Deep Learning
6 pages
Objectdetection
No ratings yet
Objectdetection
7 pages
The Ultimate Guide To Object Detection
No ratings yet
The Ultimate Guide To Object Detection
16 pages
Second Progress Report UID - 17BCS2127
No ratings yet
Second Progress Report UID - 17BCS2127
13 pages
IMINT Target Acquisition Using Deep Learning
No ratings yet
IMINT Target Acquisition Using Deep Learning
5 pages
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
No ratings yet
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
6 pages
Purchase Requisition Form
100% (1)
Purchase Requisition Form
9 pages
R2 Unet PDF
No ratings yet
R2 Unet PDF
12 pages
Object Detection
No ratings yet
Object Detection
57 pages
Second Edition Harry Potter and The Tabletop RPG
No ratings yet
Second Edition Harry Potter and The Tabletop RPG
53 pages
Book List 2023 24 For Website
No ratings yet
Book List 2023 24 For Website
10 pages
Theoretical and Conceptual Frameworks in Research: Conceptual Clarification
No ratings yet
Theoretical and Conceptual Frameworks in Research: Conceptual Clarification
16 pages
Pathoma Fundamentals of Pathology
No ratings yet
Pathoma Fundamentals of Pathology
232 pages
Intro - S4HANA - Using - Global - Bike - Case - Study - PP - Fiori - en - v3.3 (Step 8)
No ratings yet
Intro - S4HANA - Using - Global - Bike - Case - Study - PP - Fiori - en - v3.3 (Step 8)
6 pages
Love: A Philosophy of Pastoral Care and Counselling: Author: Affiliations
No ratings yet
Love: A Philosophy of Pastoral Care and Counselling: Author: Affiliations
11 pages
AP World Unit 6 Topic 3 NoteGuide Answer Key
100% (1)
AP World Unit 6 Topic 3 NoteGuide Answer Key
6 pages
Qep Nursing Philosophy Final Draft
No ratings yet
Qep Nursing Philosophy Final Draft
6 pages
English Grammer For Cadet
No ratings yet
English Grammer For Cadet
20 pages
Positive Attitude
100% (1)
Positive Attitude
9 pages
Recommendation Letter of SAAD From DR Hassan
No ratings yet
Recommendation Letter of SAAD From DR Hassan
2 pages
Ed Excel A Level Course Guide
100% (2)
Ed Excel A Level Course Guide
8 pages
Justenoughpython Pandas 220915 175329
No ratings yet
Justenoughpython Pandas 220915 175329
64 pages
Ni-Msme News Corner October 2024
No ratings yet
Ni-Msme News Corner October 2024
18 pages
Planning For A Smart Hospital 202210 - 221219 - 033507
No ratings yet
Planning For A Smart Hospital 202210 - 221219 - 033507
50 pages
Notice For Admission GrEnFIn EMJM 25 26 en LAST Signed 2
No ratings yet
Notice For Admission GrEnFIn EMJM 25 26 en LAST Signed 2
17 pages
Toefl Certificate
No ratings yet
Toefl Certificate
2 pages
BSSW 3 2 Proposal
No ratings yet
BSSW 3 2 Proposal
5 pages
Hmems80 2021 Week00 Step by Step PDF
No ratings yet
Hmems80 2021 Week00 Step by Step PDF
6 pages
Bank Management
No ratings yet
Bank Management
19 pages
2.4 Broyard, A. Doctor, Talk To Me.
No ratings yet
2.4 Broyard, A. Doctor, Talk To Me.
4 pages
Manhattan WMS Training
No ratings yet
Manhattan WMS Training
21 pages
地理学论文题目
100% (1)
地理学论文题目
6 pages
Life Worth Living - Expections and Requirements
No ratings yet
Life Worth Living - Expections and Requirements
2 pages
Rubrics Pericare Male
No ratings yet
Rubrics Pericare Male
2 pages
DLL - Tle-He 6 - Q3 - W4
No ratings yet
DLL - Tle-He 6 - Q3 - W4
3 pages
Daad Courses 2024 11 23
No ratings yet
Daad Courses 2024 11 23
6 pages
Grade 7 - Unit Plans: Shanghai Golden Apple School
No ratings yet
Grade 7 - Unit Plans: Shanghai Golden Apple School
15 pages
Introduction For RRL
No ratings yet
Introduction For RRL
6 pages
Ocd Essay
No ratings yet
Ocd Essay
3 pages
A.OSAMA-CX Team Leader
No ratings yet
A.OSAMA-CX Team Leader
2 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet

Od Segment 221219 043435

Uploaded by

Od Segment 221219 043435

Uploaded by

Deep learning – Object Detection and Segmentation

Classification vs. Detection

• The RCNN Object Detector (2014)

Idea: No need to recompute features for every box independently,

Idea: Integrate the Bounding

https://arxiv.org/abs/1506.02640 Redmon et al. CVPR 2016.

Divide the image into 7x7 cells.

https://arxiv.org/abs/1506.02640 Redmon et al. CVPR 2016.

Convolutional Encoder-Decoder Architecture

Meanwhile, it is also impossible to obtain accurate boundaries depending only on image

You might also like