0% found this document useful (0 votes)

12 views

Instance Segmentation

Uploaded by

Babil King

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Instance Segmentation

Uploaded by

Babil King

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

Instance Segmentation

Riley Simmons-Edler, Berthy Feng

Instance Segmentation Task

● Label each foreground pixel with object

and instance
● Object detection + semantic
segmentation

Slide Credit: Kaiming He

In This Lecture...

● Microsoft COCO dataset

● Mask R-CNN (fully supervised)
● MaskX R-CNN (partially supervised)
Microsoft COCO:
Common Objects in Context
Tsung-Yi Lin, Michael Maire, Serge Belongie, et al.
“Microsoft COCO: Common Objects in Context.” arXiv,
2015.
Previous Datasets
● ImageNet: many object
categories
● PASCAL VOC: object
detection in natural images,
small number of classes
● SUN: labeling scene types and
commonly occurring objects,
but not many instances per
category
Image Credit: Tsung-Yi Lin et al.
Goal: Push research in scene understanding

1. Detecting non-iconic views

2. Contextual reasoning between objects
3. Precise 2D localization of objects
MS COCO Dataset
❖ 91 object
classes
❖ 328,000
images
❖ 2.5 million
labeled
instances

Image Credit: Tsung-Yi Lin et al.

Image Collection & Annotation
Object Categories

Image Credit: Tsung-Yi Lin et al.

Non-Iconic Image Collection

Image Credit: Tsung-Yi Lin et al.

Annotation

Image Credit: Tsung-Yi Lin et al.

Dataset Evaluation
Statistics

Image Credit: Tsung-Yi Lin et al.

Statistics

Image Credit: Tsung-Yi Lin et al.

COCO Detection Challenge

Image Credit: Tsung-Yi Lin et al.

COCO Keypoint Challenge

Image Credit: Tsung-Yi Lin et al.

COCO Stuff Challenge

Image Credit: Tsung-Yi Lin et al.

COCO Places Challenges

Image Credit: Tsung-Yi Lin et al.

Mask R-CNN
Kaiming He, Georgia Gkioxari, Piotr Dollar, and Ross
Girshick. “Mask R-CNN.” ICCV, 2017.
Faster R-CNN
Fast R-CNN

Image Credit: Shaoqing Ren et al. Image Credit: Tomasz Grel

Insight: Region Proposal and Detection Use
Same Features

Image Credit: Shaoqing Ren et al.

Faster R-CNN = RPN + Fast R-CNN
RPN = Fully Convolutional Network
Extending to Instance
Segmentation
Visual Perception Problems

Slide Credit: Kaiming He

Instance Segmentation Methods

Slide Credit: Kaiming He

Insight: Mask Prediction in Parallel

Slide Credit: Kaiming He

RoIPool

Image Credit: Tomasz Grel

RoIPool

Slide Credit: Kaiming He

RoIAlign

Slide Credit: Kaiming He

Mask R-CNN
Mask R-CNN Results
Examples

● Mask AP =
35.7

Image Credit: Kaiming He et al.

Comparisons

Image Credit: Kaiming He et al.

Comparisons

Image Credit: Kaiming He et al.

Application: Human Pose Estimation

Image Credit: Kaiming He et al.

Mask R-CNN Recap

● Add parallel mask prediction head to Faster-RCNN

● RoIAlign allows for precise localization
● Mask R-CNN improves on AP of previous state-of-the-art, can be
applied in human pose estimation
Learning to Segment Every Thing
Ronghang Hu, Piotr Dollar, Kaiming He, Trevor Darrell, and
Ross Girshick. “Learning to Segment Every Thing.” arXiv,
2017.
Partially Supervised Model
Motivation for a Partially Supervised Model

A = set of object B = set of object

categories with categories with only
complete mask bounding boxes (no
annotations segmentation
annotations)

How can we know C = A U B?

Image Credit: Ronghang Hu et al.

Transfer Learning

Image Credit: Ronghang Hu et al.

Weight Transfer Function

Image Credit: Ronghang Hu et al.

Training
● Train bounding box head using standard box detection losses on all
classes in A U B
● Train mask head, weight transfer function using mask loss on classes in A

Image Credit: Ronghang Hu et al.

Stage-Wise Training
1. Detection training ● Train detection once and then
2. Segmentation training fine-tune weight transfer function
● Inferior performance

Image Credit: Ronghang Hu et al.

End-to-End Joint Training

● Jointly train detection head and mask head end-to-end

● Want detection weights to stay constant between A and B

Image Credit: Ronghang Hu et al.

End-to-End Training Better

Image Credit: Ronghang Hu et al.

Mask Prediction
Baseline: Class-agonistic FCN mask prediction

Extension: FCN+MLP mask heads

Image Credit: Ronghang Hu et al.

Results
Examples

Image Credit: Ronghang Hu et al.

Comparisons

Image Credit: Ronghang Hu et al.

Segmenting Everything

Image Credit: Ronghang Hu et al.

Izinja Zehlathi by Precious Moloi 1-27
100% (1)
Izinja Zehlathi by Precious Moloi 1-27
168 pages
Surpac Tutorial - Pit Design - Block Modelling
0% (1)
Surpac Tutorial - Pit Design - Block Modelling
20 pages
SERVICE & PARTS ABhgjjGHT sn1500-2889
75% (4)
SERVICE & PARTS ABhgjjGHT sn1500-2889
146 pages
ACI Concrete International 2021 Vol43 No2
100% (1)
ACI Concrete International 2021 Vol43 No2
60 pages
1803.01534-PANet
No ratings yet
1803.01534-PANet
11 pages
Review: Deepmask (Instance Segmentation) : An Instance Segment Proposal Method Driven by Convolutional Neural Networks
No ratings yet
Review: Deepmask (Instance Segmentation) : An Instance Segment Proposal Method Driven by Convolutional Neural Networks
6 pages
Maskrcnn PDF
No ratings yet
Maskrcnn PDF
12 pages
He Mask R-CNN Iccv 2017 Paper
No ratings yet
He Mask R-CNN Iccv 2017 Paper
9 pages
He Mask R-CNN ICCV 2017 Paper PDF
No ratings yet
He Mask R-CNN ICCV 2017 Paper PDF
9 pages
Mask
No ratings yet
Mask
12 pages
He 2017
No ratings yet
He 2017
9 pages
5. Object Detection and Segmentation - part 2
No ratings yet
5. Object Detection and Segmentation - part 2
36 pages
Lecture-22-CAP6412_Spring2018_Mask-RCNN_New
No ratings yet
Lecture-22-CAP6412_Spring2018_Mask-RCNN_New
36 pages
Instance and Panoptic Seg Using Conditional Convolutions
No ratings yet
Instance and Panoptic Seg Using Conditional Convolutions
18 pages
Term Paper - DL
No ratings yet
Term Paper - DL
22 pages
Journal Pre-Proofs: Neurocomputing
No ratings yet
Journal Pre-Proofs: Neurocomputing
37 pages
02 Semantic Segmentation 2024
No ratings yet
02 Semantic Segmentation 2024
53 pages
Dlcv2017d3l1segmentation 170623173102
No ratings yet
Dlcv2017d3l1segmentation 170623173102
36 pages
Mazen Hany Abd El Salam Hassan
No ratings yet
Mazen Hany Abd El Salam Hassan
8 pages
Vision
No ratings yet
Vision
24 pages
od_segment_221219_043435
No ratings yet
od_segment_221219_043435
40 pages
Lecture-22-MaskRCNN
No ratings yet
Lecture-22-MaskRCNN
36 pages
NN 09
No ratings yet
NN 09
34 pages
8DL
No ratings yet
8DL
6 pages
Lecture 5 - CNNs For Detection and Segmentation
No ratings yet
Lecture 5 - CNNs For Detection and Segmentation
62 pages
2210.03105
No ratings yet
2210.03105
12 pages
lecture4
No ratings yet
lecture4
46 pages
Harley MSC Thesis Menos Especializadpo
No ratings yet
Harley MSC Thesis Menos Especializadpo
71 pages
The Ultimate Guide To Object Detection
No ratings yet
The Ultimate Guide To Object Detection
16 pages
Facemask Detection Using MMdetection Toolbox
No ratings yet
Facemask Detection Using MMdetection Toolbox
6 pages
05 CNN 2
No ratings yet
05 CNN 2
92 pages
Dlcvd3l4objects 160803161336
No ratings yet
Dlcvd3l4objects 160803161336
31 pages
8-Image Detection and Segmentation
No ratings yet
8-Image Detection and Segmentation
73 pages
Part 2
No ratings yet
Part 2
225 pages
3 SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation
No ratings yet
3 SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation
17 pages
cs231n 2018 ds06
No ratings yet
cs231n 2018 ds06
38 pages
Lect-7 Segmentation Localization
No ratings yet
Lect-7 Segmentation Localization
151 pages
The Framework For Object Detection: Generalized R-CNN
No ratings yet
The Framework For Object Detection: Generalized R-CNN
127 pages
Recent Progress in Semantic Image Segmentation: Xiaolong Liu Zhidong Deng Yuhan Yang
No ratings yet
Recent Progress in Semantic Image Segmentation: Xiaolong Liu Zhidong Deng Yuhan Yang
18 pages
Bolya YOLACT Real-Time Instance Segmentation ICCV 2019 Paper
No ratings yet
Bolya YOLACT Real-Time Instance Segmentation ICCV 2019 Paper
10 pages
CVlecture 4
No ratings yet
CVlecture 4
62 pages
Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
Deep Semantic Segmentation New Model of Natural and Medical Images
No ratings yet
Deep Semantic Segmentation New Model of Natural and Medical Images
4 pages
Computer VIsion Applications
No ratings yet
Computer VIsion Applications
30 pages
Fully Convolutional Networks For Semantic Segmentation: Jonathan Long Evan Shelhamer Trevor Darrell UC Berkeley
No ratings yet
Fully Convolutional Networks For Semantic Segmentation: Jonathan Long Evan Shelhamer Trevor Darrell UC Berkeley
10 pages
CVlecture 6
No ratings yet
CVlecture 6
33 pages
mv_cs4243_2024_amir_6_p2 (1)
No ratings yet
mv_cs4243_2024_amir_6_p2 (1)
95 pages
Rec03 - Deep Architectures
No ratings yet
Rec03 - Deep Architectures
65 pages
R-CNN (Object Detection) - A Beginners Guide To One of The Most - by Sharif Elfouly - Medium
No ratings yet
R-CNN (Object Detection) - A Beginners Guide To One of The Most - by Sharif Elfouly - Medium
6 pages
Deep Semantic Segmentation New Model of Natural and Medical Images
No ratings yet
Deep Semantic Segmentation New Model of Natural and Medical Images
4 pages
Du_2018_J._Phys.__Conf._Ser._1004_012029
No ratings yet
Du_2018_J._Phys.__Conf._Ser._1004_012029
9 pages
Segmentation Transformer: Object-Contextual Representations For Semantic Segmentation
No ratings yet
Segmentation Transformer: Object-Contextual Representations For Semantic Segmentation
21 pages
Object Detection
No ratings yet
Object Detection
57 pages
UNet For Semantic Segmentation - DTD - 19april2024
No ratings yet
UNet For Semantic Segmentation - DTD - 19april2024
20 pages
WBNet Weakly Supervised Salient Object Detection Via Scri - 2024 - Pattern Reco
No ratings yet
WBNet Weakly Supervised Salient Object Detection Via Scri - 2024 - Pattern Reco
15 pages
Segmentation-Aware Convolutional Networks Using Local Attention Masks
No ratings yet
Segmentation-Aware Convolutional Networks Using Local Attention Masks
11 pages
Semantic Segmentation of Images
No ratings yet
Semantic Segmentation of Images
76 pages
Lin2014 Chapter MicrosoftCOCOCommonObjectsInCo
No ratings yet
Lin2014 Chapter MicrosoftCOCOCommonObjectsInCo
16 pages
Adaptis: Adaptive Instance Selection Network
No ratings yet
Adaptis: Adaptive Instance Selection Network
11 pages
Visual Servoing Robot For Pick and Place Under Partial Occlusion - Ii
No ratings yet
Visual Servoing Robot For Pick and Place Under Partial Occlusion - Ii
18 pages
CenterMask
No ratings yet
CenterMask
10 pages
Object Detection in Pytorch Using Mask R-CNN
No ratings yet
Object Detection in Pytorch Using Mask R-CNN
4 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet
Photographize 112016
No ratings yet
Photographize 112016
88 pages
Photographize 052016
No ratings yet
Photographize 052016
92 pages
Liang Instance Segmentation in 3D Scenes Using Semantic Superpoint Tree Networks ICCV 2021 Paper
No ratings yet
Liang Instance Segmentation in 3D Scenes Using Semantic Superpoint Tree Networks ICCV 2021 Paper
10 pages
Lec 02 Cam Models
No ratings yet
Lec 02 Cam Models
44 pages
Photographize 092016
No ratings yet
Photographize 092016
90 pages
Photographize 072016
No ratings yet
Photographize 072016
90 pages
RV ProjectiveGeometry
No ratings yet
RV ProjectiveGeometry
36 pages
Photographize 012016
No ratings yet
Photographize 012016
90 pages
Challenges and Opportunities in Geometric Modelling of Complex Bio-Inspired 3D Objects Designed For Additive Manufacturing
No ratings yet
Challenges and Opportunities in Geometric Modelling of Complex Bio-Inspired 3D Objects Designed For Additive Manufacturing
42 pages
08 ParametricCurves Web
No ratings yet
08 ParametricCurves Web
10 pages
Distortion in Perspective Projection
No ratings yet
Distortion in Perspective Projection
5 pages
Engg - Graphics
No ratings yet
Engg - Graphics
140 pages
13 Projection
No ratings yet
13 Projection
10 pages
9 Projection Geometry
No ratings yet
9 Projection Geometry
124 pages
BBC Focus - February 2018
No ratings yet
BBC Focus - February 2018
100 pages
BBC Focus - February 2016
No ratings yet
BBC Focus - February 2016
116 pages
BBC Focus - Health Breakthroughs - Volume 8 2018
No ratings yet
BBC Focus - Health Breakthroughs - Volume 8 2018
100 pages
BBC F Efa 2017
No ratings yet
BBC F Efa 2017
100 pages
BBC Focus - December 2018
No ratings yet
BBC Focus - December 2018
108 pages
2016-07-01 BBC Focus
No ratings yet
2016-07-01 BBC Focus
116 pages
Understanding Color Models
No ratings yet
Understanding Color Models
11 pages
2016-09-01 BBC Focus
No ratings yet
2016-09-01 BBC Focus
116 pages
2016-11-01 BBC Focus
No ratings yet
2016-11-01 BBC Focus
124 pages
2016-12-01 BBC Focus
No ratings yet
2016-12-01 BBC Focus
132 pages
Architects Datafile (ADF) - July 2022
No ratings yet
Architects Datafile (ADF) - July 2022
84 pages
الحلولية ووحدة الوجود
No ratings yet
الحلولية ووحدة الوجود
354 pages
Style
No ratings yet
Style
48 pages
St. Peter'S Engineering College: (Ugc-Autonomous)
No ratings yet
St. Peter'S Engineering College: (Ugc-Autonomous)
6 pages
Jurnal Novita Kristiani 155120601111067
No ratings yet
Jurnal Novita Kristiani 155120601111067
13 pages
DX Diag
No ratings yet
DX Diag
53 pages
Optum NAF - High School Internship - Info Event Final 2 - 10 - 2021
No ratings yet
Optum NAF - High School Internship - Info Event Final 2 - 10 - 2021
1 page
Behavior Modification Techniques
No ratings yet
Behavior Modification Techniques
34 pages
Thermostat Manual
No ratings yet
Thermostat Manual
76 pages
5K3-LV-Manual Battery
No ratings yet
5K3-LV-Manual Battery
47 pages
CV RIAbreuBlondetPolanco EN
No ratings yet
CV RIAbreuBlondetPolanco EN
6 pages
Foundations of Marketing 8th Edition download pdf
100% (3)
Foundations of Marketing 8th Edition download pdf
24 pages
The Development and Validation of A New
No ratings yet
The Development and Validation of A New
12 pages
конспект БФ АЕО 23 - 24
No ratings yet
конспект БФ АЕО 23 - 24
5 pages
Louis Dupré - The Enlightenment and the Intellectual Foundations of Modern Culture-Yale University Press (2008)
No ratings yet
Louis Dupré - The Enlightenment and the Intellectual Foundations of Modern Culture-Yale University Press (2008)
414 pages
Itco Oil Spill
No ratings yet
Itco Oil Spill
25 pages
Four Noble Truths 2
No ratings yet
Four Noble Truths 2
14 pages
Reaction To Frustration Article
No ratings yet
Reaction To Frustration Article
15 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Département de Raffinage Et Pétrochimie: Cycle
No ratings yet
Département de Raffinage Et Pétrochimie: Cycle
27 pages
Ideology of Pakistan in The Light of Quid-e-Azams Sayings
0% (1)
Ideology of Pakistan in The Light of Quid-e-Azams Sayings
2 pages
Celcom
No ratings yet
Celcom
2 pages
RESUME
No ratings yet
RESUME
2 pages
THE ORIGINS OF HAWTHORNE AND POE Summary (2)
No ratings yet
THE ORIGINS OF HAWTHORNE AND POE Summary (2)
2 pages
Beauty For Ashes 5:26:24
No ratings yet
Beauty For Ashes 5:26:24
28 pages
Instant ebooks textbook Decoding Signs of Identity Egyptian Workmen s Marks in Archaeological Historical Comparative and Theoretical Perspective Proceedings of a Conference in Leiden 13 15 December 2013 1st Edition K.J. Van Der Moezel (Editor) download all chapters
100% (11)
Instant ebooks textbook Decoding Signs of Identity Egyptian Workmen s Marks in Archaeological Historical Comparative and Theoretical Perspective Proceedings of a Conference in Leiden 13 15 December 2013 1st Edition K.J. Van Der Moezel (Editor) download all chapters
84 pages
DR9
No ratings yet
DR9
2 pages

Instance Segmentation

Uploaded by

Instance Segmentation

Uploaded by

Instance Segmentation

Riley Simmons-Edler, Berthy Feng

● Label each foreground pixel with object

Slide Credit: Kaiming He

● Microsoft COCO dataset

1. Detecting non-iconic views

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Shaoqing Ren et al. Image Credit: Tomasz Grel

Image Credit: Shaoqing Ren et al.

Slide Credit: Kaiming He

Slide Credit: Kaiming He

Slide Credit: Kaiming He

Image Credit: Tomasz Grel

Slide Credit: Kaiming He

Slide Credit: Kaiming He

Image Credit: Kaiming He et al.

Image Credit: Kaiming He et al.

Image Credit: Kaiming He et al.

Image Credit: Kaiming He et al.

● Add parallel mask prediction head to Faster-RCNN

A = set of object B = set of object

How can we know C = A U B?

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

● Jointly train detection head and mask head end-to-end

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

Extension: FCN+MLP mask heads

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

You might also like