Generalized Few-shot Semantic Segmentation

Tian, Zhuotao; Lai, Xin; Jiang, Li; Liu, Shu; Shu, Michelle; Zhao, Hengshuang; Jia, Jiaya

Computer Science > Computer Vision and Pattern Recognition

arXiv:2010.05210 (cs)

[Submitted on 11 Oct 2020 (v1), last revised 31 May 2022 (this version, v4)]

Title:Generalized Few-shot Semantic Segmentation

Authors:Zhuotao Tian, Xin Lai, Li Jiang, Shu Liu, Michelle Shu, Hengshuang Zhao, Jiaya Jia

View PDF

Abstract:Training semantic segmentation models requires a large amount of finely annotated data, making it hard to quickly adapt to novel classes not satisfying this condition. Few-Shot Segmentation (FS-Seg) tackles this problem with many constraints. In this paper, we introduce a new benchmark, called Generalized Few-Shot Semantic Segmentation (GFS-Seg), to analyze the generalization ability of simultaneously segmenting the novel categories with very few examples and the base categories with sufficient examples. It is the first study showing that previous representative state-of-the-art FS-Seg methods fall short in GFS-Seg and the performance discrepancy mainly comes from the constrained setting of FS-Seg. To make GFS-Seg tractable, we set up a GFS-Seg baseline that achieves decent performance without structural change on the original model. Then, since context is essential for semantic segmentation, we propose the Context-Aware Prototype Learning (CAPL) that significantly improves performance by 1) leveraging the co-occurrence prior knowledge from support samples, and 2) dynamically enriching contextual information to the classifier, conditioned on the content of each query image. Both two contributions are experimentally shown to have substantial practical merit. Extensive experiments on Pascal-VOC and COCO manifest the effectiveness of CAPL, and CAPL generalizes well to FS-Seg by achieving competitive performance. Code is available at this https URL.

Comments:	Accepted to CVPR 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2010.05210 [cs.CV]
	(or arXiv:2010.05210v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2010.05210

Submission history

From: Zhuotao Tian [view email]
[v1] Sun, 11 Oct 2020 10:13:21 UTC (20,240 KB)
[v2] Thu, 19 Nov 2020 12:18:13 UTC (20,231 KB)
[v3] Sat, 27 Nov 2021 15:00:13 UTC (7,521 KB)
[v4] Tue, 31 May 2022 07:01:12 UTC (11,566 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Generalized Few-shot Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Generalized Few-shot Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators