SPGNet: Semantic Prediction Guidance for Scene Parsing

Cheng, Bowen; Chen, Liang-Chieh; Wei, Yunchao; Zhu, Yukun; Huang, Zilong; Xiong, Jinjun; Huang, Thomas; Hwu, Wen-Mei; Shi, Honghui

Computer Science > Computer Vision and Pattern Recognition

arXiv:1908.09798 (cs)

[Submitted on 26 Aug 2019]

Title:SPGNet: Semantic Prediction Guidance for Scene Parsing

Authors:Bowen Cheng, Liang-Chieh Chen, Yunchao Wei, Yukun Zhu, Zilong Huang, Jinjun Xiong, Thomas Huang, Wen-Mei Hwu, Honghui Shi

View PDF

Abstract:Multi-scale context module and single-stage encoder-decoder structure are commonly employed for semantic segmentation. The multi-scale context module refers to the operations to aggregate feature responses from a large spatial extent, while the single-stage encoder-decoder structure encodes the high-level semantic information in the encoder path and recovers the boundary information in the decoder path. In contrast, multi-stage encoder-decoder networks have been widely used in human pose estimation and show superior performance than their single-stage counterpart. However, few efforts have been attempted to bring this effective design to semantic segmentation. In this work, we propose a Semantic Prediction Guidance (SPG) module which learns to re-weight the local features through the guidance from pixel-wise semantic prediction. We find that by carefully re-weighting features across stages, a two-stage encoder-decoder network coupled with our proposed SPG module can significantly outperform its one-stage counterpart with similar parameters and computations. Finally, we report experimental results on the semantic segmentation benchmark Cityscapes, in which our SPGNet attains 81.1% on the test set using only 'fine' annotations.

Comments:	ICCV 2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1908.09798 [cs.CV]
	(or arXiv:1908.09798v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1908.09798

Submission history

From: Bowen Cheng [view email]
[v1] Mon, 26 Aug 2019 16:58:12 UTC (6,063 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Bowen Cheng
Liang-Chieh Chen
Yunchao Wei
Yukun Zhu
Zilong Huang

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:SPGNet: Semantic Prediction Guidance for Scene Parsing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SPGNet: Semantic Prediction Guidance for Scene Parsing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators