FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction

Sun, Shuyang; Pang, Jiangmiao; Shi, Jianping; Yi, Shuai; Ouyang, Wanli

Computer Science > Computer Vision and Pattern Recognition

arXiv:1901.03495 (cs)

[Submitted on 11 Jan 2019]

Title:FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction

Authors:Shuyang Sun, Jiangmiao Pang, Jianping Shi, Shuai Yi, Wanli Ouyang

View PDF

Abstract:The basic principles in designing convolutional neural network (CNN) structures for predicting objects on different levels, e.g., image-level, region-level, and pixel-level are diverging. Generally, network structures designed specifically for image classification are directly used as default backbone structure for other tasks including detection and segmentation, but there is seldom backbone structure designed under the consideration of unifying the advantages of networks designed for pixel-level or region-level predicting tasks, which may require very deep features with high resolution. Towards this goal, we design a fish-like network, called FishNet. In FishNet, the information of all resolutions is preserved and refined for the final task. Besides, we observe that existing works still cannot \emph{directly} propagate the gradient information from deep layers to shallow layers. Our design can better handle this problem. Extensive experiments have been conducted to demonstrate the remarkable performance of the FishNet. In particular, on ImageNet-1k, the accuracy of FishNet is able to surpass the performance of DenseNet and ResNet with fewer parameters. FishNet was applied as one of the modules in the winning entry of the COCO Detection 2018 challenge. The code is available at this https URL.

Comments:	NeurIPS 2018. Code available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1901.03495 [cs.CV]
	(or arXiv:1901.03495v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1901.03495

Submission history

From: Shuyang Sun [view email]
[v1] Fri, 11 Jan 2019 06:43:56 UTC (1,334 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators