Depth-Adapted CNN for RGB-D cameras

Wu, Zongwei; Allibert, Guillaume; Stolz, Christophe; Demonceaux, Cedric

Computer Science > Computer Vision and Pattern Recognition

arXiv:2009.09976 (cs)

[Submitted on 21 Sep 2020 (v1), last revised 23 Sep 2020 (this version, v2)]

Title:Depth-Adapted CNN for RGB-D cameras

Authors:Zongwei Wu, Guillaume Allibert, Christophe Stolz, Cedric Demonceaux

View PDF

Abstract:Conventional 2D Convolutional Neural Networks (CNN) extract features from an input image by applying linear filters. These filters compute the spatial coherence by weighting the photometric information on a fixed neighborhood without taking into account the geometric information. We tackle the problem of improving the classical RGB CNN methods by using the depth information provided by the RGB-D cameras. State-of-the-art approaches use depth as an additional channel or image (HHA) or pass from 2D CNN to 3D CNN. This paper proposes a novel and generic procedure to articulate both photometric and geometric information in CNN architecture. The depth data is represented as a 2D offset to adapt spatial sampling locations. The new model presented is invariant to scale and rotation around the X and the Y axis of the camera coordinate system. Moreover, when depth data is constant, our model is equivalent to a regular CNN. Experiments of benchmarks validate the effectiveness of our model.

Comments:	Accepted manuscript in ACCV 2020 (Oral)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2009.09976 [cs.CV]
	(or arXiv:2009.09976v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2009.09976

Submission history

From: Zongwei Wu [view email]
[v1] Mon, 21 Sep 2020 15:58:32 UTC (3,929 KB)
[v2] Wed, 23 Sep 2020 09:45:21 UTC (3,929 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Depth-Adapted CNN for RGB-D cameras

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Depth-Adapted CNN for RGB-D cameras

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators