A Lightweight Neural Network for Monocular View Generation with Occlusion Handling

Evain, Simon; Guillemot, Christine

doi:10.1109/TPAMI.2019.2960689

Computer Science > Computer Vision and Pattern Recognition

arXiv:2007.12577 (cs)

[Submitted on 24 Jul 2020]

Title:A Lightweight Neural Network for Monocular View Generation with Occlusion Handling

Authors:Simon Evain, Christine Guillemot

View PDF

Abstract:In this article, we present a very lightweight neural network architecture, trained on stereo data pairs, which performs view synthesis from one single image. With the growing success of multi-view formats, this problem is indeed increasingly relevant. The network returns a prediction built from disparity estimation, which fills in wrongly predicted regions using a occlusion handling technique. To do so, during training, the network learns to estimate the left-right consistency structural constraint on the pair of stereo input images, to be able to replicate it at test time from one single image. The method is built upon the idea of blending two predictions: a prediction based on disparity estimation, and a prediction based on direct minimization in occluded regions. The network is also able to identify these occluded areas at training and at test time by checking the pixelwise left-right consistency of the produced disparity maps. At test time, the approach can thus generate a left-side and a right-side view from one input image, as well as a depth map and a pixelwise confidence measure in the prediction. The work outperforms visually and metric-wise state-of-the-art approaches on the challenging KITTI dataset, all while reducing by a very significant order of magnitude (5 or 10 times) the required number of parameters (6.5 M).

Comments:	Accepted at IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) in December 2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2007.12577 [cs.CV]
	(or arXiv:2007.12577v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2007.12577
Related DOI:	https://doi.org/10.1109/TPAMI.2019.2960689

Submission history

From: Simon Evain [view email]
[v1] Fri, 24 Jul 2020 15:29:01 UTC (23,834 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Lightweight Neural Network for Monocular View Generation with Occlusion Handling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Lightweight Neural Network for Monocular View Generation with Occlusion Handling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators