Robust Monocular SLAM for Egocentric Videos

Patra, Suvam; Gupta, Kartikeya; Ahmad, Faran; Arora, Chetan; Banerjee, Subhashis

Computer Science > Computer Vision and Pattern Recognition

arXiv:1707.05564 (cs)

[Submitted on 18 Jul 2017 (v1), last revised 17 Nov 2018 (this version, v2)]

Title:Robust Monocular SLAM for Egocentric Videos

Authors:Suvam Patra, Kartikeya Gupta, Faran Ahmad, Chetan Arora, Subhashis Banerjee

View PDF

Abstract:Regardless of the tremendous progress, a truly general purpose pipeline for Simultaneous Localization and Mapping (SLAM) remains a challenge. We investigate the reported failure of state of the art (SOTA) SLAM techniques on egocentric videos. We find that the dominant 3D rotations, low parallax between successive frames, and primarily forward motion in egocentric videos are the most common causes of failures. The incremental nature of SOTA SLAM, in the presence of unreliable pose and 3D estimates in egocentric videos, with no opportunities for global loop closures, generates drifts and leads to the eventual failures of such techniques. Taking inspiration from batch mode Structure from Motion (SFM) techniques, we propose to solve SLAM as an SFM problem over the sliding temporal windows. This makes the problem well constrained. Further, we propose to initialize the camera poses using 2D rotation averaging, followed by translation averaging before structure estimation using bundle adjustment. This helps in stabilizing the camera poses when 3D estimates are not reliable. We show that the proposed SLAM technique, incorporating the two key ideas works successfully for long, shaky egocentric videos where other SOTA techniques have been reported to fail. Qualitative and quantitative comparisons on publicly available egocentric video datasets validate our results.

Comments:	Accepted for publication at IEEE WACV 2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1707.05564 [cs.CV]
	(or arXiv:1707.05564v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1707.05564

Submission history

From: Suvam Patra [view email]
[v1] Tue, 18 Jul 2017 11:31:13 UTC (7,842 KB)
[v2] Sat, 17 Nov 2018 20:00:08 UTC (7,634 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Robust Monocular SLAM for Egocentric Videos

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Robust Monocular SLAM for Egocentric Videos

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators