Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision

Mangalam, Karttikeya; Adeli, Ehsan; Lee, Kuan-Hui; Gaidon, Adrien; Niebles, Juan Carlos

Computer Science > Computer Vision and Pattern Recognition

arXiv:1911.01138 (cs)

[Submitted on 4 Nov 2019 (v1), last revised 13 Apr 2020 (this version, v2)]

Title:Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision

Authors:Karttikeya Mangalam, Ehsan Adeli, Kuan-Hui Lee, Adrien Gaidon, Juan Carlos Niebles

View PDF

Abstract:We tackle the problem of Human Locomotion Forecasting, a task for jointly predicting the spatial positions of several keypoints on the human body in the near future under an egocentric setting. In contrast to the previous work that aims to solve either the task of pose prediction or trajectory forecasting in isolation, we propose a framework to unify the two problems and address the practically useful task of pedestrian locomotion prediction in the wild. Among the major challenges in solving this task is the scarcity of annotated egocentric video datasets with dense annotations for pose, depth, or egomotion. To surmount this difficulty, we use state-of-the-art models to generate (noisy) annotations and propose robust forecasting models that can learn from this noisy supervision. We present a method to disentangle the overall pedestrian motion into easier to learn subparts by utilizing a pose completion and a decomposition module. The completion module fills in the missing key-point annotations and the decomposition module breaks the cleaned locomotion down to global (trajectory) and local (pose keypoint movements). Further, with Quasi RNN as our backbone, we propose a novel hierarchical trajectory forecasting network that utilizes low-level vision domain specific signals like egomotion and depth to predict the global trajectory. Our method leads to state-of-the-art results for the prediction of human locomotion in the egocentric view. Project pade: this https URL

Comments:	Accepted to WACV 2020 (Oral)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:1911.01138 [cs.CV]
	(or arXiv:1911.01138v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1911.01138

Submission history

From: Karttikeya Mangalam [view email]
[v1] Mon, 4 Nov 2019 11:30:12 UTC (9,740 KB)
[v2] Mon, 13 Apr 2020 19:33:42 UTC (9,740 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators