A simple yet effective baseline for 3d human pose estimation

Martinez, Julieta; Hossain, Rayat; Romero, Javier; Little, James J.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1705.03098v1 (cs)

[Submitted on 8 May 2017 (this version), latest version 4 Aug 2017 (v2)]

Title:A simple yet effective baseline for 3d human pose estimation

Authors:Julieta Martinez, Rayat Hossain, Javier Romero, James J. Little

View PDF

Abstract:Following the success of deep convolutional networks, state-of-the-art methods for 3d human pose estimation have focused on deep end-to-end systems that predict 3d joint locations given raw image pixels. Despite their excellent performance, it is often not easy to understand whether their remaining error stems from a limited 2d pose (visual) understanding, or from a failure to map 2d poses into 3-dimensional positions. With the goal of understanding these sources of error, we set out to build a system that given 2d joint locations predicts 3d positions. Much to our surprise, we have found that, with current technology, "lifting" ground truth 2d joint locations to 3d space is a task that can be solved with a remarkably low error rate: a relatively simple deep feed-forward network outperforms the best reported result by about 30% on Human3.6M, the largest publicly available 3d pose estimation benchmark. Furthermore, training our system on the output of an off-the-shelf state-of-the-art 2d detector (i.e., using images as input) yields results on par with the state of the art -- this includes an array of systems that have been trained end-to-end specifically for this task. Our results indicate that a large portion of the error of state-of-the-art deep 3d pose estimation systems stems from their visual analysis, and suggests directions to further advance the state of the art in 3d human pose estimation.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1705.03098 [cs.CV]
	(or arXiv:1705.03098v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1705.03098

Submission history

From: Julieta Martinez [view email]
[v1] Mon, 8 May 2017 21:48:37 UTC (8,831 KB)
[v2] Fri, 4 Aug 2017 18:36:24 UTC (8,916 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A simple yet effective baseline for 3d human pose estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A simple yet effective baseline for 3d human pose estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators