Diff-DOPE: Differentiable Deep Object Pose Estimation

Tremblay, Jonathan; Wen, Bowen; Blukis, Valts; Sundaralingam, Balakumar; Tyree, Stephen; Birchfield, Stan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2310.00463 (cs)

[Submitted on 30 Sep 2023]

Title:Diff-DOPE: Differentiable Deep Object Pose Estimation

Authors:Jonathan Tremblay, Bowen Wen, Valts Blukis, Balakumar Sundaralingam, Stephen Tyree, Stan Birchfield

View PDF

Abstract:We introduce Diff-DOPE, a 6-DoF pose refiner that takes as input an image, a 3D textured model of an object, and an initial pose of the object. The method uses differentiable rendering to update the object pose to minimize the visual error between the image and the projection of the model. We show that this simple, yet effective, idea is able to achieve state-of-the-art results on pose estimation datasets. Our approach is a departure from recent methods in which the pose refiner is a deep neural network trained on a large synthetic dataset to map inputs to refinement steps. Rather, our use of differentiable rendering allows us to avoid training altogether. Our approach performs multiple gradient descent optimizations in parallel with different random learning rates to avoid local minima from symmetric objects, similar appearances, or wrong step size. Various modalities can be used, e.g., RGB, depth, intensity edges, and object segmentation masks. We present experiments examining the effect of various choices, showing that the best results are found when the RGB image is accompanied by an object mask and depth image to guide the optimization process.

Comments:	Submitted to ICRA 2023. Project page is at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2310.00463 [cs.CV]
	(or arXiv:2310.00463v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2310.00463

Submission history

From: Stan Birchfield [view email]
[v1] Sat, 30 Sep 2023 18:52:57 UTC (9,423 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Diff-DOPE: Differentiable Deep Object Pose Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Diff-DOPE: Differentiable Deep Object Pose Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators