Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance?

Kadian, Abhishek; Truong, Joanne; Gokaslan, Aaron; Clegg, Alexander; Wijmans, Erik; Lee, Stefan; Savva, Manolis; Chernova, Sonia; Batra, Dhruv

doi:10.1109/LRA.2020.3013848

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.06321 (cs)

[Submitted on 13 Dec 2019 (v1), last revised 17 Aug 2020 (this version, v2)]

Title:Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance?

Authors:Abhishek Kadian, Joanne Truong, Aaron Gokaslan, Alexander Clegg, Erik Wijmans, Stefan Lee, Manolis Savva, Sonia Chernova, Dhruv Batra

View PDF

Abstract:Does progress in simulation translate to progress on robots? If one method outperforms another in simulation, how likely is that trend to hold in reality on a robot? We examine this question for embodied PointGoal navigation, developing engineering tools and a research paradigm for evaluating a simulator by its sim2real predictivity. First, we develop Habitat-PyRobot Bridge (HaPy), a library for seamless execution of identical code on simulated agents and robots, transferring simulation-trained agents to a LoCoBot platform with a one-line code change. Second, we investigate the sim2real predictivity of Habitat-Sim for PointGoal navigation. We 3D-scan a physical lab space to create a virtualized replica, and run parallel tests of 9 different models in reality and simulation. We present a new metric called Sim-vs-Real Correlation Coefficient (SRCC) to quantify predictivity. We find that SRCC for Habitat as used for the CVPR19 challenge is low (0.18 for the success metric), suggesting that performance differences in this simulator-based challenge do not persist after physical deployment. This gap is largely due to AI agents learning to exploit simulator imperfections, abusing collision dynamics to 'slide' along walls, leading to shortcuts through otherwise non-navigable space. Naturally, such exploits do not work in the real world. Our experiments show that it is possible to tune simulation parameters to improve sim2real predictivity (e.g. improving $SRCC_{Succ}$ from 0.18 to 0.844), increasing confidence that in-simulation comparisons will translate to deployed systems in reality.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:1912.06321 [cs.CV]
	(or arXiv:1912.06321v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.06321
Journal reference:	IEEE Robotics and Automation Letters (RA-L) 2020
Related DOI:	https://doi.org/10.1109/LRA.2020.3013848

Submission history

From: Joanne Truong [view email]
[v1] Fri, 13 Dec 2019 04:29:38 UTC (8,411 KB)
[v2] Mon, 17 Aug 2020 03:26:55 UTC (41,078 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators