Multimodal and Force-Matched Imitation Learning with a See-Through Visuotactile Sensor

Ablett, Trevor; Limoyo, Oliver; Sigal, Adam; Jilani, Affan; Kelly, Jonathan; Siddiqi, Kaleem; Hogan, Francois; Dudek, Gregory

doi:10.1109/TRO.2024.3521864

Computer Science > Robotics

arXiv:2311.01248 (cs)

[Submitted on 2 Nov 2023 (v1), last revised 26 Jan 2025 (this version, v5)]

Title:Multimodal and Force-Matched Imitation Learning with a See-Through Visuotactile Sensor

Authors:Trevor Ablett, Oliver Limoyo, Adam Sigal, Affan Jilani, Jonathan Kelly, Kaleem Siddiqi, Francois Hogan, Gregory Dudek

View PDF HTML (experimental)

Abstract:Contact-rich tasks continue to present many challenges for robotic manipulation. In this work, we leverage a multimodal visuotactile sensor within the framework of imitation learning (IL) to perform contact-rich tasks that involve relative motion (e.g., slipping and sliding) between the end-effector and the manipulated object. We introduce two algorithmic contributions, tactile force matching and learned mode switching, as complimentary methods for improving IL. Tactile force matching enhances kinesthetic teaching by reading approximate forces during the demonstration and generating an adapted robot trajectory that recreates the recorded forces. Learned mode switching uses IL to couple visual and tactile sensor modes with the learned motion policy, simplifying the transition from reaching to contacting. We perform robotic manipulation experiments on four door-opening tasks with a variety of observation and algorithm configurations to study the utility of multimodal visuotactile sensing and our proposed improvements. Our results show that the inclusion of force matching raises average policy success rates by 62.5%, visuotactile mode switching by 30.3%, and visuotactile data as a policy input by 42.5%, emphasizing the value of see-through tactile sensing for IL, both for data collection to allow force matching, and for policy execution to enable accurate task feedback. Project site: this https URL

Comments:	14 pages, 22 figures
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2311.01248 [cs.RO]
	(or arXiv:2311.01248v5 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2311.01248
Journal reference:	IEEE Transactions on Robotics (T-RO), Vol. 41, pp. 946-959, Jan. 2025
Related DOI:	https://doi.org/10.1109/TRO.2024.3521864

Submission history

From: Jonathan Kelly [view email]
[v1] Thu, 2 Nov 2023 14:02:42 UTC (5,807 KB)
[v2] Fri, 22 Dec 2023 19:27:53 UTC (12,958 KB)
[v3] Wed, 26 Jun 2024 17:40:14 UTC (13,609 KB)
[v4] Tue, 3 Dec 2024 20:56:48 UTC (13,609 KB)
[v5] Sun, 26 Jan 2025 15:03:06 UTC (13,609 KB)

Computer Science > Robotics

Title:Multimodal and Force-Matched Imitation Learning with a See-Through Visuotactile Sensor

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Multimodal and Force-Matched Imitation Learning with a See-Through Visuotactile Sensor

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators