Adapting Segment Anything Model for Unseen Object Instance Segmentation

Cao, Rui; Song, Chuanxin; Yang, Biqi; Wang, Jiangliu; Heng, Pheng-Ann; Liu, Yun-Hui

Computer Science > Robotics

arXiv:2409.15481 (cs)

[Submitted on 23 Sep 2024]

Title:Adapting Segment Anything Model for Unseen Object Instance Segmentation

Authors:Rui Cao, Chuanxin Song, Biqi Yang, Jiangliu Wang, Pheng-Ann Heng, Yun-Hui Liu

View PDF HTML (experimental)

Abstract:Unseen Object Instance Segmentation (UOIS) is crucial for autonomous robots operating in unstructured environments. Previous approaches require full supervision on large-scale tabletop datasets for effective pretraining. In this paper, we propose UOIS-SAM, a data-efficient solution for the UOIS task that leverages SAM's high accuracy and strong generalization capabilities. UOIS-SAM integrates two key components: (i) a Heatmap-based Prompt Generator (HPG) to generate class-agnostic point prompts with precise foreground prediction, and (ii) a Hierarchical Discrimination Network (HDNet) that adapts SAM's mask decoder, mitigating issues introduced by the SAM baseline, such as background confusion and over-segmentation, especially in scenarios involving occlusion and texture-rich objects. Extensive experimental results on OCID, OSD, and additional photometrically challenging datasets including PhoCAL and HouseCat6D, demonstrate that, even using only 10% of the training samples compared to previous methods, UOIS-SAM achieves state-of-the-art performance in unseen object segmentation, highlighting its effectiveness and robustness in various tabletop scenes.

Comments:	Submitted to ICRA 2025
Subjects:	Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2409.15481 [cs.RO]
	(or arXiv:2409.15481v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2409.15481

Submission history

From: Rui Cao [view email]
[v1] Mon, 23 Sep 2024 19:05:50 UTC (1,466 KB)

Computer Science > Robotics

Title:Adapting Segment Anything Model for Unseen Object Instance Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Adapting Segment Anything Model for Unseen Object Instance Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators