Adapting Segment Anything Model for Unseen Object Instance Segmentation
Cao, Rui, Song, Chuanxin, Yang, Biqi, Wang, Jiangliu, Heng, Pheng-Ann, Liu, Yun-Hui
–arXiv.org Artificial Intelligence
Unseen Object Instance Segmentation (UOIS) is crucial for autonomous robots operating in unstructured environments. Previous approaches require full supervision on large-scale tabletop datasets for effective pretraining. In this paper, we propose UOIS-SAM, a data-efficient solution for the UOIS task that leverages SAM's high accuracy and strong generalization capabilities. UOIS-SAM integrates two key components: (i) a Heatmap-based Prompt Generator (HPG) to generate class-agnostic point prompts with precise foreground prediction, and (ii) a Hierarchical Discrimination Network (HDNet) that adapts SAM's mask decoder, mitigating issues introduced by the SAM baseline, such as background confusion and over-segmentation, especially in scenarios involving occlusion and texture-rich objects. Extensive experimental results on OCID, OSD, and additional photometrically challenging datasets including PhoCAL and HouseCat6D, demonstrate that, even using only 10% of the training samples compared to previous methods, UOIS-SAM achieves state-of-the-art performance in unseen object segmentation, highlighting its effectiveness and robustness in various tabletop scenes.
arXiv.org Artificial Intelligence
Sep-23-2024
- Genre:
- Research Report (0.40)
- Industry:
- Media (0.37)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks (0.68)
- Robots (1.00)
- Vision (1.00)
- Sensing and Signal Processing > Image Processing (1.00)
- Artificial Intelligence
- Information Technology