Language Guided Robotic Grasping with Fine-Grained Instructions

Qiang Sun, Haitao Lin, Ying Fu, Yanwei Fu, Xiangyang Xue

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Given a single RGB image and the attribute-rich language instructions, this paper investigates the novel problem of using Fine-grained instructions for the Language guided robotic Grasping (FLarG). This problem is made challenging by learning fine-grained language descriptions to ground target objects. Recent advances have been made in visually grounding the objects simply by several coarse attributes [1]. However, these methods have poor performance as they cannot well align the multi-modal features, and do not make the best of recent powerful large pre-trained vision and language models, e.g., CLIP. To this end, this paper proposes a FLarG pipeline including stages of CLIP-guided object localization, and 6-DoF category-level object pose estimation for grasping. Specially, we first take the CLIP-based segmentation model CRIS as the backbone and propose an end-to-end DyCRIS model that uses a novel dynamic mask strategy to well fuse the multi-level language and vision features. Then, the well-trained instance segmentation backbone Mask R-CNN is adopted to further improve the predicted mask of our DyCRIS. Finally, the target object pose is inferred for the robotics grasping by using the recent 6-DoF object pose estimation method. To validate our CLIP-enhanced pipeline, we also construct a validation dataset for our FLarG task and name it RefNOCS. Extensive results on RefNOCS have shown the utility and effectiveness of our proposed method. The project homepage is available at https://sunqiang85.github.ioIFLarG/.

源语言英语
主期刊名2023 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2023
出版商Institute of Electrical and Electronics Engineers Inc.
1319-1326
页数8
ISBN(电子版)9781665491907
DOI
出版状态已出版 - 2023
活动2023 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2023 - Detroit, 美国
期限: 1 10月 20235 10月 2023

出版系列

姓名IEEE International Conference on Intelligent Robots and Systems
ISSN(印刷版)2153-0858
ISSN(电子版)2153-0866

会议

会议2023 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2023
国家/地区美国
Detroit
时期1/10/235/10/23

指纹

探究 'Language Guided Robotic Grasping with Fine-Grained Instructions' 的科研主题。它们共同构成独一无二的指纹。

引用此