MFS enhanced SAM: Achieving superior performance in bimodal few-shot segmentation

Ying Zhao, Kechen Song*, Wenqi Cui, Hang Ren, Yunhui Yan

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

Recently, Segment Anything Model (SAM) has become popular in computer vision field because of its powerful image segmentation ability and high interactivity of various prompts, which opens a new era of large vision foundation models. But is SAM really omnipotent? In this letter, we establish a comprehensive bimodal few-shot segmentation indoor dataset VT-840-5i, and compare SAM with eight state-of-the-art few-shot segmentation (FSS) methods on two benchmark datasets. Qualitative and quantitative experiment results show that although SAM is very effective in general object segmentation, it still has room for improvement in some challenging scenarios. Therefore, we introduce thermal infrared auxiliary information into the segmentation task and provide multiple fusion strategies (MFS) for readers to choose the most suitable approach for the specific task. Finally, we discuss several potential research trends about SAM in the future. Our test results are available at: https://github.com/VDT-2048/Bi-SAM.

源语言英语
文章编号103946
期刊Journal of Visual Communication and Image Representation
97
DOI
出版状态已出版 - 12月 2023
已对外发布

指纹

探究 'MFS enhanced SAM: Achieving superior performance in bimodal few-shot segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此