融合双阶段特征与 Transformer 编码的交互式图像分割

Translated title of the contribution: Interactive Image Segmentation Based on Fusion of Two-Stage Feature and Transformer Encoder
  • Jun Feng
  • , Tian Zhang
  • , Yichen Shi
  • , Hui Wang
  • , Jingjing Hu

Research output: Contribution to journalArticlepeer-review

Abstract

In order to segment the foreground objects that users are interested in quickly and accurately, and obtain high-quality and low-cost annotation segmentation data, an interactive image segmentation algorithm based on two-stage feature fusion and Transformer encoder is proposed. Firstly, lightweight Transformer backbone network is adopted to extract multi-scale feature coding for input image, which can make better use of context information. Then, the subjective prior knowledge is introduced by means of click interaction, and the interactive features are integrated into Transformer network through the primary and enhanced stages in turn. Finally, the atrous convolution, attention mechanism and multi-layer perceptron are combined to decode the feature map obtained by the backbone network. Experimental results show that mNoC@90% values of the proposed algorithm on the GrabCut, Berkeley and DAVIS datasets reach 2.18, 4.04 and 7.39 respectively, which is better than other comparison algorithms. And the time and space complexity is lower than that of f-BRS-B. The proposed algorithm has good stability to the disturbance change of interactive click position and click type. It shows that the proposed algorithm can quickly, accurately and stably segment users’ interested objects, and improve user interaction experience.

Translated title of the contributionInteractive Image Segmentation Based on Fusion of Two-Stage Feature and Transformer Encoder
Original languageChinese (Traditional)
Pages (from-to)831-843
Number of pages13
JournalJisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics
Volume36
Issue number6
DOIs
Publication statusPublished - Jun 2024

Fingerprint

Dive into the research topics of 'Interactive Image Segmentation Based on Fusion of Two-Stage Feature and Transformer Encoder'. Together they form a unique fingerprint.

Cite this