Efficient Semantic-Guidance High-Resolution Video Matting

Yue Yu*, Ding Li, Yulin Yang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Video matting has made significant progress in trimap-based field. However, researchers are increasingly interested in auxiliary-free matting because it is more useful in real-world applications. We propose a new efficient semantic-guidance high-resolution video matting network for human body. We apply the convolutional network as the backbone while also employing the transformer in the encoder, which is used to utilize semantic features, while ensuring that the network is not overly bloated. In addition, a channel-wise attention mechanism is introduced in the decoder to improve the representation of semantic feature. In comparison to the current state-of-the-art methods, the method proposed in this paper achieves better results while maintaining the speed and efficiency of prediction. We can complete the real-time auxiliary-free matting for high-resolution video (4K or HD).

Original languageEnglish
Title of host publicationAdvances in Computer Graphics - 40th Computer Graphics International Conference, CGI 2023, Proceedings
EditorsBin Sheng, Lei Bi, Jinman Kim, Nadia Magnenat-Thalmann, Daniel Thalmann
PublisherSpringer Science and Business Media Deutschland GmbH
Pages143-154
Number of pages12
ISBN (Print)9783031500688
DOIs
Publication statusPublished - 2024
Event40th Computer Graphics International Conference, CGI 2023 - Shanghai, China
Duration: 28 Aug 20231 Sept 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14495
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference40th Computer Graphics International Conference, CGI 2023
Country/TerritoryChina
CityShanghai
Period28/08/231/09/23

Keywords

  • Attention Mechanism
  • Auxiliary-Free Video Matting Network
  • Transformer
  • Video Matting

Fingerprint

Dive into the research topics of 'Efficient Semantic-Guidance High-Resolution Video Matting'. Together they form a unique fingerprint.

Cite this