Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation

Xueyi Li; Tianfei Zhou; Jianwu Li; Yi Zhou; Zhaoxiang Zhang

Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation

Xueyi Li, Tianfei Zhou^*, Jianwu Li, Yi Zhou, Zhaoxiang Zhang

^*此作品的通讯作者

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

80 引用（Scopus）

摘要

Acquiring sufficient ground-truth supervision to train deep visual models has been a bottleneck over the years due to the data-hungry nature of deep learning. This is exacerbated in some structured prediction tasks, such as semantic segmentation, which requires pixel-level annotations. This work addresses weakly supervised semantic segmentation (WSSS), with the goal of bridging the gap between image-level annotations and pixel-level segmentation. We formulate WSSS as a novel group-wise learning task that explicitly models semantic dependencies in a group of images to estimate more reliable pseudo ground-truths, which can be used for training more accurate segmentation models. In particular, we devise a graph neural network (GNN) for group-wise semantic mining, wherein input images are represented as graph nodes, and the underlying relations between a pair of images are characterized by an efficient co-attention mechanism. Moreover, in order to prevent the model from paying excessive attention to common semantics only, we further propose a graph dropout layer, encouraging the model to learn more accurate and complete object responses. The whole network is end-to-end trainable by iterative message passing, which propagates interaction cues over the images to progressively improve the performance. We conduct experiments on the popular PASCAL VOC 2012 and COCO benchmarks, and our model yields state-of-the-art performance. Our code is available at: https://github.com/Lixy1997/Group-WSSS.

源语言	英语
主期刊名	35th AAAI Conference on Artificial Intelligence, AAAI 2021
出版商	Association for the Advancement of Artificial Intelligence
页	1984-1992
页数	9
ISBN（电子版）	9781713835974
出版状态	已出版 - 2021
活动	35th AAAI Conference on Artificial Intelligence, AAAI 2021 - Virtual, Online 期限: 2 2月 2021 → 9 2月 2021

出版系列

姓名	35th AAAI Conference on Artificial Intelligence, AAAI 2021
卷	3A

会议

会议	35th AAAI Conference on Artificial Intelligence, AAAI 2021
市	Virtual, Online
时期	2/02/21 → 9/02/21

其它文件与链接

链接到 Scopus 的出版物

引用此

@inproceedings{f47a33d12ae04f8bb63d09cce5a05e6c,

title = "Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation",

abstract = "Acquiring sufficient ground-truth supervision to train deep visual models has been a bottleneck over the years due to the data-hungry nature of deep learning. This is exacerbated in some structured prediction tasks, such as semantic segmentation, which requires pixel-level annotations. This work addresses weakly supervised semantic segmentation (WSSS), with the goal of bridging the gap between image-level annotations and pixel-level segmentation. We formulate WSSS as a novel group-wise learning task that explicitly models semantic dependencies in a group of images to estimate more reliable pseudo ground-truths, which can be used for training more accurate segmentation models. In particular, we devise a graph neural network (GNN) for group-wise semantic mining, wherein input images are represented as graph nodes, and the underlying relations between a pair of images are characterized by an efficient co-attention mechanism. Moreover, in order to prevent the model from paying excessive attention to common semantics only, we further propose a graph dropout layer, encouraging the model to learn more accurate and complete object responses. The whole network is end-to-end trainable by iterative message passing, which propagates interaction cues over the images to progressively improve the performance. We conduct experiments on the popular PASCAL VOC 2012 and COCO benchmarks, and our model yields state-of-the-art performance. Our code is available at: https://github.com/Lixy1997/Group-WSSS.",

author = "Xueyi Li and Tianfei Zhou and Jianwu Li and Yi Zhou and Zhaoxiang Zhang",

note = "Publisher Copyright: Copyright {\textcopyright} 2021, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved; 35th AAAI Conference on Artificial Intelligence, AAAI 2021 ; Conference date: 02-02-2021 Through 09-02-2021",

year = "2021",

language = "English",

series = "35th AAAI Conference on Artificial Intelligence, AAAI 2021",

publisher = "Association for the Advancement of Artificial Intelligence",

pages = "1984--1992",

booktitle = "35th AAAI Conference on Artificial Intelligence, AAAI 2021",

}

Li, X, Zhou, T , Li, J, Zhou, Y & Zhang, Z 2021, Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation. 在 35th AAAI Conference on Artificial Intelligence, AAAI 2021. 35th AAAI Conference on Artificial Intelligence, AAAI 2021, 卷 3A, Association for the Advancement of Artificial Intelligence, 页码 1984-1992, 35th AAAI Conference on Artificial Intelligence, AAAI 2021, Virtual, Online, 2/02/21.

Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation. / Li, Xueyi; Zhou, Tianfei ; Li, Jianwu 等.
35th AAAI Conference on Artificial Intelligence, AAAI 2021. Association for the Advancement of Artificial Intelligence, 2021. 页码 1984-1992 (35th AAAI Conference on Artificial Intelligence, AAAI 2021; 卷 3A).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation

AU - Li, Xueyi

AU - Zhou, Tianfei

AU - Li, Jianwu

AU - Zhou, Yi

AU - Zhang, Zhaoxiang

PY - 2021

Y1 - 2021

N2 - Acquiring sufficient ground-truth supervision to train deep visual models has been a bottleneck over the years due to the data-hungry nature of deep learning. This is exacerbated in some structured prediction tasks, such as semantic segmentation, which requires pixel-level annotations. This work addresses weakly supervised semantic segmentation (WSSS), with the goal of bridging the gap between image-level annotations and pixel-level segmentation. We formulate WSSS as a novel group-wise learning task that explicitly models semantic dependencies in a group of images to estimate more reliable pseudo ground-truths, which can be used for training more accurate segmentation models. In particular, we devise a graph neural network (GNN) for group-wise semantic mining, wherein input images are represented as graph nodes, and the underlying relations between a pair of images are characterized by an efficient co-attention mechanism. Moreover, in order to prevent the model from paying excessive attention to common semantics only, we further propose a graph dropout layer, encouraging the model to learn more accurate and complete object responses. The whole network is end-to-end trainable by iterative message passing, which propagates interaction cues over the images to progressively improve the performance. We conduct experiments on the popular PASCAL VOC 2012 and COCO benchmarks, and our model yields state-of-the-art performance. Our code is available at: https://github.com/Lixy1997/Group-WSSS.

AB - Acquiring sufficient ground-truth supervision to train deep visual models has been a bottleneck over the years due to the data-hungry nature of deep learning. This is exacerbated in some structured prediction tasks, such as semantic segmentation, which requires pixel-level annotations. This work addresses weakly supervised semantic segmentation (WSSS), with the goal of bridging the gap between image-level annotations and pixel-level segmentation. We formulate WSSS as a novel group-wise learning task that explicitly models semantic dependencies in a group of images to estimate more reliable pseudo ground-truths, which can be used for training more accurate segmentation models. In particular, we devise a graph neural network (GNN) for group-wise semantic mining, wherein input images are represented as graph nodes, and the underlying relations between a pair of images are characterized by an efficient co-attention mechanism. Moreover, in order to prevent the model from paying excessive attention to common semantics only, we further propose a graph dropout layer, encouraging the model to learn more accurate and complete object responses. The whole network is end-to-end trainable by iterative message passing, which propagates interaction cues over the images to progressively improve the performance. We conduct experiments on the popular PASCAL VOC 2012 and COCO benchmarks, and our model yields state-of-the-art performance. Our code is available at: https://github.com/Lixy1997/Group-WSSS.

UR - http://www.scopus.com/inward/record.url?scp=85130065167&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85130065167

T3 - 35th AAAI Conference on Artificial Intelligence, AAAI 2021

SP - 1984

EP - 1992

BT - 35th AAAI Conference on Artificial Intelligence, AAAI 2021

PB - Association for the Advancement of Artificial Intelligence

T2 - 35th AAAI Conference on Artificial Intelligence, AAAI 2021

Y2 - 2 February 2021 through 9 February 2021

ER -

Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation

摘要

出版系列

会议

其它文件与链接

指纹

引用此