Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation

Tianfei Zhou; Meijie Zhang; Fang Zhao; Jianwu Li

doi:10.1109/CVPR52688.2022.00426

Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation

Tianfei Zhou, Meijie Zhang, Fang Zhao, Jianwu Li^*

^*此作品的通讯作者

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

88 引用（Scopus）

摘要

Learning semantic segmentation from weakly-labeled (e.g., image tags only) data is challenging since it is hard to infer dense object regions from sparse semantic tags. Despite being broadly studied, most current efforts directly learn from limited semantic annotations carried by individual image or image pairs, and struggle to obtain integral localization maps. Our work alleviates this from a novel perspective, by exploring rich semantic contexts synergistically among abundant weakly-labeled training data for network learning and inference. In particular, we propose regional semantic contrast and aggregation (RCA). RCA is equipped with a regional memory bank to store massive, diverse object patterns appearing in training data, which acts as strong support for exploration of dataset-level semantic structure. Particularly, we propose i) semantic contrast to drive network learning by contrasting massive categorical object regions, leading to a more holistic object pattern understanding, and ii) semantic aggregation to gather diverse relational contexts in the memory to enrich semantic repre-sentations. In this manner, RCA earns a strong capability of fine-grained semantic understanding, and eventually establishes new state-of-the-art results on two popular benchmarks, i.e., PASCAL VOC 2012 and COCO 2014.

源语言	英语
主期刊名	Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
出版商	IEEE Computer Society
页	4289-4299
页数	11
ISBN（电子版）	9781665469463
DOI	https://doi.org/10.1109/CVPR52688.2022.00426
出版状态	已出版 - 2022
活动	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 - New Orleans, 美国期限: 19 6月 2022 → 24 6月 2022

出版系列

姓名	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
卷	2022-June
ISSN（印刷版）	1063-6919

会议

会议	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
国家/地区	美国
市	New Orleans
时期	19/06/22 → 24/06/22

访问文件

10.1109/CVPR52688.2022.00426

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhou, T., Zhang, M., Zhao, F., & Li, J. (2022). Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation. 在 Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 (页码 4289-4299). (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2022-June). IEEE Computer Society. https://doi.org/10.1109/CVPR52688.2022.00426

Zhou, Tianfei ; Zhang, Meijie ; Zhao, Fang 等. / Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation. Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE Computer Society, 2022. 页码 4289-4299 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition).

@inproceedings{eaf88748569140d39a7f270fbe9c9543,

title = "Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation",

abstract = "Learning semantic segmentation from weakly-labeled (e.g., image tags only) data is challenging since it is hard to infer dense object regions from sparse semantic tags. Despite being broadly studied, most current efforts directly learn from limited semantic annotations carried by individual image or image pairs, and struggle to obtain integral localization maps. Our work alleviates this from a novel perspective, by exploring rich semantic contexts synergistically among abundant weakly-labeled training data for network learning and inference. In particular, we propose regional semantic contrast and aggregation (RCA). RCA is equipped with a regional memory bank to store massive, diverse object patterns appearing in training data, which acts as strong support for exploration of dataset-level semantic structure. Particularly, we propose i) semantic contrast to drive network learning by contrasting massive categorical object regions, leading to a more holistic object pattern understanding, and ii) semantic aggregation to gather diverse relational contexts in the memory to enrich semantic repre-sentations. In this manner, RCA earns a strong capability of fine-grained semantic understanding, and eventually establishes new state-of-the-art results on two popular benchmarks, i.e., PASCAL VOC 2012 and COCO 2014.",

keywords = "Scene analysis and understanding, Segmentation, grouping and shape analysis",

author = "Tianfei Zhou and Meijie Zhang and Fang Zhao and Jianwu Li",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 ; Conference date: 19-06-2022 Through 24-06-2022",

year = "2022",

doi = "10.1109/CVPR52688.2022.00426",

language = "English",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "4289--4299",

booktitle = "Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022",

address = "United States",

}

Zhou, T, Zhang, M, Zhao, F & Li, J 2022, Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation. 在 Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 卷 2022-June, IEEE Computer Society, 页码 4289-4299, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, 美国, 19/06/22. https://doi.org/10.1109/CVPR52688.2022.00426

Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation. / Zhou, Tianfei; Zhang, Meijie; Zhao, Fang 等.
Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE Computer Society, 2022. 页码 4289-4299 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2022-June).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation

AU - Zhou, Tianfei

AU - Zhang, Meijie

AU - Zhao, Fang

AU - Li, Jianwu

PY - 2022

Y1 - 2022

N2 - Learning semantic segmentation from weakly-labeled (e.g., image tags only) data is challenging since it is hard to infer dense object regions from sparse semantic tags. Despite being broadly studied, most current efforts directly learn from limited semantic annotations carried by individual image or image pairs, and struggle to obtain integral localization maps. Our work alleviates this from a novel perspective, by exploring rich semantic contexts synergistically among abundant weakly-labeled training data for network learning and inference. In particular, we propose regional semantic contrast and aggregation (RCA). RCA is equipped with a regional memory bank to store massive, diverse object patterns appearing in training data, which acts as strong support for exploration of dataset-level semantic structure. Particularly, we propose i) semantic contrast to drive network learning by contrasting massive categorical object regions, leading to a more holistic object pattern understanding, and ii) semantic aggregation to gather diverse relational contexts in the memory to enrich semantic repre-sentations. In this manner, RCA earns a strong capability of fine-grained semantic understanding, and eventually establishes new state-of-the-art results on two popular benchmarks, i.e., PASCAL VOC 2012 and COCO 2014.

AB - Learning semantic segmentation from weakly-labeled (e.g., image tags only) data is challenging since it is hard to infer dense object regions from sparse semantic tags. Despite being broadly studied, most current efforts directly learn from limited semantic annotations carried by individual image or image pairs, and struggle to obtain integral localization maps. Our work alleviates this from a novel perspective, by exploring rich semantic contexts synergistically among abundant weakly-labeled training data for network learning and inference. In particular, we propose regional semantic contrast and aggregation (RCA). RCA is equipped with a regional memory bank to store massive, diverse object patterns appearing in training data, which acts as strong support for exploration of dataset-level semantic structure. Particularly, we propose i) semantic contrast to drive network learning by contrasting massive categorical object regions, leading to a more holistic object pattern understanding, and ii) semantic aggregation to gather diverse relational contexts in the memory to enrich semantic repre-sentations. In this manner, RCA earns a strong capability of fine-grained semantic understanding, and eventually establishes new state-of-the-art results on two popular benchmarks, i.e., PASCAL VOC 2012 and COCO 2014.

KW - Scene analysis and understanding

KW - Segmentation

KW - grouping and shape analysis

UR - http://www.scopus.com/inward/record.url?scp=85143062884&partnerID=8YFLogxK

U2 - 10.1109/CVPR52688.2022.00426

DO - 10.1109/CVPR52688.2022.00426

M3 - Conference contribution

AN - SCOPUS:85143062884

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 4289

EP - 4299

BT - Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022

PB - IEEE Computer Society

T2 - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022

Y2 - 19 June 2022 through 24 June 2022

ER -

Zhou T, Zhang M, Zhao F, Li J. Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation. 在 Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE Computer Society. 2022. 页码 4289-4299. (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition). doi: 10.1109/CVPR52688.2022.00426

Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此