TY - JOUR
T1 - Part Decomposition and Refinement Network for Human Parsing
AU - Yang, Lu
AU - Liu, Zhiwei
AU - Zhou, Tianfei
AU - Song, Qing
N1 - Publisher Copyright:
© 2014 Chinese Association of Automation.
PY - 2022/6/1
Y1 - 2022/6/1
N2 - Dear Editor, This letter is concerned with human parsing based on part-wise semantic prediction. Human body can be regarded as a whole structure composed of different semantic parts, and the mainstream single human parser uses semantic segmentation pipeline to solve this problem. However, the differences between human parsing and semantic segmentation tasks bring some issues that are inevitable to avoid. In this paper, we propose a novel method called part decomposition and refinement network (PDRNet), which adopt part-wise mask prediction other than pixel-wise semantic prediction to tackle human parsing task. Specifically, we decompose the human body into different semantic parts and design a decomposition module to learn the central position of each part. The refinement module is proposed to obtain the mask of each human part by learning convolution kernel and convolved feature. In inference stage, the predicted human part masks are combined into a complete human parsing result. Through the decomposition, refinement and combination of human parts, PDRNet greatly reduces the confusion between the target human and the background human, and also significantly improves the semantic consistency of human part. Extensive experiments show that PDRNet performs favorably against state-of-the-art methods on several human parsing benchmarks, including LIP, CIHP and Pascal-Person-Part.
AB - Dear Editor, This letter is concerned with human parsing based on part-wise semantic prediction. Human body can be regarded as a whole structure composed of different semantic parts, and the mainstream single human parser uses semantic segmentation pipeline to solve this problem. However, the differences between human parsing and semantic segmentation tasks bring some issues that are inevitable to avoid. In this paper, we propose a novel method called part decomposition and refinement network (PDRNet), which adopt part-wise mask prediction other than pixel-wise semantic prediction to tackle human parsing task. Specifically, we decompose the human body into different semantic parts and design a decomposition module to learn the central position of each part. The refinement module is proposed to obtain the mask of each human part by learning convolution kernel and convolved feature. In inference stage, the predicted human part masks are combined into a complete human parsing result. Through the decomposition, refinement and combination of human parts, PDRNet greatly reduces the confusion between the target human and the background human, and also significantly improves the semantic consistency of human part. Extensive experiments show that PDRNet performs favorably against state-of-the-art methods on several human parsing benchmarks, including LIP, CIHP and Pascal-Person-Part.
UR - http://www.scopus.com/inward/record.url?scp=85131743366&partnerID=8YFLogxK
U2 - 10.1109/JAS.2022.105647
DO - 10.1109/JAS.2022.105647
M3 - Article
AN - SCOPUS:85131743366
SN - 2329-9266
VL - 9
SP - 1111
EP - 1114
JO - IEEE/CAA Journal of Automatica Sinica
JF - IEEE/CAA Journal of Automatica Sinica
IS - 6
ER -