基于高分辨率网络的人体姿态估计方法

Hao Pan Ren; Wen Ming Wang; De Jian Wei; Yan Yan Gao; Zhi Hui Kang; Quan Yu Wang

doi:10.11996/JG.j.2095-302X.2021030432

基于高分辨率网络的人体姿态估计方法

Hao Pan Ren, Wen Ming Wang^*, De Jian Wei, Yan Yan Gao, Zhi Hui Kang, Quan Yu Wang

^*此作品的通讯作者

计算机学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

8 引用（Scopus）

摘要

Human pose estimation plays a vital role in human-computer interaction and behavior recognition applications, but the changing scale of feature maps poses a challenge to the relevant methods in predicting the correct human poses. In order to heighten the accuracy of pose estimation, the method for the parallel network multi-scale fusion and that for generating high-quality feature maps were combined for human pose estimation. On the basis of human detection, RefinedHRNet adopted the method for parallel network multi-scale fusion to expand the receptive field in the stage using a dilated convolution module to maintain context information. In addition, RefinedHRNet employed a deconvolution module and an up-sampling module between stages to generate high-quality feature maps. Then, the parallel network feature maps with the highest resolution (1/4 of the input image size) were utilized for pose estimation. Finally, Object Keypoint Similarity (OKS) was used to evaluate the accuracy of keypoint recognition. Experimenting on the COCO2017 test set, the pose estimation accuracy of our proposed method RefinedHRNet is 0.4% higher than the HRNet network model.

投稿的翻译标题	Human pose estimation based on high-resolution net
源语言	繁体中文
页（从-至）	432-438
页数	7
期刊	Journal of Graphics
卷	42
期	3
DOI	https://doi.org/10.11996/JG.j.2095-302X.2021030432
出版状态	已出版 - 30 6月 2021

关键词

high-quality feature maps
human detection
multi-scale fusion
object keypoint similarity
pose estimation

访问文件

10.11996/JG.j.2095-302X.2021030432

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{fccfc61ea165473c8076029be1c4dd4f,

title = "基于高分辨率网络的人体姿态估计方法",

abstract = "Human pose estimation plays a vital role in human-computer interaction and behavior recognition applications, but the changing scale of feature maps poses a challenge to the relevant methods in predicting the correct human poses. In order to heighten the accuracy of pose estimation, the method for the parallel network multi-scale fusion and that for generating high-quality feature maps were combined for human pose estimation. On the basis of human detection, RefinedHRNet adopted the method for parallel network multi-scale fusion to expand the receptive field in the stage using a dilated convolution module to maintain context information. In addition, RefinedHRNet employed a deconvolution module and an up-sampling module between stages to generate high-quality feature maps. Then, the parallel network feature maps with the highest resolution (1/4 of the input image size) were utilized for pose estimation. Finally, Object Keypoint Similarity (OKS) was used to evaluate the accuracy of keypoint recognition. Experimenting on the COCO2017 test set, the pose estimation accuracy of our proposed method RefinedHRNet is 0.4% higher than the HRNet network model.",

keywords = "high-quality feature maps, human detection, multi-scale fusion, object keypoint similarity, pose estimation",

author = "Ren, {Hao Pan} and Wang, {Wen Ming} and Wei, {De Jian} and Gao, {Yan Yan} and Kang, {Zhi Hui} and Wang, {Quan Yu}",

year = "2021",

month = jun,

day = "30",

doi = "10.11996/JG.j.2095-302X.2021030432",

language = "繁体中文",

volume = "42",

pages = "432--438",

journal = "Journal of Graphics",

issn = "2095-302X",

publisher = "Editorial of Board of Journal of Graphics",

number = "3",

}

TY - JOUR

T1 - 基于高分辨率网络的人体姿态估计方法

AU - Ren, Hao Pan

AU - Wang, Wen Ming

AU - Wei, De Jian

AU - Gao, Yan Yan

AU - Kang, Zhi Hui

AU - Wang, Quan Yu

PY - 2021/6/30

Y1 - 2021/6/30

N2 - Human pose estimation plays a vital role in human-computer interaction and behavior recognition applications, but the changing scale of feature maps poses a challenge to the relevant methods in predicting the correct human poses. In order to heighten the accuracy of pose estimation, the method for the parallel network multi-scale fusion and that for generating high-quality feature maps were combined for human pose estimation. On the basis of human detection, RefinedHRNet adopted the method for parallel network multi-scale fusion to expand the receptive field in the stage using a dilated convolution module to maintain context information. In addition, RefinedHRNet employed a deconvolution module and an up-sampling module between stages to generate high-quality feature maps. Then, the parallel network feature maps with the highest resolution (1/4 of the input image size) were utilized for pose estimation. Finally, Object Keypoint Similarity (OKS) was used to evaluate the accuracy of keypoint recognition. Experimenting on the COCO2017 test set, the pose estimation accuracy of our proposed method RefinedHRNet is 0.4% higher than the HRNet network model.

AB - Human pose estimation plays a vital role in human-computer interaction and behavior recognition applications, but the changing scale of feature maps poses a challenge to the relevant methods in predicting the correct human poses. In order to heighten the accuracy of pose estimation, the method for the parallel network multi-scale fusion and that for generating high-quality feature maps were combined for human pose estimation. On the basis of human detection, RefinedHRNet adopted the method for parallel network multi-scale fusion to expand the receptive field in the stage using a dilated convolution module to maintain context information. In addition, RefinedHRNet employed a deconvolution module and an up-sampling module between stages to generate high-quality feature maps. Then, the parallel network feature maps with the highest resolution (1/4 of the input image size) were utilized for pose estimation. Finally, Object Keypoint Similarity (OKS) was used to evaluate the accuracy of keypoint recognition. Experimenting on the COCO2017 test set, the pose estimation accuracy of our proposed method RefinedHRNet is 0.4% higher than the HRNet network model.

KW - high-quality feature maps

KW - human detection

KW - multi-scale fusion

KW - object keypoint similarity

KW - pose estimation

UR - http://www.scopus.com/inward/record.url?scp=85141705287&partnerID=8YFLogxK

U2 - 10.11996/JG.j.2095-302X.2021030432

DO - 10.11996/JG.j.2095-302X.2021030432

M3 - 文章

AN - SCOPUS:85141705287

SN - 2095-302X

VL - 42

SP - 432

EP - 438

JO - Journal of Graphics

JF - Journal of Graphics

IS - 3

ER -

基于高分辨率网络的人体姿态估计方法

摘要

关键词

访问文件

其它文件与链接

指纹

引用此