A Global to Local Double Embedding Method for Multi-person Pose Estimation

Yiming Xu; Jiaxin Li; Yan Ding; Hua Liang Wei

doi:10.1007/978-3-030-69541-5_6

A Global to Local Double Embedding Method for Multi-person Pose Estimation

Yiming Xu, Jiaxin Li, Yan Ding^*, Hua Liang Wei

^*此作品的通讯作者

宇航学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Multi-person pose estimation is a fundamental and challenging problem to many computer vision tasks. Most existing methods can be broadly categorized into two classes: top-down and bottom-up methods. Both of the two types of methods involve two stages, namely, person detection and joints detection. Conventionally, the two stages are implemented separately without considering their interactions between them, and this may inevitably cause some issue intrinsically. In this paper, we present a novel method to simplify the pipeline by implementing person detection and joints detection simultaneously. We propose a Double Embedding (DE) method to complete the multi-person pose estimation task in a global-to-local way. DE consists of Global Embedding (GE) and Local Embedding (LE). GE encodes different person instances and processes information covering the whole image and LE encodes the local limbs information. GE functions for the person detection in top-down strategy while LE connects the rest joints sequentially which functions for joint grouping and information processing in A bottom-up strategy. Based on LE, we design the Mutual Refine Machine (MRM) to reduce the prediction difficulty in complex scenarios. MRM can effectively realize the information communicating between keypoints and further improve the accuracy. We achieve the competitive results on benchmarks MSCOCO, MPII and CrowdPose, demonstrating the effectiveness and generalization ability of our method.

源语言	英语
主期刊名	Computer Vision – ACCV 2020 - 15th Asian Conference on Computer Vision, 2020, Revised Selected Papers
编辑	Hiroshi Ishikawa, Cheng-Lin Liu, Tomas Pajdla, Jianbo Shi
出版商	Springer Science and Business Media Deutschland GmbH
页	88-103
页数	16
ISBN（印刷版）	9783030695408
DOI	https://doi.org/10.1007/978-3-030-69541-5_6
出版状态	已出版 - 2021
活动	15th Asian Conference on Computer Vision, ACCV 2020 - Virtual, Online 期限: 30 11月 2020 → 4 12月 2020

出版系列

姓名	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
卷	12626 LNCS
ISSN（印刷版）	0302-9743
ISSN（电子版）	1611-3349

会议

会议	15th Asian Conference on Computer Vision, ACCV 2020
市	Virtual, Online
时期	30/11/20 → 4/12/20

访问文件

10.1007/978-3-030-69541-5_6

其它文件与链接

链接到 Scopus 的出版物

引用此

Xu, Y., Li, J., Ding, Y., & Wei, H. L. (2021). A Global to Local Double Embedding Method for Multi-person Pose Estimation. 在 H. Ishikawa, C.-L. Liu, T. Pajdla, & J. Shi (编辑), Computer Vision – ACCV 2020 - 15th Asian Conference on Computer Vision, 2020, Revised Selected Papers (页码 88-103). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 12626 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-69541-5_6

Xu, Yiming ; Li, Jiaxin ; Ding, Yan 等. / A Global to Local Double Embedding Method for Multi-person Pose Estimation. Computer Vision – ACCV 2020 - 15th Asian Conference on Computer Vision, 2020, Revised Selected Papers. 编辑 / Hiroshi Ishikawa ; Cheng-Lin Liu ; Tomas Pajdla ; Jianbo Shi. Springer Science and Business Media Deutschland GmbH, 2021. 页码 88-103 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{850825c65f304921b9ca4714b1c0ddc7,

title = "A Global to Local Double Embedding Method for Multi-person Pose Estimation",

abstract = "Multi-person pose estimation is a fundamental and challenging problem to many computer vision tasks. Most existing methods can be broadly categorized into two classes: top-down and bottom-up methods. Both of the two types of methods involve two stages, namely, person detection and joints detection. Conventionally, the two stages are implemented separately without considering their interactions between them, and this may inevitably cause some issue intrinsically. In this paper, we present a novel method to simplify the pipeline by implementing person detection and joints detection simultaneously. We propose a Double Embedding (DE) method to complete the multi-person pose estimation task in a global-to-local way. DE consists of Global Embedding (GE) and Local Embedding (LE). GE encodes different person instances and processes information covering the whole image and LE encodes the local limbs information. GE functions for the person detection in top-down strategy while LE connects the rest joints sequentially which functions for joint grouping and information processing in A bottom-up strategy. Based on LE, we design the Mutual Refine Machine (MRM) to reduce the prediction difficulty in complex scenarios. MRM can effectively realize the information communicating between keypoints and further improve the accuracy. We achieve the competitive results on benchmarks MSCOCO, MPII and CrowdPose, demonstrating the effectiveness and generalization ability of our method.",

author = "Yiming Xu and Jiaxin Li and Yan Ding and Wei, {Hua Liang}",

note = "Publisher Copyright: {\textcopyright} 2021, Springer Nature Switzerland AG.; 15th Asian Conference on Computer Vision, ACCV 2020 ; Conference date: 30-11-2020 Through 04-12-2020",

year = "2021",

doi = "10.1007/978-3-030-69541-5_6",

language = "English",

isbn = "9783030695408",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "88--103",

editor = "Hiroshi Ishikawa and Cheng-Lin Liu and Tomas Pajdla and Jianbo Shi",

booktitle = "Computer Vision – ACCV 2020 - 15th Asian Conference on Computer Vision, 2020, Revised Selected Papers",

address = "Germany",

}

Xu, Y, Li, J, Ding, Y & Wei, HL 2021, A Global to Local Double Embedding Method for Multi-person Pose Estimation. 在 H Ishikawa, C-L Liu, T Pajdla & J Shi (编辑), Computer Vision – ACCV 2020 - 15th Asian Conference on Computer Vision, 2020, Revised Selected Papers. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 卷 12626 LNCS, Springer Science and Business Media Deutschland GmbH, 页码 88-103, 15th Asian Conference on Computer Vision, ACCV 2020, Virtual, Online, 30/11/20. https://doi.org/10.1007/978-3-030-69541-5_6

A Global to Local Double Embedding Method for Multi-person Pose Estimation. / Xu, Yiming; Li, Jiaxin; Ding, Yan 等.
Computer Vision – ACCV 2020 - 15th Asian Conference on Computer Vision, 2020, Revised Selected Papers. 编辑 / Hiroshi Ishikawa; Cheng-Lin Liu; Tomas Pajdla; Jianbo Shi. Springer Science and Business Media Deutschland GmbH, 2021. 页码 88-103 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 12626 LNCS).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - A Global to Local Double Embedding Method for Multi-person Pose Estimation

AU - Xu, Yiming

AU - Li, Jiaxin

AU - Ding, Yan

AU - Wei, Hua Liang

PY - 2021

Y1 - 2021

N2 - Multi-person pose estimation is a fundamental and challenging problem to many computer vision tasks. Most existing methods can be broadly categorized into two classes: top-down and bottom-up methods. Both of the two types of methods involve two stages, namely, person detection and joints detection. Conventionally, the two stages are implemented separately without considering their interactions between them, and this may inevitably cause some issue intrinsically. In this paper, we present a novel method to simplify the pipeline by implementing person detection and joints detection simultaneously. We propose a Double Embedding (DE) method to complete the multi-person pose estimation task in a global-to-local way. DE consists of Global Embedding (GE) and Local Embedding (LE). GE encodes different person instances and processes information covering the whole image and LE encodes the local limbs information. GE functions for the person detection in top-down strategy while LE connects the rest joints sequentially which functions for joint grouping and information processing in A bottom-up strategy. Based on LE, we design the Mutual Refine Machine (MRM) to reduce the prediction difficulty in complex scenarios. MRM can effectively realize the information communicating between keypoints and further improve the accuracy. We achieve the competitive results on benchmarks MSCOCO, MPII and CrowdPose, demonstrating the effectiveness and generalization ability of our method.

AB - Multi-person pose estimation is a fundamental and challenging problem to many computer vision tasks. Most existing methods can be broadly categorized into two classes: top-down and bottom-up methods. Both of the two types of methods involve two stages, namely, person detection and joints detection. Conventionally, the two stages are implemented separately without considering their interactions between them, and this may inevitably cause some issue intrinsically. In this paper, we present a novel method to simplify the pipeline by implementing person detection and joints detection simultaneously. We propose a Double Embedding (DE) method to complete the multi-person pose estimation task in a global-to-local way. DE consists of Global Embedding (GE) and Local Embedding (LE). GE encodes different person instances and processes information covering the whole image and LE encodes the local limbs information. GE functions for the person detection in top-down strategy while LE connects the rest joints sequentially which functions for joint grouping and information processing in A bottom-up strategy. Based on LE, we design the Mutual Refine Machine (MRM) to reduce the prediction difficulty in complex scenarios. MRM can effectively realize the information communicating between keypoints and further improve the accuracy. We achieve the competitive results on benchmarks MSCOCO, MPII and CrowdPose, demonstrating the effectiveness and generalization ability of our method.

UR - http://www.scopus.com/inward/record.url?scp=85103346984&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-69541-5_6

DO - 10.1007/978-3-030-69541-5_6

M3 - Conference contribution

AN - SCOPUS:85103346984

SN - 9783030695408

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 88

EP - 103

BT - Computer Vision – ACCV 2020 - 15th Asian Conference on Computer Vision, 2020, Revised Selected Papers

A2 - Ishikawa, Hiroshi

A2 - Liu, Cheng-Lin

A2 - Pajdla, Tomas

A2 - Shi, Jianbo

PB - Springer Science and Business Media Deutschland GmbH

T2 - 15th Asian Conference on Computer Vision, ACCV 2020

Y2 - 30 November 2020 through 4 December 2020

ER -

Xu Y, Li J, Ding Y, Wei HL. A Global to Local Double Embedding Method for Multi-person Pose Estimation. 在 Ishikawa H, Liu CL, Pajdla T, Shi J, 编辑, Computer Vision – ACCV 2020 - 15th Asian Conference on Computer Vision, 2020, Revised Selected Papers. Springer Science and Business Media Deutschland GmbH. 2021. 页码 88-103. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-69541-5_6

A Global to Local Double Embedding Method for Multi-person Pose Estimation

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此