A Medium Granularity Model for Human Pose Estimation in Video

Qing Xuan Shi; Hui Jun Di; Yao Lu; Xue Dong Tian

doi:10.16383/j.aas.2018.c160847

A Medium Granularity Model for Human Pose Estimation in Video

Qing Xuan Shi, Hui Jun Di, Yao Lu^*, Xue Dong Tian

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

4 Citations (Scopus)

Abstract

Human pose estimation has attracted much attention in the computer vision community due to its potential applications in action recognition, human-computer interaction, etc. To focus on pose estimation in videos, a medium granularity spatio-temporal probabilistic graphical model using body part tracklets as entities is presented in this paper. The optimal tracklet for each body part is acquired by spatiotemporal approximate reasoning through iterative spatial and temporal parsing, and the final human pose estimation is achieved by merging these optimal tracklets. To generate reliable tracklet proposals, global motion cue is adopted to propagate pose detections from individual frames to the whole video, and the trajectories from this propagation are segmented into fixed-length overlapping tracklets. To deal with the double counting problem, symmetric parts are coupled to one virtual node, so that the loops in spatial model are removed and the constaints between symmetric parts are maintained. The experiment on three datasets shows the proposed method achieves a higher accuracy than other pose estimation methods.

Original language	English
Pages (from-to)	646-655
Number of pages	10
Journal	Zidonghua Xuebao/Acta Automatica Sinica
Volume	44
Issue number	4
DOIs	https://doi.org/10.16383/j.aas.2018.c160847
Publication status	Published - Apr 2018

Keywords

Hidden Markov model
Human pose estimation
Markov random field
Medium granularity model

Access to Document

10.16383/j.aas.2018.c160847

Cite this

Shi, Q. X., Di, H. J., Lu, Y., & Tian, X. D. (2018). A Medium Granularity Model for Human Pose Estimation in Video. Zidonghua Xuebao/Acta Automatica Sinica, 44(4), 646-655. https://doi.org/10.16383/j.aas.2018.c160847

@article{3af0e430412a4b15a1a85b9516c2994f,

title = "A Medium Granularity Model for Human Pose Estimation in Video",

abstract = "Human pose estimation has attracted much attention in the computer vision community due to its potential applications in action recognition, human-computer interaction, etc. To focus on pose estimation in videos, a medium granularity spatio-temporal probabilistic graphical model using body part tracklets as entities is presented in this paper. The optimal tracklet for each body part is acquired by spatiotemporal approximate reasoning through iterative spatial and temporal parsing, and the final human pose estimation is achieved by merging these optimal tracklets. To generate reliable tracklet proposals, global motion cue is adopted to propagate pose detections from individual frames to the whole video, and the trajectories from this propagation are segmented into fixed-length overlapping tracklets. To deal with the double counting problem, symmetric parts are coupled to one virtual node, so that the loops in spatial model are removed and the constaints between symmetric parts are maintained. The experiment on three datasets shows the proposed method achieves a higher accuracy than other pose estimation methods.",

keywords = "Hidden Markov model, Human pose estimation, Markov random field, Medium granularity model",

author = "Shi, {Qing Xuan} and Di, {Hui Jun} and Yao Lu and Tian, {Xue Dong}",

year = "2018",

month = apr,

doi = "10.16383/j.aas.2018.c160847",

language = "English",

volume = "44",

pages = "646--655",

journal = "Zidonghua Xuebao/Acta Automatica Sinica",

issn = "0254-4156",

publisher = "Science Press",

number = "4",

}

TY - JOUR

T1 - A Medium Granularity Model for Human Pose Estimation in Video

AU - Shi, Qing Xuan

AU - Di, Hui Jun

AU - Lu, Yao

AU - Tian, Xue Dong

PY - 2018/4

Y1 - 2018/4

N2 - Human pose estimation has attracted much attention in the computer vision community due to its potential applications in action recognition, human-computer interaction, etc. To focus on pose estimation in videos, a medium granularity spatio-temporal probabilistic graphical model using body part tracklets as entities is presented in this paper. The optimal tracklet for each body part is acquired by spatiotemporal approximate reasoning through iterative spatial and temporal parsing, and the final human pose estimation is achieved by merging these optimal tracklets. To generate reliable tracklet proposals, global motion cue is adopted to propagate pose detections from individual frames to the whole video, and the trajectories from this propagation are segmented into fixed-length overlapping tracklets. To deal with the double counting problem, symmetric parts are coupled to one virtual node, so that the loops in spatial model are removed and the constaints between symmetric parts are maintained. The experiment on three datasets shows the proposed method achieves a higher accuracy than other pose estimation methods.

AB - Human pose estimation has attracted much attention in the computer vision community due to its potential applications in action recognition, human-computer interaction, etc. To focus on pose estimation in videos, a medium granularity spatio-temporal probabilistic graphical model using body part tracklets as entities is presented in this paper. The optimal tracklet for each body part is acquired by spatiotemporal approximate reasoning through iterative spatial and temporal parsing, and the final human pose estimation is achieved by merging these optimal tracklets. To generate reliable tracklet proposals, global motion cue is adopted to propagate pose detections from individual frames to the whole video, and the trajectories from this propagation are segmented into fixed-length overlapping tracklets. To deal with the double counting problem, symmetric parts are coupled to one virtual node, so that the loops in spatial model are removed and the constaints between symmetric parts are maintained. The experiment on three datasets shows the proposed method achieves a higher accuracy than other pose estimation methods.

KW - Hidden Markov model

KW - Human pose estimation

KW - Markov random field

KW - Medium granularity model

UR - http://www.scopus.com/inward/record.url?scp=85049516841&partnerID=8YFLogxK

U2 - 10.16383/j.aas.2018.c160847

DO - 10.16383/j.aas.2018.c160847

M3 - Article

AN - SCOPUS:85049516841

SN - 0254-4156

VL - 44

SP - 646

EP - 655

JO - Zidonghua Xuebao/Acta Automatica Sinica

JF - Zidonghua Xuebao/Acta Automatica Sinica

IS - 4

ER -

A Medium Granularity Model for Human Pose Estimation in Video

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this