Region-based Mixture Models for human action recognition in low-resolution videos

Ying Zhao; Huijun Di; Jian Zhang; Yao Lu; Feng Lv; Yufang Li

doi:10.1016/j.neucom.2017.03.033

Region-based Mixture Models for human action recognition in low-resolution videos

Ying Zhao, Huijun Di, Jian Zhang, Yao Lu^*, Feng Lv, Yufang Li

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

14 引用（Scopus）

摘要

State-of-the-art performance in human action recognition is achieved by the use of dense trajectories which are extracted by optical flow algorithms. However, optical flow algorithms are far from perfect in low-resolution (LR) videos. In addition, the spatial and temporal layout of features is a powerful cue for action discrimination. While, most existing methods encode the layout by previously segmenting body parts which is not feasible in LR videos. Addressing the problems, we adopt the Layered Elastic Motion Tracking (LEMT) method to extract a set of long-term motion trajectories and a long-term common shape from each video sequence, where the extracted trajectories are much denser than those of sparse interest points (SIPs); then we present a hybrid feature representation to integrate both of the shape and motion features; and finally we propose a Region-based Mixture Model (RMM) to be utilized for action classification. The RMM encodes the spatial layout of features without any needs of body parts segmentation. Experimental results show that the approach is effective and, more importantly, the approach is more general for LR recognition tasks.

源语言	英语
页（从-至）	1-15
页数	15
期刊	Neurocomputing
卷	247
DOI	https://doi.org/10.1016/j.neucom.2017.03.033
出版状态	已出版 - 19 7月 2017

访问文件

10.1016/j.neucom.2017.03.033

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{75e1bc7697964db9a486ed85804765d9,

title = "Region-based Mixture Models for human action recognition in low-resolution videos",

abstract = "State-of-the-art performance in human action recognition is achieved by the use of dense trajectories which are extracted by optical flow algorithms. However, optical flow algorithms are far from perfect in low-resolution (LR) videos. In addition, the spatial and temporal layout of features is a powerful cue for action discrimination. While, most existing methods encode the layout by previously segmenting body parts which is not feasible in LR videos. Addressing the problems, we adopt the Layered Elastic Motion Tracking (LEMT) method to extract a set of long-term motion trajectories and a long-term common shape from each video sequence, where the extracted trajectories are much denser than those of sparse interest points (SIPs); then we present a hybrid feature representation to integrate both of the shape and motion features; and finally we propose a Region-based Mixture Model (RMM) to be utilized for action classification. The RMM encodes the spatial layout of features without any needs of body parts segmentation. Experimental results show that the approach is effective and, more importantly, the approach is more general for LR recognition tasks.",

keywords = "Action recognition, Elastic motion tracking, Expectation Maximization (EM) algorithm, Low-resolution, Mixture model",

author = "Ying Zhao and Huijun Di and Jian Zhang and Yao Lu and Feng Lv and Yufang Li",

note = "Publisher Copyright: {\textcopyright} 2017",

year = "2017",

month = jul,

day = "19",

doi = "10.1016/j.neucom.2017.03.033",

language = "English",

volume = "247",

pages = "1--15",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Region-based Mixture Models for human action recognition in low-resolution videos

AU - Zhao, Ying

AU - Di, Huijun

AU - Zhang, Jian

AU - Lu, Yao

AU - Lv, Feng

AU - Li, Yufang

PY - 2017/7/19

Y1 - 2017/7/19

N2 - State-of-the-art performance in human action recognition is achieved by the use of dense trajectories which are extracted by optical flow algorithms. However, optical flow algorithms are far from perfect in low-resolution (LR) videos. In addition, the spatial and temporal layout of features is a powerful cue for action discrimination. While, most existing methods encode the layout by previously segmenting body parts which is not feasible in LR videos. Addressing the problems, we adopt the Layered Elastic Motion Tracking (LEMT) method to extract a set of long-term motion trajectories and a long-term common shape from each video sequence, where the extracted trajectories are much denser than those of sparse interest points (SIPs); then we present a hybrid feature representation to integrate both of the shape and motion features; and finally we propose a Region-based Mixture Model (RMM) to be utilized for action classification. The RMM encodes the spatial layout of features without any needs of body parts segmentation. Experimental results show that the approach is effective and, more importantly, the approach is more general for LR recognition tasks.

AB - State-of-the-art performance in human action recognition is achieved by the use of dense trajectories which are extracted by optical flow algorithms. However, optical flow algorithms are far from perfect in low-resolution (LR) videos. In addition, the spatial and temporal layout of features is a powerful cue for action discrimination. While, most existing methods encode the layout by previously segmenting body parts which is not feasible in LR videos. Addressing the problems, we adopt the Layered Elastic Motion Tracking (LEMT) method to extract a set of long-term motion trajectories and a long-term common shape from each video sequence, where the extracted trajectories are much denser than those of sparse interest points (SIPs); then we present a hybrid feature representation to integrate both of the shape and motion features; and finally we propose a Region-based Mixture Model (RMM) to be utilized for action classification. The RMM encodes the spatial layout of features without any needs of body parts segmentation. Experimental results show that the approach is effective and, more importantly, the approach is more general for LR recognition tasks.

KW - Action recognition

KW - Elastic motion tracking

KW - Expectation Maximization (EM) algorithm

KW - Low-resolution

KW - Mixture model

UR - http://www.scopus.com/inward/record.url?scp=85017164854&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2017.03.033

DO - 10.1016/j.neucom.2017.03.033

M3 - Article

AN - SCOPUS:85017164854

SN - 0925-2312

VL - 247

SP - 1

EP - 15

JO - Neurocomputing

JF - Neurocomputing

ER -

Region-based Mixture Models for human action recognition in low-resolution videos

摘要

访问文件

其它文件与链接

指纹

引用此