Video representation by dense trajectories motion map applied to human activity recognition

Sheeraz Arif; Tehseen Ul-Hassan; Fida Hussain; Jing Wang; Zesong Fei

doi:10.1080/1206212X.2018.1486001

Video representation by dense trajectories motion map applied to human activity recognition

Sheeraz Arif^*, Tehseen Ul-Hassan, Fida Hussain, Jing Wang, Zesong Fei

^*此作品的通讯作者

信息与电子学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

This paper introduces an efficient video representation method based on the dense trajectory motion map (DTM). We utilize the salient features of dense trajectories and motion descriptor to integrate the discriminative information of a video into a map. Firstly, we extract the dense trajectories features by using dense optical flow then multiple descriptors are computed along trajectories to capture appearance and motion information. This result is then integrated into frames difference to integrate entire discriminative information and motion energy to get our first motion map. For the final DTM each generated motion map will be integrated with the absolute frame difference of next two frames till the end of entire video. Finally, we process the resultant DTM by exploring the efficient long-term recurrent convolutional network module for encoding and action label generation. The developed approach is shown better and had comparable recognition results over the existing methods when applied to the publically available human action datasets.

源语言	英语
页（从-至）	474-484
页数	11
期刊	International Journal of Computers and Applications
卷	42
期	5
DOI	https://doi.org/10.1080/1206212X.2018.1486001
出版状态	已出版 - 3 7月 2020

访问文件

10.1080/1206212X.2018.1486001

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{47add3b2ecc54e218efaab838cf4ea96,

title = "Video representation by dense trajectories motion map applied to human activity recognition",

abstract = "This paper introduces an efficient video representation method based on the dense trajectory motion map (DTM). We utilize the salient features of dense trajectories and motion descriptor to integrate the discriminative information of a video into a map. Firstly, we extract the dense trajectories features by using dense optical flow then multiple descriptors are computed along trajectories to capture appearance and motion information. This result is then integrated into frames difference to integrate entire discriminative information and motion energy to get our first motion map. For the final DTM each generated motion map will be integrated with the absolute frame difference of next two frames till the end of entire video. Finally, we process the resultant DTM by exploring the efficient long-term recurrent convolutional network module for encoding and action label generation. The developed approach is shown better and had comparable recognition results over the existing methods when applied to the publically available human action datasets.",

keywords = "Dense trajectories, LRCN, action recognition, dense motion map",

author = "Sheeraz Arif and Tehseen Ul-Hassan and Fida Hussain and Jing Wang and Zesong Fei",

note = "Publisher Copyright: {\textcopyright} 2018, {\textcopyright} 2018 Informa UK Limited, trading as Taylor & Francis Group.",

year = "2020",

month = jul,

day = "3",

doi = "10.1080/1206212X.2018.1486001",

language = "English",

volume = "42",

pages = "474--484",

journal = "International Journal of Computers and Applications",

issn = "1206-212X",

publisher = "Taylor and Francis Ltd.",

number = "5",

}

TY - JOUR

T1 - Video representation by dense trajectories motion map applied to human activity recognition

AU - Arif, Sheeraz

AU - Ul-Hassan, Tehseen

AU - Hussain, Fida

AU - Wang, Jing

AU - Fei, Zesong

PY - 2020/7/3

Y1 - 2020/7/3

N2 - This paper introduces an efficient video representation method based on the dense trajectory motion map (DTM). We utilize the salient features of dense trajectories and motion descriptor to integrate the discriminative information of a video into a map. Firstly, we extract the dense trajectories features by using dense optical flow then multiple descriptors are computed along trajectories to capture appearance and motion information. This result is then integrated into frames difference to integrate entire discriminative information and motion energy to get our first motion map. For the final DTM each generated motion map will be integrated with the absolute frame difference of next two frames till the end of entire video. Finally, we process the resultant DTM by exploring the efficient long-term recurrent convolutional network module for encoding and action label generation. The developed approach is shown better and had comparable recognition results over the existing methods when applied to the publically available human action datasets.

AB - This paper introduces an efficient video representation method based on the dense trajectory motion map (DTM). We utilize the salient features of dense trajectories and motion descriptor to integrate the discriminative information of a video into a map. Firstly, we extract the dense trajectories features by using dense optical flow then multiple descriptors are computed along trajectories to capture appearance and motion information. This result is then integrated into frames difference to integrate entire discriminative information and motion energy to get our first motion map. For the final DTM each generated motion map will be integrated with the absolute frame difference of next two frames till the end of entire video. Finally, we process the resultant DTM by exploring the efficient long-term recurrent convolutional network module for encoding and action label generation. The developed approach is shown better and had comparable recognition results over the existing methods when applied to the publically available human action datasets.

KW - Dense trajectories

KW - LRCN

KW - action recognition

KW - dense motion map

UR - http://www.scopus.com/inward/record.url?scp=85050518402&partnerID=8YFLogxK

U2 - 10.1080/1206212X.2018.1486001

DO - 10.1080/1206212X.2018.1486001

M3 - Article

AN - SCOPUS:85050518402

SN - 1206-212X

VL - 42

SP - 474

EP - 484

JO - International Journal of Computers and Applications

JF - International Journal of Computers and Applications

IS - 5

ER -

Video representation by dense trajectories motion map applied to human activity recognition

摘要

访问文件

其它文件与链接

指纹

引用此