Inferring social roles in long timespan video sequence

Jiangen Zhang*, Wenze Hu, Benjamin Yao, Yongtian Wang, Song Chun Zhu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

6 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 6
  • Captures
    • Readers: 11
see details

摘要

In this paper, we present a method for inferring social roles of agents (persons) from their daily activities in long surveillance video sequences. We define activities as interactions between an agent's position and semantic hotspots within the scene. Given a surveillance video, our method first tracks the locations of agents then automatically discovers semantic hotspots in the scene. By enumerating spatial/temporal locations between an agent's feet and hotspots in a scene, we define a set of atomic actions, which in turn compose sub-events and events. The numbers and types of events performed by an agent are assumed to be driven by his/her social role. With the grammar model induced by composition rules, an adapted Earley parser algorithm is used to parse the trajectories into events, sub-events and atomic actions. With probabilistic output of events, the roles of agents can be predicted under the Bayesian inference framework. Experiments are carried out on a challenging 8.5 hours video from a surveillance camera in the lobby of a research lab. The video contains 7 different social roles including manager, researcher, developer, engineer, staff, visitor and mailman. Results show that our proposed method can predict the role of each agent with high precision.

源语言英语
主期刊名2011 IEEE International Conference on Computer Vision Workshops, ICCV Workshops 2011
1456-1463
页数8
DOI
出版状态已出版 - 2011
活动2011 IEEE International Conference on Computer Vision Workshops, ICCV Workshops 2011 - Barcelona, 西班牙
期限: 6 11月 201113 11月 2011

出版系列

姓名Proceedings of the IEEE International Conference on Computer Vision

会议

会议2011 IEEE International Conference on Computer Vision Workshops, ICCV Workshops 2011
国家/地区西班牙
Barcelona
时期6/11/1113/11/11

指纹

探究 'Inferring social roles in long timespan video sequence' 的科研主题。它们共同构成独一无二的指纹。

引用此

Zhang, J., Hu, W., Yao, B., Wang, Y., & Zhu, S. C. (2011). Inferring social roles in long timespan video sequence. 在 2011 IEEE International Conference on Computer Vision Workshops, ICCV Workshops 2011 (页码 1456-1463). 文章 6130422 (Proceedings of the IEEE International Conference on Computer Vision). https://doi.org/10.1109/ICCVW.2011.6130422