摘要
In this paper, we present a model for learning atomic actions for complex activities classification. A video sequence is first represented by a collection of visual interest points. The model automatically clusters visual words into atomic actions based on their co-occurrence and temporal proximity using an extension of Hierarchical Dirichlet Process (HDP) mixture model. Our approach is robust to noisy interest points caused by various conditions because HDP is a generative model. Based on the atomic actions learned from our model, we use both a Naive Bayesian and a linear SVM classifier for activity classification. We first use a synthetic example to demonstrate the intermediate result, then we apply on the complex Olympic Sport 16-class dataset and show that our model outperforms other state-of-art methods.
源语言 | 英语 |
---|---|
文章编号 | 6298410 |
页(从-至) | 278-283 |
页数 | 6 |
期刊 | Proceedings - IEEE International Conference on Multimedia and Expo |
DOI | |
出版状态 | 已出版 - 2012 |
活动 | 2012 13th IEEE International Conference on Multimedia and Expo, ICME 2012 - Melbourne, VIC, 澳大利亚 期限: 9 7月 2012 → 13 7月 2012 |