Modelling atomic actions for activity classification

Jiangen Zhang; Benjamin Yao; Yongtian Wang

doi:10.1109/ICME.2012.139

Modelling atomic actions for activity classification

Jiangen Zhang^*, Benjamin Yao, Yongtian Wang

^*Corresponding author for this work

School of Optics and Photonics

Research output: Contribution to journal › Conference article › peer-review

1 Citation (Scopus)

Abstract

In this paper, we present a model for learning atomic actions for complex activities classification. A video sequence is first represented by a collection of visual interest points. The model automatically clusters visual words into atomic actions based on their co-occurrence and temporal proximity using an extension of Hierarchical Dirichlet Process (HDP) mixture model. Our approach is robust to noisy interest points caused by various conditions because HDP is a generative model. Based on the atomic actions learned from our model, we use both a Naive Bayesian and a linear SVM classifier for activity classification. We first use a synthetic example to demonstrate the intermediate result, then we apply on the complex Olympic Sport 16-class dataset and show that our model outperforms other state-of-art methods.

Original language	English
Article number	6298410
Pages (from-to)	278-283
Number of pages	6
Journal	Proceedings - IEEE International Conference on Multimedia and Expo
DOIs	https://doi.org/10.1109/ICME.2012.139
Publication status	Published - 2012
Event	2012 13th IEEE International Conference on Multimedia and Expo, ICME 2012 - Melbourne, VIC, Australia Duration: 9 Jul 2012 → 13 Jul 2012

Keywords

Activity classification
atomic action
temporal relation

Access to Document

10.1109/ICME.2012.139

Cite this

Zhang, J., Yao, B., & Wang, Y. (2012). Modelling atomic actions for activity classification. Proceedings - IEEE International Conference on Multimedia and Expo, 278-283. Article 6298410. https://doi.org/10.1109/ICME.2012.139

@article{7094af77cbc24ebfb8f08909d5e59ee8,

title = "Modelling atomic actions for activity classification",

abstract = "In this paper, we present a model for learning atomic actions for complex activities classification. A video sequence is first represented by a collection of visual interest points. The model automatically clusters visual words into atomic actions based on their co-occurrence and temporal proximity using an extension of Hierarchical Dirichlet Process (HDP) mixture model. Our approach is robust to noisy interest points caused by various conditions because HDP is a generative model. Based on the atomic actions learned from our model, we use both a Naive Bayesian and a linear SVM classifier for activity classification. We first use a synthetic example to demonstrate the intermediate result, then we apply on the complex Olympic Sport 16-class dataset and show that our model outperforms other state-of-art methods.",

keywords = "Activity classification, atomic action, temporal relation",

author = "Jiangen Zhang and Benjamin Yao and Yongtian Wang",

year = "2012",

doi = "10.1109/ICME.2012.139",

language = "English",

pages = "278--283",

journal = "Proceedings - IEEE International Conference on Multimedia and Expo",

issn = "1945-7871",

publisher = "IEEE Computer Society",

note = "2012 13th IEEE International Conference on Multimedia and Expo, ICME 2012 ; Conference date: 09-07-2012 Through 13-07-2012",

}

TY - JOUR

T1 - Modelling atomic actions for activity classification

AU - Zhang, Jiangen

AU - Yao, Benjamin

AU - Wang, Yongtian

PY - 2012

Y1 - 2012

N2 - In this paper, we present a model for learning atomic actions for complex activities classification. A video sequence is first represented by a collection of visual interest points. The model automatically clusters visual words into atomic actions based on their co-occurrence and temporal proximity using an extension of Hierarchical Dirichlet Process (HDP) mixture model. Our approach is robust to noisy interest points caused by various conditions because HDP is a generative model. Based on the atomic actions learned from our model, we use both a Naive Bayesian and a linear SVM classifier for activity classification. We first use a synthetic example to demonstrate the intermediate result, then we apply on the complex Olympic Sport 16-class dataset and show that our model outperforms other state-of-art methods.

AB - In this paper, we present a model for learning atomic actions for complex activities classification. A video sequence is first represented by a collection of visual interest points. The model automatically clusters visual words into atomic actions based on their co-occurrence and temporal proximity using an extension of Hierarchical Dirichlet Process (HDP) mixture model. Our approach is robust to noisy interest points caused by various conditions because HDP is a generative model. Based on the atomic actions learned from our model, we use both a Naive Bayesian and a linear SVM classifier for activity classification. We first use a synthetic example to demonstrate the intermediate result, then we apply on the complex Olympic Sport 16-class dataset and show that our model outperforms other state-of-art methods.

KW - Activity classification

KW - atomic action

KW - temporal relation

UR - http://www.scopus.com/inward/record.url?scp=84868102744&partnerID=8YFLogxK

U2 - 10.1109/ICME.2012.139

DO - 10.1109/ICME.2012.139

M3 - Conference article

AN - SCOPUS:84868102744

SN - 1945-7871

SP - 278

EP - 283

JO - Proceedings - IEEE International Conference on Multimedia and Expo

JF - Proceedings - IEEE International Conference on Multimedia and Expo

M1 - 6298410

T2 - 2012 13th IEEE International Conference on Multimedia and Expo, ICME 2012

Y2 - 9 July 2012 through 13 July 2012

ER -

Modelling atomic actions for activity classification

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this