Coupling-and-decoupling: A hierarchical model for occlusion-free object detection

Bo Li, Xi Song, Tianfu Wu*, Wenze Hu, Mingtao Pei

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)

Abstract

Handling occlusion is a very challenging problem in object detection. This paper presents a method of learning a hierarchical model for X-to-X occlusion-free object detection (e.g., car-to-car and person-to-person occlusions in our experiments). The proposed method is motivated by an intuitive coupling-and-decoupling strategy. In the learning stage, the pair of occluding X's (e.g., car pairs or person pairs) is represented directly and jointly by a hierarchical And-Or directed acyclic graph (AOG) which accounts for the statistically significant co-occurrence (i.e., coupling). The structure and the parameters of the AOG are learned using the latent structural SVM (LSSVM) framework. In detection, a dynamic programming (DP) algorithm is utilized to find the best parse trees for all sliding windows with detection scores being greater than the learned threshold. Then, the two single X's are decoupled from the declared detections of X-to-X occluding pairs together with some non-maximum suppression (NMS) post-processing. In experiments, our method is tested on both a roadside-car dataset collected by ourselves (which will be released with this paper) and two public person datasets, the MPII-2Person dataset and the TUD-Crossing dataset. Our method is compared with state-of-the-art deformable part-based methods, and obtains comparable or better detection performance.

Original languageEnglish
Pages (from-to)3254-3264
Number of pages11
JournalPattern Recognition
Volume47
Issue number10
DOIs
Publication statusPublished - Oct 2014

Keywords

  • And-Or graph
  • Deformable part-based model
  • Latent structural SVM
  • Object detection
  • Occlusion modeling

Fingerprint

Dive into the research topics of 'Coupling-and-decoupling: A hierarchical model for occlusion-free object detection'. Together they form a unique fingerprint.

Cite this