Revisiting color-event based tracking: A unified network, dataset, and metric

  • Chuanming Tang
  • , Xiao Wang*
  • , Ju Huang
  • , Bo Jiang
  • , Lin Zhu
  • , Shifeng Chen
  • , Jianlin Zhang
  • , Yaowei Wang
  • , Yonghong Tian
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Combining Color and Event cameras (also called Dynamic Vision Sensors, DVS) for robust object tracking is a newly emerging research topic in recent years. Existing color-event tracking frameworks usually contain multiple scattered modules which may lead to low efficiency and high computational complexity, including feature extraction, fusion, matching, interactive learning, etc. In this paper, we propose a single-stage backbone network for Color-Event Unified Tracking (CEUTrack) that achieves the above functions simultaneously. Given the event points and color frames, we first transform the points into voxels and crop the template and search regions for both modalities, respectively. Then, these regions are projected into tokens and jointly fed into the adaptive vision Transformer network. The output features will be fed into a tracking head for target object localization. Our proposed CEUTrack is simple, effective, and efficient, achieving over 75 FPS and SOTA performance. To better validate the effectiveness of our model and address the data deficiency of the color-event tracking task, we propose a generic and large-scale benchmark dataset for color-event tracking, termed COESOT, which contains 90 categories and 1354 video sequences. Furthermore, a new evaluation criterion has been proposed, aiming to better assess tracking results by measuring the difficulty level of video frames. We hope the newly proposed method and dataset provide a better platform for color-event-based tracking. The dataset, toolkit, and source code have been released on https://github.com/Event-AHU/COESOT.

Original languageEnglish
Article number112718
JournalPattern Recognition
Volume172
DOIs
Publication statusPublished - Apr 2026

Keywords

  • Color-event tracking
  • Dataset and unified network
  • Visual tracking

Fingerprint

Dive into the research topics of 'Revisiting color-event based tracking: A unified network, dataset, and metric'. Together they form a unique fingerprint.

Cite this