DDMOT: Diffusion Based Multi-Object Tracking with Deep Association via Sensor Fusion

  • Baichuan Zhang
  • , Chengpu Yu
  • , Jingchen Xu
  • , Yunji Feng
  • , Yinni Liu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In recent years, most fusion-based multi-object tracking (MOT) works rely more heavily on LiDAR results due to the poor motion prediction performance of camera-based trackers. While these methods achieve good accuracy, they fail to fully utilize the visual feature advantages of cameras, resulting in degraded performance under occlusion scenarios. To address these limitations, we propose a diffusion model-based multi-object tracking method via sensor fusion. The diffusion model reformulates the motion prediction task as a displacement difference generation problem, which effectively handles nonlinear motion patterns. Furthermore, we design a GDIoU-based data association method and a adaptive lifecycle management system to fully leverage the perceptual capabilities of both camera and LiDAR sensors. Experimental results on the KITTI dataset demonstrate that our proposed method outperforms baseline methods in tracking accuracy.

Original languageEnglish
Title of host publicationProceedings of the 44th Chinese Control Conference, CCC 2025
EditorsJian Sun, Hongpeng Yin
PublisherIEEE Computer Society
Pages8296-8301
Number of pages6
ISBN (Electronic)9789887581611
DOIs
Publication statusPublished - 2025
Event44th Chinese Control Conference, CCC 2025 - Chongqing, China
Duration: 28 Jul 202530 Jul 2025

Publication series

NameChinese Control Conference, CCC
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

Conference44th Chinese Control Conference, CCC 2025
Country/TerritoryChina
CityChongqing
Period28/07/2530/07/25

Keywords

  • Adaptive lifecycle
  • Diffusion model
  • Multi-object tracking
  • Sensor fusion

Fingerprint

Dive into the research topics of 'DDMOT: Diffusion Based Multi-Object Tracking with Deep Association via Sensor Fusion'. Together they form a unique fingerprint.

Cite this