UniRTL: A universal RGBT and low-light benchmark for object tracking

Lian Zhang; Lingxue Wang; Yuzhen Wu; Mingkun Chen; Dezhi Zheng; Liangcai Cao; Bangze Zeng; Yi Cai

doi:10.1016/j.patcog.2024.110984

UniRTL: A universal RGBT and low-light benchmark for object tracking

Lian Zhang, Lingxue Wang^*, Yuzhen Wu, Mingkun Chen, Dezhi Zheng, Liangcai Cao, Bangze Zeng, Yi Cai

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Solving single- and multiple-object tracking problems with a single network is challenging in the RGBT tracking. We present a universal RGBT and low-light benchmark (UniRTL), which contains 3 × 626 videos for SOT and 3 × 50 videos for MOT, totally with more than 158K frame triplet. The dataset is divided into low-, middle-, and high-illuminance categories based on the measurement of the scene illuminance. We also propose a SOT and MOT unified tracking-with-detection tracker (Unismot) that comprises a detector, first-frame target prior (FTP), and data associator. SOT and MOT are unified by feeding FTP into the detector and data associator. Re-ID long-term matching module and reusing low-score bounding boxes are proposed to augment SOT and MOT performance, respectively. Experiments demonstrate that Unismot performs as well as or better than its counterparts on established RGBT tracking datasets. This work promotes a universal multimodal tracking throughout day and night.

源语言	英语
文章编号	110984
期刊	Pattern Recognition
卷	158
DOI	https://doi.org/10.1016/j.patcog.2024.110984
出版状态	已出版 - 2月 2025

访问文件

10.1016/j.patcog.2024.110984

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{6e828a0981d948e092b9d48ef8263329,

title = "UniRTL: A universal RGBT and low-light benchmark for object tracking",

abstract = "Solving single- and multiple-object tracking problems with a single network is challenging in the RGBT tracking. We present a universal RGBT and low-light benchmark (UniRTL), which contains 3 × 626 videos for SOT and 3 × 50 videos for MOT, totally with more than 158K frame triplet. The dataset is divided into low-, middle-, and high-illuminance categories based on the measurement of the scene illuminance. We also propose a SOT and MOT unified tracking-with-detection tracker (Unismot) that comprises a detector, first-frame target prior (FTP), and data associator. SOT and MOT are unified by feeding FTP into the detector and data associator. Re-ID long-term matching module and reusing low-score bounding boxes are proposed to augment SOT and MOT performance, respectively. Experiments demonstrate that Unismot performs as well as or better than its counterparts on established RGBT tracking datasets. This work promotes a universal multimodal tracking throughout day and night.",

keywords = "Multitask benchmark, RGBT and low-light benchmark, RGBT and low-light image, Unified object tracking",

author = "Lian Zhang and Lingxue Wang and Yuzhen Wu and Mingkun Chen and Dezhi Zheng and Liangcai Cao and Bangze Zeng and Yi Cai",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier Ltd",

year = "2025",

month = feb,

doi = "10.1016/j.patcog.2024.110984",

language = "English",

volume = "158",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - UniRTL

T2 - A universal RGBT and low-light benchmark for object tracking

AU - Zhang, Lian

AU - Wang, Lingxue

AU - Wu, Yuzhen

AU - Chen, Mingkun

AU - Zheng, Dezhi

AU - Cao, Liangcai

AU - Zeng, Bangze

AU - Cai, Yi

PY - 2025/2

Y1 - 2025/2

N2 - Solving single- and multiple-object tracking problems with a single network is challenging in the RGBT tracking. We present a universal RGBT and low-light benchmark (UniRTL), which contains 3 × 626 videos for SOT and 3 × 50 videos for MOT, totally with more than 158K frame triplet. The dataset is divided into low-, middle-, and high-illuminance categories based on the measurement of the scene illuminance. We also propose a SOT and MOT unified tracking-with-detection tracker (Unismot) that comprises a detector, first-frame target prior (FTP), and data associator. SOT and MOT are unified by feeding FTP into the detector and data associator. Re-ID long-term matching module and reusing low-score bounding boxes are proposed to augment SOT and MOT performance, respectively. Experiments demonstrate that Unismot performs as well as or better than its counterparts on established RGBT tracking datasets. This work promotes a universal multimodal tracking throughout day and night.

AB - Solving single- and multiple-object tracking problems with a single network is challenging in the RGBT tracking. We present a universal RGBT and low-light benchmark (UniRTL), which contains 3 × 626 videos for SOT and 3 × 50 videos for MOT, totally with more than 158K frame triplet. The dataset is divided into low-, middle-, and high-illuminance categories based on the measurement of the scene illuminance. We also propose a SOT and MOT unified tracking-with-detection tracker (Unismot) that comprises a detector, first-frame target prior (FTP), and data associator. SOT and MOT are unified by feeding FTP into the detector and data associator. Re-ID long-term matching module and reusing low-score bounding boxes are proposed to augment SOT and MOT performance, respectively. Experiments demonstrate that Unismot performs as well as or better than its counterparts on established RGBT tracking datasets. This work promotes a universal multimodal tracking throughout day and night.

KW - Multitask benchmark

KW - RGBT and low-light benchmark

KW - RGBT and low-light image

KW - Unified object tracking

UR - http://www.scopus.com/inward/record.url?scp=85203416089&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2024.110984

DO - 10.1016/j.patcog.2024.110984

M3 - Article

AN - SCOPUS:85203416089

SN - 0031-3203

VL - 158

JO - Pattern Recognition

JF - Pattern Recognition

M1 - 110984

ER -

UniRTL: A universal RGBT and low-light benchmark for object tracking

摘要

访问文件

其它文件与链接

指纹

引用此