TY - GEN
T1 - Towards Discriminative Feature Learning for Multi-object Tracking in UAV Captured Videos
AU - Wu, Jiapeng
AU - Jin, Qiuyu
AU - Han, Yuqi
AU - Fang, Hao
AU - Kang, Xiaohui
AU - Deng, Chenwei
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Recently, multi-object tracking (MOT) based on unmanned aerial vehicle (UAV) platform has become an important topic. However, in aerial photography scenes, the lack of object's appearance texture remains a challenge, as trackers are prone to confuse objects with similar appearances and lead to ID switches. Nonetheless, most of the existing methods mainly model appearance features using short-time clues, and such limited information makes it difficult to distinguish similar objects. To address this issue, we propose a novel Discriminative Multi-object Tracker (DistMOT), aiming to utilize high-quality long-term templates to mine distinctive object appearance, and further leverage the richer information of historical templates to distinguish similar objects. To this end, a Selective Memory Bank (SMB) is introduced to store multi-view historical templates; meanwhile, the Uncertainty-augmented Contrastive Learning (UACL) strategy is proposed to focus more attention on hard samples in the SMB, thereby forcing the model to highlight inter-object differential features and intra-object invariant features. Finally, the historical template differences of similar objects are considered for more accurate discrimination. Extensive experiments on the VisDrone-MOT and UAVDT datasets demonstrate the superiority of our method. Code is available at https://github.com/JackWoo0831/DistMOT
AB - Recently, multi-object tracking (MOT) based on unmanned aerial vehicle (UAV) platform has become an important topic. However, in aerial photography scenes, the lack of object's appearance texture remains a challenge, as trackers are prone to confuse objects with similar appearances and lead to ID switches. Nonetheless, most of the existing methods mainly model appearance features using short-time clues, and such limited information makes it difficult to distinguish similar objects. To address this issue, we propose a novel Discriminative Multi-object Tracker (DistMOT), aiming to utilize high-quality long-term templates to mine distinctive object appearance, and further leverage the richer information of historical templates to distinguish similar objects. To this end, a Selective Memory Bank (SMB) is introduced to store multi-view historical templates; meanwhile, the Uncertainty-augmented Contrastive Learning (UACL) strategy is proposed to focus more attention on hard samples in the SMB, thereby forcing the model to highlight inter-object differential features and intra-object invariant features. Finally, the historical template differences of similar objects are considered for more accurate discrimination. Extensive experiments on the VisDrone-MOT and UAVDT datasets demonstrate the superiority of our method. Code is available at https://github.com/JackWoo0831/DistMOT
KW - UAV
KW - contrastive learning
KW - multi-object tracking
UR - https://www.scopus.com/pages/publications/86000022616
U2 - 10.1109/ICSIDP62679.2024.10867998
DO - 10.1109/ICSIDP62679.2024.10867998
M3 - Conference contribution
AN - SCOPUS:86000022616
T3 - IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
BT - IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
Y2 - 22 November 2024 through 24 November 2024
ER -