CSMR: A Multi-Modal Registered Dataset for Complex Scenarios

Chenrui Li, Kun Gao, Zibo Hu, Zhijia Yang, Mingfeng Cai, Haobo Cheng, Zhenyu Zhu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Complex scenarios pose challenges to tasks in computer vision, including image fusion, object detection, and image-to-image translation. On the one hand, complex scenarios involve fluctuating weather or lighting conditions, where even images of the same scenarios appear to be different. On the other hand, the large amount of textural detail in the given images introduces considerable interference that can conceal the useful information contained in them. An effective solution to these problems is to use the complementary details present in multi-modal images, such as visible-light and infrared images. Visible-light images contain rich textural information while infrared images contain information about the temperature. In this study, we propose a multi-modal registered dataset for complex scenarios under various environmental conditions, targeting security surveillance and the monitoring of low-slow-small targets. Our dataset contains 30,819 images, where the targets are labeled as three classes of “person”, “car”, and “drone” using Yolo format bounding boxes. We compared our dataset with those used in the literature for computer vision-related tasks, including image fusion, object detection, and image-to-image translation. The results showed that introducing complementary information through image fusion can compensate for missing details in the original images, and we also revealed the limitations of visual tasks in single-modal images with complex scenarios.

Original languageEnglish
Article number844
JournalRemote Sensing
Volume17
Issue number5
DOIs
Publication statusPublished - Mar 2025
Externally publishedYes

Keywords

  • image fusion
  • image-to-image translation
  • infrared and visible dataset
  • object detection

Fingerprint

Dive into the research topics of 'CSMR: A Multi-Modal Registered Dataset for Complex Scenarios'. Together they form a unique fingerprint.

Cite this