基于多传感器融合的协同感知方法

Binglu Wang; Yang Jin; Lei Zhang; Le Zheng; Tianfei Zhou

doi:10.12000/JR23184

基于多传感器融合的协同感知方法

Binglu Wang, Yang Jin, Lei Zhang, Le Zheng, Tianfei Zhou^*

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

2 引用（Scopus）

摘要

This paper proposes a novel multimodal collaborative perception framework to enhance the situational awareness of autonomous vehicles. First, a multimodal fusion baseline system is built that effectively integrates Light Detection and Ranging (LiDAR) point clouds and camera images. This system provides a comparable benchmark for subsequent research. Second, various well-known feature fusion strategies are investigated in the context of collaborative scenarios, including channel-wise concatenation, element-wise summation, and transformer-based methods. This study aims to seamlessly integrate intermediate representations from different sensor modalities, facilitating an exhaustive assessment of their effects on model performance. Extensive experiments were conducted on a large-scale open-source simulation dataset, i.e., OPV2V. The results showed that attention-based multimodal fusion outperforms alternative solutions, delivering more precise target localization during complex traffic scenarios, thereby enhancing the safety and reliability of autonomous driving systems.

投稿的翻译标题	Collaborative Perception Method Based on Multisensor Fusion
源语言	繁体中文
页（从-至）	87-96
页数	10
期刊	Journal of Radars
卷	13
期	1
DOI	https://doi.org/10.12000/JR23184
出版状态	已出版 - 2024

关键词

3D object detection
Autonomous driving
Collaborative perception
Intelligent transportation systems
Multimodal fusion

访问文件

10.12000/JR23184

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{dd21bd28e8f44e8096a8f21746127731,

title = "基于多传感器融合的协同感知方法",

abstract = "This paper proposes a novel multimodal collaborative perception framework to enhance the situational awareness of autonomous vehicles. First, a multimodal fusion baseline system is built that effectively integrates Light Detection and Ranging (LiDAR) point clouds and camera images. This system provides a comparable benchmark for subsequent research. Second, various well-known feature fusion strategies are investigated in the context of collaborative scenarios, including channel-wise concatenation, element-wise summation, and transformer-based methods. This study aims to seamlessly integrate intermediate representations from different sensor modalities, facilitating an exhaustive assessment of their effects on model performance. Extensive experiments were conducted on a large-scale open-source simulation dataset, i.e., OPV2V. The results showed that attention-based multimodal fusion outperforms alternative solutions, delivering more precise target localization during complex traffic scenarios, thereby enhancing the safety and reliability of autonomous driving systems.",

keywords = "3D object detection, Autonomous driving, Collaborative perception, Intelligent transportation systems, Multimodal fusion",

author = "Binglu Wang and Yang Jin and Lei Zhang and Le Zheng and Tianfei Zhou",

note = "Publisher Copyright: {\textcopyright}The Author(s) 2023. This is an open access article under the CC-BY 4.0 License.",

year = "2024",

doi = "10.12000/JR23184",

language = "繁体中文",

volume = "13",

pages = "87--96",

journal = "Journal of Radars",

issn = "2095-283X",

publisher = "Institute of Electronics Chinese Academy of Sciences",

number = "1",

}

TY - JOUR

T1 - 基于多传感器融合的协同感知方法

AU - Wang, Binglu

AU - Jin, Yang

AU - Zhang, Lei

AU - Zheng, Le

AU - Zhou, Tianfei

N1 - Publisher Copyright: ©The Author(s) 2023. This is an open access article under the CC-BY 4.0 License.

PY - 2024

Y1 - 2024

N2 - This paper proposes a novel multimodal collaborative perception framework to enhance the situational awareness of autonomous vehicles. First, a multimodal fusion baseline system is built that effectively integrates Light Detection and Ranging (LiDAR) point clouds and camera images. This system provides a comparable benchmark for subsequent research. Second, various well-known feature fusion strategies are investigated in the context of collaborative scenarios, including channel-wise concatenation, element-wise summation, and transformer-based methods. This study aims to seamlessly integrate intermediate representations from different sensor modalities, facilitating an exhaustive assessment of their effects on model performance. Extensive experiments were conducted on a large-scale open-source simulation dataset, i.e., OPV2V. The results showed that attention-based multimodal fusion outperforms alternative solutions, delivering more precise target localization during complex traffic scenarios, thereby enhancing the safety and reliability of autonomous driving systems.

AB - This paper proposes a novel multimodal collaborative perception framework to enhance the situational awareness of autonomous vehicles. First, a multimodal fusion baseline system is built that effectively integrates Light Detection and Ranging (LiDAR) point clouds and camera images. This system provides a comparable benchmark for subsequent research. Second, various well-known feature fusion strategies are investigated in the context of collaborative scenarios, including channel-wise concatenation, element-wise summation, and transformer-based methods. This study aims to seamlessly integrate intermediate representations from different sensor modalities, facilitating an exhaustive assessment of their effects on model performance. Extensive experiments were conducted on a large-scale open-source simulation dataset, i.e., OPV2V. The results showed that attention-based multimodal fusion outperforms alternative solutions, delivering more precise target localization during complex traffic scenarios, thereby enhancing the safety and reliability of autonomous driving systems.

KW - 3D object detection

KW - Autonomous driving

KW - Collaborative perception

KW - Intelligent transportation systems

KW - Multimodal fusion

UR - http://www.scopus.com/inward/record.url?scp=85183454283&partnerID=8YFLogxK

U2 - 10.12000/JR23184

DO - 10.12000/JR23184

M3 - 文章

AN - SCOPUS:85183454283

SN - 2095-283X

VL - 13

SP - 87

EP - 96

JO - Journal of Radars

JF - Journal of Radars

IS - 1

ER -

基于多传感器融合的协同感知方法

摘要

关键词

访问文件

其它文件与链接

指纹

引用此