OpenMPD: An Open Multimodal Perception Dataset for Autonomous Driving

Xinyu Zhang; Zhiwei Li; Yan Gong; Dafeng Jin; Jun Li; Li Wang; Yanzhang Zhu; Huaping Liu

doi:10.1109/TVT.2022.3143173

OpenMPD: An Open Multimodal Perception Dataset for Autonomous Driving

Xinyu Zhang, Zhiwei Li^*, Yan Gong, Dafeng Jin, Jun Li, Li Wang, Yanzhang Zhu, Huaping Liu

^*Corresponding author for this work

Tsinghua University

Research output: Contribution to journal › Article › peer-review

12 Citations (Scopus)

Abstract

Multi-modal sensor fusion techniques have promoted the development of autonomous driving, while perception in the complex environment remains a challenging problem. In order to tackle the problem, we propose the Open Multi-modal Perception dataset (OpenMPD), a multi-modal perception benchmark objected at difficult examples. Compared with existing datasets, OpenMPD focuses more on those complex traffic scenes in urban areas with overexposure or darkness, crowded environment, unstructured roads and intersections. It acquires the multi-modal data through a vehicle with six cameras and four LiDAR for a 360-degree field of view and collected 180 clips of 20-second synchronized images at 20 Hz and point clouds at 10 Hz. Particularly, we applied a 128-beam LiDAR to provide Hi-Res point clouds to better understand the 3D environment and sensor fusion. We sampled 15 K keyframes at equal intervals from clips for annotations, including 2D/3D object detections, 3D object tracking, and 2D semantic segmentation. Moreover, we provide four benchmarks for all tasks to evaluate algorithms and conduct extensive experiments of 2D/3D detection and segmentation on OpenMPD. Data and further information are available at http://www.openmpd.com/.

Original language	English
Pages (from-to)	2437-2447
Number of pages	11
Journal	IEEE Transactions on Vehicular Technology
Volume	71
Issue number	3
DOIs	https://doi.org/10.1109/TVT.2022.3143173
Publication status	Published - 1 Mar 2022
Externally published	Yes

Keywords

Autonomous driving
complex scenes
dataset
multimodal fusion
perception

Access to Document

10.1109/TVT.2022.3143173

Cite this

@article{3932937fe57c490b86c80dc9a3a0ee17,

title = "OpenMPD: An Open Multimodal Perception Dataset for Autonomous Driving",

abstract = "Multi-modal sensor fusion techniques have promoted the development of autonomous driving, while perception in the complex environment remains a challenging problem. In order to tackle the problem, we propose the Open Multi-modal Perception dataset (OpenMPD), a multi-modal perception benchmark objected at difficult examples. Compared with existing datasets, OpenMPD focuses more on those complex traffic scenes in urban areas with overexposure or darkness, crowded environment, unstructured roads and intersections. It acquires the multi-modal data through a vehicle with six cameras and four LiDAR for a 360-degree field of view and collected 180 clips of 20-second synchronized images at 20 Hz and point clouds at 10 Hz. Particularly, we applied a 128-beam LiDAR to provide Hi-Res point clouds to better understand the 3D environment and sensor fusion. We sampled 15 K keyframes at equal intervals from clips for annotations, including 2D/3D object detections, 3D object tracking, and 2D semantic segmentation. Moreover, we provide four benchmarks for all tasks to evaluate algorithms and conduct extensive experiments of 2D/3D detection and segmentation on OpenMPD. Data and further information are available at http://www.openmpd.com/.",

keywords = "Autonomous driving, complex scenes, dataset, multimodal fusion, perception",

author = "Xinyu Zhang and Zhiwei Li and Yan Gong and Dafeng Jin and Jun Li and Li Wang and Yanzhang Zhu and Huaping Liu",

note = "Publisher Copyright: {\textcopyright} 1967-2012 IEEE.",

year = "2022",

month = mar,

day = "1",

doi = "10.1109/TVT.2022.3143173",

language = "English",

volume = "71",

pages = "2437--2447",

journal = "IEEE Transactions on Vehicular Technology",

issn = "0018-9545",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "3",

}

TY - JOUR

T1 - OpenMPD

T2 - An Open Multimodal Perception Dataset for Autonomous Driving

AU - Zhang, Xinyu

AU - Li, Zhiwei

AU - Gong, Yan

AU - Jin, Dafeng

AU - Li, Jun

AU - Wang, Li

AU - Zhu, Yanzhang

AU - Liu, Huaping

PY - 2022/3/1

Y1 - 2022/3/1

N2 - Multi-modal sensor fusion techniques have promoted the development of autonomous driving, while perception in the complex environment remains a challenging problem. In order to tackle the problem, we propose the Open Multi-modal Perception dataset (OpenMPD), a multi-modal perception benchmark objected at difficult examples. Compared with existing datasets, OpenMPD focuses more on those complex traffic scenes in urban areas with overexposure or darkness, crowded environment, unstructured roads and intersections. It acquires the multi-modal data through a vehicle with six cameras and four LiDAR for a 360-degree field of view and collected 180 clips of 20-second synchronized images at 20 Hz and point clouds at 10 Hz. Particularly, we applied a 128-beam LiDAR to provide Hi-Res point clouds to better understand the 3D environment and sensor fusion. We sampled 15 K keyframes at equal intervals from clips for annotations, including 2D/3D object detections, 3D object tracking, and 2D semantic segmentation. Moreover, we provide four benchmarks for all tasks to evaluate algorithms and conduct extensive experiments of 2D/3D detection and segmentation on OpenMPD. Data and further information are available at http://www.openmpd.com/.

AB - Multi-modal sensor fusion techniques have promoted the development of autonomous driving, while perception in the complex environment remains a challenging problem. In order to tackle the problem, we propose the Open Multi-modal Perception dataset (OpenMPD), a multi-modal perception benchmark objected at difficult examples. Compared with existing datasets, OpenMPD focuses more on those complex traffic scenes in urban areas with overexposure or darkness, crowded environment, unstructured roads and intersections. It acquires the multi-modal data through a vehicle with six cameras and four LiDAR for a 360-degree field of view and collected 180 clips of 20-second synchronized images at 20 Hz and point clouds at 10 Hz. Particularly, we applied a 128-beam LiDAR to provide Hi-Res point clouds to better understand the 3D environment and sensor fusion. We sampled 15 K keyframes at equal intervals from clips for annotations, including 2D/3D object detections, 3D object tracking, and 2D semantic segmentation. Moreover, we provide four benchmarks for all tasks to evaluate algorithms and conduct extensive experiments of 2D/3D detection and segmentation on OpenMPD. Data and further information are available at http://www.openmpd.com/.

KW - Autonomous driving

KW - complex scenes

KW - dataset

KW - multimodal fusion

KW - perception

UR - http://www.scopus.com/inward/record.url?scp=85123282148&partnerID=8YFLogxK

U2 - 10.1109/TVT.2022.3143173

DO - 10.1109/TVT.2022.3143173

M3 - Article

AN - SCOPUS:85123282148

SN - 0018-9545

VL - 71

SP - 2437

EP - 2447

JO - IEEE Transactions on Vehicular Technology

JF - IEEE Transactions on Vehicular Technology

IS - 3

ER -

OpenMPD: An Open Multimodal Perception Dataset for Autonomous Driving

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this