TY - JOUR
T1 - S2R-Bench
T2 - A Sim-to-Real Evaluation Benchmark for Autonomous Driving
AU - Wang, Li
AU - Yang, Guangqi
AU - Yang, Lei
AU - Zhang, Xinyu
AU - Song, Ziying
AU - Chen, Ying
AU - Liu, Lin
AU - Gao, Junjie
AU - Li, Zhiwei
AU - Yang, Qingshan
AU - Li, Jun
AU - Wang, Liangliang
AU - Yu, Wenhao
AU - Yang, Chao
AU - Xu, Bin
AU - Wang, Weida
AU - Liu, Huaping
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Safety is a long-standing and final pursuit in the development of autonomous driving systems, with many safety challenge arising from perception. How to effectively evaluate the safety and reliability of perception algorithms is becoming an emerging issue. Despite its critical importance, existing perception methods exhibit a limitation in their robustness, primarily due to benchmarks being entirely simulated, which fail to align predicted results with actual outcomes, particularly under extreme weather conditions and sensor anomalies that are prevalent in real-world scenarios. To fill this gap, in this study, we propose a Sim-to-Real Evaluation Benchmark for autonomous driving (S2R-Bench). We collect diverse sensor anomaly data across various road conditions to evaluate the robustness of perception methods comprehensive and realistic manner. This is the first corruption robustness dataset based on real-world scenarios, encompassing various road conditions, weather conditions, lighting intensities, and time periods. By comparing real-world data with simulated data, we demonstrate the reliability the collected data for real-world use and aim to foster research on more robust perception models for autonomous driving.
AB - Safety is a long-standing and final pursuit in the development of autonomous driving systems, with many safety challenge arising from perception. How to effectively evaluate the safety and reliability of perception algorithms is becoming an emerging issue. Despite its critical importance, existing perception methods exhibit a limitation in their robustness, primarily due to benchmarks being entirely simulated, which fail to align predicted results with actual outcomes, particularly under extreme weather conditions and sensor anomalies that are prevalent in real-world scenarios. To fill this gap, in this study, we propose a Sim-to-Real Evaluation Benchmark for autonomous driving (S2R-Bench). We collect diverse sensor anomaly data across various road conditions to evaluate the robustness of perception methods comprehensive and realistic manner. This is the first corruption robustness dataset based on real-world scenarios, encompassing various road conditions, weather conditions, lighting intensities, and time periods. By comparing real-world data with simulated data, we demonstrate the reliability the collected data for real-world use and aim to foster research on more robust perception models for autonomous driving.
UR - https://www.scopus.com/pages/publications/105025825372
U2 - 10.1038/s41597-025-06255-3
DO - 10.1038/s41597-025-06255-3
M3 - Article
C2 - 41345133
AN - SCOPUS:105025825372
SN - 2052-4463
VL - 12
JO - Scientific data
JF - Scientific data
IS - 1
M1 - 2006
ER -