TY - JOUR
T1 - VRUFinder
T2 - A Real-Time, Infrastructure-Sensor- Enabled Framework for Recognizing Vulnerable Road Users
AU - Shi, Jian
AU - Kieu, Le Minh
AU - Sun, Dongxian
AU - Tan, Haiqiu
AU - Gao, Ming
AU - Guo, Baicang
AU - Wang, Wuhong
N1 - Publisher Copyright:
© 2001-2012 IEEE.
PY - 2024/3/15
Y1 - 2024/3/15
N2 - The provision of real-time, accurate perception of vulnerable road users (VRUs) via infrastructure-sensor-based devices is integral to roadside perception in vehicle-infrastructure collaboration system. However, prevailing data and algorithms fall short of accomplishing this task effectively on high-resolution imagery. In response, we introduce a visual perception framework, VRUFinder, designed specifically for infrastructure-enabled deployment, and a multiview symmetrical knowledge distillation methodology for VRU recognition. This approach amalgamates various teacher networks into streamlined student networks from diverse perspectives. By integrating our novel logical connectivity and quality judgment model, we enhance the existing state-of-the-art algorithms of YOLOv7 and StrongSORT. Moreover, we present VRUNet, a novel dataset for VRU recognition, furnishing high-resolution, top-down perspective images with visual sensor acquisition system. To the best of our knowledge, datasets of this nature are seldom found in current VRU recognition research. The effectiveness of our approach is substantiated through a series of ablation experiments and engineering case study on a low computational infrastructure-sensor-enabled device. By encapsulating our approach, we provide mature solutions for commercial infrastructure-sensor-based devices, which will contribute to the development of connected and automated vehicles and intelligent transportation systems.
AB - The provision of real-time, accurate perception of vulnerable road users (VRUs) via infrastructure-sensor-based devices is integral to roadside perception in vehicle-infrastructure collaboration system. However, prevailing data and algorithms fall short of accomplishing this task effectively on high-resolution imagery. In response, we introduce a visual perception framework, VRUFinder, designed specifically for infrastructure-enabled deployment, and a multiview symmetrical knowledge distillation methodology for VRU recognition. This approach amalgamates various teacher networks into streamlined student networks from diverse perspectives. By integrating our novel logical connectivity and quality judgment model, we enhance the existing state-of-the-art algorithms of YOLOv7 and StrongSORT. Moreover, we present VRUNet, a novel dataset for VRU recognition, furnishing high-resolution, top-down perspective images with visual sensor acquisition system. To the best of our knowledge, datasets of this nature are seldom found in current VRU recognition research. The effectiveness of our approach is substantiated through a series of ablation experiments and engineering case study on a low computational infrastructure-sensor-enabled device. By encapsulating our approach, we provide mature solutions for commercial infrastructure-sensor-based devices, which will contribute to the development of connected and automated vehicles and intelligent transportation systems.
KW - Infrastructure-sensor-based perception
KW - object recognition
KW - vehicle-infrastructure collaboration
KW - video structured description framework (VSDF)
KW - vulnerable road users (VRUs)
UR - http://www.scopus.com/inward/record.url?scp=85184013509&partnerID=8YFLogxK
U2 - 10.1109/JSEN.2024.3356528
DO - 10.1109/JSEN.2024.3356528
M3 - Article
AN - SCOPUS:85184013509
SN - 1530-437X
VL - 24
SP - 8885
EP - 8901
JO - IEEE Sensors Journal
JF - IEEE Sensors Journal
IS - 6
ER -