TY - JOUR
T1 - 面向视频结构化的细粒度车辆检测分类模型
AU - Shi, Jian
AU - Cheng, Qian
AU - Jin, Lisheng
AU - Hu, Yaoguang
AU - Jiang, Xiaobei
AU - Guo, Baicang
AU - Wang, Wuhong
N1 - Publisher Copyright:
© 2021, Society of Automotive Engineers of China. All right reserved.
PY - 2021/10/25
Y1 - 2021/10/25
N2 - In order to solve the problem of limited understanding of complex traffic scenes in driverless environment perception technology, this paper proposes a roadside-oriented video structured description framework, which can enrich the fine-grained information of different targets in traffic scenes and improve the understanding ability of complex traffic scenes. For the proposed framework, this paper provides an engineering fine-grained vehicle detection and classification model. The YOLOv4 algorithm is optimized by channel pruning strategy, and the volume of the compressed model, YOLOv4-Pruned, is reduced by about 60% compared with the original model under the condition that mAP is almost unchanged. A vehicle classification method with 16 types and 12 colors is designed, which can effectively cover all vehicles in the current traffic scene. And the classification accuracy of the test set can reach 93%. The fine-grained vehicle detection and classification model designed in this paper is stable at 23FPS under 1920 × 1080 pixel input, NVIDIA Geforce RTX 2080ti, and the unquantified model is stable at 13FPS under Hisilicon-Hi3516DV300.
AB - In order to solve the problem of limited understanding of complex traffic scenes in driverless environment perception technology, this paper proposes a roadside-oriented video structured description framework, which can enrich the fine-grained information of different targets in traffic scenes and improve the understanding ability of complex traffic scenes. For the proposed framework, this paper provides an engineering fine-grained vehicle detection and classification model. The YOLOv4 algorithm is optimized by channel pruning strategy, and the volume of the compressed model, YOLOv4-Pruned, is reduced by about 60% compared with the original model under the condition that mAP is almost unchanged. A vehicle classification method with 16 types and 12 colors is designed, which can effectively cover all vehicles in the current traffic scene. And the classification accuracy of the test set can reach 93%. The fine-grained vehicle detection and classification model designed in this paper is stable at 23FPS under 1920 × 1080 pixel input, NVIDIA Geforce RTX 2080ti, and the unquantified model is stable at 13FPS under Hisilicon-Hi3516DV300.
KW - Driverless technology
KW - Fine⁃grained vehicle detection and classification
KW - Roadside environment perception
KW - Video structuring description algorithm
UR - http://www.scopus.com/inward/record.url?scp=85118230514&partnerID=8YFLogxK
U2 - 10.19562/j.chinasae.qcgc.2021.10.002
DO - 10.19562/j.chinasae.qcgc.2021.10.002
M3 - 文章
AN - SCOPUS:85118230514
SN - 1000-680X
VL - 43
SP - 1427
EP - 1434
JO - Qiche Gongcheng/Automotive Engineering
JF - Qiche Gongcheng/Automotive Engineering
IS - 10
ER -