TY - JOUR
T1 - Object-Level Attention Prediction for Drivers in the Information-Rich Traffic Environment
AU - Liu, Qingxiao
AU - Yao, Hui
AU - Lu, Chao
AU - Liu, Haiou
AU - Yi, Yangtian
AU - Chen, Huiyan
N1 - Publisher Copyright:
© 1982-2012 IEEE.
PY - 2024/6/1
Y1 - 2024/6/1
N2 - An object-level attention prediction framework for drivers in the urban environment with rich semantic and motion information is proposed in this article. The proposed framework is based on the visual working memory mechanism, which decomposes the perception process into three phases, external stimuli, cognitive constructing, and memory search. In the external stimuli phase, semantic and motion information of surrounding objects is obtained. In the cognitive constructing phase, the neighbor-based hierarchical clustering method is applied to extract both independent and dependent features of traffic participants and driving events. In the memory search phase, the heterogeneous motif graph neural network is utilized to construct visual memory layers and integrate multilevel features for attention reasoning. Finally, the feature embedding is fed into a multilayer perceptron to predict the object-level visual attention. Training and testing data are collected from crowded and dynamic traffic scenes. Experimental results show that the proposed framework can achieve a superior object-level prediction performance in the information-rich environments compared with the state-of-the-art methods. In addition, the proposed framework can reduce the time bias of visual attention effectively.
AB - An object-level attention prediction framework for drivers in the urban environment with rich semantic and motion information is proposed in this article. The proposed framework is based on the visual working memory mechanism, which decomposes the perception process into three phases, external stimuli, cognitive constructing, and memory search. In the external stimuli phase, semantic and motion information of surrounding objects is obtained. In the cognitive constructing phase, the neighbor-based hierarchical clustering method is applied to extract both independent and dependent features of traffic participants and driving events. In the memory search phase, the heterogeneous motif graph neural network is utilized to construct visual memory layers and integrate multilevel features for attention reasoning. Finally, the feature embedding is fed into a multilayer perceptron to predict the object-level visual attention. Training and testing data are collected from crowded and dynamic traffic scenes. Experimental results show that the proposed framework can achieve a superior object-level prediction performance in the information-rich environments compared with the state-of-the-art methods. In addition, the proposed framework can reduce the time bias of visual attention effectively.
KW - Graph model
KW - motif structure
KW - object-level attention
KW - visual attention prediction
UR - http://www.scopus.com/inward/record.url?scp=85165887731&partnerID=8YFLogxK
U2 - 10.1109/TIE.2023.3294547
DO - 10.1109/TIE.2023.3294547
M3 - Article
AN - SCOPUS:85165887731
SN - 0278-0046
VL - 71
SP - 6396
EP - 6406
JO - IEEE Transactions on Industrial Electronics
JF - IEEE Transactions on Industrial Electronics
IS - 6
ER -