Object-Level Attention Prediction for Drivers in the Information-Rich Traffic Environment

Qingxiao Liu; Hui Yao; Chao Lu; Haiou Liu; Yangtian Yi; Huiyan Chen

doi:10.1109/TIE.2023.3294547

Object-Level Attention Prediction for Drivers in the Information-Rich Traffic Environment

Qingxiao Liu, Hui Yao, Chao Lu^*, Haiou Liu, Yangtian Yi, Huiyan Chen

^*此作品的通讯作者

机械与车辆学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

An object-level attention prediction framework for drivers in the urban environment with rich semantic and motion information is proposed in this article. The proposed framework is based on the visual working memory mechanism, which decomposes the perception process into three phases, external stimuli, cognitive constructing, and memory search. In the external stimuli phase, semantic and motion information of surrounding objects is obtained. In the cognitive constructing phase, the neighbor-based hierarchical clustering method is applied to extract both independent and dependent features of traffic participants and driving events. In the memory search phase, the heterogeneous motif graph neural network is utilized to construct visual memory layers and integrate multilevel features for attention reasoning. Finally, the feature embedding is fed into a multilayer perceptron to predict the object-level visual attention. Training and testing data are collected from crowded and dynamic traffic scenes. Experimental results show that the proposed framework can achieve a superior object-level prediction performance in the information-rich environments compared with the state-of-the-art methods. In addition, the proposed framework can reduce the time bias of visual attention effectively.

源语言	英语
页（从-至）	6396-6406
页数	11
期刊	IEEE Transactions on Industrial Electronics
卷	71
期	6
DOI	https://doi.org/10.1109/TIE.2023.3294547
出版状态	已出版 - 1 6月 2024

访问文件

10.1109/TIE.2023.3294547

其它文件与链接

链接到 Scopus 的出版物

引用此

Liu, Q., Yao, H., Lu, C., Liu, H., Yi, Y., & Chen, H. (2024). Object-Level Attention Prediction for Drivers in the Information-Rich Traffic Environment. IEEE Transactions on Industrial Electronics, 71(6), 6396-6406. https://doi.org/10.1109/TIE.2023.3294547

@article{e87b0d1b56244684ae6f2715c48da78f,

title = "Object-Level Attention Prediction for Drivers in the Information-Rich Traffic Environment",

abstract = "An object-level attention prediction framework for drivers in the urban environment with rich semantic and motion information is proposed in this article. The proposed framework is based on the visual working memory mechanism, which decomposes the perception process into three phases, external stimuli, cognitive constructing, and memory search. In the external stimuli phase, semantic and motion information of surrounding objects is obtained. In the cognitive constructing phase, the neighbor-based hierarchical clustering method is applied to extract both independent and dependent features of traffic participants and driving events. In the memory search phase, the heterogeneous motif graph neural network is utilized to construct visual memory layers and integrate multilevel features for attention reasoning. Finally, the feature embedding is fed into a multilayer perceptron to predict the object-level visual attention. Training and testing data are collected from crowded and dynamic traffic scenes. Experimental results show that the proposed framework can achieve a superior object-level prediction performance in the information-rich environments compared with the state-of-the-art methods. In addition, the proposed framework can reduce the time bias of visual attention effectively.",

keywords = "Graph model, motif structure, object-level attention, visual attention prediction",

author = "Qingxiao Liu and Hui Yao and Chao Lu and Haiou Liu and Yangtian Yi and Huiyan Chen",

note = "Publisher Copyright: {\textcopyright} 1982-2012 IEEE.",

year = "2024",

month = jun,

day = "1",

doi = "10.1109/TIE.2023.3294547",

language = "English",

volume = "71",

pages = "6396--6406",

journal = "IEEE Transactions on Industrial Electronics",

issn = "0278-0046",

publisher = "IEEE Industrial Electronics Society",

number = "6",

}

TY - JOUR

T1 - Object-Level Attention Prediction for Drivers in the Information-Rich Traffic Environment

AU - Liu, Qingxiao

AU - Yao, Hui

AU - Lu, Chao

AU - Liu, Haiou

AU - Yi, Yangtian

AU - Chen, Huiyan

PY - 2024/6/1

Y1 - 2024/6/1

N2 - An object-level attention prediction framework for drivers in the urban environment with rich semantic and motion information is proposed in this article. The proposed framework is based on the visual working memory mechanism, which decomposes the perception process into three phases, external stimuli, cognitive constructing, and memory search. In the external stimuli phase, semantic and motion information of surrounding objects is obtained. In the cognitive constructing phase, the neighbor-based hierarchical clustering method is applied to extract both independent and dependent features of traffic participants and driving events. In the memory search phase, the heterogeneous motif graph neural network is utilized to construct visual memory layers and integrate multilevel features for attention reasoning. Finally, the feature embedding is fed into a multilayer perceptron to predict the object-level visual attention. Training and testing data are collected from crowded and dynamic traffic scenes. Experimental results show that the proposed framework can achieve a superior object-level prediction performance in the information-rich environments compared with the state-of-the-art methods. In addition, the proposed framework can reduce the time bias of visual attention effectively.

AB - An object-level attention prediction framework for drivers in the urban environment with rich semantic and motion information is proposed in this article. The proposed framework is based on the visual working memory mechanism, which decomposes the perception process into three phases, external stimuli, cognitive constructing, and memory search. In the external stimuli phase, semantic and motion information of surrounding objects is obtained. In the cognitive constructing phase, the neighbor-based hierarchical clustering method is applied to extract both independent and dependent features of traffic participants and driving events. In the memory search phase, the heterogeneous motif graph neural network is utilized to construct visual memory layers and integrate multilevel features for attention reasoning. Finally, the feature embedding is fed into a multilayer perceptron to predict the object-level visual attention. Training and testing data are collected from crowded and dynamic traffic scenes. Experimental results show that the proposed framework can achieve a superior object-level prediction performance in the information-rich environments compared with the state-of-the-art methods. In addition, the proposed framework can reduce the time bias of visual attention effectively.

KW - Graph model

KW - motif structure

KW - object-level attention

KW - visual attention prediction

UR - http://www.scopus.com/inward/record.url?scp=85165887731&partnerID=8YFLogxK

U2 - 10.1109/TIE.2023.3294547

DO - 10.1109/TIE.2023.3294547

M3 - Article

AN - SCOPUS:85165887731

SN - 0278-0046

VL - 71

SP - 6396

EP - 6406

JO - IEEE Transactions on Industrial Electronics

JF - IEEE Transactions on Industrial Electronics

IS - 6

ER -

Object-Level Attention Prediction for Drivers in the Information-Rich Traffic Environment

摘要

访问文件

其它文件与链接

指纹

引用此