TY - GEN
T1 - Gaze Target Prediction with the Understanding of 3D Scenes
AU - Gao, Leru
AU - Sun, Fengxi
AU - Liu, Yue
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2026.
PY - 2026
Y1 - 2026
N2 - The goal of the gaze target prediction is to determine the location where the person is focusing and the probability of the gaze falls outside the image. Although prior works have addressed this task by regressing heatmaps centered on the gaze location, they typically fail to incorporate the scene's semantic information. In this work, we first generate 3D point cloud of the given image based on depth estimation and camera intrinsics. Then we combine the point cloud and estimated 3D gaze vector to generate the 3D field of view (FoV) heatmap. Scene contextual cues are finally merged to get the output heatmap. Our method achieves competitive results on the ChildPlay, GazeFollow, and VideoAttentionTarget datasets.
AB - The goal of the gaze target prediction is to determine the location where the person is focusing and the probability of the gaze falls outside the image. Although prior works have addressed this task by regressing heatmaps centered on the gaze location, they typically fail to incorporate the scene's semantic information. In this work, we first generate 3D point cloud of the given image based on depth estimation and camera intrinsics. Then we combine the point cloud and estimated 3D gaze vector to generate the 3D field of view (FoV) heatmap. Scene contextual cues are finally merged to get the output heatmap. Our method achieves competitive results on the ChildPlay, GazeFollow, and VideoAttentionTarget datasets.
KW - 3D scene understanding
KW - gaze estimation
KW - gaze target prediction
UR - https://www.scopus.com/pages/publications/105027932979
U2 - 10.1007/978-981-95-4966-5_9
DO - 10.1007/978-981-95-4966-5_9
M3 - Conference contribution
AN - SCOPUS:105027932979
SN - 9789819549658
T3 - Communications in Computer and Information Science
SP - 129
EP - 143
BT - Image and Graphics Technologies and Applications - 20th Chinese Conference, IGTA 2025, Revised Selected Papers
A2 - Wang, Yongtian
A2 - Chen, Yi
PB - Springer Science and Business Media Deutschland GmbH
T2 - 20th Chinese Conference on Image and Graphics Technologies and Applications, IGTA 2025
Y2 - 9 August 2025 through 10 August 2025
ER -