TY - JOUR
T1 - 面向边缘 GPU 设备的快速光流估计算法
AU - Shi, Ke
AU - Nie, Suzhen
AU - Li, Dongxing
AU - Cao, Jie
AU - Sheng, Yunlong
AU - Yao, Bin
AU - Chen, Honglin
N1 - Publisher Copyright:
© 2025 Editorial office of Journal of Applied Optics. All rights reserved.
PY - 2025/3
Y1 - 2025/3
N2 - An optical flow estimation network suitable for edge GPU devices was proposed, aiming to solve the problem that dense optical flow estimation was difficult to deploy on embedded systems due to huge quantity of computation. Firstly, to fully exploit the GPU resources, an efficient feature extraction network was designed to reduce memory access costs. Secondly, by adopting a flat-shaped iterative update module to estimate the optical flow, the size of the model was further reduced, and the utilization of GPU bandwidth was improved. Experimental results on different datasets show that the proposed model has efficient inference capability and excellent flow estimation performance. In particular, compared with the advanced lightweight models, the proposed model reduces the error by 12.8% with only 0.54 Mb parameters, and improves the inference speed by 22.2%, demonstrating the satisfactory performance on embedded development boards.
AB - An optical flow estimation network suitable for edge GPU devices was proposed, aiming to solve the problem that dense optical flow estimation was difficult to deploy on embedded systems due to huge quantity of computation. Firstly, to fully exploit the GPU resources, an efficient feature extraction network was designed to reduce memory access costs. Secondly, by adopting a flat-shaped iterative update module to estimate the optical flow, the size of the model was further reduced, and the utilization of GPU bandwidth was improved. Experimental results on different datasets show that the proposed model has efficient inference capability and excellent flow estimation performance. In particular, compared with the advanced lightweight models, the proposed model reduces the error by 12.8% with only 0.54 Mb parameters, and improves the inference speed by 22.2%, demonstrating the satisfactory performance on embedded development boards.
KW - edge GPU devices
KW - embedded systems
KW - inference speed
KW - optical flow estimation
UR - http://www.scopus.com/inward/record.url?scp=105002756229&partnerID=8YFLogxK
U2 - 10.5768/JAO202546.0202008
DO - 10.5768/JAO202546.0202008
M3 - 文章
AN - SCOPUS:105002756229
SN - 1002-2082
VL - 46
SP - 355
EP - 363
JO - Journal of Applied Optics
JF - Journal of Applied Optics
IS - 2
ER -