TY - JOUR
T1 - Scalable Hardware Architecture for High-Throughput Implementation of ESPRIT Algorithm
AU - Huang, Yanjie
AU - Wang, Weijiang
AU - Xue, Chengbo
AU - Jiang, Rongkun
AU - Dang, Hua
AU - Ren, Shiwei
N1 - Publisher Copyright:
© 2001-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Estimating signal parameter via rotational invariance technique (ESPRIT) algorithm is a high-performance method for direction of arrival (DOA) estimation. Its most cumbersome aspect is performing eigenvalue decomposition twice, particularly with general square matrices. Current implementations lack flexibility in configuring the number of array elements, signal sources, and snapshots at the same time. To tackle these challenges, we propose a scalable hardware acceleration scheme for the QR decomposition algorithm on a Field Programmable Gate Array (FPGA). Our QR Decomposition (QRD) module leverages orientation displacement, parallel computing, and logic reuse to enhance speed and generality. Building upon this foundation, a reconfigurable design scheme with a parallel covariance matrix computation module for ESPRIT is proposed to increase calculation speed and throughput. Our ESPRIT implementation supports up to 26 array elements, 25 signal sources, and 2,048 snapshots. As evidenced by experiments, the root mean squared error (RMSE) of the QRD module and ESPRIT implementation achieve 0.02 and 0.04° respectively. On average, when the number of array elements, signal sources, and snapshots are set to 4, 1, 64 and 8, 1, 128 individually, our ESPRIT implementation completes DOA estimation in 22.04 µs and 60.50 µs, with throughput of 45,372 and 16,529 separately. In comparison with CPU implementations, our FPGA implementations offer up to 88.66% and 89.84% time savings for QRD and ESPRIT respectively.
AB - Estimating signal parameter via rotational invariance technique (ESPRIT) algorithm is a high-performance method for direction of arrival (DOA) estimation. Its most cumbersome aspect is performing eigenvalue decomposition twice, particularly with general square matrices. Current implementations lack flexibility in configuring the number of array elements, signal sources, and snapshots at the same time. To tackle these challenges, we propose a scalable hardware acceleration scheme for the QR decomposition algorithm on a Field Programmable Gate Array (FPGA). Our QR Decomposition (QRD) module leverages orientation displacement, parallel computing, and logic reuse to enhance speed and generality. Building upon this foundation, a reconfigurable design scheme with a parallel covariance matrix computation module for ESPRIT is proposed to increase calculation speed and throughput. Our ESPRIT implementation supports up to 26 array elements, 25 signal sources, and 2,048 snapshots. As evidenced by experiments, the root mean squared error (RMSE) of the QRD module and ESPRIT implementation achieve 0.02 and 0.04° respectively. On average, when the number of array elements, signal sources, and snapshots are set to 4, 1, 64 and 8, 1, 128 individually, our ESPRIT implementation completes DOA estimation in 22.04 µs and 60.50 µs, with throughput of 45,372 and 16,529 separately. In comparison with CPU implementations, our FPGA implementations offer up to 88.66% and 89.84% time savings for QRD and ESPRIT respectively.
KW - Direction of arrival (DOA) estimation
KW - estimating signal parameter via rotational invariance technique (ESPRIT) algorithm
KW - field programmable gate array (FPGA)
KW - hardware implementation
KW - QR decomposition algorithm
KW - uniform linear array (ULA)
UR - http://www.scopus.com/inward/record.url?scp=105005998103&partnerID=8YFLogxK
U2 - 10.1109/JSEN.2025.3568483
DO - 10.1109/JSEN.2025.3568483
M3 - Article
AN - SCOPUS:105005998103
SN - 1530-437X
JO - IEEE Sensors Journal
JF - IEEE Sensors Journal
ER -