摘要
This paper proposes an improved method for Winograd algorithm to solve the problem that the existing methods of long sequences Fast Fourier Transform (FFT) on the TS201 processor does not take full account of the Cache's miss influence on efficiency. The new method makes maximum use of the Cache's advantages in reading and writing by optimizing the access method of rows and columns to avoid three explicitly matrix transposition, and hiding the twiddle factor multiplication by reconfiguration butterfly computation. Test results show that the performance of Cache-optimized implementation of FFT is significantly improved, and it can be used for fast acquisition of pulse-compression in radar system.
源语言 | 英语 |
---|---|
页(从-至) | 1774-1178 |
页数 | 597 |
期刊 | Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology |
卷 | 35 |
期 | 7 |
DOI | |
出版状态 | 已出版 - 7月 2013 |