TY - JOUR
T1 - Compression of Head-Related Transfer Function Based on Tucker and Tensor Train Decomposition
AU - Wang, Jing
AU - Liu, Min
AU - Xie, Xiang
AU - Kuang, Jingming
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2019
Y1 - 2019
N2 - Head-related transfer function (HRTF) plays an important role in three-dimensional spatial sound system. However, the direct application of a large amount of original HRTF data would involve a great deal of computational burden, especially for high-spatial-resolution individual HRTF. To address this problem, we propose a novel compression method (called TT-Tucker) combining Tucker model with tensor train decomposition based on a 5-order HRTF tensor model developed in subspaces of an ear, subject, azimuth, elevation, and frequency. Lots of HRTF data can be decomposed into several low-parametric factors representing the key spectrum information of HRTF by capturing the hidden interactions among different subspaces. To evaluate the reconstruction performance, the numerical experiments were conducted on the CIPIC HRTF database. Under the same compression ratio of nearly 98%, the results suggest that the proposed method has a better performance in spectral distortion and signal-to-distortion ratio than that of the usual tensor method and the standard method principal component analysis (PCA). Moreover, the subjective listening test shows that the TT-Tucker method performs better in that, the compressed and reconstructed HRTF is closer to the original HRTF in the sound localization similarity.
AB - Head-related transfer function (HRTF) plays an important role in three-dimensional spatial sound system. However, the direct application of a large amount of original HRTF data would involve a great deal of computational burden, especially for high-spatial-resolution individual HRTF. To address this problem, we propose a novel compression method (called TT-Tucker) combining Tucker model with tensor train decomposition based on a 5-order HRTF tensor model developed in subspaces of an ear, subject, azimuth, elevation, and frequency. Lots of HRTF data can be decomposed into several low-parametric factors representing the key spectrum information of HRTF by capturing the hidden interactions among different subspaces. To evaluate the reconstruction performance, the numerical experiments were conducted on the CIPIC HRTF database. Under the same compression ratio of nearly 98%, the results suggest that the proposed method has a better performance in spectral distortion and signal-to-distortion ratio than that of the usual tensor method and the standard method principal component analysis (PCA). Moreover, the subjective listening test shows that the TT-Tucker method performs better in that, the compressed and reconstructed HRTF is closer to the original HRTF in the sound localization similarity.
KW - Head-related transfer function
KW - Tucker model
KW - compression
KW - tensor train
UR - http://www.scopus.com/inward/record.url?scp=85065184534&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2019.2906364
DO - 10.1109/ACCESS.2019.2906364
M3 - Article
AN - SCOPUS:85065184534
SN - 2169-3536
VL - 7
SP - 39639
EP - 39651
JO - IEEE Access
JF - IEEE Access
M1 - 8672134
ER -