Compression of Head-Related Transfer Function Based on Tucker and Tensor Train Decomposition

Jing Wang; Min Liu; Xiang Xie; Jingming Kuang

doi:10.1109/ACCESS.2019.2906364

Compression of Head-Related Transfer Function Based on Tucker and Tensor Train Decomposition

Jing Wang^*, Min Liu, Xiang Xie, Jingming Kuang

^*Corresponding author for this work

School of Information and Electronics

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

4 Citations (Scopus)

Abstract

Head-related transfer function (HRTF) plays an important role in three-dimensional spatial sound system. However, the direct application of a large amount of original HRTF data would involve a great deal of computational burden, especially for high-spatial-resolution individual HRTF. To address this problem, we propose a novel compression method (called TT-Tucker) combining Tucker model with tensor train decomposition based on a 5-order HRTF tensor model developed in subspaces of an ear, subject, azimuth, elevation, and frequency. Lots of HRTF data can be decomposed into several low-parametric factors representing the key spectrum information of HRTF by capturing the hidden interactions among different subspaces. To evaluate the reconstruction performance, the numerical experiments were conducted on the CIPIC HRTF database. Under the same compression ratio of nearly 98%, the results suggest that the proposed method has a better performance in spectral distortion and signal-to-distortion ratio than that of the usual tensor method and the standard method principal component analysis (PCA). Moreover, the subjective listening test shows that the TT-Tucker method performs better in that, the compressed and reconstructed HRTF is closer to the original HRTF in the sound localization similarity.

Original language	English
Article number	8672134
Pages (from-to)	39639-39651
Number of pages	13
Journal	IEEE Access
Volume	7
DOIs	https://doi.org/10.1109/ACCESS.2019.2906364
Publication status	Published - 2019

Keywords

Head-related transfer function
Tucker model
compression
tensor train

Access to Document

10.1109/ACCESS.2019.2906364

Cite this

@article{e65459cae9d948049e44f51c12fd0015,

title = "Compression of Head-Related Transfer Function Based on Tucker and Tensor Train Decomposition",

abstract = "Head-related transfer function (HRTF) plays an important role in three-dimensional spatial sound system. However, the direct application of a large amount of original HRTF data would involve a great deal of computational burden, especially for high-spatial-resolution individual HRTF. To address this problem, we propose a novel compression method (called TT-Tucker) combining Tucker model with tensor train decomposition based on a 5-order HRTF tensor model developed in subspaces of an ear, subject, azimuth, elevation, and frequency. Lots of HRTF data can be decomposed into several low-parametric factors representing the key spectrum information of HRTF by capturing the hidden interactions among different subspaces. To evaluate the reconstruction performance, the numerical experiments were conducted on the CIPIC HRTF database. Under the same compression ratio of nearly 98%, the results suggest that the proposed method has a better performance in spectral distortion and signal-to-distortion ratio than that of the usual tensor method and the standard method principal component analysis (PCA). Moreover, the subjective listening test shows that the TT-Tucker method performs better in that, the compressed and reconstructed HRTF is closer to the original HRTF in the sound localization similarity.",

keywords = "Head-related transfer function, Tucker model, compression, tensor train",

author = "Jing Wang and Min Liu and Xiang Xie and Jingming Kuang",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2019",

doi = "10.1109/ACCESS.2019.2906364",

language = "English",

volume = "7",

pages = "39639--39651",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Compression of Head-Related Transfer Function Based on Tucker and Tensor Train Decomposition

AU - Wang, Jing

AU - Liu, Min

AU - Xie, Xiang

AU - Kuang, Jingming

PY - 2019

Y1 - 2019

N2 - Head-related transfer function (HRTF) plays an important role in three-dimensional spatial sound system. However, the direct application of a large amount of original HRTF data would involve a great deal of computational burden, especially for high-spatial-resolution individual HRTF. To address this problem, we propose a novel compression method (called TT-Tucker) combining Tucker model with tensor train decomposition based on a 5-order HRTF tensor model developed in subspaces of an ear, subject, azimuth, elevation, and frequency. Lots of HRTF data can be decomposed into several low-parametric factors representing the key spectrum information of HRTF by capturing the hidden interactions among different subspaces. To evaluate the reconstruction performance, the numerical experiments were conducted on the CIPIC HRTF database. Under the same compression ratio of nearly 98%, the results suggest that the proposed method has a better performance in spectral distortion and signal-to-distortion ratio than that of the usual tensor method and the standard method principal component analysis (PCA). Moreover, the subjective listening test shows that the TT-Tucker method performs better in that, the compressed and reconstructed HRTF is closer to the original HRTF in the sound localization similarity.

AB - Head-related transfer function (HRTF) plays an important role in three-dimensional spatial sound system. However, the direct application of a large amount of original HRTF data would involve a great deal of computational burden, especially for high-spatial-resolution individual HRTF. To address this problem, we propose a novel compression method (called TT-Tucker) combining Tucker model with tensor train decomposition based on a 5-order HRTF tensor model developed in subspaces of an ear, subject, azimuth, elevation, and frequency. Lots of HRTF data can be decomposed into several low-parametric factors representing the key spectrum information of HRTF by capturing the hidden interactions among different subspaces. To evaluate the reconstruction performance, the numerical experiments were conducted on the CIPIC HRTF database. Under the same compression ratio of nearly 98%, the results suggest that the proposed method has a better performance in spectral distortion and signal-to-distortion ratio than that of the usual tensor method and the standard method principal component analysis (PCA). Moreover, the subjective listening test shows that the TT-Tucker method performs better in that, the compressed and reconstructed HRTF is closer to the original HRTF in the sound localization similarity.

KW - Head-related transfer function

KW - Tucker model

KW - compression

KW - tensor train

UR - http://www.scopus.com/inward/record.url?scp=85065184534&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2019.2906364

DO - 10.1109/ACCESS.2019.2906364

M3 - Article

AN - SCOPUS:85065184534

SN - 2169-3536

VL - 7

SP - 39639

EP - 39651

JO - IEEE Access

JF - IEEE Access

M1 - 8672134

ER -

Compression of Head-Related Transfer Function Based on Tucker and Tensor Train Decomposition

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this