TY - JOUR
T1 - 面向虚拟现实场景的房间脉冲响应计算模型
AU - Li, Zhiyu
AU - Wang, Jing
AU - Yue, Xinwen
AU - Yang, Lidong
AU - Zhao, Shenghui
AU - Xie, Xiang
N1 - Publisher Copyright:
© 2024 Science Press. All rights reserved.
PY - 2024/11
Y1 - 2024/11
N2 - This study proposes a room impulse response (RIR) computation model tailored for virtual reality applications, integrating deep learning neural network techniques with psychoacoustic perception parameters. This model can efficiently predict perceptually meaningful RIRs from virtual reality scene data while ensuring high-quality predictions. It meets the requirements for real-time generation, high sampling rate, unrestricted length, and lightweight implementation in virtual reality audio scenarios. The model first encodes the acoustic information from the scene using a graph convolutional neural network, then decodes this information through a neural sound field and transposed convolution model to obtain the RIR perception parameters. Finally, the RIR signal is reconstructed from these parameters. Experimental results demonstrate that the proposed model offers significant advantages in RIR generation quality, computational efficiency, and functionality, making it well-suited to meet the real-time RIR generation needs of virtual reality audio.
AB - This study proposes a room impulse response (RIR) computation model tailored for virtual reality applications, integrating deep learning neural network techniques with psychoacoustic perception parameters. This model can efficiently predict perceptually meaningful RIRs from virtual reality scene data while ensuring high-quality predictions. It meets the requirements for real-time generation, high sampling rate, unrestricted length, and lightweight implementation in virtual reality audio scenarios. The model first encodes the acoustic information from the scene using a graph convolutional neural network, then decodes this information through a neural sound field and transposed convolution model to obtain the RIR perception parameters. Finally, the RIR signal is reconstructed from these parameters. Experimental results demonstrate that the proposed model offers significant advantages in RIR generation quality, computational efficiency, and functionality, making it well-suited to meet the real-time RIR generation needs of virtual reality audio.
KW - Deep learning
KW - Perceptual parameter
KW - Room impulse response
KW - Virtual reality
UR - http://www.scopus.com/inward/record.url?scp=85211001424&partnerID=8YFLogxK
U2 - 10.12395/0371-0025.2024150
DO - 10.12395/0371-0025.2024150
M3 - 文章
AN - SCOPUS:85211001424
SN - 0371-0025
VL - 49
SP - 1186
EP - 1196
JO - Shengxue Xuebao/Acta Acustica
JF - Shengxue Xuebao/Acta Acustica
IS - 6
ER -