Individual HRTF Prediction Based on Anthropometric Data and Multi-Stage Model

Yinliang Qiu; Zhiyu Li; Jing Wang

doi:10.1109/ICMEW59549.2023.00060

Individual HRTF Prediction Based on Anthropometric Data and Multi-Stage Model

Yinliang Qiu, Zhiyu Li, Jing Wang^*

^*此作品的通讯作者

信息与电子学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

1 引用（Scopus）

摘要

Getting individual head related transfer function (HRTF) is an important step in rendering binaural immersive audio. Individual HRTF can provide a more realistic experience than general HRTF. For more accurate prediction results, we propose a multi-stage model perform individual HRTF prediction based on anthropometric data. This model can combine global and local features through different stages. In the first stage, light gradient boosting machine(LightGBM) is chosen as decision tress model to predict HRTF according to anthropometric data and different angels. In the second stage, Transformer encoder is chosen to learn the global information between different frequency points. According to the experimental results, the effect of using a multi-stage model is better than that of a single model. The spectral distortion of the results predicted by our model is smaller, which can illustrate the effectiveness of our model.

源语言	英语
主期刊名	Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023
出版商	Institute of Electrical and Electronics Engineers Inc.
页	314-319
页数	6
ISBN（电子版）	9798350313154
DOI	https://doi.org/10.1109/ICMEW59549.2023.00060
出版状态	已出版 - 2023
活动	2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023 - Brisbane, 澳大利亚期限: 10 7月 2023 → 14 7月 2023

出版系列

姓名	Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023

会议

会议	2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023
国家/地区	澳大利亚
市	Brisbane
时期	10/07/23 → 14/07/23

访问文件

10.1109/ICMEW59549.2023.00060

其它文件与链接

链接到 Scopus 的出版物

引用此

Qiu, Y., Li, Z., & Wang, J. (2023). Individual HRTF Prediction Based on Anthropometric Data and Multi-Stage Model. 在 Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023 (页码 314-319). (Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICMEW59549.2023.00060

Qiu, Yinliang ; Li, Zhiyu ; Wang, Jing. / Individual HRTF Prediction Based on Anthropometric Data and Multi-Stage Model. Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023. Institute of Electrical and Electronics Engineers Inc., 2023. 页码 314-319 (Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023).

@inproceedings{94ff5aacd8c54dfd8dc60213cf3080e1,

title = "Individual HRTF Prediction Based on Anthropometric Data and Multi-Stage Model",

abstract = "Getting individual head related transfer function (HRTF) is an important step in rendering binaural immersive audio. Individual HRTF can provide a more realistic experience than general HRTF. For more accurate prediction results, we propose a multi-stage model perform individual HRTF prediction based on anthropometric data. This model can combine global and local features through different stages. In the first stage, light gradient boosting machine(LightGBM) is chosen as decision tress model to predict HRTF according to anthropometric data and different angels. In the second stage, Transformer encoder is chosen to learn the global information between different frequency points. According to the experimental results, the effect of using a multi-stage model is better than that of a single model. The spectral distortion of the results predicted by our model is smaller, which can illustrate the effectiveness of our model.",

keywords = "Individual HRTF, LightGBM, Transformer encoder, multi-stage model",

author = "Yinliang Qiu and Zhiyu Li and Jing Wang",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023 ; Conference date: 10-07-2023 Through 14-07-2023",

year = "2023",

doi = "10.1109/ICMEW59549.2023.00060",

language = "English",

series = "Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "314--319",

booktitle = "Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023",

address = "United States",

}

Qiu, Y, Li, Z & Wang, J 2023, Individual HRTF Prediction Based on Anthropometric Data and Multi-Stage Model. 在 Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023. Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023, Institute of Electrical and Electronics Engineers Inc., 页码 314-319, 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023, Brisbane, 澳大利亚, 10/07/23. https://doi.org/10.1109/ICMEW59549.2023.00060

Individual HRTF Prediction Based on Anthropometric Data and Multi-Stage Model. / Qiu, Yinliang; Li, Zhiyu; Wang, Jing.
Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023. Institute of Electrical and Electronics Engineers Inc., 2023. 页码 314-319 (Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Individual HRTF Prediction Based on Anthropometric Data and Multi-Stage Model

AU - Qiu, Yinliang

AU - Li, Zhiyu

AU - Wang, Jing

PY - 2023

Y1 - 2023

N2 - Getting individual head related transfer function (HRTF) is an important step in rendering binaural immersive audio. Individual HRTF can provide a more realistic experience than general HRTF. For more accurate prediction results, we propose a multi-stage model perform individual HRTF prediction based on anthropometric data. This model can combine global and local features through different stages. In the first stage, light gradient boosting machine(LightGBM) is chosen as decision tress model to predict HRTF according to anthropometric data and different angels. In the second stage, Transformer encoder is chosen to learn the global information between different frequency points. According to the experimental results, the effect of using a multi-stage model is better than that of a single model. The spectral distortion of the results predicted by our model is smaller, which can illustrate the effectiveness of our model.

AB - Getting individual head related transfer function (HRTF) is an important step in rendering binaural immersive audio. Individual HRTF can provide a more realistic experience than general HRTF. For more accurate prediction results, we propose a multi-stage model perform individual HRTF prediction based on anthropometric data. This model can combine global and local features through different stages. In the first stage, light gradient boosting machine(LightGBM) is chosen as decision tress model to predict HRTF according to anthropometric data and different angels. In the second stage, Transformer encoder is chosen to learn the global information between different frequency points. According to the experimental results, the effect of using a multi-stage model is better than that of a single model. The spectral distortion of the results predicted by our model is smaller, which can illustrate the effectiveness of our model.

KW - Individual HRTF

KW - LightGBM

KW - Transformer encoder

KW - multi-stage model

UR - http://www.scopus.com/inward/record.url?scp=85172356251&partnerID=8YFLogxK

U2 - 10.1109/ICMEW59549.2023.00060

DO - 10.1109/ICMEW59549.2023.00060

M3 - Conference contribution

AN - SCOPUS:85172356251

T3 - Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023

SP - 314

EP - 319

BT - Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023

Y2 - 10 July 2023 through 14 July 2023

ER -

Qiu Y, Li Z, Wang J. Individual HRTF Prediction Based on Anthropometric Data and Multi-Stage Model. 在 Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023. Institute of Electrical and Electronics Engineers Inc. 2023. 页码 314-319. (Proceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023). doi: 10.1109/ICMEW59549.2023.00060

Individual HRTF Prediction Based on Anthropometric Data and Multi-Stage Model

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此