Binaural speech enhancement based on DNN for the application of virtual reality

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Binaural sound can increase the immersion in virtual reality scenes due to the sense of direction, but when recorded in real-world, it may be corrupted by noise. Some of the existing binaural speech enhancement or separation methods can only provide the single-channel output, which will lead to the loss of the sense of direction. Some methods can provide the dual-channel output, however, such methods will suffer performance loss when the binaural clean speeches and the binaural noise are in the same direction. In this paper, we propose a binaural speech enhancement method based on deep neural network, aiming at dealing with the situation that binaural clean speeches and binaural noises are in the same direction. By mapping the features of the binaural noisy speeches to the labels of the binaural clean speeches, the dual-channel output can be obtained. Besides, batch normalization layer is introduced to further improve the performance. Compared with the baseline methods, the proposed method can obtain better speech quality and intelligibility, and the sense of the direction of the estimated binaural speeches can also be better preserved.

Original languageEnglish
Title of host publication2018 14th IEEE International Conference on Signal Processing Proceedings, ICSP 2018
EditorsYuan Baozong, Ruan Qiuqi, Zhao Yao, An Gaoyun
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages629-633
Number of pages5
ISBN (Electronic)9781538646724
DOIs
Publication statusPublished - 2 Feb 2019
Event14th IEEE International Conference on Signal Processing, ICSP 2018 - Beijing, China
Duration: 12 Aug 201816 Aug 2018

Publication series

NameInternational Conference on Signal Processing Proceedings, ICSP
Volume2018-August

Conference

Conference14th IEEE International Conference on Signal Processing, ICSP 2018
Country/TerritoryChina
CityBeijing
Period12/08/1816/08/18

Keywords

  • Binaural speech enhancement
  • Deep neural network
  • Log-power spectra
  • Virtual reality

Fingerprint

Dive into the research topics of 'Binaural speech enhancement based on DNN for the application of virtual reality'. Together they form a unique fingerprint.

Cite this