Microphone array speech enhancement based on tensor filtering methods

Jing Wang, Xiang Xie*, Jingming Kuang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)

Abstract

This paper proposes a novel microphone array speech denoising scheme based on tensor filtering methods including truncated HOSVD (High-Order Singular Value Decomposition), low rank tensor approximation and multi-mode Wiener filtering. Microphone array speech signal is represented in three-order tensor space with channel, time, and spectrum modes and then tensor filtering model can be designed to process the multiway array data. As to the first method, noise can be reduced through the truncated HOSVD which is a simple scheme in tensor processing. It is more accurate to find the lower-rank approximation of the three-order tensor with Tucker model. Then MDL (Minimum Description Length) criterion is used to estimate the optimal tensor rank in the second method. Further, multimode Wiener filtering approach upon tensor analysis can be considered as the spanning of one-mode wiener filtering. How to take advantages of tensor model to obtain a set of filters is the heart of the novel scheme. The performances of the proposed three approaches are evaluated with objective indexes and listening quality test. The experimental results indicate that the proposed tenor filtering methods have potential ability of retrieving the target signal from noisy microphone array signal and the multi-mode Wiener filtering method provides the best denoising results among the three ones.

Original languageEnglish
Pages (from-to)141-152
Number of pages12
JournalChina Communications
Volume15
Issue number4
DOIs
Publication statusPublished - Apr 2018

Keywords

  • low rank approximation
  • microphone array
  • multi-mode Wiener filtering
  • speech denoising
  • tensor filtering
  • truncated HOSVD

Fingerprint

Dive into the research topics of 'Microphone array speech enhancement based on tensor filtering methods'. Together they form a unique fingerprint.

Cite this