A threshold denoising algorithm based on mathematical morphology for speech enhancement

Guangyan Li, Caixia Zheng*, Tingfa Xu, Xiaolin Cao, Mao Xingpeng, Shuangwei Wang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The presence of noise in speech signals can significantly degrade the performance of speech recognition systems. A threshold denoising method based on mathematical morphology is proposed to reduce background white noise. In the method we consider speech spectrograms as images and construct binary images from a normalized 256-level gray scale spectrogram image. We take advantage of a sudden slowing in the average value (ratio of the number of ‘1’ pixels to the total pixel number) of the binary image, and use it as the threshold value to zero spectrogram elements below the threshold, normalize the spectrogram, and finally, reconstruct the original speech signal to achieve the goal of speech enhancement. The main advantage of the algorithm is fast speed that is highly desired in real-time speech processing.

Original languageEnglish
Title of host publicationCommunications, Signal Processing, and Systems - Proceedings of the 2017 International Conference on Communications, Signal Processing, and Systems
EditorsQilian Liang, Min Jia, Jiasong Mu, Wei Wang, Xuhong Feng, Baoju Zhang
PublisherSpringer Verlag
Pages1776-1784
Number of pages9
ISBN (Print)9789811065705
DOIs
Publication statusPublished - 2019
Event6th International Conference on Communications, Signal Processing, and Systems, CSPS 2017 - Harbin, China
Duration: 14 Jul 201716 Jul 2017

Publication series

NameLecture Notes in Electrical Engineering
Volume463
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference6th International Conference on Communications, Signal Processing, and Systems, CSPS 2017
Country/TerritoryChina
CityHarbin
Period14/07/1716/07/17

Fingerprint

Dive into the research topics of 'A threshold denoising algorithm based on mathematical morphology for speech enhancement'. Together they form a unique fingerprint.

Cite this