A novel algorithm of seeking FrFT order for speech processing

Duo Jia Ma*, Xiang Xie, Jing Ming Kuang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Citations (Scopus)

Abstract

The determination of the optimal fractional Fourier transform (FrFT) order is a crucial issue for FrFT. This paper introduces a novel algorithm is proposed for the estimation of FrFT order. We use the information of pitches, harmonies and formants in the correlogram of Gammatone filterbanks to get a few candidates of the transform order. The proposed method reduces the computation complexity in the searching of optimal transform order. We apply this method for speech processing such as Mel-frequency cepstral coefficients (MFCC) extraction and speech enhancement. The experiment of MFCC extraction shows that the proposed method is superior to the traditional method based on Fourier Transform in the sense of Fisher distance improvement. The experimental results of speech enhancement also show the improvement in sense of SNR and the Itakura-Saito distance of LPC coefficients.

Original languageEnglish
Title of host publication2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings
Pages3832-3835
Number of pages4
DOIs
Publication statusPublished - 2011
Event36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Prague, Czech Republic
Duration: 22 May 201127 May 2011

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149

Conference

Conference36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
Country/TerritoryCzech Republic
CityPrague
Period22/05/1127/05/11

Keywords

  • FrFT order
  • MFCC
  • correlogram
  • spectral subtraction
  • speech enhancement

Fingerprint

Dive into the research topics of 'A novel algorithm of seeking FrFT order for speech processing'. Together they form a unique fingerprint.

Cite this