Adaptive-order fractional fourier transform features for speech recognition

Hui Yin*, Xiang Xie, Jingming Kuang

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

3 Citations (Scopus)

Abstract

We propose an acoustic feature for speech recognition based on the combination of MFCC and fractional Fourier transform (FrFT). Since the transform order is critical for the performance of FrFT, we use the ambiguity function to adaptively determine the optimal orders of FrFT for each frame. The performance of the proposed feature is compared with traditional MFCCs on recognizing speech of isolated and connected digits under both clean and noisy backgrounds. The recognition results and detailed confusion matrices are given and analyzed, which implies that the proposed feature is promising in certain speech processing fields.

Original languageEnglish
Pages (from-to)654-657
Number of pages4
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publication statusPublished - 2008
EventINTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association - Brisbane, QLD, Australia
Duration: 22 Sept 200826 Sept 2008

Keywords

  • Ambiguity function
  • Feature extraction
  • Fractional fourier transform
  • Speech recognition

Fingerprint

Dive into the research topics of 'Adaptive-order fractional fourier transform features for speech recognition'. Together they form a unique fingerprint.

Cite this