摘要
We propose an acoustic feature for speech recognition based on the combination of MFCC and fractional Fourier transform (FrFT). Since the transform order is critical for the performance of FrFT, we use the ambiguity function to adaptively determine the optimal orders of FrFT for each frame. The performance of the proposed feature is compared with traditional MFCCs on recognizing speech of isolated and connected digits under both clean and noisy backgrounds. The recognition results and detailed confusion matrices are given and analyzed, which implies that the proposed feature is promising in certain speech processing fields.
| 源语言 | 英语 |
|---|---|
| 页(从-至) | 654-657 |
| 页数 | 4 |
| 期刊 | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
| 出版状态 | 已出版 - 2008 |
| 活动 | INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association - Brisbane, QLD, 澳大利亚 期限: 22 9月 2008 → 26 9月 2008 |
指纹
探究 'Adaptive-order fractional fourier transform features for speech recognition' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver