Mandarin digits speech recognition using support vector machines

Xiang Xie*, Jing Ming Kuang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speech feature sequence to make up time-aligned input patterns for SVM, and the decisions of several 2-class SVM classifiers were employed for constructing an N-class classifier. Four kinds of SVM kernel functions were compared in the experiments of speaker-independent speech recognition of mandarin digits. And the kernel of radial basis function has the highest accurate rate of 99.33%, which is better than that of the baseline system based on hidden Markov models (HMM) (97.08%). And the experiments also show that SVM can outperform HMM especially when the samples for learning were very limited.

Original languageEnglish
Pages (from-to)9-12
Number of pages4
JournalJournal of Beijing Institute of Technology (English Edition)
Volume14
Issue number1
Publication statusPublished - Mar 2005

Keywords

  • Kernel function
  • Speech recognition
  • Support vector machine (SVM)

Fingerprint

Dive into the research topics of 'Mandarin digits speech recognition using support vector machines'. Together they form a unique fingerprint.

Cite this