Large scale speaker recognition method that uses 2D-haar acoustic feature

Er Man Xie, Sen Lin Luo, Li Min Pan*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

When we use the text-independent speaker recognition technology, the recognition accuracy degrades significantly as the number of target speakers increases. In order to improve the accuracy, a high accuracy large-scale speaker recognition method was proposed. This method combined certain number of continuous audio frames to be an acoustic feature figure, and then got the high-dimension 2D-Haar acoustic feature, which provide more probabilities to train a better classifier; AdaBoost.MH algorithm was employed to find out efficient 2D-Haar acoustic feature combination for classifier training. The experimental results show that recognition rate is 89.5% when the number of target speakers is 600, and average rate is 91.3% when the number of target speakers increases from 100 to 600. This method is suitable for large-scale speaker recognition and 2D-Haar acoustic feature is effective to yield higher performance. In addition, this method also has low algorithm complexity and time consumption.

Original languageEnglish
Pages (from-to)1196-1201
Number of pages6
JournalBeijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
Volume34
Issue number11
Publication statusPublished - 1 Nov 2014

Keywords

  • 2D-Haar acoustic feature
  • AdaBoost.MH
  • Speaker recognition

Fingerprint

Dive into the research topics of 'Large scale speaker recognition method that uses 2D-haar acoustic feature'. Together they form a unique fingerprint.

Cite this