Large scale speaker recognition method that uses 2D-haar acoustic feature

Er Man Xie; Sen Lin Luo; Li Min Pan

Large scale speaker recognition method that uses 2D-haar acoustic feature

Er Man Xie, Sen Lin Luo, Li Min Pan^*

^*Corresponding author for this work

School of Information and Electronics

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

Abstract

When we use the text-independent speaker recognition technology, the recognition accuracy degrades significantly as the number of target speakers increases. In order to improve the accuracy, a high accuracy large-scale speaker recognition method was proposed. This method combined certain number of continuous audio frames to be an acoustic feature figure, and then got the high-dimension 2D-Haar acoustic feature, which provide more probabilities to train a better classifier; AdaBoost.MH algorithm was employed to find out efficient 2D-Haar acoustic feature combination for classifier training. The experimental results show that recognition rate is 89.5% when the number of target speakers is 600, and average rate is 91.3% when the number of target speakers increases from 100 to 600. This method is suitable for large-scale speaker recognition and 2D-Haar acoustic feature is effective to yield higher performance. In addition, this method also has low algorithm complexity and time consumption.

Original language	English
Pages (from-to)	1196-1201
Number of pages	6
Journal	Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
Volume	34
Issue number	11
Publication status	Published - 1 Nov 2014

Keywords

2D-Haar acoustic feature
AdaBoost.MH
Speaker recognition

Cite this

@article{63dcc993e00f48efa2bda25578a850a9,

title = "Large scale speaker recognition method that uses 2D-haar acoustic feature",

abstract = "When we use the text-independent speaker recognition technology, the recognition accuracy degrades significantly as the number of target speakers increases. In order to improve the accuracy, a high accuracy large-scale speaker recognition method was proposed. This method combined certain number of continuous audio frames to be an acoustic feature figure, and then got the high-dimension 2D-Haar acoustic feature, which provide more probabilities to train a better classifier; AdaBoost.MH algorithm was employed to find out efficient 2D-Haar acoustic feature combination for classifier training. The experimental results show that recognition rate is 89.5% when the number of target speakers is 600, and average rate is 91.3% when the number of target speakers increases from 100 to 600. This method is suitable for large-scale speaker recognition and 2D-Haar acoustic feature is effective to yield higher performance. In addition, this method also has low algorithm complexity and time consumption.",

keywords = "2D-Haar acoustic feature, AdaBoost.MH, Speaker recognition",

author = "Xie, {Er Man} and Luo, {Sen Lin} and Pan, {Li Min}",

year = "2014",

month = nov,

day = "1",

language = "English",

volume = "34",

pages = "1196--1201",

journal = "Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology",

issn = "1001-0645",

publisher = "Beijing Institute of Technology",

number = "11",

}

TY - JOUR

T1 - Large scale speaker recognition method that uses 2D-haar acoustic feature

AU - Xie, Er Man

AU - Luo, Sen Lin

AU - Pan, Li Min

PY - 2014/11/1

Y1 - 2014/11/1

N2 - When we use the text-independent speaker recognition technology, the recognition accuracy degrades significantly as the number of target speakers increases. In order to improve the accuracy, a high accuracy large-scale speaker recognition method was proposed. This method combined certain number of continuous audio frames to be an acoustic feature figure, and then got the high-dimension 2D-Haar acoustic feature, which provide more probabilities to train a better classifier; AdaBoost.MH algorithm was employed to find out efficient 2D-Haar acoustic feature combination for classifier training. The experimental results show that recognition rate is 89.5% when the number of target speakers is 600, and average rate is 91.3% when the number of target speakers increases from 100 to 600. This method is suitable for large-scale speaker recognition and 2D-Haar acoustic feature is effective to yield higher performance. In addition, this method also has low algorithm complexity and time consumption.

AB - When we use the text-independent speaker recognition technology, the recognition accuracy degrades significantly as the number of target speakers increases. In order to improve the accuracy, a high accuracy large-scale speaker recognition method was proposed. This method combined certain number of continuous audio frames to be an acoustic feature figure, and then got the high-dimension 2D-Haar acoustic feature, which provide more probabilities to train a better classifier; AdaBoost.MH algorithm was employed to find out efficient 2D-Haar acoustic feature combination for classifier training. The experimental results show that recognition rate is 89.5% when the number of target speakers is 600, and average rate is 91.3% when the number of target speakers increases from 100 to 600. This method is suitable for large-scale speaker recognition and 2D-Haar acoustic feature is effective to yield higher performance. In addition, this method also has low algorithm complexity and time consumption.

KW - 2D-Haar acoustic feature

KW - AdaBoost.MH

KW - Speaker recognition

UR - http://www.scopus.com/inward/record.url?scp=84920731559&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:84920731559

SN - 1001-0645

VL - 34

SP - 1196

EP - 1201

JO - Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology

JF - Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology

IS - 11

ER -

Large scale speaker recognition method that uses 2D-haar acoustic feature

Abstract

Keywords

Other files and links

Fingerprint

Cite this