Large scale speaker recognition method that uses 2D-haar acoustic feature

Er Man Xie; Sen Lin Luo; Li Min Pan

Large scale speaker recognition method that uses 2D-haar acoustic feature

Er Man Xie, Sen Lin Luo, Li Min Pan^*

^*此作品的通讯作者

信息与电子学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

When we use the text-independent speaker recognition technology, the recognition accuracy degrades significantly as the number of target speakers increases. In order to improve the accuracy, a high accuracy large-scale speaker recognition method was proposed. This method combined certain number of continuous audio frames to be an acoustic feature figure, and then got the high-dimension 2D-Haar acoustic feature, which provide more probabilities to train a better classifier; AdaBoost.MH algorithm was employed to find out efficient 2D-Haar acoustic feature combination for classifier training. The experimental results show that recognition rate is 89.5% when the number of target speakers is 600, and average rate is 91.3% when the number of target speakers increases from 100 to 600. This method is suitable for large-scale speaker recognition and 2D-Haar acoustic feature is effective to yield higher performance. In addition, this method also has low algorithm complexity and time consumption.

源语言	英语
页（从-至）	1196-1201
页数	6
期刊	Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
卷	34
期	11
出版状态	已出版 - 1 11月 2014

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{63dcc993e00f48efa2bda25578a850a9,

title = "Large scale speaker recognition method that uses 2D-haar acoustic feature",

abstract = "When we use the text-independent speaker recognition technology, the recognition accuracy degrades significantly as the number of target speakers increases. In order to improve the accuracy, a high accuracy large-scale speaker recognition method was proposed. This method combined certain number of continuous audio frames to be an acoustic feature figure, and then got the high-dimension 2D-Haar acoustic feature, which provide more probabilities to train a better classifier; AdaBoost.MH algorithm was employed to find out efficient 2D-Haar acoustic feature combination for classifier training. The experimental results show that recognition rate is 89.5% when the number of target speakers is 600, and average rate is 91.3% when the number of target speakers increases from 100 to 600. This method is suitable for large-scale speaker recognition and 2D-Haar acoustic feature is effective to yield higher performance. In addition, this method also has low algorithm complexity and time consumption.",

keywords = "2D-Haar acoustic feature, AdaBoost.MH, Speaker recognition",

author = "Xie, {Er Man} and Luo, {Sen Lin} and Pan, {Li Min}",

year = "2014",

month = nov,

day = "1",

language = "English",

volume = "34",

pages = "1196--1201",

journal = "Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology",

issn = "1001-0645",

publisher = "Beijing Institute of Technology",

number = "11",

}

TY - JOUR

T1 - Large scale speaker recognition method that uses 2D-haar acoustic feature

AU - Xie, Er Man

AU - Luo, Sen Lin

AU - Pan, Li Min

PY - 2014/11/1

Y1 - 2014/11/1

N2 - When we use the text-independent speaker recognition technology, the recognition accuracy degrades significantly as the number of target speakers increases. In order to improve the accuracy, a high accuracy large-scale speaker recognition method was proposed. This method combined certain number of continuous audio frames to be an acoustic feature figure, and then got the high-dimension 2D-Haar acoustic feature, which provide more probabilities to train a better classifier; AdaBoost.MH algorithm was employed to find out efficient 2D-Haar acoustic feature combination for classifier training. The experimental results show that recognition rate is 89.5% when the number of target speakers is 600, and average rate is 91.3% when the number of target speakers increases from 100 to 600. This method is suitable for large-scale speaker recognition and 2D-Haar acoustic feature is effective to yield higher performance. In addition, this method also has low algorithm complexity and time consumption.

AB - When we use the text-independent speaker recognition technology, the recognition accuracy degrades significantly as the number of target speakers increases. In order to improve the accuracy, a high accuracy large-scale speaker recognition method was proposed. This method combined certain number of continuous audio frames to be an acoustic feature figure, and then got the high-dimension 2D-Haar acoustic feature, which provide more probabilities to train a better classifier; AdaBoost.MH algorithm was employed to find out efficient 2D-Haar acoustic feature combination for classifier training. The experimental results show that recognition rate is 89.5% when the number of target speakers is 600, and average rate is 91.3% when the number of target speakers increases from 100 to 600. This method is suitable for large-scale speaker recognition and 2D-Haar acoustic feature is effective to yield higher performance. In addition, this method also has low algorithm complexity and time consumption.

KW - 2D-Haar acoustic feature

KW - AdaBoost.MH

KW - Speaker recognition

UR - http://www.scopus.com/inward/record.url?scp=84920731559&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:84920731559

SN - 1001-0645

VL - 34

SP - 1196

EP - 1201

JO - Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology

JF - Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology

IS - 11

ER -

Large scale speaker recognition method that uses 2D-haar acoustic feature

摘要

其它文件与链接

指纹

引用此