TY - JOUR
T1 - 2D-Haar acoustic super feature vector fast generation method
AU - Xie, Er Man
AU - Luo, Sen Lin
AU - Pan, Li Min
N1 - Publisher Copyright:
© 2016, Beijing Institute of Technology. All right reserved.
PY - 2016/3/1
Y1 - 2016/3/1
N2 - A fast and efficient acoustic feature super vector generation method was proposed to effectively improve the recognition accuracy and speed yielded by traditional frame based acoustic features. This paper makes 3 contributions: firstly, certain number of acoustic feature vectors extracted from continuous audio frames was combined to be an acoustic feature image; secondly, AdaBoost. MH algorithm was used to select higher representative 2D-Haar pattern combinations to construct super feature vectors; thirdly, random feature selection method was proposed to further improve the processing speed. Experimental results show that under 3 kinds of audio recognition occasions such as audio events recognition, speaker recognition, speaker gender recognition, the use of 2D-Haar acoustic feature super vector can make SVM, C5.0, AdaBoost algorithms obtain higher recognition accuracy than ones that MFCC, PLP, LPCC and other traditional acoustic features yielded, and can make the training processing 7~20 times faster and the recognition processing 5~10 times faster.
AB - A fast and efficient acoustic feature super vector generation method was proposed to effectively improve the recognition accuracy and speed yielded by traditional frame based acoustic features. This paper makes 3 contributions: firstly, certain number of acoustic feature vectors extracted from continuous audio frames was combined to be an acoustic feature image; secondly, AdaBoost. MH algorithm was used to select higher representative 2D-Haar pattern combinations to construct super feature vectors; thirdly, random feature selection method was proposed to further improve the processing speed. Experimental results show that under 3 kinds of audio recognition occasions such as audio events recognition, speaker recognition, speaker gender recognition, the use of 2D-Haar acoustic feature super vector can make SVM, C5.0, AdaBoost algorithms obtain higher recognition accuracy than ones that MFCC, PLP, LPCC and other traditional acoustic features yielded, and can make the training processing 7~20 times faster and the recognition processing 5~10 times faster.
KW - 2D-Haar acoustic feature
KW - 2D-Haar feature super vector
KW - AdaBoost. MH
KW - Audio processing
KW - Audio recognition
UR - http://www.scopus.com/inward/record.url?scp=84964788284&partnerID=8YFLogxK
U2 - 10.15918/j.tbit1001-0645.2016.03.014
DO - 10.15918/j.tbit1001-0645.2016.03.014
M3 - Article
AN - SCOPUS:84964788284
SN - 1001-0645
VL - 36
SP - 295
EP - 301
JO - Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
JF - Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
IS - 3
ER -