TY - GEN
T1 - Bird sounds classification by large scale acoustic features and extreme learning machine
AU - Qian, Kun
AU - Zhang, Zixing
AU - Ringeval, Fabien
AU - Schuller, Bjorn
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2016/2/23
Y1 - 2016/2/23
N2 - Automatically classifying bird species by their sound signals is of crucial relevance for the research of ornithologists and ecologists. In this study, we present a novel framework for bird sounds classification from audio recordings. Firstly, the p-centre is used to detect the 'syllables' of bird songs, which are the units for the recognition task; then, we use our openSMILE toolkit to extract large scales of acoustic features from chunked units of analysis (the 'syllables'). ReliefF helps to reduce the dimension of the feature space. Lastly, an Extreme Learning Machine (ELM) serves for decision making. Results demonstrate that our system can achieve an excellent and robust performance scalable to different numbers of species (mean unweighted average recall of 93.82%, 89.56%, 85.30%, and 83.12% corresponding to 20, 30, 40, and 50 species of birds, respectively).
AB - Automatically classifying bird species by their sound signals is of crucial relevance for the research of ornithologists and ecologists. In this study, we present a novel framework for bird sounds classification from audio recordings. Firstly, the p-centre is used to detect the 'syllables' of bird songs, which are the units for the recognition task; then, we use our openSMILE toolkit to extract large scales of acoustic features from chunked units of analysis (the 'syllables'). ReliefF helps to reduce the dimension of the feature space. Lastly, an Extreme Learning Machine (ELM) serves for decision making. Results demonstrate that our system can achieve an excellent and robust performance scalable to different numbers of species (mean unweighted average recall of 93.82%, 89.56%, 85.30%, and 83.12% corresponding to 20, 30, 40, and 50 species of birds, respectively).
KW - Bird Sounds
KW - Extreme Learning Machine
KW - ReliefF
KW - openSMILE
KW - p-centre
UR - http://www.scopus.com/inward/record.url?scp=84964734785&partnerID=8YFLogxK
U2 - 10.1109/GlobalSIP.2015.7418412
DO - 10.1109/GlobalSIP.2015.7418412
M3 - Conference contribution
AN - SCOPUS:84964734785
T3 - 2015 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2015
SP - 1317
EP - 1321
BT - 2015 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2015
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - IEEE Global Conference on Signal and Information Processing, GlobalSIP 2015
Y2 - 13 December 2015 through 16 December 2015
ER -