Bird sounds classification by large scale acoustic features and extreme learning machine

Kun Qian, Zixing Zhang, Fabien Ringeval, Bjorn Schuller

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

26 Citations (Scopus)

Abstract

Automatically classifying bird species by their sound signals is of crucial relevance for the research of ornithologists and ecologists. In this study, we present a novel framework for bird sounds classification from audio recordings. Firstly, the p-centre is used to detect the 'syllables' of bird songs, which are the units for the recognition task; then, we use our openSMILE toolkit to extract large scales of acoustic features from chunked units of analysis (the 'syllables'). ReliefF helps to reduce the dimension of the feature space. Lastly, an Extreme Learning Machine (ELM) serves for decision making. Results demonstrate that our system can achieve an excellent and robust performance scalable to different numbers of species (mean unweighted average recall of 93.82%, 89.56%, 85.30%, and 83.12% corresponding to 20, 30, 40, and 50 species of birds, respectively).

Original languageEnglish
Title of host publication2015 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1317-1321
Number of pages5
ISBN (Electronic)9781479975914
DOIs
Publication statusPublished - 23 Feb 2016
Externally publishedYes
EventIEEE Global Conference on Signal and Information Processing, GlobalSIP 2015 - Orlando, United States
Duration: 13 Dec 201516 Dec 2015

Publication series

Name2015 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2015

Conference

ConferenceIEEE Global Conference on Signal and Information Processing, GlobalSIP 2015
Country/TerritoryUnited States
CityOrlando
Period13/12/1516/12/15

Keywords

  • Bird Sounds
  • Extreme Learning Machine
  • ReliefF
  • openSMILE
  • p-centre

Fingerprint

Dive into the research topics of 'Bird sounds classification by large scale acoustic features and extreme learning machine'. Together they form a unique fingerprint.

Cite this