Evolving learning for analysing mood-related infant vocalisation

Zixing Zhang; Jing Han; Kun Qian; Björn Schuller

doi:10.21437/Interspeech.2018-1914

Evolving learning for analysing mood-related infant vocalisation

Zixing Zhang, Jing Han, Kun Qian, Björn Schuller

科研成果: 期刊稿件 › 会议文章 › 同行评审

5 引用（Scopus）

摘要

Infant vocalisation analysis plays an important role in the study of the development of pre-speech capability of infants, while machine-based approaches nowadays emerge with an aim to advance such an analysis. However, conventional machine learning techniques require heavy feature-engineering and refined architecture designing. In this paper, we present an evolving learning framework to automate the design of neural network structures for infant vocalisation analysis. In contrast to manually searching by trial and error, we aim to automate the search process in a given space with less interference. This framework consists of a controller and its child networks, where the child networks are built according to the controller's estimation. When applying the framework to the Interspeech 2018 Computational Paralinguistics (ComParE) Crying Sub-challenge, we discover several deep recurrent neural network structures, which are able to deliver competitive results to the best ComParE baseline method.

源语言	英语
页（从-至）	142-146
页数	5
期刊	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
卷	2018-September
DOI	https://doi.org/10.21437/Interspeech.2018-1914
出版状态	已出版 - 2018
已对外发布	是
活动	19th Annual Conference of the International Speech Communication, INTERSPEECH 2018 - Hyderabad, 印度期限: 2 9月 2018 → 6 9月 2018

访问文件

10.21437/Interspeech.2018-1914

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, Z., Han, J., Qian, K., & Schuller, B. (2018). Evolving learning for analysing mood-related infant vocalisation. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2018-September, 142-146. https://doi.org/10.21437/Interspeech.2018-1914

@article{21493092b88e417db3155d1636fc3b73,

title = "Evolving learning for analysing mood-related infant vocalisation",

abstract = "Infant vocalisation analysis plays an important role in the study of the development of pre-speech capability of infants, while machine-based approaches nowadays emerge with an aim to advance such an analysis. However, conventional machine learning techniques require heavy feature-engineering and refined architecture designing. In this paper, we present an evolving learning framework to automate the design of neural network structures for infant vocalisation analysis. In contrast to manually searching by trial and error, we aim to automate the search process in a given space with less interference. This framework consists of a controller and its child networks, where the child networks are built according to the controller's estimation. When applying the framework to the Interspeech 2018 Computational Paralinguistics (ComParE) Crying Sub-challenge, we discover several deep recurrent neural network structures, which are able to deliver competitive results to the best ComParE baseline method.",

keywords = "Evolving learning, Infant vocalisation, Neural network architecture, Speech/voice analysis",

author = "Zixing Zhang and Jing Han and Kun Qian and Bj{\"o}rn Schuller",

note = "Publisher Copyright: {\textcopyright} 2018 International Speech Communication Association. All rights reserved.; 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018 ; Conference date: 02-09-2018 Through 06-09-2018",

year = "2018",

doi = "10.21437/Interspeech.2018-1914",

language = "English",

volume = "2018-September",

pages = "142--146",

journal = "Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",

issn = "2308-457X",

}

TY - JOUR

T1 - Evolving learning for analysing mood-related infant vocalisation

AU - Zhang, Zixing

AU - Han, Jing

AU - Qian, Kun

AU - Schuller, Björn

PY - 2018

Y1 - 2018

N2 - Infant vocalisation analysis plays an important role in the study of the development of pre-speech capability of infants, while machine-based approaches nowadays emerge with an aim to advance such an analysis. However, conventional machine learning techniques require heavy feature-engineering and refined architecture designing. In this paper, we present an evolving learning framework to automate the design of neural network structures for infant vocalisation analysis. In contrast to manually searching by trial and error, we aim to automate the search process in a given space with less interference. This framework consists of a controller and its child networks, where the child networks are built according to the controller's estimation. When applying the framework to the Interspeech 2018 Computational Paralinguistics (ComParE) Crying Sub-challenge, we discover several deep recurrent neural network structures, which are able to deliver competitive results to the best ComParE baseline method.

AB - Infant vocalisation analysis plays an important role in the study of the development of pre-speech capability of infants, while machine-based approaches nowadays emerge with an aim to advance such an analysis. However, conventional machine learning techniques require heavy feature-engineering and refined architecture designing. In this paper, we present an evolving learning framework to automate the design of neural network structures for infant vocalisation analysis. In contrast to manually searching by trial and error, we aim to automate the search process in a given space with less interference. This framework consists of a controller and its child networks, where the child networks are built according to the controller's estimation. When applying the framework to the Interspeech 2018 Computational Paralinguistics (ComParE) Crying Sub-challenge, we discover several deep recurrent neural network structures, which are able to deliver competitive results to the best ComParE baseline method.

KW - Evolving learning

KW - Infant vocalisation

KW - Neural network architecture

KW - Speech/voice analysis

UR - http://www.scopus.com/inward/record.url?scp=85054972660&partnerID=8YFLogxK

U2 - 10.21437/Interspeech.2018-1914

DO - 10.21437/Interspeech.2018-1914

M3 - Conference article

AN - SCOPUS:85054972660

SN - 2308-457X

VL - 2018-September

SP - 142

EP - 146

JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

T2 - 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018

Y2 - 2 September 2018 through 6 September 2018

ER -

Evolving learning for analysing mood-related infant vocalisation

摘要

访问文件

其它文件与链接

指纹

引用此