跳到主要导航 跳到搜索 跳到主要内容

Speech recognition based on deep tensor neural network and multifactor feature

  • Beijing Institute of Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This paper presents a speech recognition system based on deep tensor neural network which uses multifactor feature as input feature of acoustic model. First, a deep neural network is trained to estimate articulatory feature from input speech, where the training data is MOCHA database[1]. Mel frequency cepstrum coefficients in conjunction with articulatory feature are used as multifactor feature. Deep tensor neural network which involves tensor interactions among neurons is used as the acoustic model in this system. Speech recognition results indicate that the multifactor feature helps in improving speech recognition performance not only under clean conditions but also under noisy background conditions; deep tensor neural network is more capable of modeling multifactor features because of its tensor interactions than deep neural network.

源语言英语
主期刊名2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019
出版商Institute of Electrical and Electronics Engineers Inc.
650-654
页数5
ISBN(电子版)9781728132488
DOI
出版状态已出版 - 11月 2019
活动2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019 - Lanzhou, 中国
期限: 18 11月 201921 11月 2019

出版系列

姓名2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019

会议

会议2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019
国家/地区中国
Lanzhou
时期18/11/1921/11/19

指纹

探究 'Speech recognition based on deep tensor neural network and multifactor feature' 的科研主题。它们共同构成独一无二的指纹。

引用此